Multi-dimensional Data Structures

  • Project: Multi-dimensional Data Structures
  • Client: University of Patras, Greece
  • Github: Multi-dimensional-Data-Structures
  • Description: The train and test dataset is split based on time, and the public/private leaderboard in the test data are split randomly. There is no concept of a person in this dataset. All the row_id's are events, not people. Some of the columns, such as time and accuracy, are intentionally left vague in their definitions. Please consider them as part of the challenge.

    File descriptions:
    1. train.csv, test.csv (https://www.kaggle.com/c/facebook-v-predicting-check-ins/data)
    • row_id: id of the check-in event
    • x y: coordinates
    • accuracy: location accuracy
    • time: timestamp
    • place_id: id of the business, this is the target you are predicting

    2. train_x_y.csv
    x y: coordinates (https://drive.google.com/file/d/151xoFDNtabYaFT6Dxy5D1nXp_-8L1ttq/view?usp=sharing)
  • Technologies: K-D trees, Quad Trees και 2D Range Trees: 2D kNN Queries Implementation