Hello everyone!
We updated the dataset to address the duplicate query_id
issue and released v0.2
version of the dataset.
Changes in the new version
- Additional
product_id
column in task 1 test set.- More information about the
product_id
column is available at : 😱 Why is there a `product_id` column in the Task 1 test set ? 😱
- More information about the
- All the CSV files except the sample submission are zipped. But you should still be able to load them in pandas. For example,
import pandas as pd df = pd.read_csv("path/to/test_public-v0.2.csv.zip")
More details about the dataset is available at the updated post here : 🚀 Datasets Released & Submissions Open 🚀
Best of Luck!