Some issues that I faced with Getting started of Round 2.
- Dataset download with prefix
public_*
, howeverlocal_evaluation.py
uses directory withoutpublic_*
- Spelling mistake for unlabelled dataset, currently it is
public_unlabeled.zip
rather it should bepublic_unlabelled.zip
. You will see this once you download the dataset, not while listing it.
I may be wrong, if so please correct me @shivam @mohanty.
I have put together a Magic box
(based on magic box from Round 1) that will make things easy and make the repository ready to use. Here are a few actions that I am trying to achieve.
- Cloning the repository for Round 2
- Downloading datasets for Round 2 and putting them in relevant directories (abiding the latest
local_evaluation.py
file) - Renaming the dataset directories as per latest
local_evaluation.py
Magic Box for Colab
try:
import os
if first_run and os.path.exists("/content/data-purchasing-challenge-2022-starter-kit/data/training"):
first_run = False
except:
first_run = True
if first_run:
%cd /content/
!git clone http://gitlab.aicrowd.com/zew/data-purchasing-challenge-2022-starter-kit.git > /dev/null
%cd data-purchasing-challenge-2022-starter-kit
!aicrowd dataset list -c data-purchasing-challenge-2022 | grep -e 'v0.2'
!aicrowd dataset download -c data-purchasing-challenge-2022 *-v0.2-rc4.zip
!mkdir -p data/
!mkdir -p data/v0.2-rc4
!mv *.zip data/v0.2-rc4 && cd data/v0.2-rc4 && echo "Extracting dataset" && ls *.zip | xargs -n1 -I{} bash -c "unzip \*.zip > /dev/null"
!mv data/v0.2-rc4/public_debug data/v0.2-rc4/debug
!mv data/v0.2-rc4/public_training data/v0.2-rc4/training
!mv data/v0.2-rc4/public_unlabeled data/v0.2-rc4/unlabelled
!mv data/v0.2-rc4/public_validation data/v0.2-rc4/validation
Magic Box for Local System
#!/bin/bash
git clone http://gitlab.aicrowd.com/zew/data-purchasing-challenge-2022-starter-kit.git
cd data-purchasing-challenge-2022-starter-kit
aicrowd dataset list -c data-purchasing-challenge-2022 | grep -e 'v0.2'
aicrowd dataset download -c data-purchasing-challenge-2022 *-v0.2-rc4.zip
mkdir -p data/
mkdir -p data/v0.2-rc4
mv *.zip data/v0.2-rc4 && cd data/v0.2-rc4 && echo "Extracting dataset" && ls *.zip | xargs -n1 -I{} bash -c "unzip \*.zip > /dev/null"
mv public_debug debug
mv public_training training
mv public_unlabeled unlabelled
mv public_validation validation
Put the above code in magic_box.sh
and execute
>>> bash magic_box.sh
Please do let me know any improvements or questions in the comments below, I would be glad to help you.
Click on if this post was of any help