Can we access and read the shared folder training files in the predict.py file where the orignal test data is layed out and this is accesed from production evironment path.
I want to read both the dataset i.e., train and test like this.
Yes, you can access all the files at the same time during evaluation.
The starter kit have all the information about the environment variable, but let me clarify on the environment variables available during evaluations here as well.
AICROWD_TEST_DATA_PATH: Refers to testing_phase2_release.csv file which is used by evaluator to judge your models in testing phase (soon to be made public)
AICROWD_TRAIN_DATA_PATH: Refers to /shared_data/data/training_data/ in which all of training related files are present.
AICROWD_PREDICTIONS_OUTPUT_PATH: Refers to the path at which your code is expected to output final predictions
Now in your codebase, you can simply do something as follows to load both the files:
Your codebase need to read this environment variable i.e. absolute and just write final predictions at that location. The example is in starter kit already as well as in this comment above.
It just says 'AICROWD_PREDICTIONS_OUTPUT_PATH = os.getenv(“AICROWD_PREDICTIONS_OUTPUT_PATH”, “random_prediction.csv”)
'.But what is the default path for that file? For example,does it need to be in shared data or personal folder that I created or any directory and is it expected to have any predefined name for the output file?.Do we need to mention complete path - /x/y/z/predictions.csv.
“While during evaluation this environment variable will be set always and default value wouldn’t be used.” – What does this line mean?Does it write to some other server for evaluation?