Hi @shravankoninti,
Yes, you can access all the files at the same time during evaluation.
The starter kit have all the information about the environment variable, but let me clarify on the environment variables available during evaluations here as well.
-
AICROWD_TEST_DATA_PATH
: Refers to testing_phase2_release.csv file which is used by evaluator to judge your models in testing phase (soon to be made public)
-
AICROWD_TRAIN_DATA_PATH
: Refers to /shared_data/data/training_data/
in which all of training related files are present.
-
AICROWD_PREDICTIONS_OUTPUT_PATH
: Refers to the path at which your code is expected to output final predictions
Now in your codebase, you can simply do something as follows to load both the files:
AICROWD_TRAIN_DATA_PATH = os.getenv("AICROWD_TRAIN_DATA_PATH", "/shared_data/data/training_data/")
AICROWD_TEST_DATA_PATH = os.getenv("AICROWD_TEST_DATA_PATH", "/shared_data/data/testing_data/to_be_added_in_workspace.csv")
AICROWD_PREDICTIONS_OUTPUT_PATH = os.getenv("AICROWD_PREDICTIONS_OUTPUT_PATH", "random_prediction.csv")
train_df = pd.read_csv(AICROWD_TRAIN_DATA_PATH + 'training_data_2015_split_on_outcome.csv')
# Do pre-processing, etc
[...]
test_df = pd.read_csv(AICROWD_TEST_DATA_PATH, index_col=0)
# Make predictions
[...]
# Submit your answer
prediction_df.to_csv(AICROWD_PREDICTIONS_OUTPUT_PATH, index=False)
I hope the example clarifies your doubt.