|
Submission error
|
|
1
|
442
|
April 18, 2024
|
|
Some issues about time limit
|
|
3
|
559
|
April 18, 2024
|
|
Submission failure with empty logs
|
|
1
|
569
|
April 18, 2024
|
|
Docker build failed
|
|
2
|
719
|
April 15, 2024
|
|
Are these evaluation qa values present in qa.json correct?
|
|
1
|
776
|
April 17, 2024
|
|
Tentative Challenge Winners
|
|
1
|
795
|
April 17, 2024
|
|
Can we assume the same websites in both the test and training datasets?
|
|
1
|
470
|
April 17, 2024
|
|
Confusion about using other LLMs
|
|
1
|
606
|
April 16, 2024
|
|
ModuleNotFoundError: No module named 'aicrowd_gym'
|
|
1
|
815
|
April 16, 2024
|
|
Are the four models are competing and ranked in the same track?
|
|
1
|
293
|
April 16, 2024
|
|
Do I need to complete all three tasks?
|
|
2
|
405
|
April 16, 2024
|
|
Pretrained LLM
|
|
1
|
656
|
April 15, 2024
|
|
Can we use other LLM at training stage?
|
|
4
|
1000
|
April 15, 2024
|
|
We Cannot View Submissions
|
|
2
|
593
|
April 15, 2024
|
|
About 'System Logs Comprehensive Rag Task: Inference failed'
|
|
1
|
383
|
April 15, 2024
|
|
Issues about submission LFS file issues
|
|
0
|
708
|
April 13, 2024
|
|
What Does 2 Submission Per Week Mean?
|
|
1
|
524
|
April 13, 2024
|
|
Why Doesn't Leaderboard Show Ranking Score?
|
|
2
|
626
|
April 12, 2024
|
|
Function calling arguments in local_evaluation.py is mismatching with dummy_model's interface
|
|
3
|
647
|
April 12, 2024
|
|
AIcrowd Submission automatically runs again
|
|
2
|
451
|
April 12, 2024
|
|
Unable to create notebook
|
|
0
|
438
|
April 11, 2024
|
|
Error building the docker image
|
|
1
|
917
|
April 11, 2024
|
|
About the 'search results' type
|
|
1
|
504
|
April 11, 2024
|
|
Is there a baseline score for reference?
|
|
2
|
470
|
April 11, 2024
|
|
Total submission times?
|
|
2
|
758
|
April 11, 2024
|
|
Will the Mock API data evolve?
|
|
1
|
705
|
April 10, 2024
|
|
Jailbreaking the judge
|
|
1
|
696
|
April 10, 2024
|
|
Git package upload fails
|
|
2
|
389
|
April 10, 2024
|
|
Does proceeing to Phase 2 in each Track independently?
|
|
1
|
324
|
April 10, 2024
|
|
Can we use the hf version of llama2 models?
|
|
6
|
899
|
April 10, 2024
|