Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937

Topic	Replies	Views	Activity
Regarding the limitations on the time taken to generate an answer and the number of tokens Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 bug , question	1	946	April 19, 2024
Submission error Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	485	April 18, 2024
Some issues about time limit Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	3	691	April 18, 2024
Submission failure with empty logs Meta KDD Cup 24 - CRAG - Retrieval Summarization	1	646	April 18, 2024
Are these evaluation qa values present in qa.json correct? Meta KDD Cup 24 - CRAG - Retrieval Summarization clarification , bug	1	923	April 17, 2024
Can we assume the same websites in both the test and training datasets? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	1	558	April 17, 2024
Confusion about using other LLMs Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	1	706	April 16, 2024
Are the four models are competing and ranked in the same track? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	336	April 16, 2024
Do I need to complete all three tasks? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	2	471	April 16, 2024
Pretrained LLM Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	757	April 15, 2024
Can we use other LLM at training stage? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	4	1143	April 15, 2024
About 'System Logs Comprehensive Rag Task: Inference failed' Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	1	458	April 15, 2024
Issues about submission LFS file issues Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	0	857	April 13, 2024
Function calling arguments in local_evaluation.py is mismatching with dummy_model's interface Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	3	758	April 12, 2024
AIcrowd Submission automatically runs again Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	2	519	April 12, 2024
Error building the docker image Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	1044	April 11, 2024
About the 'search results' type Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	1	566	April 11, 2024
Is there a baseline score for reference? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	2	538	April 11, 2024
Total submission times? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	2	937	April 11, 2024
Will the Mock API data evolve? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	865	April 10, 2024
Jailbreaking the judge Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 bug	1	811	April 10, 2024
Can we use the hf version of llama2 models? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	6	1007	April 10, 2024
Hi, where is the baseline? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question , baseline	4	2170	April 9, 2024
No one submits success (leaderboard is empty)? Or submissions are hidden? Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	666	April 9, 2024
Data schema have differences between example_data and real data(task1&2) Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937 question	3	1032	April 9, 2024
About development set Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	1	656	April 2, 2024
About submission times Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937	0	863	March 27, 2024