Can we use the hf version of llama2 models?

nha_nguyen_van · April 8, 2024, 8:49am

Link: meta-llama/Llama-2-7b-chat-hf · Hugging Face

aicrowd_team · April 9, 2024, 12:17pm

@nha_nguyen_van : Yes, you can use the hf version of the llama family models.
The recently released baselines also use the same.

nha_nguyen_van · April 9, 2024, 2:01pm

Can we use any embedding model as sentence transformers in baseline, as long as it’s open source.

aicrowd_team · April 9, 2024, 3:34pm

@nha_nguyen_van : Yes, you can.

nha_nguyen_van · April 10, 2024, 1:56am

Yes, thanks. But I have a concern about scores on the leaderboard. Some team has a missing value of approximately 0.8, but n_miss=0… It makes CRAG score low. Can you check again?

aicrowd_team · April 10, 2024, 2:13am

@nha_nguyen_van : Thanks for pointing it out. The missing and hallucination columns were swapped by mistake. Its fixed now.

nha_nguyen_van · April 10, 2024, 2:34am

I run local evaluation and I set a lot of answers is “I don’t now”. According to code evaluation, you will check if prediction==“i don’t know” or prediction==“i don’t known.” n_miss+=1 . But I still recieve n_miss value = 0 when submitting.At the local, i still have n_miss > 0. And submission log,there are some answers that are not evaled successfully. I only have 247/260(95%) success in Evaluation Progress. Can you check this issue?