Some Questions

jerry8 · May 15, 2025, 6:29am

Hello, please ask the following questions:
Q1: According to the Truthfulness Score scoring rules in the single-turn QA in the figure, shouldn’t the higher the score, the better the performance? Why is the current ranking higher for lower scores? How to understand this score?

Q2: What conditions must the Potential winners mentioned in the figure meet? Is it the top 10 for each task?

Q3: About # initiate both image and web search API
search_pipeline = UnifiedSearchPipeline(
image_model_name=“openai/clip-vit-large-patch14-336”,
image_hf_dataset_id=“crag-mm-2025/image-search-index-validation”,
text_model_name=“sentence-transformers/all-MiniLM-L6-v2”,
web_hf_dataset_id=“crag-mm-2025/web-search-index-validation”,
)
Are there any restrictions on the use of models in this part, or can only the official models be used?

Thank you for your time！