What dataset should I use to evaluate my task 2 solution?

What dataset should I use to evaluate my task 2 solution?

The Question Answering data used in Task #2 is the same as the one in Task #1. To download the data, please check this link.

Best Regards,
The CRAG Team