Based on my understanding of the competition rules:
The generative model in RAG needs to be based on the LLaMA model.
During the training phase, we are able to utilize public datasets for training, which could be produced by other LLMs. However, if we opt to use additional generated data, we must generate this data using the LLaMA model, and disclose it upon completion.
I wonder if I understand the rules correctly.