Looking at the provided baseline, I found it hard to follow. Therefore I decided to publish my simple baseline.
Not all data were used nor any fine-tune has been done.
It just took more than 1 hour for training and 15 minutes for inference.
Hope it helps newcomers to understand the data and the problem.