F1 score very small

Hi Organizers

Are you sure the scoring metric for this challenge is right? :face_with_raised_eyebrow:
The F1-score on the test set seems very small compared to the cross-validation performance on the training set.

Best regards
Bjørn-J