Exploit like score - 0.998!?

picekl · June 4, 2020, 2:04pm

Dear Organisers,

Seem like @OG_SouL is using the metric exploit I have mentioned previously on the forum.

How can we (the honest participants) be sure that such submissions wont be considered in the competition? For your information, our submissions are constructed to top F1 score as the harmonic mean of precision and recall.

Thank you in advance for your answers.
Lukas

dimitri.fichou · June 4, 2020, 2:32pm

Lukas,
Of course they will be removed.
Best,
Dimitri

picekl · June 4, 2020, 2:39pm

Thank you @dimitri.fichou for clarification. I would also like to hear from the @OG_SouL about their solution. I could be wrong and they might have such solution that is super accurate.

Best,
Lukas

OG_SouL · June 4, 2020, 3:05pm

Hi @picekl, if you look at our first submission, we had got a good overall_precision score but a lower overall_recall score according to the evaluation metric. Therefore, we improved our model to get a better overall_recall score, which we achieved with our subsequent submissions.

But, by then, when we submitted, the ‘overall_recall’ column was removed. After getting confirmation from @dimitri.fichou via mail that overall_precision score is going to be the sole evaluation metric for the competition, we re-trained our model to improve on the precision scores. This is reflected in our last two submissions.

We believe that even if the evaluation metric is modified to consider either the f1 score or mAP (over IoU > 0.5), two of our submissions would excel in that, as they were trained particularly to increase the same.

@dimitri.fichou, it would be great if you could clarify what exactly would be the final evaluation metric. We’ll make another submission and tag that as the ‘Primary Run’.

dimitri.fichou · June 4, 2020, 3:07pm

I was on it, see here: About the evaluation metric (bis)

OG_SouL · June 4, 2020, 3:16pm

Thanks @dimitri.fichou for the clarification. Since we have not tagged any of our submission as a primary run, can you please confirm which one would be considered for deciding the leaderboard (according to new evaluation script) ?

Also, it would be great if we could also see the scores of other participants’ submissions, because we’re only able to see our own scores. @picekl, can you help us out in this?

Thanks.

dimitri.fichou · June 4, 2020, 3:30pm

I don’t really know what to say anymore,
If the submission looks fishy at the end, it will be remove, does that sound good ?
Best,
Dimitri

naveen_narayanan · June 4, 2020, 3:31pm

Can you please make the leaderboard public as @OG_SouL previously suggested.
@dimitri.fichou

dimitri.fichou · June 4, 2020, 3:32pm

For the leaderboard public, I looked and did not found how to do it,
EDIT: did not look well enough, found it

picekl · June 4, 2020, 3:33pm

I’m not sure if this is possible. This is how the CLEF is evaluated -> Secretly.

Sadly, since best score is visible, it’s not the same.

naveen_narayanan · June 4, 2020, 3:55pm

I would like to know how many submissions a team can make ? If there are 3 members, can we make 30 submissions (10 submissions from one account ). Also during registration,it was mentioned that team members should have their username as their team name. But then we were not able to have the same username. Can someone help me out on this case ?
@dimitri.fichou @shivam

picekl · June 4, 2020, 4:03pm

I’m afraid that you are supposed to submit only 10 submissions per team. Using multiple accounts to increase the number of submissions could be considered as rules violation. At least other platforms (e.g. Kaggle) works this way.

Lukas

naveen_narayanan · June 4, 2020, 4:35pm

@picekl Ohh I was not aware of this. But I would like to point out a few things. I don’t think the information that you have mentioned was mentioned anywhere in the challenge webpage. Secondly, we did not have an option to create a team and during registration, we were told to have the team name as the username. Hence logically, if the members of a team are represented with one particular name, then submissions can be made from either or all of the member’s accounts.

picekl · June 4, 2020, 4:55pm

This is up to organisers to decide. From my perspective, it’s really hard to track the number of people under single team and it’s expected that one team will have 10 submissions. If not you will be motivated to accumulate the huge number of “contributors” to increase your number of submissions. In our case we are 3 and we are going to submit only 10 submissions. There is only one who have signed the Eula.

dimitri.fichou · June 4, 2020, 5:24pm

This is definitely the exploit thread…
Please submit under the same username so it’s limited to 10 submissions and fair to the other participants.
Dimitri