I wrote scores from the leaderboard. I can’t check 10K there…
Local scores are a little bit higher than LB, but correlated with LB.
Yeah maybe I’ll check it locally.
I’ve checked it locally.
Using all 10K images is better than my 3K choosing by 0.006. Maybe I can take some of it by changing purchasing algorithm. But still I feel I need to tune my model.
yeah, looks like your model/training pipeline is limiting you. You should be able to get a much bigger improvement from using all labels. Maybe try a bigger model and tune it a bit.
Aah, yes. With the post. I think maybe there are only some ~4000-5000 images good enough for improving the system and therefore adding all 10000 doesn’t make much improvement because the basic model will predict them correctly anyway.
It’s just a theory, not tested. I haven’t touched the purchase part yet.
Thanks for your insight.