@strekalov , I’ve added your error on gitlab comment.
@dipam Could you provide more info about submissions #212367 and #212368, please?
Both are Product Matching: Inference failed
@dipam, could you please check
#212567
#212566
It is failed in step “Build Packages And Env”, however I have changed only NN params.
As to me it is very strange…
@dipam Could you check submission: 213161?
The diagram shows that everything worked fine, but the status is failed:
@dipam , have you changed some settings of the server for inference?
Previously I have faced near 1 failed submitting per day, just rerunning helps.
however today I have changed only NN weights files and thats all, 1 get 1 submit ok, and 4 other weights with same size, same model all same just other epochs - failed.
As to me it is very strange…
Could you pls check it?
#214502
#214483
#214469
If it is timeout, how it could be if other weights are ok, or just resubmitting sometimes helps?
All these timed out, they’re just barely above the time limit. The variation in runtime can be due to the slight variation in CPU type that the AWS nodes we provision can have. Hence resubmitting can sometimes help, but for consistency I suggest trying to bring down the compute time.
I understand that every second might matter, however the organizer has deemed 10 minutes to be a generous time limit for the kind of solutions they are looking for, hence the constraint.
Hi, you can check the explanation on this message above. Hope it helps.
Hi @dipam can you please tell me why my submission # # 216553 failed? I now that its late for submission, but just interested if it’s due to a timeout or something else.
Hello, @dipam
I’m also curious what happened to sumbissions #216545 and #216531
Could you tell please?