I’m just had a submission fail after all the roll outs seemingly completed successfully. I had another submission with slightly different hyper parameters finish successfully and get scored. There don’t seem to be anything in the logs to indicate a failure. Does anyone know what could be causing this?

Edit: Submission [#78231] and [#78257]

Hello @lev_mckinney

Apologies for the confusion. One the submission mistakenly got marked as failed due a network issue on the cluster. The status of the failed submission is updated.