I’m very interested in how domain-general the architectures in Round 1 were (or if they involved techniques like reward shaping, and if so how much). This will be helpful in order to interpret the significance of their achievment.
Will the architectures be released at some point? And if not, would be it possible to find out, in broad strokes, the extent to which the winner used hardcoded knowledge?