The challenge deadline has been extended to February 8 (to accommodate recent server downtime). Please make your final submissions by then, and start preparing your final evaluation package now. Each team may submit one final submission per track, and the final agent must be runnable with the Orak framework and compatible with 4 games.
Please ensure all eligible entries are submitted with a complete final submission package. All previously shared requirements still apply, including:
As clarified before, 8B models are allowed even if the actual parameter count slightly exceeds 8B (for example, Llama-3.1-8B). If you use multiple models that share identical base weights, the shared base weights should be counted once, and any adapter or LoRA parameters should be counted separately.
Quick question — for agents using an open‑source 8B model via API (no weights hosted), what do we need to submit?
Is a gated HF repo still required, or just docs + reproducibility details?
Qwen3VL-8B-Instruct-Q4_K_M.gguf
still dont understandable what need to be done,
this model is not fine-tuned and we only use it for local inference for prompt.
How you expect it be stored on gated Hugging Face repositories?
i think you doesn’t want us copy to huggingface,
because this model is already open and everybody can download it and deploy to endpoint.
Clarification on final evaluation / “model artifacts”
For the final evaluation, KRAFTON will run your submission on our system (including hidden tests). To be eligible for final rankings, your submission must be fully reproducible end-to-end.
If you fine-tuned or modified weights (LoRA/adapters/merged checkpoints/custom weights), you must provide those artifacts.
If you did not fine-tune, you don’t need to upload base model weights—but you must provide exact model ID + revision, exact format/quantization, and all inference/runtime details (prompts/templates, decoding params, configs, scripts, any extra files/data) needed to reproduce the result.
If we can’t reproduce your results due to missing details/artifacts, the submission won’t count toward final rankings.