Hi,
Is this model allowed? Qwen3-8B has 8.2B total parameters, of which 6.95B are non-embedding parameters.
Thanks.