How Two 13-Year-Olds Distilled DeepSeek-V4 Reasoning into Qwen3.5-2B
Hello everyone! We are two 13-year-old students from Russia, and we want to show QwenSeek-2B! For us, this is a big result! We released the model on Hugging Face, and the 1000+ downloads! We are just incredibly Training Details: https://huggingface.co/datasets/Jackrong/DeepSeek-V4-Distill-8000x We w
ORIGINAL SOURCE →via Dev.to
ADVERTISEMENT
⚡ STAY AHEAD
Events like this, convergence-verified across 689 sources, land in your inbox every Sunday. Free.
GET THE SUNDAY BRIEFING →RELATED · RU
- [CONFLICT] Putin saklanıyor mu?
- [CONFLICT] US Senators Push to Reinstate Russian Oil Sanctions
- [CONFLICT] At least 18 killed in Russian strikes across Ukraine as Zelenskyy denounces Moscow's 'cynicism'
- [PROTEST] Dark clouds, protests and resignations dampen start of 61st Venice Biennale
- [CONFLICT] Russia’s State Duma considers bill barring students from master’s programs outside their undergraduate field
- [CONFLICT] Lavrov, Rubio 'touch base' on current state of international affairs, US-Russia relations