Skip to content
techLOWRU2026-05-05 16:54 UTC

How Two 13-Year-Olds Distilled DeepSeek-V4 Reasoning into Qwen3.5-2B

Hello everyone! We are two 13-year-old students from Russia, and we want to show QwenSeek-2B! For us, this is a big result! We released the model on Hugging Face, and the 1000+ downloads! We are just incredibly Training Details: https://huggingface.co/datasets/Jackrong/DeepSeek-V4-Distill-8000x We w

ADVERTISEMENT
⚡ STAY AHEAD

Events like this, convergence-verified across 689 sources, land in your inbox every Sunday. Free.

GET THE SUNDAY BRIEFING →

RELATED · RU