If It Fits, It Sits: Qwen 3.6 35B
ByteShape's ShapeLearn-quantized release of Qwen 3.6 35B in NTP and MTP variants. Clear improvements across all tested hardware: pick the largest ByteShape model that fits, and on GPUs MTP adds another 20–40% token-generation throughput on top.
Read More →