Tencent Hunyuan’s New Generation Turbo S Fast Thinking Model Released
On February 27th, Tencent‘s Hunyuan new generation fast thinking model Turbo S was officially released.
Tencent officials stated that unlike the slow thinking models such as Deepseek R1 and Hunyuan T1 that require “thinking before answering,” Hunyuan Turbo S can achieve “instant response,” output answers more quickly, double the speaking speed, and reduce initial latency by 44%.
In several widely used benchmarks in the industry, Hunyuan Turbo S has shown performance comparable to a series of business models such as DeepSeek V3, GPT 4o, Claude in various fields including knowledge, mathematics, reasoning.
In terms of architecture, the Hunyuan Turbo S adopts the Hybrid-Mamba-Transformer fusion mode, reducing the computational complexity of traditional Transformer structures and decreasing KV-Cache cache usage, achieving a decrease in training and inference costs.
Tencent also stated that the new fusion mode breaks through the difficulty faced by large models with pure Transformer structures in high training and inference costs for long texts. On one hand, it leverages Mamba’s efficient processing of long sequences; on the other hand, it retains the advantage of Transformers in capturing complex contexts, ultimately building a hybrid architecture with dual advantages in memory and computational efficiency.
This is also the first successful application of Mamba architecture without loss on super-large MoE models in industry. Through technical innovation in model architecture, Hunyuan Turbo S significantly reduces deployment costs.
As a flagship model, Hunyuan Turbo S will become the core foundation for Tencent‘s derived models from the Hunyuan series in providing basic capabilities for inference, long texts, code generation, and other derived models.
Based on Turbo S and introducing technologies such as long thought chains, retrieval enhancement, and reinforcement learning,Hunyuan has also launched an inference model T1 with deep thinking capabilities.
From the Tencent announcement, it is learned that currently, developers and enterprise users can now call Tencent Hunyuan Turbo S through API on Tencent Cloud for a free trial within one week.
In terms of pricing, the input price for Turbo S is 0.8 yuan per million tokens, and the output price is 2 yuan per million tokens.
Tencent Yuanbao will gradually launch Hunyuan Turbo S in grayscale. Users can experience using it by selecting the “Hunyuan” model in Yuanbao and turning off deep thinking.
SEE ALSO: Tencent Yuanbao Surpasses Doubao to Become the Second on the Apple App Store Free List