
Shengshu Technology Secures Hundreds of Millions in New Funding, Vidu Leads the Multimodal AI Race
Want to read in a language you're more familiar with?
ShengShu Technology raises an undisclosed Series A (~$30–$125M) to scale Vidu multimodal AI, as ARR tops ~$20M and generated videos surpass 400M worldwide.
China’s multimodal AI startup Shengshu Technology said it has closed an A round worth “several hundred million yuan,” without disclosing an exact figure. At today’s exchange rates, that phrase typically implies roughly $30–$140 million. The round was led by Bohua Capital, with follow-on participation from Baidu Strategic Investment, the Beijing AI Industry Investment Fund, Qiming Venture Partners, DT Capital, Zoyuan Asia, and BV Baidu Ventures; C&D Emerging Investment also joined.
CEO Dr. Luo Yihang said commercialization of multimodal generation is accelerating and will “reshape” digital-content production over the next three years. Proceeds will fund model R&D and product expansion, bolster user services and industry partnerships, and support global go-to-market efforts.
Shengshu said its Vidu products now serve users in 200+ countries and regions, with more than 400 million videos generated to date; over 100 million of those were created via its “reference-guided” core feature, and commercial content accounts for over 50% of output—evidence, the company argues, that Vidu is being used in production settings. The startup also disclosed annual recurring revenue (ARR) above $20 million eight months after launch.
On September 9, Shengshu released Vidu Q1, a reference-image model that supports up to seven input images—pushing domestic limits on multi-image guidance—and aims to improve multi-subject consistency and high-fidelity rendering for images. Shengshu says Vidu now covers image, video and audio generation and plans another model update “soon.”