
Alibaba Open-Sources Fun-Audio-Chat, a New End-to-End Voice Interaction Model
Want to read in a language you're more familiar with?
Alibaba has open-sourced its Fun-Audio-Chat model, an 8B parameter end-to-end speech interaction model with integrated Function Call capabilities, aiming to foster innovation in the voice AI ecosystem.
Alibaba has announced the open-source release of Fun-Audio-Chat, a new-generation end-to-end speech interaction model, according to the Tongyi large model team.
The open-source package includes the 8-billion-parameter (8B) model weights, inference code, and detailed examples for Function Call integration, enabling developers to connect the model with external tools and services.
Designed to deliver more natural and fluid voice conversations, Fun-Audio-Chat leverages an end-to-end architecture to better understand and generate continuous speech. The built-in Function Call capability significantly expands the model’s applicability to complex, multi-step tasks. Alibaba said the open-source initiative aims to lower barriers for developers and accelerate innovation across the voice AI ecosystem.
Source : AI daliy




