Home|News|Photo|Opinions|CCYL|Fun|Fashion|Finance|Military|Sports|Employment|University|Travel|Discovery|Video|Games|Autos|Youth Inspring Stories
Photo List Xiaomi OpenSources OmniVoice

【#Tech24H】On May 7, Xiaomi AI Labs NextGeneration Kaldi team launched OmniVoice, which not only achieves toptier performance in Chinese and English scenarios but also surpasses commercial systems in multilingual tasks. It is the industrys first speechcloning TTS model covering hundreds of languages. The model demonstrates strong generalization capabilities for lowresource minor languages, supporting speech synthesis for nearly any language worldwide. OmniVoices most striking breakthrough is its minimalist architecture. Using only a single bidirectional Transformer network, it directly converts text to speech, eliminating redundant structures and steps: no separate modeling of text, no complex hybrid architectures, and no multilevel token predictions. It is the simplest nonautoregressive (NAR) TTS model available today.  [ By Zhang Liyan | Tang Ruohan ]

Editor:Hou Qianqian Source: Youth.cn Time:2026-05-08 16:48:00
PHOTO

About UsContact UsAdvertiseJobsIllegal Information Reporting Send qnb to 10658000 to order Mobile China Youthz

Organized by CCYL and Network Film & TV center of CCYL Copyright@China Youth International. All rights reserved.
信息网络传播视听节目许可证0105108号 京|ICP备11020872号-17 京公网安备110105007246