CTOnews.com, December 4, Meta recently released the AI translation suite Seamless Communication, which consists of four AI models. Meta claims that the AI suite can "accurately reproduce the speaker's mood", achieve simultaneous interpretation with a delay of only 2 seconds, and support nearly 100 language input.
It is reported that Seamless Communication is the research result published by Meta to celebrate the 10th anniversary of the founding of its AI research organization "Fundamental AI Research".
According to Meta, the suite includes the "second generation SeamlessM4T model" for accelerating translation, the interpretation model "Seamless Expressive", and the simultaneous interpretation model "Seamless Streaming". With the integrated model "Seamless", CTOnews.com collates the relevant information as follows:
The SeamlessM4T model claims to be able to automatically associate possible later texts based on what users say during translation in order to speed up translation.
Seamless Expressive is an interpretation model, which claims to solve the problem that "the traditional AI translation can not grasp the user's intonation, pause and light word weight". It can preserve the user's mood, style, speaking speed, pause and rhythm while maintaining the translation quality, thus bringing more "emotional information" to the translated content.
Seamless Streaming is a simultaneous interpretation model, which focuses on speech and text translation with a 2-second delay, and supports interpretation (speech-to-speech translation), dictation translation (speech-to-text translation,S2TT) and automatic speech recognition (Automatic speech recognition, ASR).
On the other hand, the comprehensive model Seamless integrates the above three language models to facilitate the general scenario.
At present, Meta has posted sample videos on GitHub and HuggingFace, which can be viewed by interested friends.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Thank CTOnews.com netizens soft media Xinyou 1985234 for the clue delivery! According to CTOnews.com news on November 19, Lu Weibing, partner, president of Xiaomi Group, president of international department and general manager of Redmi brand, posted on Weibo that there are still big companies.
According to CTOnews.com9 news on March 3, yesterday, the 2022 International Forum on the Development of China's Automobile Industry (Teda) was officially held in Tianjin, focusing on the regulatory reform of the automobile market, the path of building a unified national market and the strategic opportunities of automobile enterprises, so as to help the domestic automobile market.
Thanks to CTOnews.com netizens for the delivery of clues on the way! CTOnews.com, June 29 (Xinhua)-- Google's first foldable phone, the Pixel Fold, launched yesterday, which uses ultra-thin glass (UTG) screen technology provided by Samsung.
Thanks to CTOnews.com netizens Xiao Zhan cut, rain and snow on the way, MissBook, flirtatious Oo clue delivery! According to CTOnews.com news on April 13, recently, some netizens said that non-members need to watch more than 3000 seconds of commercials to watch a play on Cool Meow.
Thanks to CTOnews.com netizens, South China Daniel Wu, Piankesuohuang 4100 eyes, function jacket, Mr. Aviation, 14000 bottom copy, West window, Brother Black fly's left hand clue delivery! CTOnews.com news on September 1, ideal car was announced this afternoon.