Close Menu
Technotification
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Technotification
    • Home
    • News
    • How To
    • Explained
    • Facts
    • Lists
    • Programming
    • Security
    • Gaming
    Technotification
    Home › Artificial Intelligence › Google AI introduces Translatotron: An end-to-end speech translation model

    Google AI introduces Translatotron: An end-to-end speech translation model

    By Subham KapisweDecember 3, 2022
    Facebook Twitter Reddit LinkedIn
    google Ai platform

    Google is actively integrating Artificial Intelligence to its products these days. Recently, Google AI engineers introduced Translatotron which is an end to end, speech to speech translation model.

    Translatotron proves that a single sequence-to-sequence AI model can directly translate speech from one language into another. In their research paper, the team demonstrated the new speech translation model and successfully obtained high translation quality on two Spanish-to-English datasets.

    Also Read: Top 3 Major Limitations of Artificial Intelligence (AI)

    Google AI introduces Translatotron
    The model architecture of Translatotron

    If we go a little deeper, speech-to-speech translation systems usually consists of three components:

    • Speech Recognition: It used to convert the source speech into text.
    • Machine Translation: It is used for translating the converted text into the target language.
    • Text-to-Speech Synthesis (TTS): It is used to produce speech in the target language from the translated text.

    There are many successful speech-to-speech translation products such as Google Translate powered by such systems.

    Google engineers have been working on this project for almost three years. The story started in 2016 when researchers demonstrated the practicability of using a single sequence-to-sequence model for speech-to-text translation. It also made researchers realized the need for end-to-end speech translation models

    Later, in 2017, the Google AI team showed that such these models can outperform the conventional cascade models. Not only Google, but recently many other proposals have also been made for improving end-to-end speech-to-text translation models.

    Unlike cascaded systems, Translatotron doesn’t rely on an intermediate text representation in either language. It’s based on a sequence-to-sequence network that takes source spectrograms as input and then generates spectrograms of the translated text in the target language.

    The new end-to-end speech translation model works on two separately trained components:

    • Neural vocoder: It converts output spectrograms to time-domain waveforms.
    • Speaker encoder: It maintains the source speaker’s voice in the synthesized translated speech.

    The Google AI engineers validated Translatotron’s translation quality by measuring the BLEU (bilingual evaluation understudy) score, computed with text converted by a speech recognition system. The results might lag behind a traditional cascade system but the team has managed to demonstrate the usefulness of the end-to-end direct speech-to-speech translation.

    Also Read: Google Launches AI Platform For Developers and Data Scientists

    Translatotron retains the original vocal characteristics in the translated speech by including a speaker encoder network and makes the translated speech sound natural.

    The engineers concluded that Translatotron is the first end-to-end model that can directly translate speech from one language into speech in another language and can retain the source voice in the translated speech. They are considering this as a starting point for future research on end-to-end speech-to-speech translation systems.

    Share. Facebook Twitter LinkedIn Tumblr Reddit Telegram WhatsApp
    Subham Kapiswe
    • LinkedIn

    A computer science engineer by education and blogger by profession who loves to write about Programming, Cybersecurity, Blockchain, Artificial Intelligence, Open Source and other latest technologies.

    Related Posts

    NVIDIA GeForce NOW is Finally Coming to India

    January 8, 2025

    Achieving Cryptocurrency Success with Quantum AI

    January 18, 2024

    India’s JioGamesCloud Added 100+ New Games

    October 15, 2023

    Apple’s latest iOS 16.6 Patch Boosts iPhone Privacy & Security

    July 31, 2023

    Multiview Feature Now Available on YouTube Tv

    July 31, 2023

    Threads’ to Recieve DM Support Soon, Confirms Meta Spokesperson

    July 30, 2023
    Lists You May Like

    10 Sites to Watch Free Korean Drama [2025 Edition]

    January 2, 2025

    10 Best RARBG Alternative Sites in April 2025 [Working Links]

    April 1, 2025

    The Pirate Bay Proxy List in 2025 [Updated List]

    January 2, 2025

    10 Best Torrent Search Engine Sites (2025 Edition)

    February 12, 2025

    10 Best GTA V Roleplay Servers in 2025 (Updated List)

    January 6, 2025

    5 Best Torrent Sites for Software in 2025

    January 2, 2025

    1337x Alternatives, Proxies, and Mirror Sites in 2025

    January 2, 2025

    10 Best Torrent Sites for eBooks in 2025 [Working]

    January 2, 2025

    10 Best Anime Torrent Sites in 2025 [Working Sites]

    January 6, 2025

    Top Free Photo Editing Software For PC in 2025

    January 2, 2025
    Pages
    • About
    • Contact
    • Privacy
    • Careers
    Privacy

    Information such as the type of browser being used, its operating system, and your IP address is gathered in order to enhance your online experience.

    © 2013 - 2025 Technotification | All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.