The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 16, 2024

Filed:

Dec. 15, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Ye Jia, Mountain View, CA (US);

Michelle Tadmor Ramanovich, Mountain View, CA (US);

Tal Remez, Mountain View, CA (US);

Roi Pomerantz, Mountain View, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/58 (2020.01); G10L 13/02 (2013.01); G10L 13/10 (2013.01); G10L 19/16 (2013.01);
U.S. Cl.
CPC ...
G06F 40/58 (2020.01); G10L 13/02 (2013.01); G10L 13/10 (2013.01); G10L 19/16 (2013.01);
Abstract

A direct speech-to-speech translation (S2ST) model includes an encoder configured to receive an input speech representation that to an utterance spoken by a source speaker in a first language and encode the input speech representation into a hidden feature representation. The S2ST model also includes an attention module configured to generate a context vector that attends to the hidden representation encoded by the encoder. The S2ST model also includes a decoder configured to receive the context vector generated by the attention module and predict a phoneme representation that corresponds to a translation of the utterance in a second different language. The S2ST model also includes a synthesizer configured to receive the context vector and the phoneme representation and generate a translated synthesized speech representation that corresponds to a translation of the utterance spoken in the different second language.


Find Patent Forward Citations

Loading…