The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 02, 2021

Filed:

Nov. 16, 2018
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Quoc V. Le, Sunnyvale, CA (US);

Minh-Thang Luong, Stanford, CA (US);

Ilya Sutskever, San Francisco, CA (US);

Oriol Vinyals, London, GB;

Wojciech Zaremba, Kluczbork, PL;

Assignee:

Google LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/28 (2006.01); G06F 40/56 (2020.01); G06N 3/04 (2006.01); G06F 40/44 (2020.01); G06F 40/45 (2020.01); G06F 40/242 (2020.01); G06F 7/02 (2006.01); G06F 7/10 (2006.01); G10L 15/02 (2006.01); G10L 15/16 (2006.01);
U.S. Cl.
CPC ...
G06F 40/56 (2020.01); G06F 7/023 (2013.01); G06F 7/10 (2013.01); G06F 40/242 (2020.01); G06F 40/44 (2020.01); G06F 40/45 (2020.01); G06N 3/0445 (2013.01); G06N 3/0454 (2013.01); G10L 15/02 (2013.01); G10L 15/16 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural translation systems with rare word processing. One of the methods is a method training a neural network translation system to track the source in source sentences of unknown words in target sentences, in a source language and a target language, respectively and includes deriving alignment data from a parallel corpus, the alignment data identifying, in each pair of source and target language sentences in the parallel corpus, aligned source and target words; annotating the sentences in the parallel corpus according to the alignment data and a rare word model to generate a training dataset of paired source and target language sentences; and training a neural network translation model on the training dataset.


Find Patent Forward Citations

Loading…