The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 28, 2020

Filed:

Apr. 04, 2018
Applicant:

Electronics and Telecommunications Research Institute, Daejeon, KR;

Inventor:

Jong Hun Shin, Daejeon, KR;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/28 (2006.01); G10L 25/30 (2013.01); G06N 3/04 (2006.01); G06N 3/08 (2006.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
G06F 17/289 (2013.01); G06F 17/277 (2013.01); G06F 17/2827 (2013.01); G06N 3/0445 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01); G10L 25/30 (2013.01);
Abstract

The present invention provides a method of generating training data to which explicit word-alignment information is added without impairing sub-word tokens, and a neural machine translation method and apparatus including the method. The method of generating training data includes the steps of: (1) separating basic word boundaries through morphological analysis or named entity recognition of a sentence of a bilingual corpus used for learning; (2) extracting explicit word-alignment information from the sentence of the bilingual corpus used for learning; (3) further dividing the word boundaries separated in step (1) into sub-word tokens; (4) generating new source language training data by using an output from the step (1) and an output from the step (3); and (5) generating new target language training data by using the explicit word-alignment information generated in the step (2) and the target language outputs from the steps (1) and (3).


Find Patent Forward Citations

Loading…