The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 12, 2020

Filed:

Feb. 20, 2018
Applicant:

Beijing Baidu Netcom Science and Technology Co., Ltd., Beijing, CN;

Inventors:

Pengkai Li, Beijing, CN;

Jingzhou He, Beijing, CN;

Zhihong Fu, Beijing, CN;

Xianwei Xin, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 17/28 (2006.01); G06N 3/08 (2006.01); G06F 16/951 (2019.01); G06N 3/04 (2006.01); G06F 16/33 (2019.01);
U.S. Cl.
CPC ...
G06F 17/28 (2013.01); G06F 16/3344 (2019.01); G06F 16/951 (2019.01); G06F 17/277 (2013.01); G06F 17/2715 (2013.01); G06F 17/2881 (2013.01); G06N 3/0445 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01);
Abstract

The present disclosure discloses a method and apparatus for generating a parallel text in the same language. The method comprises: acquiring a source segmented word sequence and a pre-trained word vector table; determining a source word vector sequence corresponding to the source segmented word sequence, according to the word vector table; importing the source word vector sequence into a first pre-trained recurrent neural network model, to generate an intermediate vector of a preset dimension for characterizing semantics of the source segmented word sequence; importing the intermediate vector into a second pre-trained recurrent neural network model, to generate a target word vector sequence corresponding to the intermediate vector; and determining a target segmented word sequence corresponding to the target word vector sequence according to the word vector table, and determining the target segmented word sequence as a parallel text in the same language corresponding to the source segmented word sequence.


Find Patent Forward Citations

Loading…