The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 06, 2020

Filed:

May. 14, 2018
Applicant:

Baidu Online Network Technology (Beijing) Co., Ltd., Beijing, CN;

Inventors:

Wei Zou, Beijing, CN;

Xiangang Li, Beijing, CN;

Bin Huang, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/28 (2006.01); G10L 15/02 (2006.01); G11B 20/00 (2006.01); G11B 20/10 (2006.01); H04L 12/54 (2013.01); G10L 15/26 (2006.01); G10L 15/22 (2006.01); G06F 40/42 (2020.01); G06F 40/47 (2020.01); G06F 40/51 (2020.01); G10L 15/06 (2013.01);
U.S. Cl.
CPC ...
G10L 15/265 (2013.01); G06F 40/42 (2020.01); G06F 40/47 (2020.01); G06F 40/51 (2020.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); G10L 15/02 (2013.01); G10L 15/063 (2013.01);
Abstract

An artificial intelligence-based cross-language speech transcription method and apparatus, a device and a readable medium. The method includes pre-processing to-be-transcribed speech data to obtain multiple acoustic features, the to-be-transcribed speech data being represented in a first language; predicting a corresponding translation text after transcription of the speech data according to the multiple acoustic features and a pre-trained cross-language transcription model; wherein the translation text is represented in a second language which is different from the first language. According to the technical solution, it is unnecessary, upon cross-language speech transcription, to perform speech recognition first and then perform machine translation, but to directly perform cross-language transcription according to the pre-trained cross-language transcription model. The technical solution can overcome the problem of error accumulation in the two-step cross-language transcription manner in the prior art, and can effectively improve accuracy and efficiency of the cross-language speech transcription as compared with the prior art.


Find Patent Forward Citations

Loading…