The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 06, 2019

Filed:

Sep. 07, 2017
Applicant:

Baidu Usa, Llc, Sunnyvale, CA (US);

Inventors:

Hairong Liu, San Jose, CA (US);

Zhenyao Zhu, Sunnyvale, CA (US);

Sanjeev Satheesh, Sunnyvale, CA (US);

Assignee:

Baidu USA LLC, Sunnyvale, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/02 (2006.01); G10L 15/06 (2013.01); G10L 15/16 (2006.01); G10L 15/197 (2013.01); G10L 15/04 (2013.01); G10L 15/26 (2006.01); G10L 15/187 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/04 (2013.01); G10L 15/16 (2013.01); G10L 15/187 (2013.01); G10L 15/26 (2013.01); G10L 15/197 (2013.01); G10L 2015/0636 (2013.01);
Abstract

Described herein are systems and methods for automatic unit selection and target decomposition for sequence labelling. Embodiments include a new loss function called Gram-Connectionist Temporal Classification (CTC) loss that extend the popular CTC loss function criterion to alleviate prior limitations. While preserving the advantages of CTC, Gram-CTC automatically learns the best set of basic units (grams), as well as the most suitable decomposition of target sequences. Unlike CTC, embodiments of Gram-CTC allow a model to output variable number of characters at each time step, which enables the model to capture longer term dependency and improves the computational efficiency. It is also demonstrated that embodiments of Gram-CTC improve CTC in terms of both performance and efficiency on the large vocabulary speech recognition task at multiple scales of data, and that systems that employ an embodiment of Gram-CTC can outperform the state-of-the-art on a standard speech benchmark.


Find Patent Forward Citations

Loading…