The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 19, 2016

Filed:

Feb. 14, 2014
Applicant:

Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;

Inventors:

Feng Rao, Shenzhen, CN;

Li Lu, Shenzhen, CN;

Bo Chen, Shenzhen, CN;

Xiang Zhang, Shenzhen, CN;

Shuai Yue, Shenzhen, CN;

Lu Li, Shenzhen, CN;

Assignee:

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen, Guangdong Province, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G10L 15/183 (2013.01); G10L 15/197 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/183 (2013.01); G10L 15/197 (2013.01);
Abstract

A method includes: acquiring data samples; performing categorized sentence mining in the acquired data samples to obtain categorized training samples for multiple categories; building a text classifier based on the categorized training samples; classifying the data samples using the text classifier to obtain a class vocabulary and a corpus for each category; mining the corpus for each category according to the class vocabulary for the category to obtain a respective set of high-frequency language templates; training on the templates for each category to obtain a template-based language model for the category; training on the corpus for each category to obtain a class-based language model for the category; training on the class vocabulary for each category to obtain a lexicon-based language model for the category; building a speech decoder according to an acoustic model, the class-based language model and the lexicon-based language model for any given field, and the data samples.


Find Patent Forward Citations

Loading…