The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 19, 2016

Filed:

Dec. 17, 2013
Applicant:

Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;

Inventors:

Duling Lu, Shenzhen, CN;

Lu Li, Shenzhen, CN;

Feng Rao, Shenzhen, CN;

Bo Chen, Shenzhen, CN;

Li Lu, Shenzhen, CN;

Xiang Zhang, Shenzhen, CN;

Eryu Wang, Shenzhen, CN;

Shuai Yue, Shenzhen, CN;

Assignee:

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen, Guangdong Province, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/00 (2013.01); G10L 15/06 (2013.01); G06F 17/28 (2006.01); G10L 15/183 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G06F 17/28 (2013.01); G10L 15/183 (2013.01);
Abstract

A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.


Find Patent Forward Citations

Loading…