The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 07, 2017

Filed:

Jan. 06, 2014
Applicant:

Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;

Inventors:

Haibo Liu, Shenzhen, CN;

Eryu Wang, Shenzhen, CN;

Xiang Zhang, Shenzhen, CN;

Li Lu, Shenzhen, CN;

Shuai Yue, Shenzhen, CN;

Qiuge Liu, Shenzhen, CN;

Bo Chen, Shenzhen, CN;

Jian Liu, Shenzhen, CN;

Lu Li, Shenzhen, CN;

Assignee:

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen, Guangdong Province, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 17/28 (2006.01); G10L 15/00 (2013.01); G10L 15/26 (2006.01);
U.S. Cl.
CPC ...
G06F 17/273 (2013.01); G06F 17/2775 (2013.01); G06F 17/2785 (2013.01); G06F 17/289 (2013.01); G10L 15/265 (2013.01);
Abstract

A method of processing information content based on a Chinese language model is performed at a computer, the method including: identifying a plurality of expressions in the information content extracted from a speech input through speech recognition that is queued to be processed; dividing the expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each characteristic unit, each including a subset of the expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the Chinese language model, a plurality of probabilities for punctuation marks associated with each characteristic unit; and in accordance with the probabilities, associating a respective punctuation mark with each characteristic unit included in the information content. The method further comprises adding punctuation marks based on a weight determined for each punctuation mark.


Find Patent Forward Citations

Loading…