The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 8041566 B1

Date of Patent:

Oct. 18, 2011

Filed:

Nov. 12, 2004

Topic specific models for text formatting and speech recognition

Applicants:

Jochen Peters, Aachen, DE;

Evgeny Matusov, Aachen, DE;

Carsten Meyer, Aachen, DE;

Dietrich Klakow, Saarbrücken, DE;

Inventors:

Jochen Peters, Aachen, DE;

Evgeny Matusov, Aachen, DE;

Carsten Meyer, Aachen, DE;

Dietrich Klakow, Saarbrücken, DE;

Assignee:

Nuance Communications Austria GmbH, Vienna, AT;

Attorney:

Wolf, Greenfield & Sacks, P.C.

Primary Examiner:

Michael N Opsasnick

Int. Cl.

CPC ...

G10L 15/00 (2006.01);

U.S. Cl.

CPC ...

Abstract

The present invention relates to a method, a computer system and a computer program product for speech recognition and/or text formatting by making use of topic specific statistical models. A text document which may be obtained from a first speech recognition pass is subject to segmentation and to an assignment of topic specific models for each obtained section. Each model of the set of models provides statistic information about language model probabilities, about text processing or formatting rules, as e.g. the interpretation of commands for punctuation, formatting, text highlighting or of ambiguous text portions requiring specific formatting, as well as a specific vocabulary being characteristic for each section of the recognized text. Furthermore, other properties of a speech recognition and/or formatting system (such as e.g. settings for the speaking rate) may be encoded in the statistical models. The models themselves are generated on the basis of annotated training data and/or by manual coding. Based on the assignment of models to sections of text an improved speech recognition and/or text formatting procedure is performed.

Find Patent Forward Citations