The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 7801727 B1

Date of Patent:

Sep. 21, 2010

Filed:

Feb. 24, 2005

System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

Applicants:

Ponani Gopalakrishnan, Yorktown Heights, NY (US);

Dimitri Kanevsky, Ossining, NY (US);

Michael Daniel Monkowski, New Windsor, NY (US);

Jan Sedivy, Praha, CZ;

Inventors:

Ponani Gopalakrishnan, Yorktown Heights, NY (US);

Dimitri Kanevsky, Ossining, NY (US);

Michael Daniel Monkowski, New Windsor, NY (US);

Jan Sedivy, Praha, CZ;

Assignee:

Nuance Communications, Inc., Burlington, MA (US);

Attorney:

Wolf, Greenfield & Sacks, P.C.

Primary Examiner:

David R Hudspeth

Assistant Examiner:

Lamont M Spooner

Int. Cl.

CPC ...

G10L 15/04 (2006.01);

U.S. Cl.

CPC ...

Abstract

A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.

Find Patent Forward Citations