The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jun. 03, 2014
Filed:
Jun. 23, 2006
Rie Maeda, Tokyo, JP;
Yoshiharu Sato, Yokohama, JP;
Miyuki Seki, Tokyo, JP;
Microsoft Corporation, Redmond, WA (US);
Abstract
Method for creating a language model capable of preventing deterioration of quality caused by the conventional back-off to unigram. Parts-of-speech with the same display and reading are obtained from a storage device (). A cluster () is created by combining the obtained parts-of-speech. The created cluster () is stored in the storage device (). In addition, when an instruction () for dividing the cluster is inputted, the cluster stored in the storage device () is divided () in accordance with to the inputted instruction (). Two of the clusters stored in the storage device are combined (), and a probability of occurrence of the combined clusters in the text corpus is calculated (). The combined cluster is associated with the bigram indicating the calculated probability and stored into the storage device.