The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 02, 2013
Filed:
Dec. 06, 2007
Jun Wu, Saratoga, CA (US);
Hui Xu, Beijing, CN;
Yifei Zhang, Beijing, CN;
Google Inc., Mountain View, CA (US);
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for name detection. A method includes generating a raw name detection model using a collection of family names and an annotated corpus including a collection of n-grams. The method includes applying the raw name detection model to a collection of semi-structured data to form annotated semi-structured data identifying n-grams identifying names and n-grams not identifying names and applying the raw name detection model to a large unannotated corpus to form a large annotated corpus data identifying n-grams of the large unannotated corpus identifying names and n-grams not identifying names. The method includes generating a name detection model, including deriving a name model using the annotated semi-structured data identifying names and the large annotated corpus data identifying names, deriving a not-name model using the semi-structured data not identifying names, and deriving a language model using the large annotated corpus.