The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 27, 2011

Filed:

Dec. 15, 2010
Applicants:

Enyuan Wu, Bellevue, WA (US);

Alan K. Michael, Monroe, WA (US);

Marcus A. Taylor, Bonney Lake, WA (US);

Beom Seok OH, Fall City, WA (US);

Shusuke Uehara, Redmond, WA (US);

Inventors:

Enyuan Wu, Bellevue, WA (US);

Alan K. Michael, Monroe, WA (US);

Marcus A. Taylor, Bonney Lake, WA (US);

Beom Seok Oh, Fall City, WA (US);

Shusuke Uehara, Redmond, WA (US);

Assignee:

Microsoft Corporation, Redmond, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/20 (2006.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
Abstract

Input text may be broken into sentence, or other types of segments, by first detecting exceptions in the input text, and then detecting break positions. Given a segment breaking scheme that comprises a set of break rules and a set of exceptions, a regular expression is created that represents the break rules, and another regular expression is created that represents the exceptions. The input text is analyzed to identify strings that match any exception, and the matching strings are substituted with placeholders that are not likely to occur naturally in the input. The resulting text, with substitutions, is then evaluated to find the positions in the text that match the break rules. Those positions are declared to be segment breaks, and the placeholders are then replaced with the original strings. The result is the original text, with breaks assigned to the appropriate positions in the text.


Find Patent Forward Citations

Loading…