The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 20, 2018

Filed:

Oct. 03, 2016
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Pranab Mohanty, Redmond, WA (US);

Intaik Park, Bellevue, WA (US);

Kieran Brantner-Magee, Duvall, WA (US);

Lucas Lin, Seattle, WA (US);

Saikat Sen, Issaquah, WA (US);

Korhan Ileri, Seattle, WA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 17/24 (2006.01);
U.S. Cl.
CPC ...
G06F 17/277 (2013.01); G06F 17/24 (2013.01); G06F 17/2705 (2013.01);
Abstract

Examples for detecting and removing non-natural language within natural language to enhance performing content analysis on the natural language are provided herein. A plurality of terms is identified in a phrase, and a sliding window having a defined length is placed over a first sequence of terms from the plurality of terms. The first sequence of terms includes a first term, a second term, and a third term, the first term and the third term being adjacent to the second term. Based on the first term, the second term, and the third term, a determination is made as to whether the second term represents non-natural language. Upon determining that the second term is non-natural language, the second term is labeled as non-natural language and is removed from the plurality of terms based on determining the second term as non-natural language.


Find Patent Forward Citations

Loading…