The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 19, 2022

Filed:

Dec. 25, 2018
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Ying Wang, Shanghai, CN;

Min Li, Shanghai, CN;

Mengyan Lu, Shanghai, CN;

Xiaoliang Shi, Shanghai, CN;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/44 (2018.01); G06F 40/295 (2020.01); G06N 20/10 (2019.01); G06F 16/35 (2019.01); G06F 40/284 (2020.01); G06F 3/12 (2006.01); G06F 8/30 (2018.01); G06F 8/73 (2018.01); G06K 9/62 (2022.01); G06F 16/951 (2019.01); G06F 16/958 (2019.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06F 40/295 (2020.01); G06F 3/1298 (2013.01); G06F 8/31 (2013.01); G06F 8/73 (2013.01); G06F 16/355 (2019.01); G06F 16/951 (2019.01); G06F 16/958 (2019.01); G06F 16/986 (2019.01); G06F 40/284 (2020.01); G06K 9/6215 (2013.01); G06N 20/10 (2019.01); G06N 20/00 (2019.01);
Abstract

A coding information extractor disclosed herein uses machine learning approach to extract coding information from documents. An implementation of the coding information extractor is implemented using various computer process instructions including scanning a document to generate a plurality of tokens, determining one or more features of the plurality of tokens using term frequency (TF), inverse document frequency (IDF), and code type similarity features, and determining field type, field name, and field value of the one or more of the tokens using named entity recognition (NER).


Find Patent Forward Citations

Loading…