The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 29, 2020

Filed:

Jan. 15, 2020
Applicant:

Innovaccer Inc., San Francisco, CA (US);

Inventors:

Vibhuti Agrawal, Delhi, IN;

Gourav Sanjukta Bhabesh, Baripada, IN;

Assignee:

INNOVACCER INC., San Francisco, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06K 9/00 (2006.01); G06F 40/205 (2020.01); G16H 10/60 (2018.01);
U.S. Cl.
CPC ...
G06K 9/00463 (2013.01); G06F 40/205 (2020.01); G06K 9/00469 (2013.01); G16H 10/60 (2018.01);
Abstract

A system and method for extracting relevant data elements from a file for conversion to a tabular format includes a computing device receiving an XML format file having a loop with nested blocks. Each of the blocks has at least one data element. Features are extracted from each data element. These extracted features are processed using a machine learning algorithm to estimate a column header value for the data elements relative to a data schema. With the data element classified, a configuration file is generated to map the column header value to the data elements of the XML file. The configuration file is used to extract the data elements from the XML file to a tabular format. In the healthcare industry, the system and method may be used to extract relevant health information from a clinical document for conversion to a tabular format.


Find Patent Forward Citations

Loading…