The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 17, 2023

Filed:

May. 10, 2019
Applicant:

Iqvia Inc., Danbury, CT (US);

Inventors:

Gwyn Rhys Jones, Sevenoaks, GB;

Nicola Lazzarini, London, GB;

Charikleia Eleftherochorinou, London, GB;

Karolina Katarzyna Dluzniak, Brentford, GB;

Tomass Bernots, Ottawa, CA;

Assignee:

IQVIA Inc., Durham, NC (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 12/02 (2006.01); G06F 9/54 (2006.01);
U.S. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 9/544 (2013.01); G06F 12/0284 (2013.01); G06F 2212/7202 (2013.01);
Abstract

A parser is deployed early in a machine learning pipeline to read raw data and collect useful statistics about the raw data's content to determine which items of raw data exhibit a proxy for feature importance for the machine learning model. The parser operates at high speeds that approach the disk's absolute throughput while utilizing a small memory footprint. Utilization of the parser enables the machine learning pipeline to receive a fraction of the total raw data that would otherwise be available. Several scans through the data are performed, by which proxies for feature importance are indicated and irrelevant features may be discarded and thereby not forwarded to the machine learning pipeline. This reduces the amount of memory and other hardware resources used at the server and also expedites the machine learning process.


Find Patent Forward Citations

Loading…