The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 07, 2022

Filed:

Oct. 11, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Sanjoy Dey, White Plains, NY (US);

Achille B. Fokoue-Nkoutche, White Plains, NY (US);

William S. Spangler, San Martin, CA (US);

Ping Zhang, White Plains, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06N 5/02 (2006.01); G06F 16/22 (2019.01); G16B 20/00 (2019.01); G16B 50/30 (2019.01); G16B 40/00 (2019.01);
U.S. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 16/22 (2019.01); G06N 5/022 (2013.01); G16B 20/00 (2019.02); G16B 40/00 (2019.02); G16B 50/30 (2019.02);
Abstract

Mechanisms are provided to implement a genomic database curation (GDC) system. The GDC system generates a ground truth database based on a training subset of datasets from an uncurated large scale genomic database, and label metadata for the training subset. The GDC system trains at least one classification engine of the GDC system based on the training subset and the ground truth database at least by performing a machine learning operation on the at least one classification engine. The GDC system automatically applies the at least one trained classification engine on the uncurated large scale genomic database to generate an automatically curated large scale genomic database. A meta-classifier engine generates an output specifying at least one of significant gene signatures or gene pathways for at least one of diseases or drug agents based on the automatically curated large scale genomic database.


Find Patent Forward Citations

Loading…