The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 31, 2022

Filed:

Dec. 20, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

James H. Kaufman, San Jose, CA (US);

Matthew A. Davis, San Jose, CA (US);

Mark Kunitomi, San Francisco, CA (US);

Kenneth L. Clarkson, Madison, NJ (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/90 (2019.01); G06F 16/906 (2019.01); G06F 16/901 (2019.01); G06F 16/907 (2019.01);
U.S. Cl.
CPC ...
G06F 16/906 (2019.01); G06F 16/907 (2019.01); G06F 16/9027 (2019.01);
Abstract

A method is described for automatically correcting metadata errors in a k-mer database. A k-mer database having a self-consistent taxonomy based on genome-genome distance was constructed from a set of sample and reference genomes whose metadata included taxonomic labeling from a reference taxonomy (the standard NCBI taxonomy), which is not based on genetic distance. As a result, genomes of a given taxonomic ID of the self-consistent taxonomy could be separated into clusters based on the differences in the metadata. Genomes of the clusters less than a minimum cluster size Cmin were removed and profiled against the remaining genomes, correcting metadata automatically for those genomes that could be mapped back. The resulting k-mer database showed improved specificity for genetic profiling. Another method is described for identifying and handling chimeric genomes using the self-consistent taxonomy. Another method is described for correcting a classification database.


Find Patent Forward Citations

Loading…