The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 23, 2021

Filed:

Dec. 07, 2017
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Romeo Kienzler, Allschwil, CH;

Jenny Li, Cary, NC (US);

Stefan Mueck, Cologne, DE;

Stefan Ravizza, Wallisellen, CH;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G16B 30/00 (2019.01); G06F 7/08 (2006.01); G16B 30/10 (2019.01); G16B 30/20 (2019.01); G16B 50/30 (2019.01);
U.S. Cl.
CPC ...
G16B 30/00 (2019.02); G06F 7/08 (2013.01); G16B 30/10 (2019.02); G16B 30/20 (2019.02); G16B 50/30 (2019.02); C12Q 2535/122 (2013.01);
Abstract

A system and machine-implemented method for sorting Next-Generation Sequencing (NGS) reads in O(n) time and space complexity that makes use low sparsity and nearly uniform distribution of the input array. The genome position field in the input array is used to determine the target position of the output array. Duplicate target positions due to n-fold coverage are handled by assigning either overflow buckets to each position or anterior assigning multiple target slots in the output array for each genome position depending on the distribution of reads over the genome and the resulting probability of hitting an already occupied slot. Once every tuple in the input array has been written to the output array, the output array in read through ascending order and each tuple is appended to the end of a final result array.


Find Patent Forward Citations

Loading…