The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 03, 2024
Filed:
Jun. 16, 2016
Gottfried Wilhelm Leibniz Universitaet Hannover, Hannover, DE;
GOTTFRIED WILHELM LEIBNIZ UNIVERSITAET HANNOVER, Hannover, DE;
Abstract
The present invention relates to a method for compressing genomic data, whereby the genomic data are stored in at least one data file containing at least a plurality of reads built by a genome sequencing method, whereby each read includes a mapping position, a CIGAR string and an actual sequenced nucleotide sequence as a local part of the donor genome, comprising the steps: —unwind a nucleotide sequence of a current read of one of said data files by using the mapping position and the CIGAR string of said current read, whereby said current read has at least one previous read, —compute a difference between the unwound nucleotide sequence of said current read and an unwound nucleotide sequence of at least one of said previous reads, whereby said difference contains the differences of the mapping positions and the nucleotide sequences, —pass said computed difference to an entropy coder to compress said difference, —encode said current read by the compressed difference, and —repeat the forgoing steps with said current read as one of said previous reads and a following read as a new current read until no more following reads are available.