The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 10, 2023

Filed:

Sep. 18, 2018
Applicant:

Genetalks Bio-tech (Changsha) Co., Ltd., Hunan, CN;

Inventors:

Zhuo Song, Hunan, CN;

Gen Li, Hunan, CN;

Pengxia Liu, Hunan, CN;

Zhenguo Wang, Hunan, CN;

Bolun Feng, Hunan, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/22 (2019.01); G16B 30/00 (2019.01); G06F 16/2455 (2019.01); G06F 16/174 (2019.01); G16B 20/00 (2019.01);
U.S. Cl.
CPC ...
G16B 30/00 (2019.02); G06F 16/1744 (2019.01); G06F 16/2228 (2019.01); G06F 16/2455 (2019.01); G16B 20/00 (2019.02);
Abstract

The present invention discloses a gene sequencing data compression preprocessing, compression and decompression method, a system, and a computer-readable medium. The preprocessing method implementation steps include: obtaining reference genome data; obtaining a mapping relationship between a short string K-mer and a prediction character c to obtain a prediction data model Pcontaining any short string K-mer in the positive strand and negative strand of a reference genome and the prediction character c in a corresponding adjacent bit. The compression and decompression methods relate to performing compression/decompression on the basis of the prediction data model PThe system is a computer system including a program for executing the previous method. The computer-readable medium includes a computer program for executing the previous method. The present invention can be oriented towards lossless gene sequencing data compression, provides fully effective information for a high-performance lossless compression and decompression algorithm for gene sequencing data.


Find Patent Forward Citations

Loading…