The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 17, 2023

Filed:

Sep. 13, 2018
Applicant:

Dnastar, Inc., Madison, WI (US);

Inventors:

Frederick R. Blattner, Madison, WI (US);

Schuyler F. Baldwin, Madison, WI (US);

Tim J. Durfee, Madison, WI (US);

Madalina Miskoski, Madison, WI (US);

Daniel A. Nash, Madison, WI (US);

Assignee:

DNASTAR, INC., Madison, WI (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G16B 30/00 (2019.01); G16B 40/00 (2019.01); G16B 45/00 (2019.01); G16B 50/00 (2019.01);
U.S. Cl.
CPC ...
G16B 30/00 (2019.02); G16B 40/00 (2019.02); G16B 45/00 (2019.02); G16B 50/00 (2019.02);
Abstract

Systems and methods to automatically de novo assemble a set of unordered read sequences into one or more, larger nucleotide sequences are presented. The method involves first creating two identical sets of the reads, dividing each read in both sets into smaller sorted mer sequences and then comparing the mers for each read in set 1 to the mers from each read in set 2 to exhaustively identify overlapping segments. Overlap information is used to construct a modified assembly string graph, traversal of which produces a sorted string graph layout file consisting of all the reads ordered left to right including their approximate starting offset position. The sorted string graph layout file is then processed by a novel multiple sequence alignment system that uses mer matches between all the overlapping reads at a given position to place matching individual bases from each read into columns from which an overall consensus sequence is determined.


Find Patent Forward Citations

Loading…