The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 29, 2000

Filed:

Apr. 21, 1998
Applicant:
Inventor:

Juergen Haas, Gaithersburg, MD (US);

Assignee:

Gene Logic, Inc., Gaithersburg, MD (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G01N / ; G01N / ; C12Q / ; G06G / ;
U.S. Cl.
CPC ...
364496 ; 435-6 ; 364497 ; 364578 ; 702127 ; 702 30 ;
Abstract

A method and system for computationally analyzing an initial set of patterns in order to identify subsets of patterns, called clusters, that contain common sub-patterns. The patterns of the initial set of patterns are represented as linear sequences of subunits, and the common sub-patterns occur as sub-sequences of subunits within the linear sequences starting at different positions within the different linear sequences. Variations in the offset and in the sequence of subunits within a common sub-pattern are considered in the analysis. In one embodiment, an initial set of oligonucleotide sequences that are produced by various biochemical techniques are computationally analyzed to identify clusters that may correspond to a number of different binding sites for DNA-binding proteins within one or more double-stranded DNA duplexes. The method places each oligonucleotide sequence within a new cluster and calculates an initial information weight matrix for that cluster. Then, other sequences from the initial set of sequences are added to the cluster and the information weight matrix of the cluster is re-computed until the information content of the information weight matrix falls below a threshold value.


Find Patent Forward Citations

Loading…