Method and apparatus for biological sequence comparison

Marr, Thomas G; Chang, William I-Wei

Advanced Search OptionsAdvanced Search queries use a traditional Term Search. For more info, see our FAQ.

All Fields:

Patent Title:

Abstract:

Assignee:

Inventor(s):

Patent Number:

Patent Classification (CPC):

All Classifications
A - human necessities
A01 - agriculture
A21 - baking
A22 - butchering
A23 - foods or foodstuffs
A24 - tobacco
A41 - wearing apparel
A42 - headwear
A43 - footwear
A44 - haberdashery
A45 - hand or travelling articles
A46 - brushware
A47 - furniture
A61 - medical or veterinary science
A62 - life-saving
A63 - sports
A99 - subject matter not otherwise provided for in this section
B - performing operations
B01 - physical or chemical processes or apparatus in general
B02 - crushing, pulverising, or disintegrating
B03 - separation of solid materials using liquids or using pneumatic tables or jigs
B04 - centrifugal apparatus or machines for carrying-out physical or chemical processes
B05 - spraying or atomising in general
B06 - generating or transmitting mechanical vibrations in general
B07 - separating solids from solids
B08 - cleaning
B09 - disposal of solid waste
B21 - mechanical metal-working without essentially removing material
B22 - casting
B23 - machine tools
B24 - grinding
B25 - hand tools
B26 - hand cutting tools
B27 - working or preserving wood or similar material
B28 - working cement, clay, or stone
B29 - working of plastics
B30 - presses
B31 - making articles of paper, cardboard or material worked in a manner analogous to paper
B32 - layered products
B33 - additive manufacturing technology
B41 - printing
B42 - bookbinding
B43 - writing or drawing implements
B44 - decorative arts
B60 - vehicles in general
B61 - railways
B62 - land vehicles for travelling otherwise than on rails
B63 - ships or other waterborne vessels
B64 - aircraft
B65 - conveying
B66 - hoisting
B67 - opening, closing {or cleaning} bottles, jars or similar containers
B68 - saddlery
B81 - microstructural technology
B82 - nanotechnology
B99 - subject matter not otherwise provided for in this section
C - chemistry
C01 - inorganic chemistry
C02 - treatment of water, waste water, sewage, or sludge
C03 - glass
C04 - cements
C05 - fertilisers
C06 - explosives
C07 - organic chemistry
C08 - organic macromolecular compounds
C09 - dyes
C10 - petroleum, gas or coke industries
C11 - animal or vegetable oils, fats, fatty substances or waxes
C12 - biochemistry
C13 - sugar industry
C14 - skins
C21 - metallurgy of iron
C22 - metallurgy
C23 - coating metallic material
C25 - electrolytic or electrophoretic processes
C30 - crystal growth
C40 - combinatorial technology
C99 - subject matter not otherwise provided for in this section
D - textiles
D01 - natural or man-made threads or fibres
D02 - yarns
D03 - weaving
D04 - braiding
D05 - sewing
D06 - treatment of textiles or the like
D07 - ropes
D10 - indexing scheme associated with sublasses of section d, relating to textiles
D21 - paper-making
D99 - subject matter not otherwise provided for in this section
E - fixed constructions
E01 - construction of roads, railways, or bridges
E02 - hydraulic engineering
E03 - water supply
E04 - building
E05 - locks
E06 - doors, windows, shutters, or roller blinds in general
E21 - earth drilling
E99 - subject matter not otherwise provided for in this section
F - mechanical engineering
F01 - machines or engines in general
F02 - combustion engines
F03 - machines or engines for liquids
F04 - positive - displacement machines for liquids
F05 - indexing schemes relating to engines or pumps in various subclasses of classes f01-f04
F15 - fluid-pressure actuators
F16 - engineering elements and units
F17 - storing or distributing gases or liquids
F21 - lighting
F22 - steam generation
F23 - combustion apparatus
F24 - heating
F25 - refrigeration or cooling
F26 - drying
F27 - furnaces
F28 - heat exchange in general
F41 - weapons
F42 - ammunition
F99 - subject matter not otherwise provided for in this section
G - physics
G01 - measuring
G02 - optics
G03 - photography
G04 - horology
G05 - controlling
G06 - computing
G07 - checking-devices
G08 - signalling
G09 - education
G10 - musical instruments
G11 - information storage
G12 - instrument details
G16 - information and communication technology [ict] specially adapted for specific application fields
G21 - nuclear physics
G99 - subject matter not otherwise provided for in this section
H - electricity
H01 - basic electric elements
H02 - generation
H03 - basic electronic circuitry
H04 - electric communication technique
H05 - electric techniques not otherwise provided for
H99 - subject matter not otherwise provided for in this section
Y - new / cross sectional technologies
Y02 - technologies or applications for mitigation or adaptation against climate change
Y04 - information or communication technologies having an impact on other technology areas
Y10 - technical subjects covered by former uspc

More Options ...

Title: Method and apparatus for biological sequence comparison

Abstract

A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greatermore » « less

Inventors:

Marr, Thomas G ^[1]; Chang, William I-Wei ^[1]

Huntington, NY

Issue Date:: Wed Jan 01 00:00:00 EST 1997

Research Org.:: Cold Spring Harbor Lab., NY (United States)

OSTI Identifier:: 871286

Patent Number(s):: 5701256

Assignee:: Cold Spring Harbor Laboratory (Cold Spring Harbor, NY)

Patent Classifications (CPCs):: G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING

G - PHYSICS G16 - INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS G16B - BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY

Show more

G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
G06F16/90344 - {by using string matching techniques}

G - PHYSICS G16 - INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS G16B - BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
G16B30/00 - ICT specially adapted for sequence analysis involving nucleotides or amino acids
G16B30/10 - Sequence alignment

Show less

DOE Contract Number:: FG02-91ER61190

Resource Type:: Patent

Country of Publication:: United States

Language:: English

Subject:: method; apparatus; biological; sequence; comparison; comparing; sequences; source; subject; query; takes; input; set; target; similarity; levels; evolutionary; distances; units; pam; fragments; similar; level; statistically; significant; device; filters; average; required; compared; remaining; matches; filtering; divides; overlapping; blocks; block; sufficiently; contain; minimum-length; alignment; filter; compares; fragment; determines; match; determined; provide; upper; threshold; values; regions; length; mean; value; unit; score; concatenated; form; union; current; provides; indication; local; determined set; /702/382/

Citation Formats


                    Marr, Thomas G, and Chang, William I-Wei. Method and apparatus for biological sequence comparison.  United States: N. p., 1997. 
        Web.

Copy to clipboard


                    Marr, Thomas G, & Chang, William I-Wei. Method and apparatus for biological sequence comparison.  United States.

Copy to clipboard


                    Marr, Thomas G, and Chang, William I-Wei. Wed .  
        "Method and apparatus for biological sequence comparison".  United States.  https://www.osti.gov/servlets/purl/871286.

Copy to clipboard


                    
@article{osti_871286,

  title        = {Method and apparatus for biological sequence comparison},

  author       = {Marr, Thomas G and Chang, William I-Wei},

  abstractNote = {A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence.},

  doi          = {},

  journal      = {},
number       = ,

  volume       = ,

  place        = {United States},

  year         = {Wed Jan 01 00:00:00 EST 1997},

  month        = {Wed Jan 01 00:00:00 EST 1997}

}

Copy to clipboard

Patent:

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

The theory and computation of evolutionary distances: Pattern recognition
journal, December 1980

Sellers, Peter H.
Journal of Algorithms, Vol. 1, Issue 4
https://doi.org/10.1016/0196-6774(80)90016-4

A general method applicable to the search for similarities in the amino acid sequence of two proteins
journal, March 1970

Needleman, Saul B.; Wunsch, Christian D.
Journal of Molecular Biology, Vol. 48, Issue 3, p. 443-453
https://doi.org/10.1016/0022-2836(70)90057-4

Theoretical and empirical comparisons of approximate string matching algorithms
book, January 1992

Chang, William I.; Lampe, Jordan
Combinatorial Pattern Matching
https://doi.org/10.1007/3-540-56024-6_14

Fast text searching: allowing errors
journal, October 1992

Wu, Sun; Manber, Udi
Communications of the ACM, Vol. 35, Issue 10
https://doi.org/10.1145/135239.135244

A time-efficient, linear-space local similarity algorithm
journal, September 1991

Huang, Xiaoqiu; Miller, Webb
Advances in Applied Mathematics, Vol. 12, Issue 3
https://doi.org/10.1016/0196-8858(91)90017-D

A subquadratic algorithm for approximate limited expression matching
journal, January 1996

Wu, Sun; Manber, U.; Myers, G.
Algorithmica, Vol. 15, Issue 1
https://doi.org/10.1007/BF01942606

Fast string matching with k differences
journal, August 1988

Landau, Gad M.; Vishkin, Uzi
Journal of Computer and System Sciences, Vol. 37, Issue 1
https://doi.org/10.1016/0022-0000(88)90045-1

An improved algorithm for matching biological sequences
journal, December 1982

Gotoh, Osamu
Journal of Molecular Biology, Vol. 162, Issue 3
https://doi.org/10.1016/0022-2836(82)90398-9

A contig assembly program based on sensitive detection of fragment overlaps
journal, September 1992

Huang, Xiaoqiu
Genomics, Vol. 14, Issue 1
https://doi.org/10.1016/S0888-7543(05)80277-0

Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries
journal, January 1982

Goad, Walter B.; Kanehisa, Minoru I.
Nucleic Acids Research, Vol. 10, Issue 1
https://doi.org/10.1093/nar/10.1.247

Algorithmic Advances for Searching Biosequence Databases
book, January 1994

Myers, Eugene W.
Computational Methods in Genome Research
https://doi.org/10.1007/978-1-4615-2451-9_10

Pattern recognition in genetic sequences by mismatch density
journal, July 1984

Sellers, Peter H.
Bulletin of Mathematical Biology, Vol. 46, Issue 4
https://doi.org/10.1007/BF02459499

Finding approximate patterns in strings
journal, March 1985

Ukkonen, Esko
Journal of Algorithms, Vol. 6, Issue 1
https://doi.org/10.1016/0196-6774(85)90023-9

Protein sequence comparison: methods and significance
journal, January 1991

Argos, Patrick; Vingron, Martin; Vogt, Gerhard
"Protein Engineering, Design and Selection", Vol. 4, Issue 4
https://doi.org/10.1093/protein/4.4.375

Approximate string matching in sublinear expected time
conference, January 1990

Chang, W. I.; Lawler, E. L.
Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science
https://doi.org/10.1109/FSCS.1990.89530

Identification of common molecular subsequences
journal, March 1981

Smith, T. F.; Waterman, M. S.
Journal of Molecular Biology, Vol. 147, Issue 1, p. 195-197
https://doi.org/10.1016/0022-2836(81)90087-5

A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons
journal, October 1987

Waterman, Michael S.; Eggert, Mark
Journal of Molecular Biology, Vol. 197, Issue 4
https://doi.org/10.1016/0022-2836(87)90478-5

Searching protein sequence libraries: Comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms
journal, November 1991

Pearson, William R.
Genomics, Vol. 11, Issue 3
https://doi.org/10.1016/0888-7543(91)90071-L

[20] Mutation data matrix and its uses
book, January 1990

George, David G.; Barker, Winona C.; Hunt, Lois T.
Methods in Enzymology
https://doi.org/10.1016/0076-6879(90)83022-2

Sublinear approximate string matching and biological applications
journal, November 1994

Chang, W. I.; Lawler, E. L.
Algorithmica, Vol. 12, Issue 4-5
https://doi.org/10.1007/BF01185431

A sublinear algorithm for approximate keyword searching
journal, November 1994

Myers, E. W.
Algorithmica, Vol. 12, Issue 4-5
https://doi.org/10.1007/BF01185432

[21] Sensitivity comparison of protein amino acid sequences
book, January 1990

Argos, Patrick; Vingron, Martin
Methods in Enzymology
https://doi.org/10.1016/0076-6879(90)83023-3

Similar Records in DOE Patents and OSTI.GOV collections:

Method of and apparatus for determining the similarity of a biological analyte from a model constructed from known biological fluids

Patent Robinson, Mark R [Albuquerque, NM]; Ward, Kenneth J [Albuquerque, NM]; Eaton, Robert P [Albuquerque, NM]; ...

The characteristics of a biological fluid sample having an analyte are determined from a model constructed from plural known biological fluid samples. The model is a function of the concentration of materials in the known fluid samples as a function of absorption of wideband infrared energy. The wideband infrared energy is coupled to the analyte containing sample so there is differential absorption of the infrared energy as a function of the wavelength of the wideband infrared energy incident on the analyte containing sample. The differential absorption causes intensity variations of the infrared energy incident on the analyte containing sample asmore » « less
Full Text Available
Method and apparatus to image biological interactions in plants

Patent Weisenberger, Andrew; Bonito, Gregory M.; Reid, Chantal D.; ...

A method to dynamically image the actual translocation of molecular compounds of interest in a plant root, root system, and rhizosphere without disturbing the root or the soil. The technique makes use of radioactive isotopes as tracers to label molecules of interest and to image their distribution in the plant and/or soil. The method allows for the study and imaging of various biological and biochemical interactions in the rhizosphere of a plant, including, but not limited to, mycorrhizal associations in such regions.
Full Text Available
Method and apparatus for sustaining viability of biological cells on a substrate

Patent McKnight, Timothy E.; Melechko, Anatoli V.; Simpson, Michael L.

A method for the transient transformation of a living biological cell having an intact cell membrane defining an intracellular domain, and an apparatus for the transient transformation of biological cells. The method and apparatus include introducing a compartmentalized extracellular component fixedly attached to a cellular penetrant structure to the intracellular domain of the cell, wherein the cell is fixed in a predetermined location and wherein the component is expressed within in the cell while being retained within the compartment and wherein the compartment restricts the mobility and interactions of the component within the cell and prevents transference of the componentmore » to the cell. « less
Full Text Available
Method and apparatus for sustaining viability of biological cells on a substrate

Patent McKnight, Timothy E [Greenback, TN]; Melechko, Anatoli V [Oak Ridge, TN]; Simpson, Michael L [Knoxville, TN]

A method for the transient transformation of a living biological cell having an intact cell membrane defining an intracellular domain, and an apparatus for the transient transformation of biological cells. The method and apparatus include introducing a compartmentalized extracellular component fixedly attached to a cellular penetrant structure to the intracellular domain of the cell, wherein the cell is fixed in a predetermined location and wherein the component is expressed within in the cell while being retained within the compartment and wherein the compartment restricts the mobility and interactions of the component within the cell and prevents transference of the componentmore » to the cell. « less
Full Text Available
Apparatus and method for biological purification of waste

Patent Lucido, John A [Mt. Sinai, NY]; Keenan, Daniel [Rockville Centre, NY]; Premuzic, Eugene T [East Moriches, NY]; ...

An apparatus is disclosed for containing a microorganism culture in an active exponential growth and delivering a supply of microorganisms to an environment containing wastes for bio-augmenting the biodegradation of the wastes. The apparatus comprises a bioreactor and an operably connected controller. The bioreactor has a bioreactor chamber for containing a supply of microorganisms, a second chamber for containing a supply of water and inorganic nutrients, and a third chamber for containing a supply of organic nutrients. The bioreactor is operably connected to the controller in which a first pump is operably connected in fluid communication between the bioreactor chambermore » « less
Full Text Available

Similar Records

Title: Method and apparatus for biological sequence comparison

Abstract

Citation Formats

The theory and computation of evolutionary distances: Pattern recognition journal, December 1980

A general method applicable to the search for similarities in the amino acid sequence of two proteins journal, March 1970

Theoretical and empirical comparisons of approximate string matching algorithms book, January 1992

Fast text searching: allowing errors journal, October 1992

A time-efficient, linear-space local similarity algorithm journal, September 1991

A subquadratic algorithm for approximate limited expression matching journal, January 1996

Fast string matching with k differences journal, August 1988

An improved algorithm for matching biological sequences journal, December 1982

A contig assembly program based on sensitive detection of fragment overlaps journal, September 1992

Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries journal, January 1982

Algorithmic Advances for Searching Biosequence Databases book, January 1994

Pattern recognition in genetic sequences by mismatch density journal, July 1984

Finding approximate patterns in strings journal, March 1985

Protein sequence comparison: methods and significance journal, January 1991

Approximate string matching in sublinear expected time conference, January 1990

Identification of common molecular subsequences journal, March 1981

A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons journal, October 1987

Searching protein sequence libraries: Comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms journal, November 1991

[20] Mutation data matrix and its uses book, January 1990

Sublinear approximate string matching and biological applications journal, November 1994

A sublinear algorithm for approximate keyword searching journal, November 1994

[21] Sensitivity comparison of protein amino acid sequences book, January 1990

The theory and computation of evolutionary distances: Pattern recognition
journal, December 1980

A general method applicable to the search for similarities in the amino acid sequence of two proteins
journal, March 1970

Theoretical and empirical comparisons of approximate string matching algorithms
book, January 1992

Fast text searching: allowing errors
journal, October 1992

A time-efficient, linear-space local similarity algorithm
journal, September 1991

A subquadratic algorithm for approximate limited expression matching
journal, January 1996

Fast string matching with k differences
journal, August 1988

An improved algorithm for matching biological sequences
journal, December 1982

A contig assembly program based on sensitive detection of fragment overlaps
journal, September 1992

Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries
journal, January 1982

Algorithmic Advances for Searching Biosequence Databases
book, January 1994

Pattern recognition in genetic sequences by mismatch density
journal, July 1984

Finding approximate patterns in strings
journal, March 1985

Protein sequence comparison: methods and significance
journal, January 1991

Approximate string matching in sublinear expected time
conference, January 1990

Identification of common molecular subsequences
journal, March 1981

A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons
journal, October 1987

Searching protein sequence libraries: Comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms
journal, November 1991

[20] Mutation data matrix and its uses
book, January 1990

Sublinear approximate string matching and biological applications
journal, November 1994

A sublinear algorithm for approximate keyword searching
journal, November 1994

[21] Sensitivity comparison of protein amino acid sequences
book, January 1990