Contamination of sequence databases with adaptor sequences

Yoshikawa, Takeo; Sanders, A R; Detera-Wadleigh, S D

Title: Contamination of sequence databases with adaptor sequences

Journal Article · Sat Feb 01 00:00:00 EST 1997 · American Journal of Human Genetics

OSTI ID:518528

Yoshikawa, Takeo; Sanders, A R; Detera-Wadleigh, S D ^[1]

National Institute of Mental Health, Bethesda, MD (United States)

Because of the exponential increase in the amount of DNA sequences being added to the public databases on a daily basis, it has become imperative to identify sources of contamination rapidly. Previously, contaminations of sequence databases have been reported to alert the scientific community to the problem. These contaminations can be divided into two categories. The first category comprises host sequences that have been difficult for submitters to manage or control. Examples include anomalous sequences derived from Escherichia coli, which are inserted into the chromosomes (and plasmids) of the bacterial hosts. Insertion sequences are highly mobile and are capable of transposing themselves into plasmids during cloning manipulation. Another example of the first category is the infection with yeast genomic DNA or with bacterial DNA of some commercially available cDNA libraries from Clontech. The second category of database contamination is due to the inadvertent inclusion of nonhost sequences. This category includes incorporation of cloning-vector sequences and multicloning sites in the database submission. M13-derived artifacts have been common, since M13-based vectors have been widely used for subcloning DNA fragments. Recognizing this problem, the National Center for Biotechnology Information (NCBI) started to screen, in April 1994, all sequences directly submitted to GenBank, against a set of vector data retrieved from GenBank by use of key-word searches, such as {open_quotes}vector.{close_quotes} In this report, we present evidence for another sequence artifact that is widespread but that, to our knowledge, has not yet been reported. 11 refs., 1 tab.

Cite

Export

Save

OSTI ID:: 518528

Journal Information:: American Journal of Human Genetics, Vol. 60, Issue 2; Other Information: PBD: Feb 1997

Country of Publication:: United States

Language:: English

Similar Records

Transformation of Schwanniomyces occidentalis with an ADE2 gene cloned from S. occidentalis

Journal Article · Thu Dec 01 00:00:00 EST 1988 · J. Bacteriol.; (United States) · OSTI ID:518528

Klein, R D; Favreau, M A

Development of 124 sequence-tagged sites and cytogenetic localization of 217 cosmids for human chromosome 10

Journal Article · Fri Jul 01 00:00:00 EDT 1994 · Genomics · OSTI ID:518528

Zheng, C J; Ma, N S.F.; Dorman, T E

Towards a transcription map of human chromosome 21: Identification of expressed sequences by exon trapping

Journal Article · Thu Sep 01 00:00:00 EDT 1994 · American Journal of Human Genetics · OSTI ID:518528

Chen, H M; Chrast, R; Rossier, C

Related Subjects

55 BIOLOGY AND MEDICINE
BASIC STUDIES
99 MATHEMATICS
COMPUTERS
INFORMATION SCIENCE
MANAGEMENT
LAW
MISCELLANEOUS
DATA BASE MANAGEMENT
CONTAMINATION
EVALUATION
SCREENING
CHROMOSOMES
DNA SEQUENCING
DNA-CLONING
GENETIC MAPPING
PLASMIDS
SAMPLE PREPARATION
ESCHERICHIA COLI
BACTERIA
YEASTS

Title: Contamination of sequence databases with adaptor sequences

Citation Formats

Similar Records

Related Subjects