skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: VecFinder: Automated de novo identification and removal of vector and adapter sequence from genomic datasets

Abstract

High-throughput Sanger sequencing requires DNA to be inserted into bacterial vectors for biological amplification. Adapter or linker oligonucleotides may also be attached to target DNA fragments to facilitate insertion into the vector. These vector and adapter sequences are sequenced concomitantly with the target, or insert, sequence and represent contamination which must be removed from the dataset prior to analysis. Removal of such contamination can be accomplished by screening the dataset against the known sequence of the vector and adapter used to generate the data. However, often in the case of public or collaborator datasets, information regarding these contaminant sequences may be incorrect or absent, resulting in an incomplete screening. We've created a piece of software, VecFinder, which is able to identify the sequence of the vector and adapter from the read sequences alone and subsequently remove it. This alleviates the dependence on the library creators to provide the vector and adapter sequences used for the library. It also automates the previously manual task of identifying and screening the adapter or linker sequence, which can be tedious and time-consuming

Authors:
; ; ;
Publication Date:
Research Org.:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1170608
Report Number(s):
LBNL-6833E
DOE Contract Number:  
DE-AC02-05CH11231
Resource Type:
Conference
Resource Relation:
Conference: The Biology of Genomes - Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, May 8-12, 2007
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS; VecFinder, de novo, vector and adapter sequence, genomic datasets, Sanger sequencing

Citation Formats

Zhang, Michael Y., Tu, Hank, Shapiro, Harris, and Platt, Darren. VecFinder: Automated de novo identification and removal of vector and adapter sequence from genomic datasets. United States: N. p., 2007. Web.
Zhang, Michael Y., Tu, Hank, Shapiro, Harris, & Platt, Darren. VecFinder: Automated de novo identification and removal of vector and adapter sequence from genomic datasets. United States.
Zhang, Michael Y., Tu, Hank, Shapiro, Harris, and Platt, Darren. Fri . "VecFinder: Automated de novo identification and removal of vector and adapter sequence from genomic datasets". United States. doi:. https://www.osti.gov/servlets/purl/1170608.
@article{osti_1170608,
title = {VecFinder: Automated de novo identification and removal of vector and adapter sequence from genomic datasets},
author = {Zhang, Michael Y. and Tu, Hank and Shapiro, Harris and Platt, Darren},
abstractNote = {High-throughput Sanger sequencing requires DNA to be inserted into bacterial vectors for biological amplification. Adapter or linker oligonucleotides may also be attached to target DNA fragments to facilitate insertion into the vector. These vector and adapter sequences are sequenced concomitantly with the target, or insert, sequence and represent contamination which must be removed from the dataset prior to analysis. Removal of such contamination can be accomplished by screening the dataset against the known sequence of the vector and adapter used to generate the data. However, often in the case of public or collaborator datasets, information regarding these contaminant sequences may be incorrect or absent, resulting in an incomplete screening. We've created a piece of software, VecFinder, which is able to identify the sequence of the vector and adapter from the read sequences alone and subsequently remove it. This alleviates the dependence on the library creators to provide the vector and adapter sequences used for the library. It also automates the previously manual task of identifying and screening the adapter or linker sequence, which can be tedious and time-consuming},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Fri May 04 00:00:00 EDT 2007},
month = {Fri May 04 00:00:00 EDT 2007}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: