skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: ESTPiper – a web-based analysis pipeline for expressed sequence tags

Journal Article · · BMC Genomics
 [1];  [1];  [1];  [1];  [1];  [1]
  1. Indiana Univ., Bloomington, IN (United States). The Center for Genomics and Bioinformatics

Background: EST sequencing projects are increasing in scale and scope as the genome sequencing technologies migrate from core sequencing centers to individual research laboratories. Effectively, generating EST data is no longer a bottleneck for investigators. However, processing large amounts of EST data remains a non-trivial challenge for many. Web-based EST analysis tools are proving to be the most convenient option for biologists when performing their analysis, so these tools must continuously improve on their utility to keep in step with the growing needs of research communities. We have developed a web-based EST analysis pipeline called ESTPiper, which streamlines typical large-scale EST analysis components. Results: The intuitive web interface guides users through each step of base calling, data cleaning, assembly, genome alignment, annotation, analysis of gene ontology (GO), and microarray oligonucleotide probe design. Each step is modularized. Therefore, a user can execute them separately or together in batch mode. In addition, the user has control over the parameters used by the underlying programs. Extensive documentation of ESTPiper's functionality is embedded throughout the web site to facilitate understanding of the required input and interpretation of the computational results. The user can also download intermediate results and port files to separate programs for further analysis. In addition, our server provides a time-stamped description of the run history for reproducibility. The pipeline can also be installed locally, allowing researchers to modify ESTPiper to suit their own needs. Conclusion: ESTPiper streamlines the typical process of EST analysis. The pipeline was initially designed in part to support the Daphnia pulex cDNA sequencing project. A web server hosting ESTPiper is provided at http://estpiper.cgb.indiana.edu/ to now support projects of all size. The software is also freely available from the authors for local installations.

Research Organization:
Indiana Univ., Bloomington, IN (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1626396
Journal Information:
BMC Genomics, Vol. 10, Issue 1; ISSN 1471-2164
Publisher:
SpringerCopyright Statement
Country of Publication:
United States
Language:
English

References (40)

e2g: an interactive web-based server for efficiently mapping large EST and cDNA sets to genomic sequences journal July 2004
ESTAP—an automated system for the analysis of EST data journal September 2003
preAssemble: a tool for automatic sequencer trace data processing journal January 2006
Complementary DNA sequencing: expressed sequence tags and human genome project journal June 1991
DNA sequence quality trimming and vector removal journal December 2001
ParPEST: a pipeline for EST data analysis based on parallel computing journal December 2005
Base-Calling of Automated Sequencer Traces Using Phred.  I. Accuracy Assessment journal March 1998
Selection of oligonucleotide probes for protein coding sequences journal May 2003
EST2uni: an open, parallel tool for automated EST analysis and database creation, with a data mining web interface and microarray expression data integration journal January 2008
High-throughput functional annotation and data mining with the Blast2GO suite journal April 2008
OREST: the online resource for EST analysis journal May 2008
ECgene: Genome-based EST clustering and gene modeling for alternative splicing journal April 2005
PartiGene—constructing partial genomes journal February 2004
BLAT---The BLAST-Like Alignment Tool journal March 2002
The Gene Ontology project in 2008 journal November 2007
ESTpass: a web-based server for processing and annotating expressed sequence tag (EST) sequences journal May 2007
PROBEmer: a web-based software tool for selecting optimal DNA oligos journal July 2003
Construction and characterization of a rock-cluster-based EST analysis pipeline journal February 2006
ESTAnnotator: a tool for high throughput EST annotation journal July 2003
A hitchhiker's guide to expressed sequence tag (EST) analysis journal May 2006
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs journal September 1997
PipeOnline 2.0: automated EST processing and functional data sorting journal November 2002
Comparative Plant Genomics Resources at PlantGDB journal October 2005
ESTprep: preprocessing cDNA sequence reads journal July 2003
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies journal October 2003
EST-PAGE—managing and analyzing EST data journal January 2004
OrfPredictor: predicting protein-coding regions in EST-derived sequences journal July 2005
TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets journal March 2003
ESTWeb: bioinformatics services for EST sequencing projects journal August 2003
EST assembly supported by a draft genome sequence: an analysis of the Chlamydomonas reinhardtii transcriptome journal March 2007
CAP3: A DNA Sequence Assembly Program journal September 1999
EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments journal July 2006
WebTraceMiner: a web service for processing and mining EST sequence trace files journal May 2007
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus journal February 2004
Refined Annotation of the Arabidopsis Genome by Complete Expressed Sequence Tag Mapping journal June 2003
EST Pipeline System: Detailed and Automated EST Data Processing and Mining journal August 2003
Expressed sequence tags: alternative or complement to whole genome sequences? journal July 2003
Comparative EST Analyses in Plant Systems book January 2005
BLAT---The BLAST-Like Alignment Tool journal March 2002
ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform journal May 2007

Cited By (7)

Why so many unknown genes? Partitioning orphans from a representative transcriptome of the lone star tick Amblyomma americanum journal January 2013
Gene discovery for the bark beetle-vectored fungal tree pathogen Grosmannia clavigera journal October 2010
Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms journal February 2015
The mining of toxin-like polypeptides from EST database by single residue distribution analysis journal January 2011
A survey of well conserved families of C2H2 zinc-finger genes in Daphnia journal April 2010
Bio301: A Web-Based EST Annotation Pipeline That Facilitates Functional Comparison Studies journal November 2012
Differential expression analysis of transcripts related to oil metabolism in maturing seeds of Jatropha curcas L. journal March 2014

Similar Records

MicrobesFlux: a web platform for drafting metabolic models from the KEGG database
Journal Article · Thu Aug 02 00:00:00 EDT 2012 · BMC Systems Biology · OSTI ID:1626396

The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database
Journal Article · Wed Jan 27 00:00:00 EST 2010 · Plos One · OSTI ID:1626396

Asc-Seurat: analytical single-cell Seurat-based web application
Journal Article · Thu Nov 18 00:00:00 EST 2021 · BMC Bioinformatics · OSTI ID:1626396