Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

SeqTU: A web server for identification of bacterial transcription units

Journal Article · · Scientific Reports
DOI:https://doi.org/10.1038/srep43925· OSTI ID:1355909
 [1];  [2];  [3];  [4]
  1. Jilin Univ. Jilin (China); Univ. of Georgia, Athens, GA (United States); BioEnergy Science Center, Washington, D.C. (United States); Tianjin Univ., Tianjin (China); South Dakota State University
  2. Broad Institute of MIT and Harvard Univ., Cambridge, MA (United States)
  3. South Dakota State Univ., Brookings, SD (United States)
  4. Jilin Univ., Jilin (China); Univ. of Georgia, Athens, GA (United States); BioEnergy Science Center, Washington, D.C. (United States)
A transcription unit (TU) consists of K ≥ 1 consecutive genes on the same strand of a bacterial genome that are transcribed into a single mRNA molecule under certain conditions. Their identification is an essential step in elucidation of transcriptional regulatory networks. We have recently developed a machine-learning method to accurately identify TUs from RNA-seq data, based on two features of the assembled RNA reads: the continuity and stability of RNA-seq coverage across a genomic region. While good performance was achieved by the method on Escherichia coli and Clostridium thermocellum, substantial work is needed to make the program generally applicable to all bacteria, knowing that the program requires organism specific information. A web server, named SeqTU, was developed to automatically identify TUs with given RNA-seq data of any bacterium using a machine-learning approach. The server consists of a number of utility tools, in addition to TU identification, such as data preparation, data quality check and RNA-read mapping. SeqTU provides a user-friendly interface and automated prediction of TUs from given RNA-seq data. Furthermore, the predicted TUs are displayed intuitively using HTML format along with a graphic visualization of the prediction.
Research Organization:
South Dakota State Univ., Brookings, SD (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23)
Grant/Contract Number:
SC0013632
OSTI ID:
1355909
Journal Information:
Scientific Reports, Journal Name: Scientific Reports Vol. 7; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (12)

BEDTools: The Swiss-Army Tool for Genome Feature Analysis: BEDTools: the Swiss-Army Tool for Genome Feature Analysis journal September 2014
DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information journal October 2007
DOOR: a database for prokaryotic operons journal November 2008
OperonDB: a comprehensive database of predicted operons in microbial genomes journal January 2009
DOOR 2.0: presenting operons and their functions through dynamic and integrated views journal November 2013
Computational analysis of bacterial RNA-Seq data journal May 2013
Analysis of strand-specific RNA-seq data using machine learning reveals the structures of transcription units in Clostridium thermocellum journal March 2015
De novo assembly of bacterial transcriptomes from RNA-seq data journal January 2015
Minimal metabolic pathway structure is consistent with associated biomolecular interactions journal July 2014
The transcription unit architecture of the Escherichia coli genome journal November 2009
Fast gapped-read alignment with Bowtie 2 journal March 2012
Deep RNA sequencing of L. monocytogenes reveals overlapping and extensive stationary phase and sigma B-dependent transcriptomes, including multiple highly transcribed noncoding RNAs journal January 2009

Cited By (4)

AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees journal January 2019
Single-Cell RNA Sequencing of Plant-Associated Bacterial Communities journal October 2019
DOOR: a prokaryotic operon database for genome analyses and functional inference journal July 2017
A New Machine Learning-Based Framework for Mapping Uncertainty Analysis in RNA-Seq Read Alignment and Gene Expression Estimation journal August 2018