DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: SeqTU: A web server for identification of bacterial transcription units

Abstract

A transcription unit (TU) consists of K ≥ 1 consecutive genes on the same strand of a bacterial genome that are transcribed into a single mRNA molecule under certain conditions. Their identification is an essential step in elucidation of transcriptional regulatory networks. We have recently developed a machine-learning method to accurately identify TUs from RNA-seq data, based on two features of the assembled RNA reads: the continuity and stability of RNA-seq coverage across a genomic region. While good performance was achieved by the method on Escherichia coli and Clostridium thermocellum, substantial work is needed to make the program generally applicable to all bacteria, knowing that the program requires organism specific information. A web server, named SeqTU, was developed to automatically identify TUs with given RNA-seq data of any bacterium using a machine-learning approach. The server consists of a number of utility tools, in addition to TU identification, such as data preparation, data quality check and RNA-read mapping. SeqTU provides a user-friendly interface and automated prediction of TUs from given RNA-seq data. Furthermore, the predicted TUs are displayed intuitively using HTML format along with a graphic visualization of the prediction.

Authors:
 [1];  [2];  [3];  [4]
  1. Jilin Univ. Jilin (China); Univ. of Georgia, Athens, GA (United States); BioEnergy Science Center, Washington, D.C. (United States); Tianjin Univ., Tianjin (China)
  2. Broad Institute of MIT and Harvard Univ., Cambridge, MA (United States)
  3. South Dakota State Univ., Brookings, SD (United States)
  4. Jilin Univ., Jilin (China); Univ. of Georgia, Athens, GA (United States); BioEnergy Science Center, Washington, D.C. (United States)
Publication Date:
Research Org.:
South Dakota State Univ., Brookings, SD (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1355909
Grant/Contract Number:  
SC0013632
Resource Type:
Accepted Manuscript
Journal Name:
Scientific Reports
Additional Journal Information:
Journal Volume: 7; Journal ID: ISSN 2045-2322
Publisher:
Nature Publishing Group
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; bacillus-subtilis; Escherichia coli; database; operons; reveals; door; bioinformatics; computational platforms and environments; software

Citation Formats

Chen, Xin, Chou, Wen -Chi, Ma, Qin, and Xu, Ying. SeqTU: A web server for identification of bacterial transcription units. United States: N. p., 2017. Web. doi:10.1038/srep43925.
Chen, Xin, Chou, Wen -Chi, Ma, Qin, & Xu, Ying. SeqTU: A web server for identification of bacterial transcription units. United States. https://doi.org/10.1038/srep43925
Chen, Xin, Chou, Wen -Chi, Ma, Qin, and Xu, Ying. Tue . "SeqTU: A web server for identification of bacterial transcription units". United States. https://doi.org/10.1038/srep43925. https://www.osti.gov/servlets/purl/1355909.
@article{osti_1355909,
title = {SeqTU: A web server for identification of bacterial transcription units},
author = {Chen, Xin and Chou, Wen -Chi and Ma, Qin and Xu, Ying},
abstractNote = {A transcription unit (TU) consists of K ≥ 1 consecutive genes on the same strand of a bacterial genome that are transcribed into a single mRNA molecule under certain conditions. Their identification is an essential step in elucidation of transcriptional regulatory networks. We have recently developed a machine-learning method to accurately identify TUs from RNA-seq data, based on two features of the assembled RNA reads: the continuity and stability of RNA-seq coverage across a genomic region. While good performance was achieved by the method on Escherichia coli and Clostridium thermocellum, substantial work is needed to make the program generally applicable to all bacteria, knowing that the program requires organism specific information. A web server, named SeqTU, was developed to automatically identify TUs with given RNA-seq data of any bacterium using a machine-learning approach. The server consists of a number of utility tools, in addition to TU identification, such as data preparation, data quality check and RNA-read mapping. SeqTU provides a user-friendly interface and automated prediction of TUs from given RNA-seq data. Furthermore, the predicted TUs are displayed intuitively using HTML format along with a graphic visualization of the prediction.},
doi = {10.1038/srep43925},
journal = {Scientific Reports},
number = ,
volume = 7,
place = {United States},
year = {Tue Mar 07 00:00:00 EST 2017},
month = {Tue Mar 07 00:00:00 EST 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

De novo assembly of bacterial transcriptomes from RNA-seq data
journal, January 2015


DOOR 2.0: presenting operons and their functions through dynamic and integrated views
journal, November 2013

  • Mao, Xizeng; Ma, Qin; Zhou, Chuan
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1048

DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information
journal, October 2007

  • Sierro, Nicolas; Makita, Yuko; de Hoon, Michiel
  • Nucleic Acids Research, Vol. 36, Issue suppl_1
  • DOI: 10.1093/nar/gkm910

OperonDB: a comprehensive database of predicted operons in microbial genomes
journal, January 2009

  • Pertea, M.; Ayanbule, K.; Smedinghoff, M.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn784

DOOR: a database for prokaryotic operons
journal, November 2008

  • Mao, Fenglou; Dam, Phuongan; Chou, Jacky
  • Nucleic Acids Research, Vol. 37, Issue suppl_1
  • DOI: 10.1093/nar/gkn757

The transcription unit architecture of the Escherichia coli genome
journal, November 2009

  • Cho, Byung-Kwan; Zengler, Karsten; Qiu, Yu
  • Nature Biotechnology, Vol. 27, Issue 11
  • DOI: 10.1038/nbt.1582

Computational analysis of bacterial RNA-Seq data
journal, May 2013

  • McClure, Ryan; Balasubramanian, Divya; Sun, Yan
  • Nucleic Acids Research, Vol. 41, Issue 14
  • DOI: 10.1093/nar/gkt444

Analysis of strand-specific RNA-seq data using machine learning reveals the structures of transcription units in Clostridium thermocellum
journal, March 2015

  • Chou, Wen-Chi; Ma, Qin; Yang, Shihui
  • Nucleic Acids Research, Vol. 43, Issue 10
  • DOI: 10.1093/nar/gkv177

Fast gapped-read alignment with Bowtie 2
journal, March 2012

  • Langmead, Ben; Salzberg, Steven L.
  • Nature Methods, Vol. 9, Issue 4
  • DOI: 10.1038/nmeth.1923

BEDTools: The Swiss-Army Tool for Genome Feature Analysis: BEDTools: the Swiss-Army Tool for Genome Feature Analysis
journal, September 2014


Minimal metabolic pathway structure is consistent with associated biomolecular interactions
journal, July 2014

  • Bordbar, Aarash; Nagarajan, Harish; Lewis, Nathan E.
  • Molecular Systems Biology, Vol. 10, Issue 7
  • DOI: 10.15252/msb.20145243

BEDTools: The Swiss-Army Tool for Genome Feature Analysis: BEDTools: the Swiss-Army Tool for Genome Feature Analysis
journal, September 2014


DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information
journal, October 2007

  • Sierro, Nicolas; Makita, Yuko; de Hoon, Michiel
  • Nucleic Acids Research, Vol. 36, Issue suppl_1
  • DOI: 10.1093/nar/gkm910

DOOR: a database for prokaryotic operons
journal, November 2008

  • Mao, Fenglou; Dam, Phuongan; Chou, Jacky
  • Nucleic Acids Research, Vol. 37, Issue suppl_1
  • DOI: 10.1093/nar/gkn757

OperonDB: a comprehensive database of predicted operons in microbial genomes
journal, January 2009

  • Pertea, M.; Ayanbule, K.; Smedinghoff, M.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn784

DOOR 2.0: presenting operons and their functions through dynamic and integrated views
journal, November 2013

  • Mao, Xizeng; Ma, Qin; Zhou, Chuan
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1048

Computational analysis of bacterial RNA-Seq data
journal, May 2013

  • McClure, Ryan; Balasubramanian, Divya; Sun, Yan
  • Nucleic Acids Research, Vol. 41, Issue 14
  • DOI: 10.1093/nar/gkt444

Analysis of strand-specific RNA-seq data using machine learning reveals the structures of transcription units in Clostridium thermocellum
journal, March 2015

  • Chou, Wen-Chi; Ma, Qin; Yang, Shihui
  • Nucleic Acids Research, Vol. 43, Issue 10
  • DOI: 10.1093/nar/gkv177

De novo assembly of bacterial transcriptomes from RNA-seq data
journal, January 2015


Minimal metabolic pathway structure is consistent with associated biomolecular interactions
journal, July 2014

  • Bordbar, Aarash; Nagarajan, Harish; Lewis, Nathan E.
  • Molecular Systems Biology, Vol. 10, Issue 7
  • DOI: 10.15252/msb.20145243

Works referencing / citing this record:

DOOR: a prokaryotic operon database for genome analyses and functional inference
journal, July 2017

  • Cao, Huansheng; Ma, Qin; Chen, Xin
  • Briefings in Bioinformatics, Vol. 20, Issue 4
  • DOI: 10.1093/bib/bbx088

Single-Cell RNA Sequencing of Plant-Associated Bacterial Communities
journal, October 2019

  • Ma, Qin; Bücking, Heike; Gonzalez Hernandez, Jose L.
  • Frontiers in Microbiology, Vol. 10
  • DOI: 10.3389/fmicb.2019.02452

AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
journal, January 2019

  • Manavalan, Balachandran; Basith, Shaherin; Shin, Tae Hwan
  • Computational and Structural Biotechnology Journal, Vol. 17
  • DOI: 10.1016/j.csbj.2019.06.024

Single-Cell RNA Sequencing of Plant-Associated Bacterial Communities
journal, October 2019

  • Ma, Qin; Bücking, Heike; Gonzalez Hernandez, Jose L.
  • Frontiers in Microbiology, Vol. 10
  • DOI: 10.3389/fmicb.2019.02452