DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing

Abstract

Abstract Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases.

Authors:
 [1];  [1]; ORCiD logo [1]
  1. Life and Health Sciences Research institute (ICVS), School of Medicine, University of Minho, Braga, Portugal, ICVS/3B’s - PT Government Associate Laboratory, Braga, Guimarães, Portugal
Publication Date:
Sponsoring Org.:
USDOE Office of Nuclear Energy (NE), Nuclear Fuel Cycle and Supply Chain
OSTI Identifier:
1574552
Grant/Contract Number:  
NORTE-01-0145-FEDER-000013; POCI-01-0145-FEDER-007038; IF/00474/2014; PDE/BDE/113599/2015
Resource Type:
Published Article
Journal Name:
Virus Evolution
Additional Journal Information:
Journal Name: Virus Evolution Journal Volume: 5 Journal Issue: 2; Journal ID: ISSN 2057-1577
Publisher:
Oxford University Press
Country of Publication:
United Kingdom
Language:
English

Citation Formats

Araújo, Pedro M. M., Martins, Joana S., and Osório, Nuno S. SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. United Kingdom: N. p., 2019. Web. doi:10.1093/ve/vez050.
Araújo, Pedro M. M., Martins, Joana S., & Osório, Nuno S. SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. United Kingdom. https://doi.org/10.1093/ve/vez050
Araújo, Pedro M. M., Martins, Joana S., and Osório, Nuno S. Tue . "SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing". United Kingdom. https://doi.org/10.1093/ve/vez050.
@article{osti_1574552,
title = {SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing},
author = {Araújo, Pedro M. M. and Martins, Joana S. and Osório, Nuno S.},
abstractNote = {Abstract Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases.},
doi = {10.1093/ve/vez050},
journal = {Virus Evolution},
number = 2,
volume = 5,
place = {United Kingdom},
year = {Tue Nov 19 00:00:00 EST 2019},
month = {Tue Nov 19 00:00:00 EST 2019}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1093/ve/vez050

Citation Metrics:
Cited by: 5 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

HIV‐1 Subtype D Infection Is Associated with Faster Disease Progression than Subtype A in Spite of Similar Plasma HIV‐1 Loads
journal, April 2007

  • Baeten, Jared M.; Chohan, Bhavna; Lavreys, Ludo
  • The Journal of Infectious Diseases, Vol. 195, Issue 8
  • DOI: 10.1086/512682

Subtype C Is Associated with Increased Vaginal Shedding of HIV‐1
journal, August 2005

  • John‐Stewart, Grace C.; Nduati, Ruth W.; Rousseau, Christine M.
  • The Journal of Infectious Diseases, Vol. 192, Issue 3
  • DOI: 10.1086/431514

A statistical model for HIV-1 sequence classification using the subtype analyser (STAR)
journal, July 2005


Nextflow enables reproducible computational workflows
journal, April 2017

  • Di Tommaso, Paolo; Chatzou, Maria; Floden, Evan W.
  • Nature Biotechnology, Vol. 35, Issue 4
  • DOI: 10.1038/nbt.3820

Rationale and Uses of a Public HIV Drug‐Resistance Database
journal, September 2006

  • Shafer, Robert W.
  • The Journal of Infectious Diseases, Vol. 194, Issue s1
  • DOI: 10.1086/505356

jpHMM: Improving the reliability of recombination prediction in HIV-1
journal, May 2009

  • Schultz, A. -K.; Zhang, M.; Bulla, I.
  • Nucleic Acids Research, Vol. 37, Issue Web Server
  • DOI: 10.1093/nar/gkp371

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification
journal, August 2014

  • Struck, Daniel; Lawyer, Glenn; Ternes, Anne-Marie
  • Nucleic Acids Research, Vol. 42, Issue 18
  • DOI: 10.1093/nar/gku739

Biopython: freely available Python tools for computational molecular biology and bioinformatics
journal, March 2009


Implications of HIV diversity for the HIV-1 pandemic
journal, May 2013


Web Resources for HIV Type 1 Genotypic-Resistance Test Interpretation
journal, June 2006

  • Liu, T. F.; Shafer, R. W.
  • Clinical Infectious Diseases, Vol. 42, Issue 11
  • DOI: 10.1086/503914

Recombination in viruses: Mechanisms, methods of study, and evolutionary consequences
journal, March 2015


Origin and Epidemiological History of HIV-1 CRF14_BG
journal, September 2011


Analysis of the history and spread of HIV-1 in Uganda using phylodynamics
journal, July 2015

  • Ragonnet-Cronin, Manon; Ssemwanga, Deogratius; Logue, Christopher H.
  • Journal of General Virology, Vol. 96, Issue 7
  • DOI: 10.1099/vir.0.000107

BLAST+: architecture and applications
journal, January 2009

  • Camacho, Christiam; Coulouris, George; Avagyan, Vahram
  • BMC Bioinformatics, Vol. 10, Issue 1
  • DOI: 10.1186/1471-2105-10-421

An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1
journal, November 2009


Snakemake--a scalable bioinformatics workflow engine
journal, August 2012


Comparative Evaluation of Subtyping Tools for Surveillance of Newly Emerging HIV-1 Strains
journal, July 2017

  • Fabeni, Lavinia; Berno, Giulia; Fokam, Joseph
  • Journal of Clinical Microbiology, Vol. 55, Issue 9
  • DOI: 10.1128/JCM.00656-17

Characterization of a large cluster of HIV-1 A1 infections detected in Portugal and connected to several Western European countries
journal, May 2019

  • Araújo, Pedro M. M.; Carvalho, Alexandre; Pingarilho, Marta
  • Scientific Reports, Vol. 9, Issue 1
  • DOI: 10.1038/s41598-019-43420-2

HIV-1 Nomenclature Proposal
journal, April 2000


Antiretroviral resistance in different HIV-1 subtypes: impact on therapy outcomes and resistance testing interpretation
journal, January 2007


Protease mutation M89I/V is linked to therapy failure in patients infected with the HIV-1 non-B subtypes C, F or G
journal, January 2005


Assessment of automated genotyping protocols as tools for surveillance of HIV-1 genetic diversity
journal, January 2006


Effect of Human Immunodeficiency Virus Type 1 (HIV‐1) Subtype on Disease Progression in Persons from Rakai, Uganda, with Incident HIV‐1 Infection
journal, March 2008

  • Kiwanuka, Noah; Laeyendecker, Oliver; Robb, Merlin
  • The Journal of Infectious Diseases, Vol. 197, Issue 5
  • DOI: 10.1086/527416

Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools
journal, October 2013

  • Pineda-Peña, Andrea-Clemencia; Faria, Nuno Rodrigues; Imbrechts, Stijn
  • Infection, Genetics and Evolution, Vol. 19
  • DOI: 10.1016/j.meegid.2013.04.032

Impact of HIV‐1 viral subtype on disease progression and response to antiretroviral therapy
journal, January 2010

  • Easterbrook, Philippa J.; Smith, Mel; Mullen, Jane
  • Journal of the International AIDS Society, Vol. 13, Issue 1
  • DOI: 10.1186/1758-2652-13-4

HIV-1 subtype distribution and its demographic determinants in newly diagnosed patients in Europe suggest highly compartmentalized epidemics
journal, January 2013

  • Abecasis, Ana B.; Wensing, Annemarie MJ; Paraskevis, Dimitris
  • Retrovirology, Vol. 10, Issue 1
  • DOI: 10.1186/1742-4690-10-7

Preferential in-utero transmission of HIV-1 subtype C as compared to HIV-1 subtype A or D
journal, January 2004


MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability
journal, January 2013

  • Katoh, K.; Standley, D. M.
  • Molecular Biology and Evolution, Vol. 30, Issue 4
  • DOI: 10.1093/molbev/mst010

A web-based genotyping resource for viral sequences
journal, July 2004

  • Rozanov, M.; Plikat, U.; Chappey, C.
  • Nucleic Acids Research, Vol. 32, Issue Web Server
  • DOI: 10.1093/nar/gkh419

IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies
journal, November 2014

  • Nguyen, Lam-Tung; Schmidt, Heiko A.; von Haeseler, Arndt
  • Molecular Biology and Evolution, Vol. 32, Issue 1
  • DOI: 10.1093/molbev/msu300

ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data
journal, February 2016

  • Huerta-Cepas, Jaime; Serra, François; Bork, Peer
  • Molecular Biology and Evolution, Vol. 33, Issue 6
  • DOI: 10.1093/molbev/msw046