SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
Abstract
Abstract Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases.
- Authors:
-
- Life and Health Sciences Research institute (ICVS), School of Medicine, University of Minho, Braga, Portugal, ICVS/3B’s - PT Government Associate Laboratory, Braga, Guimarães, Portugal
- Publication Date:
- Sponsoring Org.:
- USDOE Office of Nuclear Energy (NE), Nuclear Fuel Cycle and Supply Chain
- OSTI Identifier:
- 1574552
- Grant/Contract Number:
- NORTE-01-0145-FEDER-000013; POCI-01-0145-FEDER-007038; IF/00474/2014; PDE/BDE/113599/2015
- Resource Type:
- Published Article
- Journal Name:
- Virus Evolution
- Additional Journal Information:
- Journal Name: Virus Evolution Journal Volume: 5 Journal Issue: 2; Journal ID: ISSN 2057-1577
- Publisher:
- Oxford University Press
- Country of Publication:
- United Kingdom
- Language:
- English
Citation Formats
Araújo, Pedro M. M., Martins, Joana S., and Osório, Nuno S. SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. United Kingdom: N. p., 2019.
Web. doi:10.1093/ve/vez050.
Araújo, Pedro M. M., Martins, Joana S., & Osório, Nuno S. SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. United Kingdom. https://doi.org/10.1093/ve/vez050
Araújo, Pedro M. M., Martins, Joana S., and Osório, Nuno S. Tue .
"SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing". United Kingdom. https://doi.org/10.1093/ve/vez050.
@article{osti_1574552,
title = {SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing},
author = {Araújo, Pedro M. M. and Martins, Joana S. and Osório, Nuno S.},
abstractNote = {Abstract Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases.},
doi = {10.1093/ve/vez050},
journal = {Virus Evolution},
number = 2,
volume = 5,
place = {United Kingdom},
year = {Tue Nov 19 00:00:00 EST 2019},
month = {Tue Nov 19 00:00:00 EST 2019}
}
https://doi.org/10.1093/ve/vez050
Web of Science
Works referenced in this record:
HIV‐1 Subtype D Infection Is Associated with Faster Disease Progression than Subtype A in Spite of Similar Plasma HIV‐1 Loads
journal, April 2007
- Baeten, Jared M.; Chohan, Bhavna; Lavreys, Ludo
- The Journal of Infectious Diseases, Vol. 195, Issue 8
Subtype C Is Associated with Increased Vaginal Shedding of HIV‐1
journal, August 2005
- John‐Stewart, Grace C.; Nduati, Ruth W.; Rousseau, Christine M.
- The Journal of Infectious Diseases, Vol. 192, Issue 3
The heterosexual human immunodeficiency virus type 1 epidemic in Thailand is caused by an intersubtype (A/E) recombinant of African origin.
journal, January 1996
- Gao, F.; Robertson, D. L.; Morrison, S. G.
- Journal of virology, Vol. 70, Issue 10
A statistical model for HIV-1 sequence classification using the subtype analyser (STAR)
journal, July 2005
- Myers, R. E.; Gale, C. V.; Harrison, A.
- Bioinformatics, Vol. 21, Issue 17
Frequencies of Gag-restricted T-cell escape “footprints” differ across HIV-1 clades A1 and D chronically infected Ugandans irrespective of host HLA B alleles
journal, March 2015
- Serwanga, Jennifer; Nakiboneka, Ritah; Mugaba, Susan
- Vaccine, Vol. 33, Issue 14
Nextflow enables reproducible computational workflows
journal, April 2017
- Di Tommaso, Paolo; Chatzou, Maria; Floden, Evan W.
- Nature Biotechnology, Vol. 35, Issue 4
Rationale and Uses of a Public HIV Drug‐Resistance Database
journal, September 2006
- Shafer, Robert W.
- The Journal of Infectious Diseases, Vol. 194, Issue s1
jpHMM: Improving the reliability of recombination prediction in HIV-1
journal, May 2009
- Schultz, A. -K.; Zhang, M.; Bulla, I.
- Nucleic Acids Research, Vol. 37, Issue Web Server
COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification
journal, August 2014
- Struck, Daniel; Lawyer, Glenn; Ternes, Anne-Marie
- Nucleic Acids Research, Vol. 42, Issue 18
Biopython: freely available Python tools for computational molecular biology and bioinformatics
journal, March 2009
- Cock, P. J. A.; Antao, T.; Chang, J. T.
- Bioinformatics, Vol. 25, Issue 11
Implications of HIV diversity for the HIV-1 pandemic
journal, May 2013
- Hemelaar, Joris
- Journal of Infection, Vol. 66, Issue 5
Web Resources for HIV Type 1 Genotypic-Resistance Test Interpretation
journal, June 2006
- Liu, T. F.; Shafer, R. W.
- Clinical Infectious Diseases, Vol. 42, Issue 11
Recombination in viruses: Mechanisms, methods of study, and evolutionary consequences
journal, March 2015
- Pérez-Losada, Marcos; Arenas, Miguel; Galán, Juan Carlos
- Infection, Genetics and Evolution, Vol. 30
Origin and Epidemiological History of HIV-1 CRF14_BG
journal, September 2011
- Bártolo, Inês; Abecasis, Ana B.; Borrego, Pedro
- PLoS ONE, Vol. 6, Issue 9
A V106M mutation in HIV-1 clade C viruses exposed to efavirenz confers cross-resistance to non-nucleoside reverse transcriptase inhibitors
journal, January 2003
- Brenner, Bluma; Turner, Dan; Oliveira, Maureen
- AIDS, Vol. 17, Issue 1
Analysis of the history and spread of HIV-1 in Uganda using phylodynamics
journal, July 2015
- Ragonnet-Cronin, Manon; Ssemwanga, Deogratius; Logue, Christopher H.
- Journal of General Virology, Vol. 96, Issue 7
BLAST+: architecture and applications
journal, January 2009
- Camacho, Christiam; Coulouris, George; Avagyan, Vahram
- BMC Bioinformatics, Vol. 10, Issue 1
An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1
journal, November 2009
- Kosakovsky Pond, Sergei L.; Posada, David; Stawiski, Eric
- PLoS Computational Biology, Vol. 5, Issue 11
Snakemake--a scalable bioinformatics workflow engine
journal, August 2012
- Koster, J.; Rahmann, S.
- Bioinformatics, Vol. 28, Issue 19
Comparative Evaluation of Subtyping Tools for Surveillance of Newly Emerging HIV-1 Strains
journal, July 2017
- Fabeni, Lavinia; Berno, Giulia; Fokam, Joseph
- Journal of Clinical Microbiology, Vol. 55, Issue 9
Characterization of a large cluster of HIV-1 A1 infections detected in Portugal and connected to several Western European countries
journal, May 2019
- Araújo, Pedro M. M.; Carvalho, Alexandre; Pingarilho, Marta
- Scientific Reports, Vol. 9, Issue 1
Antiretroviral resistance in different HIV-1 subtypes: impact on therapy outcomes and resistance testing interpretation
journal, January 2007
- Camacho, Ricardo Jorge; Vandamme, Anne-Mieke
- Current Opinion in HIV and AIDS, Vol. 2, Issue 2
Protease mutation M89I/V is linked to therapy failure in patients infected with the HIV-1 non-B subtypes C, F or G
journal, January 2005
- Abecasis, Ana Barroso; Deforche, Koen; Snoeck, Joke
- AIDS, Vol. 19, Issue 16
Assessment of automated genotyping protocols as tools for surveillance of HIV-1 genetic diversity
journal, January 2006
- Gifford, Robert; de Oliveira, Tulio; Rambaut, Andrew
- AIDS, Vol. 20, Issue 11
Effect of Human Immunodeficiency Virus Type 1 (HIV‐1) Subtype on Disease Progression in Persons from Rakai, Uganda, with Incident HIV‐1 Infection
journal, March 2008
- Kiwanuka, Noah; Laeyendecker, Oliver; Robb, Merlin
- The Journal of Infectious Diseases, Vol. 197, Issue 5
Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools
journal, October 2013
- Pineda-Peña, Andrea-Clemencia; Faria, Nuno Rodrigues; Imbrechts, Stijn
- Infection, Genetics and Evolution, Vol. 19
Impact of HIV‐1 viral subtype on disease progression and response to antiretroviral therapy
journal, January 2010
- Easterbrook, Philippa J.; Smith, Mel; Mullen, Jane
- Journal of the International AIDS Society, Vol. 13, Issue 1
HIV-1 subtype distribution and its demographic determinants in newly diagnosed patients in Europe suggest highly compartmentalized epidemics
journal, January 2013
- Abecasis, Ana B.; Wensing, Annemarie MJ; Paraskevis, Dimitris
- Retrovirology, Vol. 10, Issue 1
Preferential in-utero transmission of HIV-1 subtype C as compared to HIV-1 subtype A or D
journal, January 2004
- Renjifo, Boris; Gilbert, Peter; Chaplin, Beth
- AIDS, Vol. 18, Issue 12
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability
journal, January 2013
- Katoh, K.; Standley, D. M.
- Molecular Biology and Evolution, Vol. 30, Issue 4
A web-based genotyping resource for viral sequences
journal, July 2004
- Rozanov, M.; Plikat, U.; Chappey, C.
- Nucleic Acids Research, Vol. 32, Issue Web Server
IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies
journal, November 2014
- Nguyen, Lam-Tung; Schmidt, Heiko A.; von Haeseler, Arndt
- Molecular Biology and Evolution, Vol. 32, Issue 1
ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data
journal, February 2016
- Huerta-Cepas, Jaime; Serra, François; Bork, Peer
- Molecular Biology and Evolution, Vol. 33, Issue 6