DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: PyPop: a mature open-source software pipeline for population genomics

Journal Article · · Frontiers in Immunology
 [1];  [2];  [3];  [4];  [5];  [6]
  1. Amber Biology LLC, Cambridge, MA (United States); Ronin Institute, Montclair, NJ (United States); Institute for Globally Distributed Open Research and Education (IGDORE), Cambridge, MA (United States)
  2. Univ. of Vermont, Burlington, VT (United States)
  3. Univ. of California, San Francisco, CA (United States)
  4. Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
  5. Univ. of Vermont, Burlington, VT (United States); Mariani Systems LLC, Hanover, NH (United States)
  6. Amber Biology LLC, Cambridge, MA (United States); Ronin Institute, Montclair, NJ (United States)

Python for Population Genomics (PyPop) is a software package that processes genotype and allele data and performs large-scale population genetic analyses on highly polymorphic multi-locus genotype data. In particular, PyPop tests data conformity to Hardy-Weinberg equilibrium expectations, performs Ewens-Watterson tests for selection, estimates haplotype frequencies, measures linkage disequilibrium, and tests significance. Standardized means of performing these tests is key for contemporary studies of evolutionary biology and population genetics, and these tests are central to genetic studies of disease association as well. Here, we present PyPop 1.0.0, a new major release of the package, which implements new features using the more robust infrastructure of GitHub, and is distributed via the industry-standard Python Package Index. New features include implementation of the asymmetric linkage disequilibrium measures and, of particular interest to the immunogenetics research communities, support for modern nomenclature, including colon-delimited allele names, and improvements to meta-analysis features for aggregating outputs for multiple populations.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
2471181
Journal Information:
Frontiers in Immunology, Journal Name: Frontiers in Immunology Vol. 15; ISSN 1664-3224
Publisher:
Frontiers Research FoundationCopyright Statement
Country of Publication:
United States
Language:
English

References (29)

Urban colonization through multiple genetic lenses: The city‐fox phenomenon revisited journal January 2019
Balancing selection and heterogeneity across the classical human leukocyte antigen loci: A meta-analytic review of 497 population studies journal July 2008
Collection and storage of HLA NGS genotyping data for the 17th International HLA and Immunogenetics Workshop journal February 2018
How natural selection shapes genetic differentiation in the MHC region: A case study with Native Americans journal July 2021
Unrelated Stem Cell Donor HLA Match Likelihood in the US Registry Incorporating HLA-DPB1 Permissive Mismatching journal April 2023
CYP2C9, CYP2D6, G6PD, GCLC, GSTM1 and NAT2 gene polymorphisms and risk of adverse reactions to sulfamethoxazole and ciprofloxacin in San Luis Potosí, Mexico journal September 2019
XXI.—On the Dominance Ratio journal January 1923
High resolution HLA analysis reveals independent class I haplotypes and amino-acid motifs protective for multiple sclerosis journal January 2018
HLA-associated susceptibility to childhood B-cell precursor ALL: definition and role of HLA-DPB1 supertypes journal March 2008
Hematopoietic stem cell donor registry strategies for assigning search determinants and matching relationships journal December 2003
MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers journal January 1994
Gametic Disequilibrium Measures: Proceed With Caution journal October 1987
Systems of Mating. i. the Biometric Relations Between Parent and Offspring journal March 1921
Systems of Mating. ii. the Effects of Inbreeding on the Genetic Composition of a Population journal March 1921
Technical Debt in Computational Science journal November 2015
MHC class I polymorphic Alu insertion (POALIN) allele and haplotype frequencies in the Arabs of the United Arab Emirates and other world populations journal April 2019
PyPop update - a software pipeline for large-scale multilocus population genomics journal April 2007
Nomenclature for factors of the HLA system, 2010 journal April 2010
A Mathematical Theory of Natural and Artificial Selection. part ii the Influence of Partial Self‐Fertilisation, Inbreeding, Assortative Mating, and Selective Fertilisation on the Composition of Mendelian Populations, and on Natural Selection. journal October 1924
Genotype List String: a grammar for describing HLA and KIR genotyping results in a text string journal July 2013
Genotype List String 1.1: Extending the Genotype List String grammar for describing HLA and Killer‐cell Immunoglobulin‐like Receptor genotypes journal June 2023
A T-cell epitope encoded by a subset of HLA-DPB1 alleles determines nonpermissive mismatches for hematologic stem cell transplantation journal October 2003
Significantly higher frequencies of alloreactive CD4+ T cells responding to nonpermissive than to permissive HLA-DPB1 T-cell epitope disparities journal September 2010
Malaria in Venezuela: changes in the complexity of infection reflects the increment in transmission intensity journal May 2020
Limited differentiation among Plasmodium vivax populations from the northwest and to the south Pacific Coast of Colombia: A malaria corridor? journal March 2019
Singularity: Scientific containers for mobility of compute journal May 2017
Conditional Asymmetric Linkage Disequilibrium (ALD): Extending the Biallelic r2 Measure journal July 2014
PyPop: A mature open-source software pipeline for population genomics preprint January 2024
How open science helps researchers succeed journal July 2016

Similar Records

HLA polymorphism in the Havasupai: Evidence for balancing selection
Journal Article · 1993 · American Journal of Human Genetics; (United States) · OSTI ID:5105330

Allelic associations of two polymorphic microsatellites in intron 40 of the human von Willebrand factor gene
Journal Article · 1994 · Proceedings of the National Academy of Sciences of the United States of America; (United States) · OSTI ID:6911071

Violations of the ceiling principle: Exact conditions and statistical evidence
Journal Article · 1993 · American Journal of Human Genetics; (United States) · OSTI ID:5941266