Mutations that Cause Human Disease: A Computational/Experimental Approach

Beernink, P; Barsky, D; Pesavento, B

doi:10.2172/898012

Title: Mutations that Cause Human Disease: A Computational/Experimental Approach

Technical Report · Wed Jan 11 00:00:00 EST 2006

DOI:https://doi.org/10.2172/898012· OSTI ID:898012

Beernink, P; Barsky, D; Pesavento, B

International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximately half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: W-7405-ENG-48

OSTI ID:: 898012

Report Number(s):: UCRL-TR-218296; TRN: US200705%%522

Country of Publication:: United States

Language:: English

Similar Records

Final Report - DE-FG03-97ER62385

Technical Report · Thu Jan 02 00:00:00 EST 2003 · OSTI ID:898012

Nickerson, Deborah A

SNPs in putative regulatory regions identified by human mouse comparative sequencing and transcription factor binding site data

Journal Article · Tue Jan 01 00:00:00 EST 2002 · Mammalian Genome · OSTI ID:898012

Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R; +5 more

Identification of functional elements and regulatory circuits by Drosophila modENCODE

Journal Article · Wed Dec 22 00:00:00 EST 2010 · Science · OSTI ID:898012

Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V; +93 more

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
AMINO ACIDS
BIOLOGY
CHROMOSOMES
CRYSTALLOGRAPHY
DISEASES
DNA
FORECASTING
GENETIC VARIABILITY
GENETICS
MUTATIONS
NEOPLASMS
NUCLEOTIDES
PROTEIN STRUCTURE
PROTEINS
RESIDUES
RHEUMATIC DISEASES
SORTING
SPECTROSCOPY

Title: Mutations that Cause Human Disease: A Computational/Experimental Approach

Citation Formats

Similar Records

Related Subjects