Sample records for large structural variants

  1. Structure-based design of combinatorial mutagenesis libraries

    PubMed Central

    Verma, Deeptak; Grigoryan, Gevorg; Bailey-Kellogg, Chris

    2015-01-01

    The development of protein variants with improved properties (thermostability, binding affinity, catalytic activity, etc.) has greatly benefited from the application of high-throughput screens evaluating large, diverse combinatorial libraries. At the same time, since only a very limited portion of sequence space can be experimentally constructed and tested, an attractive possibility is to use computational protein design to focus libraries on a productive portion of the space. We present a general-purpose method, called “Structure-based Optimization of Combinatorial Mutagenesis” (SOCoM), which can optimize arbitrarily large combinatorial mutagenesis libraries directly based on structural energies of their constituents. SOCoM chooses both positions and substitutions, employing a combinatorial optimization framework based on library-averaged energy potentials in order to avoid explicitly modeling every variant in every possible library. In case study applications to green fluorescent protein, β-lactamase, and lipase A, SOCoM optimizes relatively small, focused libraries whose variants achieve energies comparable to or better than previous library design efforts, as well as larger libraries (previously not designable by structure-based methods) whose variants cover greater diversity while still maintaining substantially better energies than would be achieved by representative random library approaches. By allowing the creation of large-scale combinatorial libraries based on structural calculations, SOCoM promises to increase the scope of applicability of computational protein design and improve the hit rate of discovering beneficial variants. While designs presented here focus on variant stability (predicted by total energy), SOCoM can readily incorporate other structure-based assessments, such as the energy gap between alternative conformational or bound states. PMID:25611189

  2. Structure-based design of combinatorial mutagenesis libraries.

    PubMed

    Verma, Deeptak; Grigoryan, Gevorg; Bailey-Kellogg, Chris

    2015-05-01

    The development of protein variants with improved properties (thermostability, binding affinity, catalytic activity, etc.) has greatly benefited from the application of high-throughput screens evaluating large, diverse combinatorial libraries. At the same time, since only a very limited portion of sequence space can be experimentally constructed and tested, an attractive possibility is to use computational protein design to focus libraries on a productive portion of the space. We present a general-purpose method, called "Structure-based Optimization of Combinatorial Mutagenesis" (SOCoM), which can optimize arbitrarily large combinatorial mutagenesis libraries directly based on structural energies of their constituents. SOCoM chooses both positions and substitutions, employing a combinatorial optimization framework based on library-averaged energy potentials in order to avoid explicitly modeling every variant in every possible library. In case study applications to green fluorescent protein, β-lactamase, and lipase A, SOCoM optimizes relatively small, focused libraries whose variants achieve energies comparable to or better than previous library design efforts, as well as larger libraries (previously not designable by structure-based methods) whose variants cover greater diversity while still maintaining substantially better energies than would be achieved by representative random library approaches. By allowing the creation of large-scale combinatorial libraries based on structural calculations, SOCoM promises to increase the scope of applicability of computational protein design and improve the hit rate of discovering beneficial variants. While designs presented here focus on variant stability (predicted by total energy), SOCoM can readily incorporate other structure-based assessments, such as the energy gap between alternative conformational or bound states. © 2015 The Protein Society.

  3. Genomic Rearrangements in Arabidopsis Considered as Quantitative Traits.

    PubMed

    Imprialou, Martha; Kahles, André; Steffen, Joshua G; Osborne, Edward J; Gan, Xiangchao; Lempe, Janne; Bhomra, Amarjit; Belfield, Eric; Visscher, Anne; Greenhalgh, Robert; Harberd, Nicholas P; Goram, Richard; Hein, Jotun; Robert-Seilaniantz, Alexandre; Jones, Jonathan; Stegle, Oliver; Kover, Paula; Tsiantis, Miltos; Nordborg, Magnus; Rätsch, Gunnar; Clark, Richard M; Mott, Richard

    2017-04-01

    To understand the population genetics of structural variants and their effects on phenotypes, we developed an approach to mapping structural variants that segregate in a population sequenced at low coverage. We avoid calling structural variants directly. Instead, the evidence for a potential structural variant at a locus is indicated by variation in the counts of short-reads that map anomalously to that locus. These structural variant traits are treated as quantitative traits and mapped genetically, analogously to a gene expression study. Association between a structural variant trait at one locus, and genotypes at a distant locus indicate the origin and target of a transposition. Using ultra-low-coverage (0.3×) population sequence data from 488 recombinant inbred Arabidopsis thaliana genomes, we identified 6502 segregating structural variants. Remarkably, 25% of these were transpositions. While many structural variants cannot be delineated precisely, we validated 83% of 44 predicted transposition breakpoints by polymerase chain reaction. We show that specific structural variants may be causative for quantitative trait loci for germination and resistance to infection by the fungus Albugo laibachii , isolate Nc14. Further we show that the phenotypic heritability attributable to read-mapping anomalies differs from, and, in the case of time to germination and bolting, exceeds that due to standard genetic variation. Genes within structural variants are also more likely to be silenced or dysregulated. This approach complements the prevalent strategy of structural variant discovery in fewer individuals sequenced at high coverage. It is generally applicable to large populations sequenced at low-coverage, and is particularly suited to mapping transpositions. Copyright © 2017 by the Genetics Society of America.

  4. RAPTR-SV: a hybrid method for the detection of structural variants

    USDA-ARS?s Scientific Manuscript database

    Motivation: Identification of Structural Variants (SV) in sequence data results in a large number of false positive calls using existing software, which overburdens subsequent validation. Results: Simulations using RAPTR-SV and another software package that uses a similar algorithm for SV detection...

  5. G2S: a web-service for annotating genomic variants on 3D protein structures.

    PubMed

    Wang, Juexin; Sheridan, Robert; Sumer, S Onur; Schultz, Nikolaus; Xu, Dong; Gao, Jianjiong

    2018-06-01

    Accurately mapping and annotating genomic locations on 3D protein structures is a key step in structure-based analysis of genomic variants detected by recent large-scale sequencing efforts. There are several mapping resources currently available, but none of them provides a web API (Application Programming Interface) that supports programmatic access. We present G2S, a real-time web API that provides automated mapping of genomic variants on 3D protein structures. G2S can align genomic locations of variants, protein locations, or protein sequences to protein structures and retrieve the mapped residues from structures. G2S API uses REST-inspired design and it can be used by various clients such as web browsers, command terminals, programming languages and other bioinformatics tools for bringing 3D structures into genomic variant analysis. The webserver and source codes are freely available at https://g2s.genomenexus.org. g2s@genomenexus.org. Supplementary data are available at Bioinformatics online.

  6. Identification of causal genes for complex traits.

    PubMed

    Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar

    2015-06-15

    Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider 'causal variants' as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Software is freely available for download at genetics.cs.ucla.edu/caviar. © The Author 2015. Published by Oxford University Press.

  7. Identification of causal genes for complex traits

    PubMed Central

    Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar

    2015-01-01

    Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu PMID:26072484

  8. Read count-based method for high-throughput allelic genotyping of transposable elements and structural variants.

    PubMed

    Kuhn, Alexandre; Ong, Yao Min; Quake, Stephen R; Burkholder, William F

    2015-07-08

    Like other structural variants, transposable element insertions can be highly polymorphic across individuals. Their functional impact, however, remains poorly understood. Current genome-wide approaches for genotyping insertion-site polymorphisms based on targeted or whole-genome sequencing remain very expensive and can lack accuracy, hence new large-scale genotyping methods are needed. We describe a high-throughput method for genotyping transposable element insertions and other types of structural variants that can be assayed by breakpoint PCR. The method relies on next-generation sequencing of multiplex, site-specific PCR amplification products and read count-based genotype calls. We show that this method is flexible, efficient (it does not require rounds of optimization), cost-effective and highly accurate. This method can benefit a wide range of applications from the routine genotyping of animal and plant populations to the functional study of structural variants in humans.

  9. A high-quality human reference panel reveals the complexity and distribution of genomic structural variants.

    PubMed

    Hehir-Kwa, Jayne Y; Marschall, Tobias; Kloosterman, Wigard P; Francioli, Laurent C; Baaijens, Jasmijn A; Dijkstra, Louis J; Abdellaoui, Abdel; Koval, Vyacheslav; Thung, Djie Tjwan; Wardenaar, René; Renkens, Ivo; Coe, Bradley P; Deelen, Patrick; de Ligt, Joep; Lameijer, Eric-Wubbo; van Dijk, Freerk; Hormozdiari, Fereydoun; Uitterlinden, André G; van Duijn, Cornelia M; Eichler, Evan E; de Bakker, Paul I W; Swertz, Morris A; Wijmenga, Cisca; van Ommen, Gert-Jan B; Slagboom, P Eline; Boomsma, Dorret I; Schönhuth, Alexander; Ye, Kai; Guryev, Victor

    2016-10-06

    Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.

  10. Dynamic response analysis of structure under time-variant interval process model

    NASA Astrophysics Data System (ADS)

    Xia, Baizhan; Qin, Yuan; Yu, Dejie; Jiang, Chao

    2016-10-01

    Due to the aggressiveness of the environmental factor, the variation of the dynamic load, the degeneration of the material property and the wear of the machine surface, parameters related with the structure are distinctly time-variant. Typical model for time-variant uncertainties is the random process model which is constructed on the basis of a large number of samples. In this work, we propose a time-variant interval process model which can be effectively used to deal with time-variant uncertainties with limit information. And then two methods are presented for the dynamic response analysis of the structure under the time-variant interval process model. The first one is the direct Monte Carlo method (DMCM) whose computational burden is relative high. The second one is the Monte Carlo method based on the Chebyshev polynomial expansion (MCM-CPE) whose computational efficiency is high. In MCM-CPE, the dynamic response of the structure is approximated by the Chebyshev polynomials which can be efficiently calculated, and then the variational range of the dynamic response is estimated according to the samples yielded by the Monte Carlo method. To solve the dependency phenomenon of the interval operation, the affine arithmetic is integrated into the Chebyshev polynomial expansion. The computational effectiveness and efficiency of MCM-CPE is verified by two numerical examples, including a spring-mass-damper system and a shell structure.

  11. Molecular models of NS3 protease variants of the Hepatitis C virus.

    PubMed

    da Silveira, Nelson J F; Arcuri, Helen A; Bonalumi, Carlos E; de Souza, Fátima P; Mello, Isabel M V G C; Rahal, Paula; Pinho, João R R; de Azevedo, Walter F

    2005-01-21

    Hepatitis C virus (HCV) currently infects approximately three percent of the world population. In view of the lack of vaccines against HCV, there is an urgent need for an efficient treatment of the disease by an effective antiviral drug. Rational drug design has not been the primary way for discovering major therapeutics. Nevertheless, there are reports of success in the development of inhibitor using a structure-based approach. One of the possible targets for drug development against HCV is the NS3 protease variants. Based on the three-dimensional structure of these variants we expect to identify new NS3 protease inhibitors. In order to speed up the modeling process all NS3 protease variant models were generated in a Beowulf cluster. The potential of the structural bioinformatics for development of new antiviral drugs is discussed. The atomic coordinates of crystallographic structure 1CU1 and 1DY9 were used as starting model for modeling of the NS3 protease variant structures. The NS3 protease variant structures are composed of six subdomains, which occur in sequence along the polypeptide chain. The protease domain exhibits the dual beta-barrel fold that is common among members of the chymotrypsin serine protease family. The helicase domain contains two structurally related beta-alpha-beta subdomains and a third subdomain of seven helices and three short beta strands. The latter domain is usually referred to as the helicase alpha-helical subdomain. The rmsd value of bond lengths and bond angles, the average G-factor and Verify 3D values are presented for NS3 protease variant structures. This project increases the certainty that homology modeling is an useful tool in structural biology and that it can be very valuable in annotating genome sequence information and contributing to structural and functional genomics from virus. The structural models will be used to guide future efforts in the structure-based drug design of a new generation of NS3 protease variants inhibitors. All models in the database are publicly accessible via our interactive website, providing us with large amount of structural models for use in protein-ligand docking analysis.

  12. Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder.

    PubMed

    Steinberg, Karyn Meltz; Ramachandran, Dhanya; Patel, Viren C; Shetty, Amol C; Cutler, David J; Zwick, Michael E

    2012-09-28

    Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3' UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects.

  13. Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder

    PubMed Central

    2012-01-01

    Background Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. Methods We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. Results We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3’ UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. Conclusions These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects. PMID:23020841

  14. A FRMD7 variant in a Japanese family causes congenital nystagmus.

    PubMed

    Kohmoto, Tomohiro; Okamoto, Nana; Satomura, Shigeko; Naruto, Takuya; Komori, Takahide; Hashimoto, Toshiaki; Imoto, Issei

    2015-01-01

    Idiopathic congenital nystagmus (ICN) is a genetically heterogeneous eye movement disorder that causes a large proportion of childhood visual impairment. Here we describe a missense variant (p.L292P) within a mutation-rich region of FRMD7 detected in three affected male siblings in a Japanese family with X-linked ICN. Combining sequence analysis and results from structural and functional predictions, we report p.L292P as a variant potentially disrupting FRMD7 function associated with X-linked ICN.

  15. A FRMD7 variant in a Japanese family causes congenital nystagmus

    PubMed Central

    Kohmoto, Tomohiro; Okamoto, Nana; Satomura, Shigeko; Naruto, Takuya; Komori, Takahide; Hashimoto, Toshiaki; Imoto, Issei

    2015-01-01

    Idiopathic congenital nystagmus (ICN) is a genetically heterogeneous eye movement disorder that causes a large proportion of childhood visual impairment. Here we describe a missense variant (p.L292P) within a mutation-rich region of FRMD7 detected in three affected male siblings in a Japanese family with X-linked ICN. Combining sequence analysis and results from structural and functional predictions, we report p.L292P as a variant potentially disrupting FRMD7 function associated with X-linked ICN. PMID:27081518

  16. DangerTrack: A scoring system to detect difficult-to-assess regions.

    PubMed

    Dolgalev, Igor; Sedlazeck, Fritz; Busby, Ben

    2017-01-01

    Over recent years, multiple groups have shown that a large number of structural variants, repeats, or problems with the underlying genome assembly have dramatic effects on the mapping, calling, and overall reliability of single nucleotide polymorphism calls. This project endeavored to develop an easy-to-use track for looking at structural variant and repeat regions. This track, DangerTrack, can be displayed alongside the existing Genome Reference Consortium assembly tracks to warn clinicians and biologists when variants of interest may be incorrectly called, of dubious quality, or on an insertion or copy number expansion. While mapping and variant calling can be automated, it is our opinion that when these regions are of interest to a particular clinical or research group, they warrant a careful examination, potentially involving localized reassembly. DangerTrack is available at https://github.com/DCGenomics/DangerTrack.

  17. VAT: a computational framework to functionally annotate variants in personal genomes within a cloud-computing environment.

    PubMed

    Habegger, Lukas; Balasubramanian, Suganthi; Chen, David Z; Khurana, Ekta; Sboner, Andrea; Harmanci, Arif; Rozowsky, Joel; Clarke, Declan; Snyder, Michael; Gerstein, Mark

    2012-09-01

    The functional annotation of variants obtained through sequencing projects is generally assumed to be a simple intersection of genomic coordinates with genomic features. However, complexities arise for several reasons, including the differential effects of a variant on alternatively spliced transcripts, as well as the difficulty in assessing the impact of small insertions/deletions and large structural variants. Taking these factors into consideration, we developed the Variant Annotation Tool (VAT) to functionally annotate variants from multiple personal genomes at the transcript level as well as obtain summary statistics across genes and individuals. VAT also allows visualization of the effects of different variants, integrates allele frequencies and genotype data from the underlying individuals and facilitates comparative analysis between different groups of individuals. VAT can either be run through a command-line interface or as a web application. Finally, in order to enable on-demand access and to minimize unnecessary transfers of large data files, VAT can be run as a virtual machine in a cloud-computing environment. VAT is implemented in C and PHP. The VAT web service, Amazon Machine Image, source code and detailed documentation are available at vat.gersteinlab.org.

  18. Different structural stability and toxicity of PrP(ARR) and PrP(ARQ) sheep prion protein variants.

    PubMed

    Paludi, Domenico; Thellung, Stefano; Chiovitti, Katia; Corsaro, Alessandro; Villa, Valentina; Russo, Claudio; Ianieri, Adriana; Bertsch, Uwe; Kretzschmar, Hans A; Aceto, Antonio; Florio, Tullio

    2007-12-01

    The polymorphisms at amino acid residues 136, 154, and 171 in ovine prion protein (PrP) have been associated with different susceptibility to scrapie: animals expressing PrP(ARQ) [PrP(Ala136/Arg154/Gln171)] show vulnerability, whereas those that express PrP(ARR) [PrP(Ala136/Arg154/Arg171)] are resistant to scrapie. The aim of this study was to evaluate the in vitro toxic effects of PrP(ARR) and PrP(ARQ) variants in relation with their structural characteristics. We show that both peptides cause cell death inducing apoptosis but, unexpectedly, the scrapie resistant PrP(ARR) form was more toxic than the scrapie susceptible PrP(ARQ) variant. Moreover, the alpha-helical conformation of PrP(ARR) was less stable than that of PrP(ARQ) and the structural determinants responsible of these different conformational stabilities were characterized by spectroscopic analysis. We observed that PrP toxicity was inversely related to protein structural stability, being the unfolded conformation more toxic than the native one. However, the PrP(ARQ) variant displays a higher propensity to form large aggregates than PrP(ARR). Interestingly, in the presence of small amounts of PrP(ARR), PrP(ARQ) aggregability was reduced to levels similar to that of PrP(ARR). Thus, in contrast to PrP(ARR) toxicity, scrapie transmissibility seems to reside in the more stable conformation of PrP(ARQ) that allows the formation of large amyloid fibrils.

  19. Meta-analysis of gene-level tests for rare variant association.

    PubMed

    Liu, Dajiang J; Peloso, Gina M; Zhan, Xiaowei; Holmen, Oddgeir L; Zawistowski, Matthew; Feng, Shuang; Nikpay, Majid; Auer, Paul L; Goel, Anuj; Zhang, He; Peters, Ulrike; Farrall, Martin; Orho-Melander, Marju; Kooperberg, Charles; McPherson, Ruth; Watkins, Hugh; Willer, Cristen J; Hveem, Kristian; Melander, Olle; Kathiresan, Sekar; Abecasis, Gonçalo R

    2014-02-01

    The majority of reported complex disease associations for common genetic variants have been identified through meta-analysis, a powerful approach that enables the use of large sample sizes while protecting against common artifacts due to population structure and repeated small-sample analyses sharing individual-level data. As the focus of genetic association studies shifts to rare variants, genes and other functional units are becoming the focus of analysis. Here we propose and evaluate new approaches for performing meta-analysis of rare variant association tests, including burden tests, weighted burden tests, variable-threshold tests and tests that allow variants with opposite effects to be grouped together. We show that our approach retains useful features from single-variant meta-analysis approaches and demonstrate its use in a study of blood lipid levels in ∼18,500 individuals genotyped with exome arrays.

  20. VAT: a computational framework to functionally annotate variants in personal genomes within a cloud-computing environment

    PubMed Central

    Habegger, Lukas; Balasubramanian, Suganthi; Chen, David Z.; Khurana, Ekta; Sboner, Andrea; Harmanci, Arif; Rozowsky, Joel; Clarke, Declan; Snyder, Michael; Gerstein, Mark

    2012-01-01

    Summary: The functional annotation of variants obtained through sequencing projects is generally assumed to be a simple intersection of genomic coordinates with genomic features. However, complexities arise for several reasons, including the differential effects of a variant on alternatively spliced transcripts, as well as the difficulty in assessing the impact of small insertions/deletions and large structural variants. Taking these factors into consideration, we developed the Variant Annotation Tool (VAT) to functionally annotate variants from multiple personal genomes at the transcript level as well as obtain summary statistics across genes and individuals. VAT also allows visualization of the effects of different variants, integrates allele frequencies and genotype data from the underlying individuals and facilitates comparative analysis between different groups of individuals. VAT can either be run through a command-line interface or as a web application. Finally, in order to enable on-demand access and to minimize unnecessary transfers of large data files, VAT can be run as a virtual machine in a cloud-computing environment. Availability and Implementation: VAT is implemented in C and PHP. The VAT web service, Amazon Machine Image, source code and detailed documentation are available at vat.gersteinlab.org. Contact: lukas.habegger@yale.edu or mark.gerstein@yale.edu Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:22743228

  1. Quantitative Mass Spectrometry Reveals Changes in Histone H2B Variants as Cells Undergo Inorganic Arsenic-Mediated Cellular Transformation*

    PubMed Central

    Rea, Matthew; Jiang, Tingting; Eleazer, Rebekah; Eckstein, Meredith; Marshall, Alan G.; Fondufe-Mittendorf, Yvonne N.

    2016-01-01

    Exposure to inorganic arsenic, a ubiquitous environmental toxic metalloid, leads to carcinogenesis. However, the mechanism is unknown. Several studies have shown that inorganic arsenic exposure alters specific gene expression patterns, possibly through alterations in chromatin structure. While most studies on understanding the mechanism of chromatin-mediated gene regulation have focused on histone post-translational modifications, the role of histone variants remains largely unknown. Incorporation of histone variants alters the functional properties of chromatin. To understand the global dynamics of chromatin structure and function in arsenic-mediated carcinogenesis, analysis of the histone variants incorporated into the nucleosome and their covalent modifications is required. Here we report the first global mass spectrometric analysis of histone H2B variants as cells undergo arsenic-mediated epithelial to mesenchymal transition. We used electron capture dissociation-based top-down tandem mass spectrometry analysis validated with quantitative reverse transcription real-time polymerase chain reaction to identify changes in the expression levels of H2B variants in inorganic arsenic-mediated epithelial-mesenchymal transition. We identified changes in the expression levels of specific histone H2B variants in two cell types, which are dependent on dose and length of exposure of inorganic arsenic. In particular, we found increases in H2B variants H2B1H/1K/1C/1J/1O and H2B2E/2F, and significant decreases in H2B1N/1D/1B as cells undergo inorganic arsenic-mediated epithelial-mesenchymal transition. The analysis of these histone variants provides a first step toward an understanding of the functional significance of the diversity of histone structures, especially in inorganic arsenic-mediated gene expression and carcinogenesis. PMID:27169413

  2. Copy number variation at the 7q11.23 segmental duplications is a susceptibility factor for the Williams-Beuren syndrome deletion

    PubMed Central

    Cuscó, Ivon; Corominas, Roser; Bayés, Mònica; Flores, Raquel; Rivera-Brugués, Núria; Campuzano, Victoria; Pérez-Jurado, Luis A.

    2008-01-01

    Large copy number variants (CNVs) have been recently found as structural polymorphisms of the human genome of still unknown biological significance. CNVs are significantly enriched in regions with segmental duplications or low-copy repeats (LCRs). Williams-Beuren syndrome (WBS) is a neurodevelopmental disorder caused by a heterozygous deletion of contiguous genes at 7q11.23 mediated by nonallelic homologous recombination (NAHR) between large flanking LCRs and facilitated by a structural variant of the region, a ∼2-Mb paracentric inversion present in 20%–25% of WBS-transmitting progenitors. We now report that eight out of 180 (4.44%) WBS-transmitting progenitors are carriers of a CNV, displaying a chromosome with large deletion of LCRs. The prevalence of this CNV among control individuals and non-transmitting progenitors is much lower (1%, n = 600), thus indicating that it is a predisposing factor for the WBS deletion (odds ratio 4.6-fold, P = 0.002). LCR duplications were found in 2.22% of WBS-transmitting progenitors but also in 1.16% of controls, which implies a non–statistically significant increase in WBS-transmitting progenitors. We have characterized the organization and breakpoints of these CNVs, encompassing ∼100–300 kb of genomic DNA and containing several pseudogenes but no functional genes. Additional structural variants of the region have also been defined, all generated by NAHR between different blocks of segmental duplications. Our data further illustrate the highly dynamic structure of regions rich in segmental duplications, such as the WBS locus, and indicate that large CNVs can act as susceptibility alleles for disease-associated genomic rearrangements in the progeny. PMID:18292220

  3. A Commonly Carried Genetic Variant in the Delta Opioid Receptor Gene, OPRD1, is Associated with Smaller Regional Brain Volumes: Replication in Elderly and Young Populations

    PubMed Central

    Roussotte, Florence F.; Jahanshad, Neda; Hibar, Derrek P.; Sowell, Elizabeth R.; Kohannim, Omid; Barysheva, Marina; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Montgomery, Grant W.; Martin, Nicholas G.; Wright, Margaret J.; Toga, Arthur W.; Jack, Clifford R.; Weiner, Michael W.; Thompson, Paul M.

    2014-01-01

    Delta opioid receptors are implicated in a variety of psychiatric and neurological disorders. These receptors play a key role in the reinforcing properties of drugs of abuse, and polymorphisms in OPRD1 (the gene encoding delta opioid receptors) are associated with drug addiction. Delta opioid receptors are also involved in protecting neurons against hypoxic and ischemic stress. Here, we first examined a large sample of 738 elderly participants with neuroimaging and genetic data from the Alzheimer’s Disease Neuroimaging Initiative. We hypothesized that common variants in OPRD1 would be associated with differences in brain structure, particularly in regions relevant to addictive and neurodegenerative disorders. One very common variant (rs678849) predicted differences in regional brain volumes. We replicated the association of this single-nucleotide polymorphism with regional tissue volumes in a large sample of young participants in the Queensland Twin Imaging study. Although the same allele was associated with reduced volumes in both cohorts, the brain regions affected differed between the two samples. In healthy elderly, exploratory analyses suggested that the genotype associated with reduced brain volumes in both cohorts may also predict cerebrospinal fluid levels of neurodegenerative biomarkers, but this requires confirmation. If opiate receptor genetic variants are related to individual differences in brain structure, genotyping of these variants may be helpful when designing clinical trials targeting delta opioid receptors to treat neurological disorders. PMID:23427138

  4. Role of H1 Linker Histones in Mammalian Development and Stem Cell Differentiation

    PubMed Central

    Pan, Chenyi; Fan, Yuhong

    2016-01-01

    H1 linker histones are key chromatin architectural proteins facilitating the formation of higher order chromatin structures. The H1 family constitutes the most heterogeneous group of histone proteins, with eleven non-allelic H1 variants in mammals. H1 variants differ in their biochemical properties and exhibit significant sequence divergence from one another, yet most of them are highly conserved during evolution from mouse to human. H1 variants are differentially regulated during development and their cellular compositions undergo dramatic changes in embryogenesis, gametogenesis, tissue maturation and cellular differentiation. As a group, H1 histones are essential for mouse development and proper stem cell differentiation. Here we summarize our current knowledge on the expression and functions of H1 variants in mammalian development and stem cell differentiation. Their diversity, sequence conservation, complex expression and distinct functions suggest that H1s mediate chromatin reprogramming and contribute to the large variations and complexity of chromatin structure and gene expression in the mammalian genome. PMID:26689747

  5. Characterization of SNPs in the dopamine-β-hydroxylase gene providing new insights into its structure-function relationship.

    PubMed

    Punchaichira, Toyanji Joseph; Dey, Sanjay Kumar; Mukhopadhyay, Anirban; Kundu, Suman; Thelma, B K

    2017-07-01

    Dopamine-β-hydroxylase (DBH, EC 1.14.17.1), an oxido-reductase that catalyses the conversion of dopamine to norepinephrine, is largely expressed in sympathetic neurons and adrenal medulla. Several regulatory and structural variants in DBH associated with various neuropsychiatric, cardiovascular diseases and a few that may determine enzyme activity have also been identified. Due to paucity of studies on functional characterization of DBH variants, its structure-function relationship is poorly understood. The purpose of the study was to characterize five non-synonymous (ns) variants that were prioritized either based on previous association studies or Sorting Tolerant From Intolerant (SIFT) algorithm. The DBH ORF with wild type (WT) and site-directed mutagenized variants were transfected into HEK293 cells to generate transient and stable lines expressing these variant enzymes. Activity was determined by UPLC-PDA and corresponding quantity by MRM HR on a TripleTOF 5600 MS respectively of spent media from stable cell lines. Homospecific activity computed for the WT and variant proteins showed a marginal decrease in A318S, W544S and R549C variants. In transient cell lines, differential secretion was observed in the case of L317P, W544S and R549C. Secretory defect in L317P was confirmed by localization in ER. R549C exhibited both decreased homospecific activity and differential secretion. Of note, all the variants were seen to be destabilizing based on in silico folding analysis and molecular dynamics (MD) simulation, lending support to our experimental observations. These novel genotype-phenotype correlations in this gene of considerable pharmacological relevance have implications for dopamine-related disorders.

  6. Computational design of chimeric protein libraries for directed evolution.

    PubMed

    Silberg, Jonathan J; Nguyen, Peter Q; Stevenson, Taylor

    2010-01-01

    The best approach for creating libraries of functional proteins with large numbers of nondisruptive amino acid substitutions is protein recombination, in which structurally related polypeptides are swapped among homologous proteins. Unfortunately, as more distantly related proteins are recombined, the fraction of variants having a disrupted structure increases. One way to enrich the fraction of folded and potentially interesting chimeras in these libraries is to use computational algorithms to anticipate which structural elements can be swapped without disturbing the integrity of a protein's structure. Herein, we describe how the algorithm Schema uses the sequences and structures of the parent proteins recombined to predict the structural disruption of chimeras, and we outline how dynamic programming can be used to find libraries with a range of amino acid substitution levels that are enriched in variants with low Schema disruption.

  7. A Large Cohort of Hemoglobin Variants in Thailand: Molecular Epidemiological Study and Diagnostic Consideration

    PubMed Central

    Srivorakun, Hataichanok; Singha, Kritsada; Fucharoen, Goonnapa; Sanchaisuriya, Kanokwan; Fucharoen, Supan

    2014-01-01

    Background Hemoglobin (Hb) variants are structurally inherited changes of globin chains. Accurate diagnoses of these variants are important for planning of appropriate management and genetic counseling. Since no epidemiological study has been conducted before, we have investigated frequencies, molecular and hematological features of Hb variants found in a large cohort of Thai subjects. Materials and Methods Study was conducted on 26,013 unrelated subjects, inhabiting in all geographical parts of Thailand over a period of 11 years from January 2002-December 2012. Hb analysis was done on high performance liquid chromatography (HPLC) or capillary electrophoresis (CE). Mutations causing Hb variants were identified using PCR and related techniques. Results Among 26,013 subjects investigated, 636 (2.4%) were found to carry Hb variants. Of these 636 subjects, 142 (22.4%) carried α-chain variants with 13 different mutations. The remaining included 451 (70.9%) cases with 16 β-chain variants, 37 (5.8%) cases with Hb Lepore (δβ-hybrid Hb) and 6 (0.9%) cases with a single δ-chain variant. The most common α-globin chain variant was the Hb Q-Thailand (α74GAC-CAC, Asp-His) which was found in 101 cases (15.8%). For β-globin chain variants, Hb Hope (β136GGT-GAT, Gly-Asp) and Hb Tak (β146+AC, Ter-Thr) are the two most common ones, found in 121 (19.0%) and 90 (14.2%) cases, respectively. Seven Hb variants have never been found in Thai population. Hb analysis profiles on HPLC or CE of these variants were illustrated to guide presumptive diagnostics. Conclusions Hb variants are common and heterogeneous in Thai population. With varieties of thalassemias and hemoglobinopathies in the population, interactions between them leading to complex syndromes are common and render their diagnoses difficult in routine practices. Knowledge of the spectrum, molecular basis, genotype-phenotype correlation and diagnostic features should prove useful for prevention and control of the diseases in the region. PMID:25244406

  8. The role of the interactome in the maintenance of deleterious variability in human populations

    PubMed Central

    Garcia-Alonso, Luz; Jiménez-Almazán, Jorge; Carbonell-Caballero, Jose; Vela-Boza, Alicia; Santoyo-López, Javier; Antiñolo, Guillermo; Dopazo, Joaquin

    2014-01-01

    Recent genomic projects have revealed the existence of an unexpectedly large amount of deleterious variability in the human genome. Several hypotheses have been proposed to explain such an apparently high mutational load. However, the mechanisms by which deleterious mutations in some genes cause a pathological effect but are apparently innocuous in other genes remain largely unknown. This study searched for deleterious variants in the 1,000 genomes populations, as well as in a newly sequenced population of 252 healthy Spanish individuals. In addition, variants causative of monogenic diseases and somatic variants from 41 chronic lymphocytic leukaemia patients were analysed. The deleterious variants found were analysed in the context of the interactome to understand the role of network topology in the maintenance of the observed mutational load. Our results suggest that one of the mechanisms whereby the effect of these deleterious variants on the phenotype is suppressed could be related to the configuration of the protein interaction network. Most of the deleterious variants observed in healthy individuals are concentrated in peripheral regions of the interactome, in combinations that preserve their connectivity, and have a marginal effect on interactome integrity. On the contrary, likely pathogenic cancer somatic deleterious variants tend to occur in internal regions of the interactome, often with associated structural consequences. Finally, variants causative of monogenic diseases seem to occupy an intermediate position. Our observations suggest that the real pathological potential of a variant might be more a systems property rather than an intrinsic property of individual proteins. PMID:25261458

  9. The role of the interactome in the maintenance of deleterious variability in human populations.

    PubMed

    Garcia-Alonso, Luz; Jiménez-Almazán, Jorge; Carbonell-Caballero, Jose; Vela-Boza, Alicia; Santoyo-López, Javier; Antiñolo, Guillermo; Dopazo, Joaquin

    2014-09-26

    Recent genomic projects have revealed the existence of an unexpectedly large amount of deleterious variability in the human genome. Several hypotheses have been proposed to explain such an apparently high mutational load. However, the mechanisms by which deleterious mutations in some genes cause a pathological effect but are apparently innocuous in other genes remain largely unknown. This study searched for deleterious variants in the 1,000 genomes populations, as well as in a newly sequenced population of 252 healthy Spanish individuals. In addition, variants causative of monogenic diseases and somatic variants from 41 chronic lymphocytic leukaemia patients were analysed. The deleterious variants found were analysed in the context of the interactome to understand the role of network topology in the maintenance of the observed mutational load. Our results suggest that one of the mechanisms whereby the effect of these deleterious variants on the phenotype is suppressed could be related to the configuration of the protein interaction network. Most of the deleterious variants observed in healthy individuals are concentrated in peripheral regions of the interactome, in combinations that preserve their connectivity, and have a marginal effect on interactome integrity. On the contrary, likely pathogenic cancer somatic deleterious variants tend to occur in internal regions of the interactome, often with associated structural consequences. Finally, variants causative of monogenic diseases seem to occupy an intermediate position. Our observations suggest that the real pathological potential of a variant might be more a systems property rather than an intrinsic property of individual proteins. © 2014 The Authors. Published under the terms of the CC BY 4.0 license.

  10. Human Apolipoprotein A-I Natural Variants: Molecular Mechanisms Underlying Amyloidogenic Propensity

    PubMed Central

    Ramella, Nahuel A.; Schinella, Guillermo R.; Ferreira, Sergio T.; Prieto, Eduardo D.; Vela, María E.; Ríos, José Luis

    2012-01-01

    Human apolipoprotein A-I (apoA-I)-derived amyloidosis can present with either wild-type (Wt) protein deposits in atherosclerotic plaques or as a hereditary form in which apoA-I variants deposit causing multiple organ failure. More than 15 single amino acid replacement amyloidogenic apoA-I variants have been described, but the molecular mechanisms involved in amyloid-associated pathology remain largely unknown. Here, we have investigated by fluorescence and biochemical approaches the stabilities and propensities to aggregate of two disease-associated apoA-I variants, apoA-IGly26Arg, associated with polyneuropathy and kidney dysfunction, and apoA-ILys107-0, implicated in amyloidosis in severe atherosclerosis. Results showed that both variants share common structural properties including decreased stability compared to Wt apoA-I and a more flexible structure that gives rise to formation of partially folded states. Interestingly, however, distinct features appear to determine their pathogenic mechanisms. ApoA-ILys107-0 has an increased propensity to aggregate at physiological pH and in a pro-inflammatory microenvironment than Wt apoA-I, whereas apoA-IGly26Arg elicited macrophage activation, thus stimulating local chronic inflammation. Our results strongly suggest that some natural mutations in apoA-I variants elicit protein tendency to aggregate, but in addition the specific interaction of different variants with macrophages may contribute to cellular stress and toxicity in hereditary amyloidosis. PMID:22952757

  11. Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants

    PubMed Central

    2014-01-01

    Background Grapevine (Vitis vinifera L.) is the most important Mediterranean fruit crop, used to produce both wine and spirits as well as table grape and raisins. Wine and table grape cultivars represent two divergent germplasm pools with different origins and domestication history, as well as differential characteristics for berry size, cluster architecture and berry chemical profile, among others. ‘Sultanina’ plays a pivotal role in modern table grape breeding providing the main source of seedlessness. This cultivar is also one of the most planted for fresh consumption and raisins production. Given its importance, we sequenced it and implemented a novel strategy for the de novo assembly of its highly heterozygous genome. Results Our approach produced a draft genome of 466 Mb, recovering 82% of the genes present in the grapevine reference genome; in addition, we identified 240 novel genes. A large number of structural variants and SNPs were identified. Among them, 45 (21 SNPs and 24 INDELs) were experimentally confirmed in ‘Sultanina’ and six SNPs in other 23 table grape varieties. Transposable elements corresponded to ca. 80% of the repetitive sequences involved in structural variants and more than 2,000 genes were affected in their structure by these variants. Some of these genes are likely involved in embryo development, suggesting that they may contribute to seedlessness, a key trait for table grapes. Conclusions This work produced the first structural variants and SNPs catalog for grapevine, constituting a novel and very powerful tool for genomic studies in this key fruit crop, particularly useful to support marker assisted breeding in table grapes. PMID:24397443

  12. Genetic association of marbling score with intragenic nucleotide variants at selection signals of the bovine genome.

    PubMed

    Ryu, J; Lee, C

    2016-04-01

    Selection signals of Korean cattle might be attributed largely to artificial selection for meat quality. Rapidly increased intragenic markers of newly annotated genes in the bovine genome would help overcome limited findings of genetic markers associated with meat quality at the selection signals in a previous study. The present study examined genetic associations of marbling score (MS) with intragenic nucleotide variants at selection signals of Korean cattle. A total of 39 092 nucleotide variants of 407 Korean cattle were utilized in the association analysis. A total of 129 variants were selected within newly annotated genes in the bovine genome. Their genetic associations were analyzed using the mixed model with random polygenic effects based on identical-by-state genetic relationships among animals in order to control for spurious associations produced by population structure. Genetic associations of MS were found (P<3.88×10-4) with six intragenic nucleotide variants on bovine autosomes 3 (cache domain containing 1, CACHD1), 5 (like-glycosyltransferase, LARGE), 16 (cell division cycle 42 binding protein kinase alpha, CDC42BPA) and 21 (snurportin 1, SNUPN; protein tyrosine phosphatase, non-receptor type 9, PTPN9; chondroitin sulfate proteoglycan 4, CSPG4). In particular, the genetic associations with CDC42BPA and LARGE were confirmed using an independent data set of Korean cattle. The results implied that allele frequencies of functional variants and their proximity variants have been augmented by directional selection for greater MS and remain selection signals in the bovine genome. Further studies of fine mapping would be useful to incorporate favorable alleles in marker-assisted selection for MS of Korean cattle.

  13. Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale.

    PubMed

    Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun

    2015-01-01

    Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.

  14. The UK10K project identifies rare variants in health and disease.

    PubMed

    Walter, Klaudia; Min, Josine L; Huang, Jie; Crooks, Lucy; Memari, Yasin; McCarthy, Shane; Perry, John R B; Xu, ChangJiang; Futema, Marta; Lawson, Daniel; Iotchkova, Valentina; Schiffels, Stephan; Hendricks, Audrey E; Danecek, Petr; Li, Rui; Floyd, James; Wain, Louise V; Barroso, Inês; Humphries, Steve E; Hurles, Matthew E; Zeggini, Eleftheria; Barrett, Jeffrey C; Plagnol, Vincent; Richards, J Brent; Greenwood, Celia M T; Timpson, Nicholas J; Durbin, Richard; Soranzo, Nicole

    2015-10-01

    The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Radhakrishnan, Bala; Gorti, Sarma; Babu, Suresh Sudharsanam

    Here, we present phase field simulations incorporating energy contributions due to thermodynamics, and anisotropic interfacial and strain energies, to demonstrate the nucleation and growth of multiple variants of alpha from beta in Ti-6Al-4V under isothermal conditions. The simulations focused on the effect of thermodynamic driving force and nucleation rate on the morphology of the transformed alpha assuming that the partitioning of V between beta and alpha is negligible for short isothermal holds. The results indicate that a high nucleation rate favors the formation of the basket-weave structure. However, at a lower nucleation rate the simulations show the intragranular nucleation ofmore » a colony structure by an autocatalytic nucleation mechanism adjacent to a pre-existing alpha variant. New side-plates of the same variant appear to nucleate progressively and grow to form the colony. The isothermal simulation results are used to offer a possible explanation for the transition from a largely basket weave structure to a colony structure inside narrow layer bands occurring during continuous heating and cooling conditions encountered during laser additive manufacturing of Ti-6Al-4V.« less

  16. Evaluation of Dynamic Characteristics of the Footbridge with Integral Abutments

    NASA Astrophysics Data System (ADS)

    Pańtak, Marek; Jarek, Bogusław

    2017-09-01

    The paper presents the results of dynamic field tests and numerical analysis of the footbridge designed as a three-span composite structure with integral abutments. The adopted design solution which has allowed to achieve a high resistance of the structure to dynamic loads and to meet the requirements of the criteria of comfort of use with a large reserve has been characterized. For comparative purposes, numerical analyzes of three construction variants of the footbridge were presented: F-1 - construction with integral abutments (realized variant), F-2 - construction with girders anchored in the abutments by means of tension rocker bearings, F-3 - construction with concrete side spans.

  17. Predicting primary progressive aphasias with support vector machine approaches in structural MRI data.

    PubMed

    Bisenius, Sandrine; Mueller, Karsten; Diehl-Schmid, Janine; Fassbender, Klaus; Grimmer, Timo; Jessen, Frank; Kassubek, Jan; Kornhuber, Johannes; Landwehrmeyer, Bernhard; Ludolph, Albert; Schneider, Anja; Anderl-Straub, Sarah; Stuke, Katharina; Danek, Adrian; Otto, Markus; Schroeter, Matthias L

    2017-01-01

    Primary progressive aphasia (PPA) encompasses the three subtypes nonfluent/agrammatic variant PPA, semantic variant PPA, and the logopenic variant PPA, which are characterized by distinct patterns of language difficulties and regional brain atrophy. To validate the potential of structural magnetic resonance imaging data for early individual diagnosis, we used support vector machine classification on grey matter density maps obtained by voxel-based morphometry analysis to discriminate PPA subtypes (44 patients: 16 nonfluent/agrammatic variant PPA, 17 semantic variant PPA, 11 logopenic variant PPA) from 20 healthy controls (matched for sample size, age, and gender) in the cohort of the multi-center study of the German consortium for frontotemporal lobar degeneration. Here, we compared a whole-brain with a meta-analysis-based disease-specific regions-of-interest approach for support vector machine classification. We also used support vector machine classification to discriminate the three PPA subtypes from each other. Whole brain support vector machine classification enabled a very high accuracy between 91 and 97% for identifying specific PPA subtypes vs. healthy controls, and 78/95% for the discrimination between semantic variant vs. nonfluent/agrammatic or logopenic PPA variants. Only for the discrimination between nonfluent/agrammatic and logopenic PPA variants accuracy was low with 55%. Interestingly, the regions that contributed the most to the support vector machine classification of patients corresponded largely to the regions that were atrophic in these patients as revealed by group comparisons. Although the whole brain approach took also into account regions that were not covered in the regions-of-interest approach, both approaches showed similar accuracies due to the disease-specificity of the selected networks. Conclusion, support vector machine classification of multi-center structural magnetic resonance imaging data enables prediction of PPA subtypes with a very high accuracy paving the road for its application in clinical settings.

  18. Screening of the Filamin C Gene in a Large Cohort of Hypertrophic Cardiomyopathy Patients.

    PubMed

    Gómez, Juan; Lorca, Rebeca; Reguero, Julian R; Morís, César; Martín, María; Tranche, Salvador; Alonso, Belén; Iglesias, Sara; Alvarez, Victoria; Díaz-Molina, Beatriz; Avanzas, Pablo; Coto, Eliecer

    2017-04-01

    Recent exome sequencing studies identified filamin C ( FLNC ) as a candidate gene for hypertrophic cardiomyopathy (HCM). Our aim was to determine the rate of FLNC candidate variants in a large cohort of HCM patients who were also sequenced for the main sarcomere genes. A total of 448 HCM patients were next generation-sequenced (semiconductor chip technology) for the MYH7, MYBPC3 , TNNT2 , TNNI3 , ACTC1 , TNNC1 , MYL2 , MYL3 , TPM1 , and FLNC genes. We also sequenced 450 healthy controls from the same population. Based on the reported population frequencies, bioinformatic criteria, and familial segregation, we identified 20 FLNC candidate variants (13 new; 1 nonsense; and 19 missense) in 22 patients. Compared with the patients, only 1 of the control's missense variants was nonreported ( P =0.007; Fisher exact probability test). Based on the familial segregation and the reported functional studies, 6 of the candidate variants (in 7 patients) were finally classified as likely pathogenic, 10 as variants of uncertain significance, and 4 as likely benign. We provide a compelling evidence of the involvement of FLNC in the development of HCM. Most of the FLNC variants were associated with mild forms of HCM and a reduced penetrance, with few affected in the families to confirm the segregation. Our work, together with others who found FLNC variants among patients with dilated and restrictive cardiomyopathies, pointed to this gene as an important cause of structural cardiomyopathies. © 2017 American Heart Association, Inc.

  19. Comparison of gene-based rare variant association mapping methods for quantitative traits in a bovine population with complex familial relationships.

    PubMed

    Zhang, Qianqian; Guldbrandtsen, Bernt; Calus, Mario P L; Lund, Mogens Sandø; Sahana, Goutam

    2016-08-17

    There is growing interest in the role of rare variants in the variation of complex traits due to increasing evidence that rare variants are associated with quantitative traits. However, association methods that are commonly used for mapping common variants are not effective to map rare variants. Besides, livestock populations have large half-sib families and the occurrence of rare variants may be confounded with family structure, which makes it difficult to disentangle their effects from family mean effects. We compared the power of methods that are commonly applied in human genetics to map rare variants in cattle using whole-genome sequence data and simulated phenotypes. We also studied the power of mapping rare variants using linear mixed models (LMM), which are the method of choice to account for both family relationships and population structure in cattle. We observed that the power of the LMM approach was low for mapping a rare variant (defined as those that have frequencies lower than 0.01) with a moderate effect (5 to 8 % of phenotypic variance explained by multiple rare variants that vary from 5 to 21 in number) contributing to a QTL with a sample size of 1000. In contrast, across the scenarios studied, statistical methods that are specialized for mapping rare variants increased power regardless of whether multiple rare variants or a single rare variant underlie a QTL. Different methods for combining rare variants in the test single nucleotide polymorphism set resulted in similar power irrespective of the proportion of total genetic variance explained by the QTL. However, when the QTL variance is very small (only 0.1 % of the total genetic variance), these specialized methods for mapping rare variants and LMM generally had no power to map the variants within a gene with sample sizes of 1000 or 5000. We observed that the methods that combine multiple rare variants within a gene into a meta-variant generally had greater power to map rare variants compared to LMM. Therefore, it is recommended to use rare variant association mapping methods to map rare genetic variants that affect quantitative traits in livestock, such as bovine populations.

  20. regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

    PubMed

    Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

    2017-09-01

    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.

  1. Rare variation facilitates inferences of fine-scale population structure in humans.

    PubMed

    O'Connor, Timothy D; Fu, Wenqing; Mychaleckyj, Josyf C; Logsdon, Benjamin; Auer, Paul; Carlson, Christopher S; Leal, Suzanne M; Smith, Joshua D; Rieder, Mark J; Bamshad, Michael J; Nickerson, Deborah A; Akey, Joshua M

    2015-03-01

    Understanding the genetic structure of human populations has important implications for the design and interpretation of disease mapping studies and reconstructing human evolutionary history. To date, inferences of human population structure have primarily been made with common variants. However, recent large-scale resequencing studies have shown an abundance of rare variation in humans, which may be particularly useful for making inferences of fine-scale population structure. To this end, we used an information theory framework and extensive coalescent simulations to rigorously quantify the informativeness of rare and common variation to detect signatures of fine-scale population structure. We show that rare variation affords unique insights into patterns of recent population structure. Furthermore, to empirically assess our theoretical findings, we analyzed high-coverage exome sequences in 6,515 European and African American individuals. As predicted, rare variants are more informative than common polymorphisms in revealing a distinct cluster of European-American individuals, and subsequent analyses demonstrate that these individuals are likely of Ashkenazi Jewish ancestry. Our results provide new insights into the population structure using rare variation, which will be an important factor to account for in rare variant association studies. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. Deep whole-genome sequencing of 90 Han Chinese genomes.

    PubMed

    Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

    2017-09-01

    Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.

  3. Multidimensional structure-function relationships in human β-cardiac myosin from population-scale genetic variation

    PubMed Central

    Homburger, Julian R.; Green, Eric M.; Caleshu, Colleen; Sunitha, Margaret S.; Taylor, Rebecca E.; Ruppel, Kathleen M.; Metpally, Raghu Prasad Rao; Colan, Steven D.; Michels, Michelle; Day, Sharlene M.; Olivotto, Iacopo; Bustamante, Carlos D.; Dewey, Frederick E.; Ho, Carolyn Y.; Spudich, James A.; Ashley, Euan A.

    2016-01-01

    Myosin motors are the fundamental force-generating elements of muscle contraction. Variation in the human β-cardiac myosin heavy chain gene (MYH7) can lead to hypertrophic cardiomyopathy (HCM), a heritable disease characterized by cardiac hypertrophy, heart failure, and sudden cardiac death. How specific myosin variants alter motor function or clinical expression of disease remains incompletely understood. Here, we combine structural models of myosin from multiple stages of its chemomechanical cycle, exome sequencing data from two population cohorts of 60,706 and 42,930 individuals, and genetic and phenotypic data from 2,913 patients with HCM to identify regions of disease enrichment within β-cardiac myosin. We first developed computational models of the human β-cardiac myosin protein before and after the myosin power stroke. Then, using a spatial scan statistic modified to analyze genetic variation in protein 3D space, we found significant enrichment of disease-associated variants in the converter, a kinetic domain that transduces force from the catalytic domain to the lever arm to accomplish the power stroke. Focusing our analysis on surface-exposed residues, we identified a larger region significantly enriched for disease-associated variants that contains both the converter domain and residues on a single flat surface on the myosin head described as the myosin mesa. Notably, patients with HCM with variants in the enriched regions have earlier disease onset than patients who have HCM with variants elsewhere. Our study provides a model for integrating protein structure, large-scale genetic sequencing, and detailed phenotypic data to reveal insight into time-shifted protein structures and genetic disease. PMID:27247418

  4. Hundreds of variants clustered in genomic loci and biological pathways affect human height

    PubMed Central

    Lango Allen, Hana; Estrada, Karol; Lettre, Guillaume; Berndt, Sonja I.; Weedon, Michael N.; Rivadeneira, Fernando; Willer, Cristen J.; Jackson, Anne U.; Vedantam, Sailaja; Raychaudhuri, Soumya; Ferreira, Teresa; Wood, Andrew R.; Weyant, Robert J.; Segrè, Ayellet V.; Speliotes, Elizabeth K.; Wheeler, Eleanor; Soranzo, Nicole; Park, Ju-Hyun; Yang, Jian; Gudbjartsson, Daniel; Heard-Costa, Nancy L.; Randall, Joshua C.; Qi, Lu; Smith, Albert Vernon; Mägi, Reedik; Pastinen, Tomi; Liang, Liming; Heid, Iris M.; Luan, Jian'an; Thorleifsson, Gudmar; Winkler, Thomas W.; Goddard, Michael E.; Lo, Ken Sin; Palmer, Cameron; Workalemahu, Tsegaselassie; Aulchenko, Yurii S.; Johansson, Åsa; Zillikens, M.Carola; Feitosa, Mary F.; Esko, Tõnu; Johnson, Toby; Ketkar, Shamika; Kraft, Peter; Mangino, Massimo; Prokopenko, Inga; Absher, Devin; Albrecht, Eva; Ernst, Florian; Glazer, Nicole L.; Hayward, Caroline; Hottenga, Jouke-Jan; Jacobs, Kevin B.; Knowles, Joshua W.; Kutalik, Zoltán; Monda, Keri L.; Polasek, Ozren; Preuss, Michael; Rayner, Nigel W.; Robertson, Neil R.; Steinthorsdottir, Valgerdur; Tyrer, Jonathan P.; Voight, Benjamin F.; Wiklund, Fredrik; Xu, Jianfeng; Zhao, Jing Hua; Nyholt, Dale R.; Pellikka, Niina; Perola, Markus; Perry, John R.B.; Surakka, Ida; Tammesoo, Mari-Liis; Altmaier, Elizabeth L.; Amin, Najaf; Aspelund, Thor; Bhangale, Tushar; Boucher, Gabrielle; Chasman, Daniel I.; Chen, Constance; Coin, Lachlan; Cooper, Matthew N.; Dixon, Anna L.; Gibson, Quince; Grundberg, Elin; Hao, Ke; Junttila, M. Juhani; Kaplan, Lee M.; Kettunen, Johannes; König, Inke R.; Kwan, Tony; Lawrence, Robert W.; Levinson, Douglas F.; Lorentzon, Mattias; McKnight, Barbara; Morris, Andrew P.; Müller, Martina; Ngwa, Julius Suh; Purcell, Shaun; Rafelt, Suzanne; Salem, Rany M.; Salvi, Erika; Sanna, Serena; Shi, Jianxin; Sovio, Ulla; Thompson, John R.; Turchin, Michael C.; Vandenput, Liesbeth; Verlaan, Dominique J.; Vitart, Veronique; White, Charles C.; Ziegler, Andreas; Almgren, Peter; Balmforth, Anthony J.; Campbell, Harry; Citterio, Lorena; De Grandi, Alessandro; Dominiczak, Anna; Duan, Jubao; Elliott, Paul; Elosua, Roberto; Eriksson, Johan G.; Freimer, Nelson B.; Geus, Eco J.C.; Glorioso, Nicola; Haiqing, Shen; Hartikainen, Anna-Liisa; Havulinna, Aki S.; Hicks, Andrew A.; Hui, Jennie; Igl, Wilmar; Illig, Thomas; Jula, Antti; Kajantie, Eero; Kilpeläinen, Tuomas O.; Koiranen, Markku; Kolcic, Ivana; Koskinen, Seppo; Kovacs, Peter; Laitinen, Jaana; Liu, Jianjun; Lokki, Marja-Liisa; Marusic, Ana; Maschio, Andrea; Meitinger, Thomas; Mulas, Antonella; Paré, Guillaume; Parker, Alex N.; Peden, John F.; Petersmann, Astrid; Pichler, Irene; Pietiläinen, Kirsi H.; Pouta, Anneli; Ridderstråle, Martin; Rotter, Jerome I.; Sambrook, Jennifer G.; Sanders, Alan R.; Schmidt, Carsten Oliver; Sinisalo, Juha; Smit, Jan H.; Stringham, Heather M.; Walters, G.Bragi; Widen, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Zagato, Laura; Zgaga, Lina; Zitting, Paavo; Alavere, Helene; Farrall, Martin; McArdle, Wendy L.; Nelis, Mari; Peters, Marjolein J.; Ripatti, Samuli; van Meurs, Joyce B.J.; Aben, Katja K.; Ardlie, Kristin G; Beckmann, Jacques S.; Beilby, John P.; Bergman, Richard N.; Bergmann, Sven; Collins, Francis S.; Cusi, Daniele; den Heijer, Martin; Eiriksdottir, Gudny; Gejman, Pablo V.; Hall, Alistair S.; Hamsten, Anders; Huikuri, Heikki V.; Iribarren, Carlos; Kähönen, Mika; Kaprio, Jaakko; Kathiresan, Sekar; Kiemeney, Lambertus; Kocher, Thomas; Launer, Lenore J.; Lehtimäki, Terho; Melander, Olle; Mosley, Tom H.; Musk, Arthur W.; Nieminen, Markku S.; O'Donnell, Christopher J.; Ohlsson, Claes; Oostra, Ben; Palmer, Lyle J.; Raitakari, Olli; Ridker, Paul M.; Rioux, John D.; Rissanen, Aila; Rivolta, Carlo; Schunkert, Heribert; Shuldiner, Alan R.; Siscovick, David S.; Stumvoll, Michael; Tönjes, Anke; Tuomilehto, Jaakko; van Ommen, Gert-Jan; Viikari, Jorma; Heath, Andrew C.; Martin, Nicholas G.; Montgomery, Grant W.; Province, Michael A.; Kayser, Manfred; Arnold, Alice M.; Atwood, Larry D.; Boerwinkle, Eric; Chanock, Stephen J.; Deloukas, Panos; Gieger, Christian; Grönberg, Henrik; Hall, Per; Hattersley, Andrew T.; Hengstenberg, Christian; Hoffman, Wolfgang; Lathrop, G.Mark; Salomaa, Veikko; Schreiber, Stefan; Uda, Manuela; Waterworth, Dawn; Wright, Alan F.; Assimes, Themistocles L.; Barroso, Inês; Hofman, Albert; Mohlke, Karen L.; Boomsma, Dorret I.; Caulfield, Mark J.; Cupples, L.Adrienne; Erdmann, Jeanette; Fox, Caroline S.; Gudnason, Vilmundur; Gyllensten, Ulf; Harris, Tamara B.; Hayes, Richard B.; Jarvelin, Marjo-Riitta; Mooser, Vincent; Munroe, Patricia B.; Ouwehand, Willem H.; Penninx, Brenda W.; Pramstaller, Peter P.; Quertermous, Thomas; Rudan, Igor; Samani, Nilesh J.; Spector, Timothy D.; Völzke, Henry; Watkins, Hugh; Wilson, James F.; Groop, Leif C.; Haritunians, Talin; Hu, Frank B.; Kaplan, Robert C.; Metspalu, Andres; North, Kari E.; Schlessinger, David; Wareham, Nicholas J.; Hunter, David J.; O'Connell, Jeffrey R.; Strachan, David P.; Wichmann, H.-Erich; Borecki, Ingrid B.; van Duijn, Cornelia M.; Schadt, Eric E.; Thorsteinsdottir, Unnur; Peltonen, Leena; Uitterlinden, André; Visscher, Peter M.; Chatterjee, Nilanjan; Loos, Ruth J.F.; Boehnke, Michael; McCarthy, Mark I.; Ingelsson, Erik; Lindgren, Cecilia M.; Abecasis, Gonçalo R.; Stefansson, Kari; Frayling, Timothy M.; Hirschhorn, Joel N

    2010-01-01

    Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence phenotype. Genome-wide association (GWA) studies have identified >600 variants associated with human traits1, but these typically explain small fractions of phenotypic variation, raising questions about the utility of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait2,3. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P=0.016), and that underlie skeletal growth defects (P<0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants, and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented amongst variants that alter amino acid structure of proteins and expression levels of nearby genes. Our data explain ∼10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to ∼16% of phenotypic variation (∼20% of heritable variation). Although additional approaches are needed to fully dissect the genetic architecture of polygenic human traits, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways. PMID:20881960

  5. Genetic Structures of Copy Number Variants Revealed by Genotyping Single Sperm

    PubMed Central

    Luo, Minjie; Cui, Xiangfeng; Fredman, David; Brookes, Anthony J.; Azaro, Marco A.; Greenawalt, Danielle M.; Hu, Guohong; Wang, Hui-Yun; Tereshchenko, Irina V.; Lin, Yong; Shentu, Yue; Gao, Richeng; Shen, Li; Li, Honghua

    2009-01-01

    Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis. PMID:19384415

  6. Biochemical analyses are instrumental in identifying the impact of mutations on holo and/or apo-forms and on the region(s) of alanine:glyoxylate aminotransferase variants associated with Primary Hyperoxaluria Type I☆

    PubMed Central

    Oppici, Elisa; Montioli, Riccardo; Lorenzetto, Antonio; Bianconi, Silvia; Borri Voltattorni, Carla; Cellini, Barbara

    2012-01-01

    Primary Hyperoxaluria Type I (PH1) is a disorder of glyoxylate metabolism caused by mutations in the human AGXT gene encoding liver peroxisomal alanine:glyoxylate aminotransferase (AGT), a pyridoxal 5′-phosphate (PLP) dependent enzyme. Previous investigations highlighted that, although PH1 is characterized by a significant variability in terms of enzymatic phenotype, the majority of the pathogenic variants are believed to share both structural and functional defects, as mainly revealed by data on AGT activity and expression level in crude cellular extracts. However, the knowledge of the defects of the AGT variants at a protein level is still poor. We therefore performed a side-by-side comparison between normal AGT and nine purified recombinant pathogenic variants in terms of catalytic activity, coenzyme binding mode and affinity, spectroscopic features, oligomerization, and thermal stability of both the holo- and apo-forms. Notably, we chose four variants in which the mutated residues are located in the large domain of AGT either within the active site and interacting with the coenzyme or in its proximity, and five variants in which the mutated residues are distant from the active site either in the large or in the small domain. Overall, this integrated analysis of enzymatic activity, spectroscopic and stability information is used to (i) reassess previous data obtained with crude cellular extracts, (ii) establish which form(s) (i.e. holoenzyme and/or apoenzyme) and region(s) (i.e. active site microenvironment, large and/or small domain) of the protein are affected by each mutation, and (iii) suggest the possible therapeutic approach for patients bearing the examined mutations. PMID:22018727

  7. Biochemical analyses are instrumental in identifying the impact of mutations on holo and/or apo-forms and on the region(s) of alanine:glyoxylate aminotransferase variants associated with primary hyperoxaluria type I.

    PubMed

    Oppici, Elisa; Montioli, Riccardo; Lorenzetto, Antonio; Bianconi, Silvia; Borri Voltattorni, Carla; Cellini, Barbara

    2012-01-01

    Primary Hyperoxaluria Type I (PH1) is a disorder of glyoxylate metabolism caused by mutations in the human AGXT gene encoding liver peroxisomal alanine:glyoxylate aminotransferase (AGT), a pyridoxal 5'-phosphate (PLP) dependent enzyme. Previous investigations highlighted that, although PH1 is characterized by a significant variability in terms of enzymatic phenotype, the majority of the pathogenic variants are believed to share both structural and functional defects, as mainly revealed by data on AGT activity and expression level in crude cellular extracts. However, the knowledge of the defects of the AGT variants at a protein level is still poor. We therefore performed a side-by-side comparison between normal AGT and nine purified recombinant pathogenic variants in terms of catalytic activity, coenzyme binding mode and affinity, spectroscopic features, oligomerization, and thermal stability of both the holo- and apo-forms. Notably, we chose four variants in which the mutated residues are located in the large domain of AGT either within the active site and interacting with the coenzyme or in its proximity, and five variants in which the mutated residues are distant from the active site either in the large or in the small domain. Overall, this integrated analysis of enzymatic activity, spectroscopic and stability information is used to (i) reassess previous data obtained with crude cellular extracts, (ii) establish which form(s) (i.e. holoenzyme and/or apoenzyme) and region(s) (i.e. active site microenvironment, large and/or small domain) of the protein are affected by each mutation, and (iii) suggest the possible therapeutic approach for patients bearing the examined mutations. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Homozygous missense mutation in the LMAN2L gene segregates with intellectual disability in a large consanguineous Pakistani family.

    PubMed

    Rafiullah, Rafiullah; Aslamkhan, Muhammad; Paramasivam, Nagarajan; Thiel, Christian; Mustafa, Ghulam; Wiemann, Stefan; Schlesner, Matthias; Wade, Rebecca C; Rappold, Gudrun A; Berkel, Simone

    2016-02-01

    Intellectual disability (ID) is a neurodevelopmental disorder affecting 1%-3% of the population worldwide. It is characterised by high phenotypic and genetic heterogeneity and in most cases the underlying cause of the disorder is unknown. In our study we investigated a large consanguineous family from Baluchistan, Pakistan, comprising seven affected individuals with a severe form of autosomal recessive ID (ARID) and epilepsy, to elucidate a putative genetic cause. Whole exome sequencing (WES) of a trio, including a child with ID and epilepsy and its healthy parents that were part of this large family, revealed a homozygous missense variant p.R53Q in the lectin mannose-binding 2-like (LMAN2L) gene. This homozygous variant was co-segregating in the family with the phenotype of severe ID and infantile epilepsy; unaffected family members were heterozygous variant carriers. The variant was predicted to be pathogenic by five different in silico programmes and further three-dimensional structure modelling of the protein suggests that variant p.R53Q may impair protein-protein interaction. LMAN2L (OMIM: 609552) encodes for the lectin, mannose-binding 2-like protein which is a cargo receptor in the endoplasmic reticulum important for glycoprotein transport. Genome-wide association studies have identified an association of LMAN2L to different neuropsychiatric disorders. This is the first report linking LMAN2L to a phenotype of severe ARID and seizures, indicating that the deleterious homozygous p.R53Q variant very likely causes the disorder. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  9. Filtering genetic variants and placing informative priors based on putative biological function.

    PubMed

    Friedrichs, Stefanie; Malzahn, Dörthe; Pugh, Elizabeth W; Almeida, Marcio; Liu, Xiao Qing; Bailey, Julia N

    2016-02-03

    High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure.

  10. An integrated map of structural variation in 2,504 human genomes.

    PubMed

    Sudmant, Peter H; Rausch, Tobias; Gardner, Eugene J; Handsaker, Robert E; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo; Fritz, Markus Hsi-Yang; Konkel, Miriam K; Malhotra, Ankit; Stütz, Adrian M; Shi, Xinghua; Casale, Francesco Paolo; Chen, Jieming; Hormozdiari, Fereydoun; Dayama, Gargi; Chen, Ken; Malig, Maika; Chaisson, Mark J P; Walter, Klaudia; Meiers, Sascha; Kashin, Seva; Garrison, Erik; Auton, Adam; Lam, Hugo Y K; Mu, Xinmeng Jasmine; Alkan, Can; Antaki, Danny; Bae, Taejeong; Cerveira, Eliza; Chines, Peter; Chong, Zechen; Clarke, Laura; Dal, Elif; Ding, Li; Emery, Sarah; Fan, Xian; Gujral, Madhusudan; Kahveci, Fatma; Kidd, Jeffrey M; Kong, Yu; Lameijer, Eric-Wubbo; McCarthy, Shane; Flicek, Paul; Gibbs, Richard A; Marth, Gabor; Mason, Christopher E; Menelaou, Androniki; Muzny, Donna M; Nelson, Bradley J; Noor, Amina; Parrish, Nicholas F; Pendleton, Matthew; Quitadamo, Andrew; Raeder, Benjamin; Schadt, Eric E; Romanovitch, Mallory; Schlattl, Andreas; Sebra, Robert; Shabalin, Andrey A; Untergasser, Andreas; Walker, Jerilyn A; Wang, Min; Yu, Fuli; Zhang, Chengsheng; Zhang, Jing; Zheng-Bradley, Xiangqun; Zhou, Wanding; Zichner, Thomas; Sebat, Jonathan; Batzer, Mark A; McCarroll, Steven A; Mills, Ryan E; Gerstein, Mark B; Bashir, Ali; Stegle, Oliver; Devine, Scott E; Lee, Charles; Eichler, Evan E; Korbel, Jan O

    2015-10-01

    Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.

  11. A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data

    PubMed Central

    Zhu, Yun; Fan, Ruzong; Xiong, Momiao

    2017-01-01

    Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics. PMID:29040274

  12. MutaBind estimates and interprets the effects of sequence variants on protein-protein interactions.

    PubMed

    Li, Minghui; Simonetti, Franco L; Goncearenco, Alexander; Panchenko, Anna R

    2016-07-08

    Proteins engage in highly selective interactions with their macromolecular partners. Sequence variants that alter protein binding affinity may cause significant perturbations or complete abolishment of function, potentially leading to diseases. There exists a persistent need to develop a mechanistic understanding of impacts of variants on proteins. To address this need we introduce a new computational method MutaBind to evaluate the effects of sequence variants and disease mutations on protein interactions and calculate the quantitative changes in binding affinity. The MutaBind method uses molecular mechanics force fields, statistical potentials and fast side-chain optimization algorithms. The MutaBind server maps mutations on a structural protein complex, calculates the associated changes in binding affinity, determines the deleterious effect of a mutation, estimates the confidence of this prediction and produces a mutant structural model for download. MutaBind can be applied to a large number of problems, including determination of potential driver mutations in cancer and other diseases, elucidation of the effects of sequence variants on protein fitness in evolution and protein design. MutaBind is available at http://www.ncbi.nlm.nih.gov/projects/mutabind/. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  13. Localized structural frustration for evaluating the impact of sequence variants

    PubMed Central

    Kumar, Sushant; Clarke, Declan; Gerstein, Mark

    2016-01-01

    Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype–genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. PMID:27915290

  14. SvABA: genome-wide detection of structural variants and indels by local assembly.

    PubMed

    Wala, Jeremiah A; Bandopadhayay, Pratiti; Greenwald, Noah F; O'Rourke, Ryan; Sharpe, Ted; Stewart, Chip; Schumacher, Steve; Li, Yilong; Weischenfeldt, Joachim; Yao, Xiaotong; Nusbaum, Chad; Campbell, Peter; Getz, Gad; Meyerson, Matthew; Zhang, Cheng-Zhong; Imielinski, Marcin; Beroukhim, Rameen

    2018-04-01

    Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ∼4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs. © 2018 Wala et al.; Published by Cold Spring Harbor Laboratory Press.

  15. Prospects and limitations of full-text index structures in genome analysis

    PubMed Central

    Vyverman, Michaël; De Baets, Bernard; Fack, Veerle; Dawyndt, Peter

    2012-01-01

    The combination of incessant advances in sequencing technology producing large amounts of data and innovative bioinformatics approaches, designed to cope with this data flood, has led to new interesting results in the life sciences. Given the magnitude of sequence data to be processed, many bioinformatics tools rely on efficient solutions to a variety of complex string problems. These solutions include fast heuristic algorithms and advanced data structures, generally referred to as index structures. Although the importance of index structures is generally known to the bioinformatics community, the design and potency of these data structures, as well as their properties and limitations, are less understood. Moreover, the last decade has seen a boom in the number of variant index structures featuring complex and diverse memory-time trade-offs. This article brings a comprehensive state-of-the-art overview of the most popular index structures and their recently developed variants. Their features, interrelationships, the trade-offs they impose, but also their practical limitations, are explained and compared. PMID:22584621

  16. Phase field simulations of autocatalytic formation of alpha lamellar colonies in Ti-6Al-4V

    DOE PAGES

    Radhakrishnan, Bala; Gorti, Sarma; Babu, Suresh Sudharsanam

    2016-09-13

    Here, we present phase field simulations incorporating energy contributions due to thermodynamics, and anisotropic interfacial and strain energies, to demonstrate the nucleation and growth of multiple variants of alpha from beta in Ti-6Al-4V under isothermal conditions. The simulations focused on the effect of thermodynamic driving force and nucleation rate on the morphology of the transformed alpha assuming that the partitioning of V between beta and alpha is negligible for short isothermal holds. The results indicate that a high nucleation rate favors the formation of the basket-weave structure. However, at a lower nucleation rate the simulations show the intragranular nucleation ofmore » a colony structure by an autocatalytic nucleation mechanism adjacent to a pre-existing alpha variant. New side-plates of the same variant appear to nucleate progressively and grow to form the colony. The isothermal simulation results are used to offer a possible explanation for the transition from a largely basket weave structure to a colony structure inside narrow layer bands occurring during continuous heating and cooling conditions encountered during laser additive manufacturing of Ti-6Al-4V.« less

  17. Molecular Mechanism of Wide Photoabsorption Spectral Shifts of Color Variants of Human Cellular Retinol Binding Protein II.

    PubMed

    Cheng, Cheng; Kamiya, Motoshi; Uchida, Yoshihiro; Hayashi, Shigehiko

    2015-10-21

    Color variants of human cellular retinol binding protein II (hCRBPII) created by protein engineering were recently shown to exhibit anomalously wide photoabsorption spectral shifts over ∼200 nm across the visible region. The remarkable phenomenon provides a unique opportunity to gain insight into the molecular basis of the color tuning of retinal binding proteins for understanding of color vision as well as for engineering of novel color variants of retinal binding photoreceptor proteins employed in optogenetics. Here, we report a theoretical investigation of the molecular mechanism underlying the anomalously wide spectral shifts of the color variants of hCRBPII. Computational modeling of the color variants with hybrid molecular simulations of free energy geometry optimization succeeded in reproducing the experimentally observed wide spectral shifts, and revealed that protein flexibility, through which the active site structure of the protein and bound water molecules is altered by remote mutations, plays a significant role in inducing the large spectral shifts.

  18. De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries.

    PubMed

    Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

    2015-09-21

    Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants.

  19. Genetic influences on schizophrenia and subcortical brain volumes: large-scale proof-of-concept and roadmap for future studies

    PubMed Central

    Anttila, Verneri; Hibar, Derrek P; van Hulzen, Kimm J E; Arias-Vasquez, Alejandro; Smoller, Jordan W; Nichols, Thomas E; Neale, Michael C; McIntosh, Andrew M; Lee, Phil; McMahon, Francis J; Meyer-Lindenberg, Andreas; Mattheisen, Manuel; Andreassen, Ole A; Gruber, Oliver; Sachdev, Perminder S; Roiz-Santiañez, Roberto; Saykin, Andrew J; Ehrlich, Stefan; Mather, Karen A; Turner, Jessica A; Schwarz, Emanuel; Thalamuthu, Anbupalam; Shugart, Yin Yao; Ho, Yvonne YW; Martin, Nicholas G; Wright, Margaret J

    2016-01-01

    Schizophrenia is a devastating psychiatric illness with high heritability. Brain structure and function differ, on average, between schizophrenia cases and healthy individuals. As common genetic associations are emerging for both schizophrenia and brain imaging phenotypes, we can now use genome-wide data to investigate genetic overlap. Here we integrated results from common variant studies of schizophrenia (33,636 cases, 43,008 controls) and volumes of several (mainly subcortical) brain structures (11,840 subjects). We did not find evidence of genetic overlap between schizophrenia risk and subcortical volume measures either at the level of common variant genetic architecture or for single genetic markers. The current study provides proof-of-concept (albeit based on a limited set of structural brain measures), and defines a roadmap for future studies investigating the genetic covariance between structural/functional brain phenotypes and risk for psychiatric disorders. PMID:26854805

  20. Genetic influences on schizophrenia and subcortical brain volumes: large-scale proof of concept.

    PubMed

    Franke, Barbara; Stein, Jason L; Ripke, Stephan; Anttila, Verneri; Hibar, Derrek P; van Hulzen, Kimm J E; Arias-Vasquez, Alejandro; Smoller, Jordan W; Nichols, Thomas E; Neale, Michael C; McIntosh, Andrew M; Lee, Phil; McMahon, Francis J; Meyer-Lindenberg, Andreas; Mattheisen, Manuel; Andreassen, Ole A; Gruber, Oliver; Sachdev, Perminder S; Roiz-Santiañez, Roberto; Saykin, Andrew J; Ehrlich, Stefan; Mather, Karen A; Turner, Jessica A; Schwarz, Emanuel; Thalamuthu, Anbupalam; Shugart, Yin Yao; Ho, Yvonne Yw; Martin, Nicholas G; Wright, Margaret J; O'Donovan, Michael C; Thompson, Paul M; Neale, Benjamin M; Medland, Sarah E; Sullivan, Patrick F

    2016-03-01

    Schizophrenia is a devastating psychiatric illness with high heritability. Brain structure and function differ, on average, between people with schizophrenia and healthy individuals. As common genetic associations are emerging for both schizophrenia and brain imaging phenotypes, we can now use genome-wide data to investigate genetic overlap. Here we integrated results from common variant studies of schizophrenia (33,636 cases, 43,008 controls) and volumes of several (mainly subcortical) brain structures (11,840 subjects). We did not find evidence of genetic overlap between schizophrenia risk and subcortical volume measures either at the level of common variant genetic architecture or for single genetic markers. These results provide a proof of concept (albeit based on a limited set of structural brain measures) and define a roadmap for future studies investigating the genetic covariance between structural or functional brain phenotypes and risk for psychiatric disorders.

  1. Assessing the effects of common variation in the FOXP2 gene on human brain structure.

    PubMed

    Hoogman, Martine; Guadalupe, Tulio; Zwiers, Marcel P; Klarenbeek, Patricia; Francks, Clyde; Fisher, Simon E

    2014-01-01

    The FOXP2 transcription factor is one of the most well-known genes to have been implicated in developmental speech and language disorders. Rare mutations disrupting the function of this gene have been described in different families and cases. In a large three-generation family carrying a missense mutation, neuroimaging studies revealed significant effects on brain structure and function, most notably in the inferior frontal gyrus, caudate nucleus, and cerebellum. After the identification of rare disruptive FOXP2 variants impacting on brain structure, several reports proposed that common variants at this locus may also have detectable effects on the brain, extending beyond disorder into normal phenotypic variation. These neuroimaging genetics studies used groups of between 14 and 96 participants. The current study assessed effects of common FOXP2 variants on neuroanatomy using voxel-based morphometry (VBM) and volumetric techniques in a sample of >1300 people from the general population. In a first targeted stage we analyzed single nucleotide polymorphisms (SNPs) claimed to have effects in prior smaller studies (rs2253478, rs12533005, rs2396753, rs6980093, rs7784315, rs17137124, rs10230558, rs7782412, rs1456031), beginning with regions proposed in the relevant papers, then assessing impact across the entire brain. In the second gene-wide stage, we tested all common FOXP2 variation, focusing on volumetry of those regions most strongly implicated from analyses of rare disruptive mutations. Despite using a sample that is more than 10 times that used for prior studies of common FOXP2 variation, we found no evidence for effects of SNPs on variability in neuroanatomy in the general population. Thus, the impact of this gene on brain structure may be largely limited to extreme cases of rare disruptive alleles. Alternatively, effects of common variants at this gene exist but are too subtle to be detected with standard volumetric techniques.

  2. Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation

    PubMed Central

    Christophersen, Ingrid E.; Rienstra, Michiel; Roselli, Carolina; Yin, Xiaoyan; Geelhoed, Bastiaan; Barnard, John; Lin, Honghuang; Arking, Dan E.; Smith, Albert V.; Albert, Christine M.; Chaffin, Mark; Tucker, Nathan R.; Li, Molong; Klarin, Derek; Bihlmeyer, Nathan A; Low, Siew-Kee; Weeke, Peter E.; Müller-Nurasyid, Martina; Smith, J. Gustav; Brody, Jennifer A.; Niemeijer, Maartje N.; Dörr, Marcus; Trompet, Stella; Huffman, Jennifer; Gustafsson, Stefan; Schurman, Claudia; Kleber, Marcus E.; Lyytikäinen, Leo-Pekka; Seppälä, Ilkka; Malik, Rainer; Horimoto, Andrea R. V. R.; Perez, Marco; Sinisalo, Juha; Aeschbacher, Stefanie; Thériault, Sébastien; Yao, Jie; Radmanesh, Farid; Weiss, Stefan; Teumer, Alexander; Choi, Seung Hoan; Weng, Lu-Chen; Clauss, Sebastian; Deo, Rajat; Rader, Daniel J.; Shah, Svati; Sun, Albert; Hopewell, Jemma C.; Debette, Stephanie; Chauhan, Ganesh; Yang, Qiong; Worrall, Bradford B.; Paré, Guillaume; Kamatani, Yoichiro; Hagemeijer, Yanick P.; Verweij, Niek; Siland, Joylene E.; Kubo, Michiaki; Smith, Jonathan D.; Van Wagoner, David R.; Bis, Joshua C.; Perz, Siegfried; Psaty, Bruce M.; Ridker, Paul M.; Magnani, Jared W.; Harris, Tamara B.; Launer, Lenore J.; Shoemaker, M. Benjamin; Padmanabhan, Sandosh; Haessler, Jeffrey; Bartz, Traci M.; Waldenberger, Melanie; Lichtner, Peter; Arendt, Marina; Krieger, Jose E.; Kähönen, Mika; Risch, Lorenz; Mansur, Alfredo J.; Peters, Annette; Smith, Blair H.; Lind, Lars; Scott, Stuart A.; Lu, Yingchang; Bottinger, Erwin B.; Hernesniemi, Jussi; Lindgren, Cecilia M.; Wong, Jorge; Huang, Jie; Eskola, Markku; Morris, Andrew P.; Ford, Ian; Reiner, Alex P.; Delgado, Graciela; Chen, Lin Y.; Chen, Yii-Der Ida; Sandhu, Roopinder K.; Li, Man; Boerwinkle, Eric; Eisele, Lewin; Lannfelt, Lars; Rost, Natalia; Anderson, Christopher D.; Taylor, Kent D.; Campbell, Archie; Magnusson, Patrik K.; Porteous, David; Hocking, Lynne J.; Vlachopoulou, Efthymia; Pedersen, Nancy L.; Nikus, Kjell; Orho-Melander, Marju; Hamsten, Anders; Heeringa, Jan; Denny, Joshua C.; Kriebel, Jennifer; Darbar, Dawood; Newton-Cheh, Christopher; Shaffer, Christian; Macfarlane, Peter W.; Heilmann, Stefanie; Almgren, Peter; Huang, Paul L.; Sotoodehnia, Nona; Soliman, Elsayed Z.; Uitterlinden, Andre G.; Hofman, Albert; Franco, Oscar H.; Völker, Uwe; Jöckel, Karl-Heinz; Sinner, Moritz F.; Lin, Henry J.; Guo, Xiuqing; Dichgans, Martin; Ingelsson, Erik; Kooperberg, Charles; Melander, Olle; Loos, Ruth J. F.; Laurikka, Jari; Conen, David; Rosand, Jonathan; van der Harst, Pim; Lokki, Marja-Liisa; Kathiresan, Sekar; Pereira, Alexandre; Jukema, J. Wouter; Hayward, Caroline; Rotter, Jerome I.; März, Winfried; Lehtimäki, Terho; Stricker, Bruno H.; Chung, Mina K.; Felix, Stephan B.; Gudnason, Vilmundur; Alonso, Alvaro; Roden, Dan M.; Kääb, Stefan; Chasman, Daniel I.; Heckbert, Susan R.; Benjamin, Emelia J.; Tanaka, Toshihiro; Lunetta, Kathryn L.; Lubitz, Steven A.; Ellinor, Patrick T.

    2017-01-01

    Atrial fibrillation affects more than 33 million people worldwide and increases the risk of stroke, heart failure, and death.1,2 Fourteen genetic loci have been associated with atrial fibrillation in European and Asian ancestry groups.3–7 To further define the genetic basis of atrial fibrillation, we performed large-scale, multi-racial meta-analyses of common and rare variant association studies. The genome-wide association studies (GWAS) included 18,398 individuals with atrial fibrillation and 91,536 referents; the exome-wide association studies (ExWAS) and rare variant association studies (RVAS) involved 22,806 cases and 132,612 referents. We identified 12 novel genetic loci that exceeded genome-wide significance, implicating genes involved in cardiac electrical and structural remodeling. Our results nearly double the number of known genetic loci for atrial fibrillation, provide insights into the molecular basis of atrial fibrillation, and may facilitate new potential targets for drug discovery.8 PMID:28416818

  3. Chromosomal microarray analysis as the first-tier test for the identification of pathogenic copy number variants in chromosome 9 pericentric regions and its challenge.

    PubMed

    Wang, Jia-Chi; Boyar, Fatih Z

    2016-01-01

    Chromosomal microarray analysis (CMA) has been recommended and practiced routinely in the large reference laboratories of U.S.A. as the first-tier test for the postnatal evaluation of individuals with intellectual disability, autism spectrum disorders, and/or multiple congenital anomalies. Using CMA as a diagnostic tool and without a routine setting of fluorescence in situ hybridization with labeled bacterial artificial chromosome probes (BAC-FISH) in the large reference laboratories becomes a challenge in the characterization of chromosome 9 pericentric region. This region has a very complex genomic structure and contains a variety of heterochromatic and euchromatic polymorphic variants. These variants were usually studied by G-banding, C-banding and BAC-FISH analysis. Chromosomal microarray analysis (CMA) was not recommended since it may lead to false positive results. Here, we presented a cohort of four cases, in which high-resolution CMA was used as the first-tier test or simultaneously with G-banding analysis on the proband to identify pathogenic copy number variants (CNVs) in the whole genome. CMA revealed large pathogenic CNVs from chromosome 9 in 3 cases which also revealed different G-banding patterns between the two chromosome 9 homologues. Although we demonstrated that high-resolution CMA played an important role in the identification of pathogenic copy number variants in chromosome 9 pericentric regions, the lack of BAC-FISH analysis or other useful tools renders significant challenges in the characterization of chromosome 9 pericentric regions. None; it is not a clinical trial, and the cases were retrospectively collected and analyzed.

  4. Minimization of vibration in elastic beams with time-variant boundary conditions

    NASA Technical Reports Server (NTRS)

    Amirouche, F. M. L.; Xie, Mingjun

    1992-01-01

    This paper presents an innovative method for minimizing the vibration of structures with time-variant boundary conditions (supports). The elastic body is modeled in two ways: (1) the first model is a letter seven type beam with a movable mass not to exceed the lower tip; (2) the second model has an arm that is a hollow beam with an inside mass with adjustable position. The complete solutions to both problems are carried out where the body is undergoing large rotation. The quasi-static procedure is used for the time-variant boundary conditions. The method developed employs partial differential equations governing the motion of the beam, including the effects of rigid-body motion, time-variant boundary conditions, and calculus of variations. The analytical solution is developed using Laplace and Fourier transforms. Examples of elastic robotic arms are given to illustrate the effectiveness of the methods developed.

  5. Extreme Entropy-Enthalpy Compensation in a Drug Resistant Variant of HIV-1 Protease

    PubMed Central

    King, Nancy M.; Prabu-Jeyabalan, Moses; Bandaranayake, Rajintha M.; Nalam, Madhavi N. L.; Nalivaika, Ellen A.; Özen, Ayşegül; Haliloglu, Türkan; Yılmaz, Neşe Kurt; Schiffer, Celia A.

    2012-01-01

    The development of HIV-1 protease inhibitors has been the historic paradigm of rational structure-based drug design, where structural and thermodynamic analyses have assisted in the discovery of novel inhibitors. While the total enthalpy and entropy change upon binding determine the affinity, often the thermodynamics are considered in terms of inhibitor properties only. In the current study, profound changes are observed in the binding thermodynamics of a drug resistant variant compared to wild-type HIV-1 protease, irrespective of the inhibitor bound. This variant (Flap+) has a combination of flap and active site mutations and exhibits extremely large entropy-enthalpy compensation compared to wild-type protease, 5–15 kcal/mol, while losing only 1–3 kcal/mol in total binding free energy for any of six FDA approved inhibitors. Although entropy-enthalpy compensation has been previously observed for a variety of systems, never have changes of this magnitude been reported. The co-crystal structures of Flap+ protease with four of the inhibitors were determined and compared with complexes of both the wildtype protease and another drug resistant variant that does not exhibit this energetic compensation. Structural changes conserved across the Flap+ complexes, which are more pronounced for the flaps covering the active site, likely contribute to the thermodynamic compensation. The finding that drug resistant mutations can profoundly modulate the relative thermodynamic properties of a therapeutic target independent of the inhibitor presents a new challenge for rational drug design. PMID:22712830

  6. A 5000-Fold Increase in the Specificity of a Bacterial Phosphotriesterase for Malathion through Combinatorial Active Site Mutagenesis

    PubMed Central

    Naqvi, Tatheer; Warden, Andrew C.; French, Nigel; Sugrue, Elena; Carr, Paul D.; Jackson, Colin J.; Scott, Colin

    2014-01-01

    Phosphotriesterases (PTEs) have been isolated from a range of bacterial species, including Agrobcaterium radiobacter (PTEAr), and are efficient enzymes with broad substrate ranges. The turnover rate of PTEAr for the common organophosphorous insecticide malathion is lower than expected based on its physical properties; principally the pka of its leaving group. In this study, we rationalise the turnover rate of PTEAr for malathion using computational docking of the substrate into a high resolution crystal structure of the enzyme, suggesting that malathion is too large for the PTEAr binding pocket. Protein engineering through combinatorial active site saturation testing (CASTing) was then used to increase the rate of malathion turnover. Variants from a CASTing library in which Ser308 and Tyr309 were mutated yielded variants with increased activity towards malathion. The most active PTEAr variant carried Ser308Leu and Tyr309Ala substitutions, which resulted in a ca. 5000-fold increase in k cat/K M for malathion. X-ray crystal structures for the PTEAr Ser308Leu\\Tyr309Ala variant demonstrate that the access to the binding pocket was enhanced by the replacement of the bulky Tyr309 residue with the smaller alanine residue. PMID:24721933

  7. Improving Disease Prediction by Incorporating Family Disease History in Risk Prediction Models with Large-Scale Genetic Data.

    PubMed

    Gim, Jungsoo; Kim, Wonji; Kwak, Soo Heon; Choi, Hosik; Park, Changyi; Park, Kyong Soo; Kwon, Sunghoon; Park, Taesung; Won, Sungho

    2017-11-01

    Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance. Copyright © 2017 by the Genetics Society of America.

  8. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects

    PubMed Central

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB PMID:25281234

  9. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects.

    PubMed

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB. © The Author(s) 2014. Published by Oxford University Press.

  10. De novo design of the hydrophobic core of ubiquitin.

    PubMed Central

    Lazar, G. A.; Desjarlais, J. R.; Handel, T. M.

    1997-01-01

    We have previously reported the development and evaluation of a computational program to assist in the design of hydrophobic cores of proteins. In an effort to investigate the role of core packing in protein structure, we have used this program, referred to as Repacking of Cores (ROC), to design several variants of the protein ubiquitin. Nine ubiquitin variants containing from three to eight hydrophobic core mutations were constructed, purified, and characterized in terms of their stability and their ability to adopt a uniquely folded native-like conformation. In general, designed ubiquitin variants are more stable than control variants in which the hydrophobic core was chosen randomly. However, in contrast to previous results with 434 cro, all designs are destabilized relative to the wild-type (WT) protein. This raises the possibility that beta-sheet structures have more stringent packing requirements than alpha-helical proteins. A more striking observation is that all variants, including random controls, adopt fairly well-defined conformations, regardless of their stability. This result supports conclusions from the cro studies that non-core residues contribute significantly to the conformational uniqueness of these proteins while core packing largely affects protein stability and has less impact on the nature or uniqueness of the fold. Concurrent with the above work, we used stability data on the nine ubiquitin variants to evaluate and improve the predictive ability of our core packing algorithm. Additional versions of the program were generated that differ in potential function parameters and sampling of side chain conformers. Reasonable correlations between experimental and predicted stabilities suggest the program will be useful in future studies to design variants with stabilities closer to that of the native protein. Taken together, the present study provides further clarification of the role of specific packing interactions in protein structure and stability, and demonstrates the benefit of using systematic computational methods to predict core packing arrangements for the design of proteins. PMID:9194177

  11. Computational Redesign of Acyl-ACP Thioesterase with Improved Selectivity toward Medium-Chain-Length Fatty Acids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Grisewood, Matthew J.; Hernández-Lozada, Néstor J.; Thoden, James B.

    Enzyme and metabolic engineering offer the potential to develop biocatalysts for converting natural resources to a wide range of chemicals. To broaden the scope of potential products beyond natural metabolites, methods of engineering enzymes to accept alternative substrates and/or perform novel chemistries must be developed. DNA synthesis can create large libraries of enzyme-coding sequences, but most biochemistries lack a simple assay to screen for promising enzyme variants. Our solution to this challenge is structure-guided mutagenesis, in which optimization algorithms select the best sequences from libraries based on specified criteria (i.e., binding selectivity). We demonstrate this approach by identifying medium-chain (C8–C12)more » acyl-ACP thioesterases through structure-guided mutagenesis. Medium-chain fatty acids, which are products of thioesterase-catalyzed hydrolysis, are limited in natural abundance, compared to long-chain fatty acids; the limited supply leads to high costs of C6–C10 oleochemicals such as fatty alcohols, amines, and esters. Here, we applied computational tools to tune substrate binding of the highly active ‘TesA thioesterase in Escherichia coli. We used the IPRO algorithm to design thioesterase variants with enhanced C12 or C8 specificity, while maintaining high activity. After four rounds of structure-guided mutagenesis, we identified 3 variants with enhanced production of dodecanoic acid (C12) and 27 variants with enhanced production of octanoic acid (C8). The top variants reached up to 49% C12 and 50% C8 while exceeding native levels of total free fatty acids. A comparably sized library created by random mutagenesis failed to identify promising mutants. The chain length-preference of ‘TesA and the best mutant were confirmed in vitro using acyl-CoA substrates. Molecular dynamics simulations, confirmed by resolved crystal structures, of ‘TesA variants suggest that hydrophobic forces govern ‘TesA substrate specificity. Finally, we expect the design rules that we uncovered and the thioesterase variants that we identified will be useful to metabolic engineering projects aimed at sustainable production of medium-chain-length oleochemicals.« less

  12. Computational Redesign of Acyl-ACP Thioesterase with Improved Selectivity toward Medium-Chain-Length Fatty Acids

    DOE PAGES

    Grisewood, Matthew J.; Hernández-Lozada, Néstor J.; Thoden, James B.; ...

    2017-04-20

    Enzyme and metabolic engineering offer the potential to develop biocatalysts for converting natural resources to a wide range of chemicals. To broaden the scope of potential products beyond natural metabolites, methods of engineering enzymes to accept alternative substrates and/or perform novel chemistries must be developed. DNA synthesis can create large libraries of enzyme-coding sequences, but most biochemistries lack a simple assay to screen for promising enzyme variants. Our solution to this challenge is structure-guided mutagenesis, in which optimization algorithms select the best sequences from libraries based on specified criteria (i.e., binding selectivity). We demonstrate this approach by identifying medium-chain (C8–C12)more » acyl-ACP thioesterases through structure-guided mutagenesis. Medium-chain fatty acids, which are products of thioesterase-catalyzed hydrolysis, are limited in natural abundance, compared to long-chain fatty acids; the limited supply leads to high costs of C6–C10 oleochemicals such as fatty alcohols, amines, and esters. Here, we applied computational tools to tune substrate binding of the highly active ‘TesA thioesterase in Escherichia coli. We used the IPRO algorithm to design thioesterase variants with enhanced C12 or C8 specificity, while maintaining high activity. After four rounds of structure-guided mutagenesis, we identified 3 variants with enhanced production of dodecanoic acid (C12) and 27 variants with enhanced production of octanoic acid (C8). The top variants reached up to 49% C12 and 50% C8 while exceeding native levels of total free fatty acids. A comparably sized library created by random mutagenesis failed to identify promising mutants. The chain length-preference of ‘TesA and the best mutant were confirmed in vitro using acyl-CoA substrates. Molecular dynamics simulations, confirmed by resolved crystal structures, of ‘TesA variants suggest that hydrophobic forces govern ‘TesA substrate specificity. Finally, we expect the design rules that we uncovered and the thioesterase variants that we identified will be useful to metabolic engineering projects aimed at sustainable production of medium-chain-length oleochemicals.« less

  13. Extensive Diversity of Prion Strains Is Defined by Differential Chaperone Interactions and Distinct Amyloidogenic Regions

    PubMed Central

    Stein, Kevin C.; True, Heather L.

    2014-01-01

    Amyloidogenic proteins associated with a variety of unrelated diseases are typically capable of forming several distinct self-templating conformers. In prion diseases, these different structures, called prion strains (or variants), confer dramatic variation in disease pathology and transmission. Aggregate stability has been found to be a key determinant of the diverse pathological consequences of different prion strains. Yet, it remains largely unclear what other factors might account for the widespread phenotypic variation seen with aggregation-prone proteins. Here, we examined a set of yeast prion variants of the [RNQ+] prion that differ in their ability to induce the formation of another yeast prion called [PSI+]. Remarkably, we found that the [RNQ+] variants require different, non-contiguous regions of the Rnq1 protein for both prion propagation and [PSI+] induction. This included regions outside of the canonical prion-forming domain of Rnq1. Remarkably, such differences did not result in variation in aggregate stability. Our analysis also revealed a striking difference in the ability of these [RNQ+] variants to interact with the chaperone Sis1. Thus, our work shows that the differential influence of various amyloidogenic regions and interactions with host cofactors are critical determinants of the phenotypic consequences of distinct aggregate structures. This helps reveal the complex interdependent factors that influence how a particular amyloid structure may dictate disease pathology and progression. PMID:24811344

  14. Localized structural frustration for evaluating the impact of sequence variants.

    PubMed

    Kumar, Sushant; Clarke, Declan; Gerstein, Mark

    2016-12-01

    Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype-genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Resistance to malaria through structural variation of red blood cell invasion receptors

    PubMed Central

    Leffler, Ellen M.; Band, Gavin; Busby, George B.J.; Kivinen, Katja; Le, Quang Si; Clarke, Geraldine M.; Bojang, Kalifa A.; Conway, David J.; Jallow, Muminatou; Sisay-Joof, Fatoumatta; Bougouma, Edith C.; Mangano, Valentina D.; Modiano, David; Sirima, Sodiomon B.; Achidi, Eric; Apinjoh, Tobias O.; Marsh, Kevin; Ndila, Carolyne M.; Peshu, Norbert; Williams, Thomas N.; Drakeley, Chris; Manjurano, Alphaxard; Reyburn, Hugh; Riley, Eleanor; Kachala, David; Molyneux, Malcolm; Nyirongo, Vysaul; Taylor, Terrie; Thornton, Nicole; Tilley, Louise; Grimsley, Shane; Drury, Eleanor; Stalker, Jim; Cornelius, Victoria; Hubbart, Christina; Jeffreys, Anna E.; Rowlands, Kate; Rockett, Kirk A.; Spencer, Chris C.A.; Kwiatkowski, Dominic P.

    2017-01-01

    The malaria parasite Plasmodium falciparum invades human red blood cells via interactions between host and parasite surface proteins. By analyzing genome sequence data from human populations, including 1269 individuals from sub-Saharan Africa, we identify a diverse array of large copy number variants affecting the host invasion receptor genes GYPA and GYPB. We find that a nearby association with severe malaria is explained by a complex structural rearrangement involving the loss of GYPB and gain of two GYPB-A hybrid genes, which encode a serologically distinct blood group antigen known as Dantu. This variant reduces the risk of severe malaria by 40% and has recently risen in frequency in parts of Kenya, yet it appears to be absent from west Africa. These findings link structural variation of red blood cell invasion receptors with natural resistance to severe malaria. PMID:28522690

  16. Resistance to malaria through structural variation of red blood cell invasion receptors.

    PubMed

    Leffler, Ellen M; Band, Gavin; Busby, George B J; Kivinen, Katja; Le, Quang Si; Clarke, Geraldine M; Bojang, Kalifa A; Conway, David J; Jallow, Muminatou; Sisay-Joof, Fatoumatta; Bougouma, Edith C; Mangano, Valentina D; Modiano, David; Sirima, Sodiomon B; Achidi, Eric; Apinjoh, Tobias O; Marsh, Kevin; Ndila, Carolyne M; Peshu, Norbert; Williams, Thomas N; Drakeley, Chris; Manjurano, Alphaxard; Reyburn, Hugh; Riley, Eleanor; Kachala, David; Molyneux, Malcolm; Nyirongo, Vysaul; Taylor, Terrie; Thornton, Nicole; Tilley, Louise; Grimsley, Shane; Drury, Eleanor; Stalker, Jim; Cornelius, Victoria; Hubbart, Christina; Jeffreys, Anna E; Rowlands, Kate; Rockett, Kirk A; Spencer, Chris C A; Kwiatkowski, Dominic P

    2017-06-16

    The malaria parasite Plasmodium falciparum invades human red blood cells by a series of interactions between host and parasite surface proteins. By analyzing genome sequence data from human populations, including 1269 individuals from sub-Saharan Africa, we identify a diverse array of large copy-number variants affecting the host invasion receptor genes GYPA and GYPB We find that a nearby association with severe malaria is explained by a complex structural rearrangement involving the loss of GYPB and gain of two GYPB-A hybrid genes, which encode a serologically distinct blood group antigen known as Dantu. This variant reduces the risk of severe malaria by 40% and has recently increased in frequency in parts of Kenya, yet it appears to be absent from west Africa. These findings link structural variation of red blood cell invasion receptors with natural resistance to severe malaria. Copyright © 2017, American Association for the Advancement of Science.

  17. The Role of Constitutional Copy Number Variants in Breast Cancer

    PubMed Central

    Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.

    2015-01-01

    Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231

  18. G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.

    PubMed

    Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran

    2016-08-26

    Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.

  19. Dual allosteric activation mechanisms in monomeric human glucokinase

    PubMed Central

    Whittington, A. Carl; Larion, Mioara; Bowler, Joseph M.; Ramsey, Kristen M.; Brüschweiler, Rafael; Miller, Brian G.

    2015-01-01

    Cooperativity in human glucokinase (GCK), the body’s primary glucose sensor and a major determinant of glucose homeostatic diseases, is fundamentally different from textbook models of allostery because GCK is monomeric and contains only one glucose-binding site. Prior work has demonstrated that millisecond timescale order-disorder transitions within the enzyme’s small domain govern cooperativity. Here, using limited proteolysis, we map the site of disorder in unliganded GCK to a 30-residue active-site loop that closes upon glucose binding. Positional randomization of the loop, coupled with genetic selection in a glucokinase-deficient bacterium, uncovers a hyperactive GCK variant with substantially reduced cooperativity. Biochemical and structural analysis of this loop variant and GCK variants associated with hyperinsulinemic hypoglycemia reveal two distinct mechanisms of enzyme activation. In α-type activation, glucose affinity is increased, the proteolytic susceptibility of the active site loop is suppressed and the 1H-13C heteronuclear multiple quantum coherence (HMQC) spectrum of 13C-Ile–labeled enzyme resembles the glucose-bound state. In β-type activation, glucose affinity is largely unchanged, proteolytic susceptibility of the loop is enhanced, and the 1H-13C HMQC spectrum reveals no perturbation in ensemble structure. Leveraging both activation mechanisms, we engineer a fully noncooperative GCK variant, whose functional properties are indistinguishable from other hexokinase isozymes, and which displays a 100-fold increase in catalytic efficiency over wild-type GCK. This work elucidates specific structural features responsible for generating allostery in a monomeric enzyme and suggests a general strategy for engineering cooperativity into proteins that lack the structural framework typical of traditional allosteric systems. PMID:26283387

  20. Dual allosteric activation mechanisms in monomeric human glucokinase.

    PubMed

    Whittington, A Carl; Larion, Mioara; Bowler, Joseph M; Ramsey, Kristen M; Brüschweiler, Rafael; Miller, Brian G

    2015-09-15

    Cooperativity in human glucokinase (GCK), the body's primary glucose sensor and a major determinant of glucose homeostatic diseases, is fundamentally different from textbook models of allostery because GCK is monomeric and contains only one glucose-binding site. Prior work has demonstrated that millisecond timescale order-disorder transitions within the enzyme's small domain govern cooperativity. Here, using limited proteolysis, we map the site of disorder in unliganded GCK to a 30-residue active-site loop that closes upon glucose binding. Positional randomization of the loop, coupled with genetic selection in a glucokinase-deficient bacterium, uncovers a hyperactive GCK variant with substantially reduced cooperativity. Biochemical and structural analysis of this loop variant and GCK variants associated with hyperinsulinemic hypoglycemia reveal two distinct mechanisms of enzyme activation. In α-type activation, glucose affinity is increased, the proteolytic susceptibility of the active site loop is suppressed and the (1)H-(13)C heteronuclear multiple quantum coherence (HMQC) spectrum of (13)C-Ile-labeled enzyme resembles the glucose-bound state. In β-type activation, glucose affinity is largely unchanged, proteolytic susceptibility of the loop is enhanced, and the (1)H-(13)C HMQC spectrum reveals no perturbation in ensemble structure. Leveraging both activation mechanisms, we engineer a fully noncooperative GCK variant, whose functional properties are indistinguishable from other hexokinase isozymes, and which displays a 100-fold increase in catalytic efficiency over wild-type GCK. This work elucidates specific structural features responsible for generating allostery in a monomeric enzyme and suggests a general strategy for engineering cooperativity into proteins that lack the structural framework typical of traditional allosteric systems.

  1. Influence of population diversity on neurovirulence potential of plaque purified L-Zagreb variants.

    PubMed

    Ivancic-Jelecki, Jelena; Forcic, Dubravko; Jagusic, Maja; Kosutic-Gulija, Tanja; Mazuran, Renata; Balija, Maja Lang; Isakov, Ofer; Shomron, Noam

    2016-04-29

    Despite continuing research efforts, determinants of mumps virus virulence are still largely unknown. One of consequences of this is difficulty in striking a balance between efficacy and safety of live attenuated mumps vaccines. Among mumps vaccine strains associated with occurrence of postvaccinal aseptic meningitis is L-Zagreb, developed by further attenuation of vaccine strain L-3. Starting from an archived L-Zagreb sample with suboptimal neuroattenuation score, we isolated different viral variants and compared their genetic and phenotypic properties, in investigation of neurovirulence markers. Six different L-Zagreb variants were isolated by plaque purification. Their neurovirulent status was determined by rat-based neurovirulence test; population structure was determined by deep sequencing. We isolated one well neuroattenuated viral variant, two marginally neuroattenuated, and three insufficiently neuroattenuated. No genetic markers of neurovirulence could be identified. None of variants had detectable amounts of defective interfering particles. Two characteristics set insufficiently neuroattenuated variants apart from less-neurovirulent ones: elevated variability level in regions 1293-3314, 5363-7773 and 9382-11657, and/or elevated number of mutations present in frequencies ≥ 1%. The most neurovirulent variants possessed both of these features. Distinctive heterogeneity profiles were obtained for insufficiently neuroattenuated L-Zagreb variants. No markers that would discriminate between marginally and well neuroattenuated variants were identified. The findings of this study may serve as a guideline during development of an improved L3/L-Zagreb vaccine strain. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. Molecular Dynamics of CYP2D6 Polymorphisms in the Absence and Presence of a Mechanism-Based Inactivator Reveals Changes in Local Flexibility and Dominant Substrate Access Channels

    PubMed Central

    de Waal, Parker W.; Sunden, Kyle F.; Furge, Laura Lowe

    2014-01-01

    Cytochrome P450 enzymes (CYPs) represent an important enzyme superfamily involved in metabolism of many endogenous and exogenous small molecules. CYP2D6 is responsible for ∼15% of CYP-mediated drug metabolism and exhibits large phenotypic diversity within CYPs with over 100 different allelic variants. Many of these variants lead to functional changes in enzyme activity and substrate selectivity. Herein, a molecular dynamics comparative analysis of four different variants of CYP2D6 was performed. The comparative analysis included simulations with and without SCH 66712, a ligand that is also a mechanism-based inactivator, in order to investigate the possible structural basis of CYP2D6 inactivation. Analysis of protein stability highlighted significantly altered flexibility in both proximal and distal residues from the variant residues. In the absence of SCH 66712, *34, *17-2, and *17-3 displayed more flexibility than *1, and *53 displayed more rigidity. SCH 66712 binding reversed flexibility in *17-2 and *17-3, through *53 remained largely rigid. Throughout simulations with docked SCH 66712, ligand orientation within the heme-binding pocket was consistent with previously identified sites of metabolism and measured binding energies. Subsequent tunnel analysis of substrate access, egress, and solvent channels displayed varied bottle-neck radii. Taken together, our results indicate that SCH 66712 should inactivate these allelic variants, although varied flexibility and substrate binding-pocket accessibility may alter its interaction abilities. PMID:25286176

  3. Real-Time Nonlocal Means-Based Despeckling.

    PubMed

    Breivik, Lars Hofsoy; Snare, Sten Roar; Steen, Erik Normann; Solberg, Anne H Schistad

    2017-06-01

    In this paper, we propose a multiscale nonlocal means-based despeckling method for medical ultrasound. The multiscale approach leads to large computational savings and improves despeckling results over single-scale iterative approaches. We present two variants of the method. The first, denoted multiscale nonlocal means (MNLM), yields uniform robust filtering of speckle both in structured and homogeneous regions. The second, denoted unnormalized MNLM (UMNLM), is more conservative in regions of structure assuring minimal disruption of salient image details. Due to the popularity of anisotropic diffusion-based methods in the despeckling literature, we review the connection between anisotropic diffusion and iterative variants of NLM. These iterative variants in turn relate to our multiscale variant. As part of our evaluation, we conduct a simulation study making use of ground truth phantoms generated from clinical B-mode ultrasound images. We evaluate our method against a set of popular methods from the despeckling literature on both fine and coarse speckle noise. In terms of computational efficiency, our method outperforms the other considered methods. Quantitatively on simulations and on a tissue-mimicking phantom, our method is found to be competitive with the state-of-the-art. On clinical B-mode images, our method is found to effectively smooth speckle while preserving low-contrast and highly localized salient image detail.

  4. Mapping cis- and trans-regulatory effects across multiple tissues in twins

    PubMed Central

    Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.

    2013-01-01

    Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192

  5. Fine-scale patterns of population stratification confound rare variant association tests.

    PubMed

    O'Connor, Timothy D; Kiezun, Adam; Bamshad, Michael; Rich, Stephen S; Smith, Joshua D; Turner, Emily; Leal, Suzanne M; Akey, Joshua M

    2013-01-01

    Advances in next-generation sequencing technology have enabled systematic exploration of the contribution of rare variation to Mendelian and complex diseases. Although it is well known that population stratification can generate spurious associations with common alleles, its impact on rare variant association methods remains poorly understood. Here, we performed exhaustive coalescent simulations with demographic parameters calibrated from exome sequence data to evaluate the performance of nine rare variant association methods in the presence of fine-scale population structure. We find that all methods have an inflated spurious association rate for parameter values that are consistent with levels of differentiation typical of European populations. For example, at a nominal significance level of 5%, some test statistics have a spurious association rate as high as 40%. Finally, we empirically assess the impact of population stratification in a large data set of 4,298 European American exomes. Our results have important implications for the design, analysis, and interpretation of rare variant genome-wide association studies.

  6. Cancer genetics meets biomolecular mechanism-bridging an age-old gulf.

    PubMed

    González-Sánchez, Juan Carlos; Raimondi, Francesco; Russell, Robert B

    2018-02-01

    Increasingly available genomic sequencing data are exploited to identify genes and variants contributing to diseases, particularly cancer. Traditionally, methods to find such variants have relied heavily on allele frequency and/or familial history, often neglecting to consider any mechanistic understanding of their functional consequences. Thus, while the set of known cancer-related genes has increased, for many, their mechanistic role in the disease is not completely understood. This issue highlights a wide gap between the disciplines of genetics, which largely aims to correlate genetic events with phenotype, and molecular biology, which ultimately aims at a mechanistic understanding of biological processes. Fortunately, new methods and several systematic studies have proved illuminating for many disease genes and variants by integrating sequencing with mechanistic data, including biomolecular structures and interactions. These have provided new interpretations for known mutations and suggested new disease-relevant variants and genes. Here, we review these approaches and discuss particular examples where these have had a profound impact on the understanding of human cancers. © 2018 Federation of European Biochemical Societies.

  7. Disruption of the Putative Vascular Leak Peptide Sequence in the Stabilized Ricin Vaccine Candidate RTA1-33/44-198

    PubMed Central

    Janosi, Laszlo; Compton, Jaimee R.; Legler, Patricia M.; Steele, Keith E.; Davis, Jon M.; Matyas, Gary R.; Millard, Charles B.

    2013-01-01

    Vitetta and colleagues identified and characterized a putative vascular leak peptide (VLP) consensus sequence in recombinant ricin toxin A-chain (RTA) that contributed to dose-limiting human toxicity when RTA was administered intravenously in large quantities during chemotherapy. We disrupted this potentially toxic site within the more stable RTA1-33/44-198 vaccine immunogen and determined the impact of these mutations on protein stability, structure and protective immunogenicity using an experimental intranasal ricin challenge model in BALB/c mice to determine if the mutations were compatible. Single amino acid substitutions at the positions corresponding with RTA D75 (to A, or N) and V76 (to I, or M) had minor effects on the apparent protein melting temperature of RTA1-33/44-198 but all four variants retained greater apparent stability than the parent RTA. Moreover, each VLP(−) variant tested provided protection comparable with that of RTA1-33/44-198 against supralethal intranasal ricin challenge as judged by animal survival and several biomarkers. To understand better how VLP substitutions and mutations near the VLP site impact epitope structure, we introduced a previously described thermal stabilizing disulfide bond (R48C/T77C) along with the D75N or V76I substitutions in RTA1-33/44-198. The D75N mutation was compatible with the adjacent stabilizing R48C/T77C disulfide bond and the Tm was unaffected, whereas the V76I mutation was less compatible with the adjacent disulfide bond involving C77. A crystal structure of the RTA1-33/44-198 R48C/T77C/D75N variant showed that the structural integrity of the immunogen was largely conserved and that a stable immunogen could be produced from E. coli. We conclude that it is feasible to disrupt the VLP site in RTA1-33/44-198 with little or no impact on apparent protein stability or protective efficacy in mice and such variants can be stabilized further by introduction of a disulfide bond. PMID:23364220

  8. Comparison of N- and O-linked glycosylation patterns of ebolavirus glycoproteins.

    PubMed

    Collar, Amanda L; Clarke, Elizabeth C; Anaya, Eduardo; Merrill, Denise; Yarborough, Sarah; Anthony, Scott M; Kuhn, Jens H; Merle, Christine; Theisen, Manfred; Bradfute, Steven B

    2017-02-01

    Ebolaviruses are emerging pathogens that cause severe and often fatal viral hemorrhagic fevers. Four distinct ebolaviruses are known to cause Ebola virus disease in humans. The ebolavirus envelope glycoprotein (GP 1,2 ) is heavily glycosylated, but the precise glycosylation patterns of ebolaviruses are largely unknown. Here we demonstrate that approximately 50 different N-glycan structures are present in GP 1,2 derived from the four pathogenic ebolaviruses, including high mannose, hybrid, and bi-, tri-, and tetra-antennary complex glycans with and without fucose and sialic acid. The overall N-glycan composition is similar between the different ebolavirus GP 1,2 s. In contrast, the amount and type of O-glycan structures varies widely between ebolavirus GP 1,2 s. Notably, this O-glycan dissimilarity is also present between two variants of Ebola virus, the original Yambuku variant and the Makona variant responsible for the most recent Western African epidemic. The data presented here should serve as the foundation for future ebolaviral entry and immunogenicity studies. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Quantitative Missense Variant Effect Prediction Using Large-Scale Mutagenesis Data.

    PubMed

    Gray, Vanessa E; Hause, Ronald J; Luebeck, Jens; Shendure, Jay; Fowler, Douglas M

    2018-01-24

    Large datasets describing the quantitative effects of mutations on protein function are becoming increasingly available. Here, we leverage these datasets to develop Envision, which predicts the magnitude of a missense variant's molecular effect. Envision combines 21,026 variant effect measurements from nine large-scale experimental mutagenesis datasets, a hitherto untapped training resource, with a supervised, stochastic gradient boosting learning algorithm. Envision outperforms other missense variant effect predictors both on large-scale mutagenesis data and on an independent test dataset comprising 2,312 TP53 variants whose effects were measured using a low-throughput approach. This dataset was never used for hyperparameter tuning or model training and thus serves as an independent validation set. Envision prediction accuracy is also more consistent across amino acids than other predictors. Finally, we demonstrate that Envision's performance improves as more large-scale mutagenesis data are incorporated. We precompute Envision predictions for every possible single amino acid variant in human, mouse, frog, zebrafish, fruit fly, worm, and yeast proteomes (https://envision.gs.washington.edu/). Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Different disease-causing mutations in transthyretin trigger the same conformational conversion.

    PubMed

    Steward, Robert E; Armen, Roger S; Daggett, Valerie

    2008-03-01

    Transthyretin (TTR)-containing amyloid fibrils are deposited in cardiac tissue as a natural consequence of aging. A large number of inherited mutations lead to amyloid diseases by accelerating TTR deposition in other organs. Amyloid formation is preceded by a disruption of the quaternary structure of TTR and conformational changes in the monomer. To study conformational changes preceding the formation of amyloid, we performed molecular dynamics simulations of the wild-type monomer, amyloidogenic variants (V30M, L55P, V122I) and a protective variant (T119M) at neutral and low pH. At low pH, the D strand dissociated from the beta-sheet to expose the A strand, consistent with experimental studies. In amyloidogenic variants and in the wild-type at low pH, there was a conformational change in the beta-sheets into alpha-sheet via peptide bond flips that was not observed at neutral pH in the wild-type monomer. The same residues participated in conversion in each amyloidogenic variant simulation, originating in the G strand between residues 106 and 109, with accelerated conversion at low pH. The T119M protective variant changed the local conformation of the H strand and suppressed the conversion observed in amyloidogenic variants.

  11. New family of graphene-based organic semiconductors: An investigation of photon-induced electronic structure manipulation in half-fluorinated graphene

    NASA Astrophysics Data System (ADS)

    Walter, Andrew L.; Sahin, Hasan; Kang, Jun; Jeon, Ki-Joon; Bostwick, Aaron; Horzum, Seyda; Moreschini, Luca; Chang, Young Jun; Peeters, Francois M.; Horn, Karsten; Rotenberg, Eli

    2016-02-01

    The application of graphene to electronic and optoelectronic devices is limited by the absence of reliable semiconducting variants of this material. A promising candidate in this respect is graphene oxide, with a band gap on the order of ˜5 eV , however, this has a finite density of states at the Fermi level. Here, we examine the electronic structure of three variants of half -fluorinated carbon on Sic(0001), i.e., the (6 √{3 }×6 √{3 } ) R 30∘ C/SiC "buffer layer," graphene on this (6 √{3 }×6 √{3 } ) R 30∘ C/SiC buffer layer, and graphene decoupled from the SiC substrate by hydrogen intercalation. Using angle-resolved photoemission, core level photoemission, and x-ray absorption, we show that the electronic, chemical, and physical structure of all three variants is remarkably similar, exhibiting a large band gap and a vanishing density of states at the Fermi level. These results are explained in terms of first-principles calculations. This material thus appears very suitable for applications, even more so since it is prepared on a processing-friendly substrate. We also investigate two separate UV photon-induced modifications of the electronic structure that transform the insulating samples (6.2-eV band gap) into semiconducting (˜2.5 -eV band gap) and metallic regions, respectively.

  12. Atomic structures of corkscrew-forming segments of SOD1 reveal varied oligomer conformations.

    PubMed

    Sangwan, Smriti; Sawaya, Michael R; Murray, Kevin A; Hughes, Michael P; Eisenberg, David S

    2018-02-17

    The aggregation cascade of disease-related amyloidogenic proteins, terminating in insoluble amyloid fibrils, involves intermediate oligomeric states. The structural and biochemical details of these oligomers have been largely unknown. Here we report crystal structures of variants of the cytotoxic oligomer-forming segment residues 28-38 of the ALS-linked protein, SOD1. The crystal structures reveal three different architectures: corkscrew oligomeric structure, nontwisting curved sheet structure and a steric zipper proto-filament structure. Our work highlights the polymorphism of the segment 28-38 of SOD1 and identifies the molecular features of amyloidogenic entities. © 2018 The Protein Society.

  13. Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples

    PubMed Central

    Peterson, Thomas A.; Park, Junyong

    2017-01-01

    The fight against cancer is hindered by its highly heterogeneous nature. Genome-wide sequencing studies have shown that individual malignancies contain many mutations that range from those commonly found in tumor genomes to rare somatic variants present only in a small fraction of lesions. Such rare somatic variants dominate the landscape of genomic mutations in cancer, yet efforts to correlate somatic mutations found in one or few individuals with functional roles have been largely unsuccessful. Traditional methods for identifying somatic variants that drive cancer are ‘gene-centric’ in that they consider only somatic variants within a particular gene and make no comparison to other similar genes in the same family that may play a similar role in cancer. In this work, we present oncodomain hotspots, a new ‘domain-centric’ method for identifying clusters of somatic mutations across entire gene families using protein domain models. Our analysis confirms that our approach creates a framework for leveraging structural and functional information encapsulated by protein domains into the analysis of somatic variants in cancer, enabling the assessment of even rare somatic variants by comparison to similar genes. Our results reveal a vast landscape of somatic variants that act at the level of domain families altering pathways known to be involved with cancer such as protein phosphorylation, signaling, gene regulation, and cell metabolism. Due to oncodomain hotspots’ unique ability to assess rare variants, we expect our method to become an important tool for the analysis of sequenced tumor genomes, complementing existing methods. PMID:28426665

  14. Predicting Gene Structure Changes Resulting from Genetic Variants via Exon Definition Features.

    PubMed

    Majoros, William H; Holt, Carson; Campbell, Michael S; Ware, Doreen; Yandell, Mark; Reddy, Timothy E

    2018-04-25

    Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed, and produce functional proteins. We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and noncoding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or noncoding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products, and we propose that they may commonly act as cryptic factors in disease. The software is available from geneprediction.org/SGRF. bmajoros@duke.edu. Supplementary information is available at Bioinformatics online.

  15. Sex-dependent association of common variants of microcephaly genes with brain structure.

    PubMed

    Rimol, Lars M; Agartz, Ingrid; Djurovic, Srdjan; Brown, Andrew A; Roddey, J Cooper; Kähler, Anna K; Mattingsdal, Morten; Athanasiu, Lavinia; Joyner, Alexander H; Schork, Nicholas J; Halgren, Eric; Sundet, Kjetil; Melle, Ingrid; Dale, Anders M; Andreassen, Ole A

    2010-01-05

    Loss-of-function mutations in the genes associated with primary microcephaly (MCPH) reduce human brain size by about two-thirds, without producing gross abnormalities in brain organization or physiology and leaving other organs largely unaffected [Woods CG, et al. (2005) Am J Hum Genet 76:717-728]. There is also evidence suggesting that MCPH genes have evolved rapidly in primates and humans and have been subjected to selection in recent human evolution [Vallender EJ, et al. (2008) Trends Neurosci 31:637-644]. Here, we show that common variants of MCPH genes account for some of the common variation in brain structure in humans, independently of disease status. We investigated the correlations of SNPs from four MCPH genes with brain morphometry phenotypes obtained with MRI. We found significant, sex-specific associations between common, nonexonic, SNPs of the genes CDK5RAP2, MCPH1, and ASPM, with brain volume or cortical surface area in an ethnically homogenous Norwegian discovery sample (n = 287), including patients with mental illness. The most strongly associated SNP findings were replicated in an independent North American sample (n = 656), which included patients with dementia. These results are consistent with the view that common variation in brain structure is associated with genetic variants located in nonexonic, presumably regulatory, regions.

  16. Hsp40 function in yeast prion propagation: Amyloid diversity necessitates chaperone functional complexity.

    PubMed

    Sporn, Zachary A; Hines, Justin K

    2015-01-01

    Yeast prions are heritable protein-based elements, most of which are formed of amyloid aggregates that rely on the action of molecular chaperones for transmission to progeny. Prions can form distinct amyloid structures, known as 'strains' in mammalian systems, that dictate both pathological progression and cross-species infection barriers. In yeast these same amyloid structural polymorphisms, called 'variants', dictate the intensity of prion-associated phenotypes and stability in mitosis. We recently reported that [PSI(+)] prion variants differ in the fundamental domain requirements for one chaperone, the Hsp40/J-protein Sis1, which are mutually exclusive between 2 different yeast prions, demonstrating a functional plurality for Sis1. Here we extend that analysis to incorporate additional data that collectively support the hypothesis that Sis1 has multiple functional roles that can be accomplished by distinct sets of domains. These functions are differentially required by distinct prions and prion variants. We also present new data regarding Hsp104-mediated prion elimination and show that some Sis1 functions, but not all, are conserved in the human homolog Hdj1/DNAJB1. Importantly, of the 10 amyloid-based prions indentified to date in Saccharomyces cerevisiae, the chaperone requirements of only 4 are known, leaving a great diversity of amyloid structures, and likely modes of amyloid-chaperone interaction, largely unexplored.

  17. Theoretical foundations of spatially-variant mathematical morphology part ii: gray-level images.

    PubMed

    Bouaynaya, Nidhal; Schonfeld, Dan

    2008-05-01

    In this paper, we develop a spatially-variant (SV) mathematical morphology theory for gray-level signals and images in the Euclidean space. The proposed theory preserves the geometrical concept of the structuring function, which provides the foundation of classical morphology and is essential in signal and image processing applications. We define the basic SV gray-level morphological operators (i.e., SV gray-level erosion, dilation, opening, and closing) and investigate their properties. We demonstrate the ubiquity of SV gray-level morphological systems by deriving a kernel representation for a large class of systems, called V-systems, in terms of the basic SV graylevel morphological operators. A V-system is defined to be a gray-level operator, which is invariant under gray-level (vertical) translations. Particular attention is focused on the class of SV flat gray-level operators. The kernel representation for increasing V-systems is a generalization of Maragos' kernel representation for increasing and translation-invariant function-processing systems. A representation of V-systems in terms of their kernel elements is established for increasing and upper-semi-continuous V-systems. This representation unifies a large class of spatially-variant linear and non-linear systems under the same mathematical framework. Finally, simulation results show the potential power of the general theory of gray-level spatially-variant mathematical morphology in several image analysis and computer vision applications.

  18. Rare variants and cardiovascular disease.

    PubMed

    Wain, Louise V

    2014-09-01

    Cardiovascular disease (CVD) is a leading cause of mortality and morbidity in the Western world. Large genome-wide association studies (GWASs) of coronary artery disease, myocardial infarction, stroke and dilated cardiomyopathy have identified a number of common genetic variants with modest effects on disease risk. Similarly, studies of important modifiable risk factors of CVD have identified a large number of predominantly common variant associations, for example, with blood pressure and blood lipid levels. In each case, despite the often large numbers of loci identified, only a small proportion of the phenotypic variance is explained. It has been hypothesised that rare variants with large effects may account for some of the missing variance but large-scale studies of rare variation are in their infancy for cardiovascular traits and have yet to produce fruitful results. Studies of monogenic CVDs, inherited disorders believed to be entirely driven by individual rare mutations, have highlighted genes that play a key role in disease aetiology. In this review, we discuss how findings from studies of rare variants in monogenic disease and GWAS of predominantly common variants are converging to provide further insight into biological disease mechanisms. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  19. The correlation of fragmentation and structure of a protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Qinyuan; Cheng, Xueheng; Van Orden, S.

    1995-12-31

    Characterization of proteins of similar structures is important to understanding the biological function of the proteins and the processes with which they are involved. Cytochrome c variants typically have similar sequences, and have similar conformations in solution with almost identical absorption spectra and redox potentials. The authors chose cytochrome c`s from bovine, tuna, rabbit and horse as a model system in studying large biomolecules using MS{sup n} of multiply charged ions generated from electrospray ionization (ESI).

  20. Direct Calculation of Protein Fitness Landscapes through Computational Protein Design

    PubMed Central

    Au, Loretta; Green, David F.

    2016-01-01

    Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411

  1. Mapping and phasing of structural variation in patient genomes using nanopore sequencing.

    PubMed

    Cretu Stancu, Mircea; van Roosmalen, Markus J; Renkens, Ivo; Nieboer, Marleen M; Middelkamp, Sjors; de Ligt, Joep; Pregno, Giulia; Giachino, Daniela; Mandrile, Giorgia; Espejo Valle-Inclan, Jose; Korzelius, Jerome; de Bruijn, Ewart; Cuppen, Edwin; Talkowski, Michael E; Marschall, Tobias; de Ridder, Jeroen; Kloosterman, Wigard P

    2017-11-06

    Despite improvements in genomics technology, the detection of structural variants (SVs) from short-read sequencing still poses challenges, particularly for complex variation. Here we analyse the genomes of two patients with congenital abnormalities using the MinION nanopore sequencer and a novel computational pipeline-NanoSV. We demonstrate that nanopore long reads are superior to short reads with regard to detection of de novo chromothripsis rearrangements. The long reads also enable efficient phasing of genetic variations, which we leveraged to determine the parental origin of all de novo chromothripsis breakpoints and to resolve the structure of these complex rearrangements. Additionally, genome-wide surveillance of inherited SVs reveals novel variants, missed in short-read data sets, a large proportion of which are retrotransposon insertions. We provide a first exploration of patient genome sequencing with a nanopore sequencer and demonstrate the value of long-read sequencing in mapping and phasing of SVs for both clinical and research applications.

  2. Structures of MART-1 26/27-35Peptide/HLA-A2 Complexes Reveal a Remarkable Disconnect between Antigen Structural Homology and T Cell Recognition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Borbulevych, Oleg Y; Insaidoo, Francis K; Baxter, Tiffany K

    2008-09-17

    Small structural changes in peptides presented by major histocompatibility complex (MHC) molecules often result in large changes in immunogenicity, supporting the notion that T cell receptors are exquisitely sensitive to antigen structure. Yet there are striking examples of TCR recognition of structurally dissimilar ligands. The resulting unpredictability of how T cells will respond to different or modified antigens impacts both our understanding of the physical bases for TCR specificity as well as efforts to engineer peptides for immunomodulation. In cancer immunotherapy, epitopes and variants derived from the MART-1/Melan-A protein are widely used as clinical vaccines. Two overlapping epitopes spanning aminomore » acid residues 26 through 35 are of particular interest: numerous clinical studies have been performed using variants of the MART-1 26-35 decamer, although only the 27-35 nonamer has been found on the surface of targeted melanoma cells. Here, we show that the 26-35 and 27-35 peptides adopt strikingly different conformations when bound to HLA-A2. Nevertheless, clonally distinct MART-1{sub 26/27-35}-reactive T cells show broad cross-reactivity towards these ligands. Simultaneously, however, many of the cross-reactive T cells remain unable to recognize anchor-modified variants with very subtle structural differences. These dichotomous observations challenge our thinking about how structural information on unligated peptide/MHC complexes should be best used when addressing questions of TCR specificity. Our findings also indicate that caution is warranted in the design of immunotherapeutics based on the MART-1 26/27-35 epitopes, as neither cross-reactivity nor selectivity is predictable based on the analysis of the structures alone.« less

  3. Structures of MART-126/27–35 Peptide/HLA-A2 Complexes Reveal a Remarkable Disconnect between Antigen Structural Homology and T Cell Recognition

    PubMed Central

    Borbulevych, Oleg Y.; Insaidoo, Francis K.; Baxter, Tiffany K.; Powell, Daniel J.; Johnson, Laura A.; Restifo, Nicholas P.; Baker, Brian M.

    2007-01-01

    Small structural changes in peptides presented by major histocompatibility complex (MHC) molecules often result in large changes in immunogenicity, supporting the notion that T cell receptors are exquisitely sensitive to antigen structure. Yet there are striking examples of TCR recognition of structurally dissimilar ligands. The resulting unpredictability of how T cells will respond to different or modified antigens impacts both our understanding of the physical bases for TCR specificity as well as efforts to engineer peptides for immunomodulation. In cancer immunotherapy, epitopes and variants derived from the MART-1/Melan-A protein are widely used as clinical vaccines. Two overlapping epitopes spanning amino acid residues 26 through 35 are of particular interest: numerous clinical studies have been performed using variants of the MART-1 26–35 decamer, although only the 27–35 nonamer has been found on the surface of targeted melanoma cells. Here, we show that the 26–35 and 27–35 peptides adopt strikingly different conformations when bound to HLA-A2. Nevertheless, clonally distinct MART-126/27–35-reactive T cells show broad cross-reactivity towards these ligands. Simultaneously, however, many of the cross-reactive T cells remain unable to recognize anchor-modified variants with very subtle structural differences. These dichotomous observations challenge our thinking about how structural information on unligated peptide/MHC complexes should be best used when addressing questions of TCR specificity. Our findings also indicate that caution is warranted in the design of immunotherapeutics based on the MART-1 26/27–35 epitopes, as neither cross-reactivity nor selectivity is predictable based on the analysis of the structures alone. PMID:17719062

  4. Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set.

    PubMed

    Eyrich, V A; Standley, D M; Friesner, R A

    1999-05-14

    We report the tertiary structure predictions for 95 proteins ranging in size from 17 to 160 residues starting from known secondary structure. Predictions are obtained from global minimization of an empirical potential function followed by the application of a refined atomic overlap potential. The minimization strategy employed represents a variant of the Monte Carlo plus minimization scheme of Li and Scheraga applied to a reduced model of the protein chain. For all of the cases except beta-proteins larger than 75 residues, a native-like structure, usually 4-6 A root-mean-square deviation from the native, is located. For beta-proteins larger than 75 residues, the energy gap between native-like structures and the lowest energy structures produced in the simulation is large, so that low RMSD structures are not generated starting from an unfolded state. This is attributed to the lack of an explicit hydrogen bond term in the potential function, which we hypothesize is necessary to stabilize large assemblies of beta-strands. Copyright 1999 Academic Press.

  5. The Evolution and Functional Impact of Human Deletion Variants Shared with Archaic Hominin Genomes

    PubMed Central

    Lin, Yen-Lung; Pavlidis, Pavlos; Karakoc, Emre; Ajay, Jerry; Gokcumen, Omer

    2015-01-01

    Allele sharing between modern and archaic hominin genomes has been variously interpreted to have originated from ancestral genetic structure or through non-African introgression from archaic hominins. However, evolution of polymorphic human deletions that are shared with archaic hominin genomes has yet to be studied. We identified 427 polymorphic human deletions that are shared with archaic hominin genomes, approximately 87% of which originated before the Human–Neandertal divergence (ancient) and only approximately 9% of which have been introgressed from Neandertals (introgressed). Recurrence, incomplete lineage sorting between human and chimp lineages, and hominid-specific insertions constitute the remaining approximately 4% of allele sharing between humans and archaic hominins. We observed that ancient deletions correspond to more than 13% of all common (>5% allele frequency) deletion variation among modern humans. Our analyses indicate that the genomic landscapes of both ancient and introgressed deletion variants were primarily shaped by purifying selection, eliminating large and exonic variants. We found 17 exonic deletions that are shared with archaic hominin genomes, including those leading to three fusion transcripts. The affected genes are involved in metabolism of external and internal compounds, growth and sperm formation, as well as susceptibility to psoriasis and Crohn’s disease. Our analyses suggest that these “exonic” deletion variants have evolved through different adaptive forces, including balancing and population-specific positive selection. Our findings reveal that genomic structural variants that are shared between humans and archaic hominin genomes are common among modern humans and can influence biomedically and evolutionarily important phenotypes. PMID:25556237

  6. Dual Modifications of α-Galactosylceramide Synergize to Promote Activation of Human Invariant Natural Killer T Cells and Stimulate Anti-tumor Immunity.

    PubMed

    Chennamadhavuni, Divya; Saavedra-Avila, Noemi Alejandra; Carreño, Leandro J; Guberman-Pfeffer, Matthew J; Arora, Pooja; Yongqing, Tang; Koay, Hui-Fern; Godfrey, Dale I; Keshipeddy, Santosh; Richardson, Stewart K; Sundararaj, Srinivasan; Lo, Jae Ho; Wen, Xiangshu; Gascón, José A; Yuan, Weiming; Rossjohn, Jamie; Le Nours, Jérôme; Porcelli, Steven A; Howell, Amy R

    2018-05-17

    Glycosylceramides that activate CD1d-restricted invariant natural killer T (iNKT) cells have potential therapeutic applications for augmenting immune responses against cancer and infections. Previous studies using mouse models identified sphinganine variants of α-galactosylceramide as promising iNKT cell activators that stimulate cytokine responses with a strongly proinflammatory bias. However, the activities of sphinganine variants in mice have generally not translated well to studies of human iNKT cell responses. Here, we show that strongly proinflammatory and anti-tumor iNKT cell responses were achieved in mice by a variant of α-galactosylceramide that combines a sphinganine base with a hydrocinnamoyl ester on C6″ of the sugar. Importantly, the activities observed with this variant were largely preserved for human iNKT cell responses. Structural and in silico modeling studies provided a mechanistic basis for these findings and suggested basic principles for capturing useful properties of sphinganine analogs of synthetic iNKT cell activators in the design of immunotherapeutic agents. Copyright © 2018 Elsevier Ltd. All rights reserved.

  7. Genetic Relationships Between Schizophrenia, Bipolar Disorder, and Schizoaffective Disorder

    PubMed Central

    Cardno, Alastair G.

    2014-01-01

    There is substantial evidence for partial overlap of genetic influences on schizophrenia and bipolar disorder, with family, twin, and adoption studies showing a genetic correlation between the disorders of around 0.6. Results of genome-wide association studies are consistent with commonly occurring genetic risk variants, contributing to both the shared and nonshared aspects, while studies of large, rare chromosomal structural variants, particularly copy number variants, show a stronger influence on schizophrenia than bipolar disorder to date. Schizoaffective disorder has been less investigated but shows substantial familial overlap with both schizophrenia and bipolar disorder. A twin analysis is consistent with genetic influences on schizoaffective episodes being entirely shared with genetic influences on schizophrenic and manic episodes, while association studies suggest the possibility of some relatively specific genetic influences on broadly defined schizoaffective disorder, bipolar subtype. Further insights into genetic relationships between these disorders are expected as studies continue to increase in sample size and in technical and analytical sophistication, information on phenotypes beyond clinical diagnoses are increasingly incorporated, and approaches such as next-generation sequencing identify additional types of genetic risk variant. PMID:24567502

  8. [Clinical and morphological variants of diverticular disease in colon].

    PubMed

    Levchenko, S V; Lazebnik, L B; Potapova, V B; Rogozina, V A

    2013-01-01

    Our own results of two-stage research are presented in the article. The first stage contains the retrospective analysis of 3682 X-ray examining of large bowel which were conducted in 2002-2004 to define the structure of colon disease and to determine gender differences. The second stage is prospective research which took place from 2003 to 2012 and 486 patients with diverticular disease were regularly observed. Following parameters were estimated: dynamics of complaints, life quality, clinical symptoms. Multiple X-ray and endoscopic examining were done with estimation of quantity and size of diverticula, changes of colon mucosa, comparison of X-ray and endoscopic methods in prognosis of complications. Two basic clinical morphological variants of diverticular disease (DD) of colon are made out as a result of our research. There are IBD-like and DD with ischemic component. The variants differ by pain characteristics, presence of accompanying diseases, life quality parameters and description of colon mucosa morphological research. We suppose that different ethiopathogenetic factors of development of both variants mentioned above influence the disease prognosis and selection of treatment.

  9. Sequence variants of the DFNB31 gene among Usher syndrome patients of diverse origin

    PubMed Central

    Aller, Elena; Jaijo, Teresa; van Wijk, Erwin; Ebermann, Inga; Kersten, Ferry; García-García, Gema; Voesenek, Krysta; Aparisi, María José; Hoefsloot, Lies; Cremers, Cor; Díaz-Llopis, Manuel; Pennings, Ronald; Bolz, Hanno J.; Kremer, Hannie; Millán, José M.

    2010-01-01

    Purpose It has been demonstrated that mutations in deafness, autosomal recessive 31 (DFNB31), the gene encoding whirlin, is responsible for nonsyndromic hearing loss (NSHL; DFNB31) and Usher syndrome type II (USH2D). We screened DFNB31 in a large cohort of patients with different clinical subtypes of Usher syndrome (USH) to determine the prevalence of DFNB31 mutations among USH patients. Methods DFNB31 was screened in 149 USH2, 29 USH1, six atypical USH, and 11 unclassified USH patients from diverse ethnic backgrounds. Mutation detection was performed by direct sequencing of all coding exons. Results We identified 38 different variants among 195 patients. Most variants were clearly polymorphic, but at least two out of the 15 nonsynonymous variants (p.R350W and p.R882S) are predicted to impair whirlin structure and function, suggesting eventual pathogenicity. No putatively pathogenic mutation was found in the second allele of patients with these mutations. Conclusions DFNB31 is not a major cause of USH. PMID:20352026

  10. Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.

    PubMed

    Lakshmikumaran, M; Negi, M S

    1994-03-01

    Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.

  11. A novel approach to determine primary stability of acetabular press-fit cups.

    PubMed

    Weißmann, Volker; Boss, Christian; Bader, Rainer; Hansmann, Harald

    2018-04-01

    Today hip cups are used in a large variety of design variants and in increasing numbers of units. Their development is steadily progressing. In addition to conventional manufacturing methods for hip cups, additive methods, in particular, play an increasingly important role as development progresses. The present paper describes a modified cup model developed based on a commercially available press-fit cup (Allofit 54/JJ). The press-fit cup was designed in two variants and manufactured using selective laser melting (SLM). Variant 1 (Ti) was modeled on the Allofit cup using an adapted process technology. Variant 2 (Ti-S) was provided with a porous load bearing structure on its surface. In addition to the typical (complete) geometry, both variants were also manufactured and tested in a reduced shape where only the press-fit area was formed. To assess the primary stability of the press-fit cups in the artificial bone cavity, pull-out and lever-out tests were carried out. Exact fit conditions and two-millimeter press-fit were investigated. The closed-cell PU foam used as an artificial bone cavity was mechanically characterized to exclude any influence on the results of the investigation. The pull-out forces of the Ti-variant (complete-526 N, reduced-468 N) and the Ti-S variant (complete-548 N, reduced-526 N) as well as the lever-out moments of the Ti-variant (complete-10 Nm, reduced-9.8 Nm) and the Ti-S variant (complete-9 Nm, reduced-7.9 N) show no significant differences in the results between complete and reduced cups. The results show that the use of reduced cups in a press-fit design is possible within the scope of development work. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants

    PubMed Central

    Reuter, Miriam S.; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K.C.; Trost, Brett; Paton, Tara A.; Pereira, Sergio L.; Herbrick, Jo-Anne; Wintle, Richard F.; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R.; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W.L.; Wang, Zhuozhi; Patel, Rohan V.; Pellecchia, Giovanna; Wei, John; Strug, Lisa J.; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M.; Bassett, Anne S.; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D.; Stavropoulos, Dimitri J.; Bowdin, Sarah; Hildebrandt, Matthew R.; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M. Stephen; Monfared, Nasim; Hosseini, S. Mohsen; Joseph-George, Ann M.; Keeley, Fred W.; Cook, Ryan A.; Fiume, Marc; Lee, Hin C.; Marshall, Christian R.; Davies, Jill; Hazell, Allison; Buchanan, Janet A.; Szego, Michael J.; Scherer, Stephen W.

    2018-01-01

    BACKGROUND: The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. METHODS: Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. RESULTS: Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set (n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants — associated with cancer, cardiac or neurodegenerative phenotypes — remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. INTERPRETATION: Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. PMID:29431110

  13. Relation of genomic variants for Alzheimer disease dementia to common neuropathologies

    PubMed Central

    Yu, Lei; Buchman, Aron S.; Schneider, Julie A.; De Jager, Philip L.; Bennett, David A.

    2016-01-01

    Objective: To investigate the associations of previously reported Alzheimer disease (AD) dementia genomic variants with common neuropathologies. Methods: This is a postmortem study including 1,017 autopsied participants from 2 clinicopathologic cohorts. Analyses focused on 22 genomic variants associated with AD dementia in large-scale case-control genome-wide association study (GWAS) meta-analyses. The neuropathologic traits of interest were a pathologic diagnosis of AD according to NIA-Reagan criteria, macroscopic and microscopic infarcts, Lewy bodies (LB), and hippocampal sclerosis. For each variant, multiple logistic regression was used to investigate its association with neuropathologic traits, adjusting for age, sex, and subpopulation structure. We also conducted power analyses to estimate the sample sizes required to detect genome-wide significance (p < 5 × 10−8) for pathologic AD for all variants. Results: APOE ε4 allele was associated with greater odds of pathologic AD (odds ratio [OR] 3.82, 95% confidence interval [CI] 2.67–5.46, p = 1.9 × 10−13), while ε2 allele was associated with lower odds of pathologic AD (OR 0.42, 95% CI 0.30–0.61, p = 3.1 × 10−6). Four additional genomic variants including rs6656401 (CR1), rs1476679 (ZCWPW1), rs35349669 (INPP5D), and rs17125944 (FERMT2) had p values less than 0.05. Remarkably, half of the previously reported AD dementia variants are not likely to be detected for association with pathologic AD with a sample size in excess of the largest GWAS meta-analyses of AD dementia. Conclusions: Many recently discovered genomic variants for AD dementia are not associated with the pathology of AD. Some genomic variants for AD dementia appear to be associated with other common neuropathologies. PMID:27371493

  14. Relation of genomic variants for Alzheimer disease dementia to common neuropathologies.

    PubMed

    Farfel, Jose M; Yu, Lei; Buchman, Aron S; Schneider, Julie A; De Jager, Philip L; Bennett, David A

    2016-08-02

    To investigate the associations of previously reported Alzheimer disease (AD) dementia genomic variants with common neuropathologies. This is a postmortem study including 1,017 autopsied participants from 2 clinicopathologic cohorts. Analyses focused on 22 genomic variants associated with AD dementia in large-scale case-control genome-wide association study (GWAS) meta-analyses. The neuropathologic traits of interest were a pathologic diagnosis of AD according to NIA-Reagan criteria, macroscopic and microscopic infarcts, Lewy bodies (LB), and hippocampal sclerosis. For each variant, multiple logistic regression was used to investigate its association with neuropathologic traits, adjusting for age, sex, and subpopulation structure. We also conducted power analyses to estimate the sample sizes required to detect genome-wide significance (p < 5 × 10(-8)) for pathologic AD for all variants. APOE ε4 allele was associated with greater odds of pathologic AD (odds ratio [OR] 3.82, 95% confidence interval [CI] 2.67-5.46, p = 1.9 × 10(-13)), while ε2 allele was associated with lower odds of pathologic AD (OR 0.42, 95% CI 0.30-0.61, p = 3.1 × 10(-6)). Four additional genomic variants including rs6656401 (CR1), rs1476679 (ZCWPW1), rs35349669 (INPP5D), and rs17125944 (FERMT2) had p values less than 0.05. Remarkably, half of the previously reported AD dementia variants are not likely to be detected for association with pathologic AD with a sample size in excess of the largest GWAS meta-analyses of AD dementia. Many recently discovered genomic variants for AD dementia are not associated with the pathology of AD. Some genomic variants for AD dementia appear to be associated with other common neuropathologies. © 2016 American Academy of Neurology.

  15. The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants.

    PubMed

    Reuter, Miriam S; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K C; Trost, Brett; Paton, Tara A; Pereira, Sergio L; Herbrick, Jo-Anne; Wintle, Richard F; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W L; Wang, Zhuozhi; Patel, Rohan V; Pellecchia, Giovanna; Wei, John; Strug, Lisa J; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M; Bassett, Anne S; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D; Stavropoulos, Dimitri J; Bowdin, Sarah; Hildebrandt, Matthew R; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M Stephen; Monfared, Nasim; Hosseini, S Mohsen; Joseph-George, Ann M; Keeley, Fred W; Cook, Ryan A; Fiume, Marc; Lee, Hin C; Marshall, Christian R; Davies, Jill; Hazell, Allison; Buchanan, Janet A; Szego, Michael J; Scherer, Stephen W

    2018-02-05

    The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set ( n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants - associated with cancer, cardiac or neurodegenerative phenotypes - remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. © 2018 Joule Inc. or its licensors.

  16. Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.

    PubMed

    Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Moult, John

    2017-09-01

    The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area. © 2017 The Authors. **Human Mutation published by Wiley Periodicals, Inc.

  17. The histone variant H2A.Bbd is enriched at sites of DNA synthesis

    PubMed Central

    Sansoni, Viola; Casas-Delucchi, Corella S.; Rajan, Malini; Schmidt, Andreas; Bönisch, Clemens; Thomae, Andreas W.; Staege, Martin S.; Hake, Sandra B.; Cardoso, M. Cristina; Imhof, Axel

    2014-01-01

    Histone variants play an important role in shaping the mammalian epigenome and their aberrant expression is frequently observed in several types of cancer. However, the mechanisms that mediate their function and the composition of the variant-containing chromatin are still largely unknown. A proteomic interrogation of chromatin containing the different H2A variants macroH2A.1.2, H2A.Bbd and H2A revealed a strikingly different protein composition. Gene ontology analysis reveals a strong enrichment of splicing factors as well as components of the mammalian replisome in H2A.Bbd-containing chromatin. We find H2A.Bbd localizing transiently to sites of DNA synthesis during S-phase and during DNA repair. Cells that express H2A.Bbd have a shortened S-phase and are more susceptible to DNA damage, two phenotypes that are also observed in human Hodgkin's lymphoma cells that aberrantly express this variant. Based on our experiments we conclude that H2A.Bbd is targeted to newly synthesized DNA during replication and DNA repair. The transient incorporation of H2A.Bbd may be due to the intrinsic instability of nucleosomes carrying this variant or a faster chromatin loading. This potentially leads to a disturbance of the existing chromatin structure, which may have effects on cell cycle regulation and DNA damage sensitivity. PMID:24753410

  18. Adaptation and major chromosomal changes in populations of Saccharomyces cerevisiae.

    PubMed

    Adams, J; Puskas-Rozsa, S; Simlar, J; Wilke, C M

    1992-07-01

    Thirteen independent populations of Saccharomyces cerevisiae (nine haploid and four diploid) were maintained in continuous culture for up to approximately 1000 generations, with growth limited by the concentration of organic phosphates in medium buffered at pH 6. Analysis of clones isolated from these populations showed that a number (17) of large-scale chromosomal-length variants and rearrangements were present in the populations at their termination. Nine of the 16 yeast chromosomes were involved in such changes. Few of the changes could be explained by copy-number increases in the structural loci for acid phosphatase. Several considerations concerning the nature and frequency of the chromosome-length variants observed lead us to conclude that they are selectively advantageous.

  19. An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people

    PubMed Central

    Nelson, Matthew R.; Wegmann, Daniel; Ehm, Margaret G.; Kessner, Darren; St. Jean, Pamela; Verzilli, Claudio; Shen, Judong; Tang, Zhengzheng; Bacanu, Silviu-Alin; Fraser, Dana; Warren, Liling; Aponte, Jennifer; Zawistowski, Matthew; Liu, Xiao; Zhang, Hao; Zhang, Yong; Li, Jun; Li, Yun; Li, Li; Woollard, Peter; Topp, Simon; Hall, Matthew D.; Nangle, Keith; Wang, Jun; Abecasis, Gonçalo; Cardon, Lon R.; Zöllner, Sebastian; Whittaker, John C.; Chissoe, Stephanie L.; Novembre, John; Mooser, Vincent

    2015-01-01

    Rare genetic variants contribute to complex disease risk; however, the abundance of rare variants in human populations remains unknown. We explored this spectrum of variation by sequencing 202 genes encoding drug targets in 14,002 individuals. We find rare variants are abundant (one every 17 bases) and geographically localized, such that even with large sample sizes, rare variant catalogs will be largely incomplete. We used the observed patterns of variation to estimate population growth parameters, the proportion of variants in a given frequency class that are putatively deleterious, and mutation rates for each gene. Overall we conclude that, due to rapid population growth and weak purifying selection, human populations harbor an abundance of rare variants, many of which are deleterious and have relevance to understanding disease risk. PMID:22604722

  20. A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data.

    PubMed

    Demidov, German; Simakova, Tamara; Vnuchkova, Julia; Bragin, Anton

    2016-10-22

    Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm's sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples.

  1. Characterization of Antibacterial and Hemolytic Activity of Synthetic Pandinin 2 Variants and Their Inhibition against Mycobacterium tuberculosis

    PubMed Central

    Rodríguez, Alexis; Villegas, Elba; Montoya-Rosales, Alejandra; Rivas-Santiago, Bruno; Corzo, Gerardo

    2014-01-01

    The contention and treatment of Mycobacterium tuberculosis and other bacteria that cause infectious diseases require the use of new type of antibiotics. Pandinin 2 (Pin2) is a scorpion venom antimicrobial peptide highly hemolytic that has a central proline residue. This residue forms a structural “kink” linked to its pore-forming activity towards human erythrocytes. In this work, the residue Pro14 of Pin2 was both substituted and flanked using glycine residues (P14G and P14GPG) based on the low hemolytic activities of antimicrobial peptides with structural motifs Gly and GlyProGly such as magainin 2 and ponericin G1, respectively. The two Pin2 variants showed antimicrobial activity against E. coli, S. aureus, and M. tuberculosis. However, Pin2 [GPG] was less hemolytic (30%) than that of Pin2 [G] variant. In addition, based on the primary structure of Pin2 [G] and Pin2 [GPG], two short peptide variants were designed and chemically synthesized keeping attention to their physicochemical properties such as hydrophobicity and propensity to adopt alpha-helical conformations. The aim to design these two short antimicrobial peptides was to avoid the drawback cost associated to the synthesis of peptides with large sequences. The short Pin2 variants named Pin2 [14] and Pin2 [17] showed antibiotic activity against E. coli and M. tuberculosis. Besides, Pin2 [14] presented only 25% of hemolysis toward human erythrocytes at concentrations as high as 100 µM, while the peptide Pin2 [17] did not show any hemolytic effect at the same concentration. Furthermore, these short antimicrobial peptides had better activity at molar concentrations against multidrug resistance M. tuberculosis than that of the conventional antibiotics ethambutol, isoniazid and rifampicin. Therefore, Pin2 [14] and Pin2 [17] have the potential to be used as an alternative antibiotics and anti-tuberculosis agents with reduced hemolytic effects. PMID:25019413

  2. Identification of missing variants by combining multiple analytic pipelines.

    PubMed

    Ren, Yingxue; Reddy, Joseph S; Pottier, Cyril; Sarangi, Vivekananda; Tian, Shulan; Sinnwell, Jason P; McDonnell, Shannon K; Biernacka, Joanna M; Carrasquillo, Minerva M; Ross, Owen A; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hudson, Matthew; Mainzer, Liudmila Sergeevna; Asmann, Yan W

    2018-04-16

    After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. Identification of the complete variant set from sequencing data is the prerequisite of genetic association analyses. The current analytic practice of calling genetic variants from sequencing data using a single bioinformatics pipeline is no longer adequate with the increasingly large projects. The number and percentage of quality variants that passed quality filters but are missed by the one-pipeline approach rapidly increased with sample size.

  3. Use of stabilizing mutations to engineer a charged group within a ligand-binding hydrophobic cavity in T4 lysozyme.

    PubMed

    Liu, Lijun; Baase, Walter A; Michael, Miya M; Matthews, Brian W

    2009-09-22

    Both large-to-small and nonpolar-to-polar mutations in the hydrophobic core of T4 lysozyme cause significant loss in stability. By including supplementary stabilizing mutations we constructed a variant that combines the cavity-creating substitution Leu99 --> Ala with the buried charge mutant Met102 --> Glu. Crystal structure determination confirmed that this variant has a large cavity with the side chain of Glu102 located within the cavity wall. The cavity includes a large disk-shaped region plus a bulge. The disk-like region is essentially nonpolar, similar to L99A, while the Glu102 substituent is located in the vicinity of the bulge. Three ordered water molecules bind within this part of the cavity and appear to stabilize the conformation of Glu102. Glu102 has an estimated pKa of about 5.5-6.5, suggesting that it is at least partially charged in the crystal structure. The polar ligands pyridine, phenol and aniline bind within the cavity, and crystal structures of the complexes show one or two water molecules to be retained. Nonpolar ligands of appropriate shape can also bind in the cavity and in some cases exclude all three water molecules. This disrupts the hydrogen-bond network and causes the Glu102 side chain to move away from the ligand by up to 0.8 A where it remains buried in a completely nonpolar environment. Isothermal titration calorimetry revealed that the binding of these compounds stabilizes the protein by 4-6 kcal/mol. For both polar and nonpolar ligands the binding is enthalpically driven. Large negative changes in entropy adversely balance the binding of the polar ligands, whereas entropy has little effect on the nonpolar ligand binding.

  4. Rare coding variants in the phospholipase D3 gene confer risk for Alzheimer's disease

    NASA Astrophysics Data System (ADS)

    2014-01-01

    Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD). These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case-control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer's disease in seven independent case-control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer's disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer's disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer's disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.

  5. Rare coding variants in the phospholipase D3 gene confer risk for Alzheimer's disease.

    PubMed

    Cruchaga, Carlos; Karch, Celeste M; Jin, Sheng Chih; Benitez, Bruno A; Cai, Yefei; Guerreiro, Rita; Harari, Oscar; Norton, Joanne; Budde, John; Bertelsen, Sarah; Jeng, Amanda T; Cooper, Breanna; Skorupa, Tara; Carrell, David; Levitch, Denise; Hsu, Simon; Choi, Jiyoon; Ryten, Mina; Sassi, Celeste; Bras, Jose; Gibbs, Raphael J; Hernandez, Dena G; Lupton, Michelle K; Powell, John; Forabosco, Paola; Ridge, Perry G; Corcoran, Christopher D; Tschanz, JoAnn T; Norton, Maria C; Munger, Ronald G; Schmutz, Cameron; Leary, Maegan; Demirci, F Yesim; Bamne, Mikhil N; Wang, Xingbin; Lopez, Oscar L; Ganguli, Mary; Medway, Christopher; Turton, James; Lord, Jenny; Braae, Anne; Barber, Imelda; Brown, Kristelle; Pastor, Pau; Lorenzo-Betancor, Oswaldo; Brkanac, Zoran; Scott, Erick; Topol, Eric; Morgan, Kevin; Rogaeva, Ekaterina; Singleton, Andy; Hardy, John; Kamboh, M Ilyas; George-Hyslop, Peter St; Cairns, Nigel; Morris, John C; Kauwe, John S K; Goate, Alison M

    2014-01-23

    Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD). These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case-control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer's disease in seven independent case-control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer's disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer's disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer's disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.

  6. Structural assemblies of the di- and oligomeric G-protein coupled receptor TGR5 in live cells: an MFIS-FRET and integrative modelling study

    NASA Astrophysics Data System (ADS)

    Greife, Annemarie; Felekyan, Suren; Ma, Qijun; Gertzen, Christoph G. W.; Spomer, Lina; Dimura, Mykola; Peulen, Thomas O.; Wöhler, Christina; Häussinger, Dieter; Gohlke, Holger; Keitel, Verena; Seidel, Claus A. M.

    2016-11-01

    TGR5 is the first identified bile acid-sensing G-protein coupled receptor, which has emerged as a potential therapeutic target for metabolic disorders. So far, structural and multimerization properties are largely unknown for TGR5. We used a combined strategy applying cellular biology, Multiparameter Image Fluorescence Spectroscopy (MFIS) for quantitative FRET analysis, and integrative modelling to obtain structural information about dimerization and higher-order oligomerization assemblies of TGR5 wildtype (wt) and Y111 variants fused to fluorescent proteins. Residue 111 is located in transmembrane helix 3 within the highly conserved ERY motif. Co-immunoprecipitation and MFIS-FRET measurements with gradually increasing acceptor to donor concentrations showed that TGR5 wt forms higher-order oligomers, a process disrupted in TGR5 Y111A variants. From the concentration dependence of the MFIS-FRET data we conclude that higher-order oligomers - likely with a tetramer organization - are formed from dimers, the smallest unit suggested for TGR5 Y111A variants. Higher-order oligomers likely have a linear arrangement with interaction sites involving transmembrane helix 1 and helix 8 as well as transmembrane helix 5. The latter interaction is suggested to be disrupted by the Y111A mutation. The proposed model of TGR5 oligomer assembly broadens our view of possible oligomer patterns and affinities of class A GPCRs.

  7. Structural variants of yeast prions show conformer-specific requirements for chaperone activity

    PubMed Central

    Stein, Kevin C.; True, Heather L.

    2016-01-01

    Summary Molecular chaperones monitor protein homeostasis and defend against the misfolding and aggregation of proteins that is associated with protein conformational disorders. In these diseases, a variety of different aggregate structures can form. These are called prion strains, or variants, in prion diseases, and cause variation in disease pathogenesis. Here, we use variants of the yeast prions [RNQ+] and [PSI+] to explore the interactions of chaperones with distinct aggregate structures. We found that prion variants show striking variation in their relationship with Hsp40s. Specifically, the yeast Hsp40 Sis1, and its human ortholog Hdj1, had differential capacities to process prion variants, suggesting that Hsp40 selectivity has likely changed through evolution. We further show that such selectivity involves different domains of Sis1, with some prion conformers having a greater dependence on particular Hsp40 domains. Moreover, [PSI+] variants were more sensitive to certain alterations in Hsp70 activity as compared to [RNQ+] variants. Collectively, our data indicate that distinct chaperone machinery is required, or has differential capacity, to process different aggregate structures. Elucidating the intricacies of chaperone-client interactions, and how these are altered by particular client structures, will be crucial to understanding how this system can go awry in disease and contribute to pathological variation. PMID:25060529

  8. Rare genetic variants and the risk of cancer.

    PubMed

    Bodmer, Walter; Tomlinson, Ian

    2010-06-01

    There are good reasons to expect that common genetic variants do not explain all of the inherited risk of the common cancers, not least of these being the relatively low proportion of familial relative risk that common cancer SNPs currently explain. One promising source of the unexplained risk is rare, low-penetrance genetic variants, a class that ranges from low-frequency polymorphisms (allele frequency < 5%) through subpolymorphic variants (frequency 0.1-1.0%) to very low frequency or 'private' variants with frequencies of 0.1% or less. Examples of rare cancer variants include breast cancer susceptibility loci CHEK2, BRIP1 and PALB2. There are considerable challenges associated with the discovery and testing of rare predisposition alleles, many of which are illustrated by the issues associated with variants of unknown significance in the Mendelian cancer predisposition genes. However, whilst cost constraints remain, the technological barriers to rare variant discovery and large-scale genotyping no longer exist. If each individual carries many disease-causing rare variants, the so-called missing heritability of cancer might largely be explained. Whether or not rare variants do end up filling the heritability gap, it is imperative to look for them along side common variants.

  9. Macroscopic inhomogeneous deformation behavior arising in single crystal Ni-Mn-Ga foils under tensile loading

    NASA Astrophysics Data System (ADS)

    Murasawa, Go; Yeduru, Srinivasa R.; Kohl, Manfred

    2016-12-01

    This study investigated macroscopic inhomogeneous deformation occurring in single-crystal Ni-Mn-Ga foils under uniaxial tensile loading. Two types of single-crystal Ni-Mn-Ga foil samples were examined as-received and after thermo-mechanical training. Local strain and the strain field were measured under tensile loading using laser speckle and digital image correlation. The as-received sample showed a strongly inhomogeneous strain field with intermittence under progressive deformation, but the trained sample result showed strain field homogeneity throughout the specimen surface. The as-received sample is a mainly polycrystalline-like state composed of the domain structure. The sample contains many domain boundaries and large domain structures in the body. Its structure would cause large local strain band nucleation with intermittence. However, the trained one is an ideal single-crystalline state with a transformation preferential orientation of variants after almost all domain boundary and large domain structures vanish during thermo-mechanical training. As a result, macroscopic homogeneous deformation occurs on the trained sample surface during deformation.

  10. Atypical face shape and genomic structural variants in epilepsy

    PubMed Central

    Chinthapalli, Krishna; Bartolini, Emanuele; Novy, Jan; Suttie, Michael; Marini, Carla; Falchi, Melania; Fox, Zoe; Clayton, Lisa M. S.; Sander, Josemir W.; Guerrini, Renzo; Depondt, Chantal; Hennekam, Raoul; Hammond, Peter

    2012-01-01

    Many pathogenic structural variants of the human genome are known to cause facial dysmorphism. During the past decade, pathogenic structural variants have also been found to be an important class of genetic risk factor for epilepsy. In other fields, face shape has been assessed objectively using 3D stereophotogrammetry and dense surface models. We hypothesized that computer-based analysis of 3D face images would detect subtle facial abnormality in people with epilepsy who carry pathogenic structural variants as determined by chromosome microarray. In 118 children and adults attending three European epilepsy clinics, we used an objective measure called Face Shape Difference to show that those with pathogenic structural variants have a significantly more atypical face shape than those without such variants. This is true when analysing the whole face, or the periorbital region or the perinasal region alone. We then tested the predictive accuracy of our measure in a second group of 63 patients. Using a minimum threshold to detect face shape abnormalities with pathogenic structural variants, we found high sensitivity (4/5, 80% for whole face; 3/5, 60% for periorbital and perinasal regions) and specificity (45/58, 78% for whole face and perinasal regions; 40/58, 69% for periorbital region). We show that the results do not seem to be affected by facial injury, facial expression, intellectual disability, drug history or demographic differences. Finally, we use bioinformatics tools to explore relationships between facial shape and gene expression within the developing forebrain. Stereophotogrammetry and dense surface models are powerful, objective, non-contact methods of detecting relevant face shape abnormalities. We demonstrate that they are useful in identifying atypical face shape in adults or children with structural variants, and they may give insights into the molecular genetics of facial development. PMID:22975390

  11. SNPnexus: assessing the functional relevance of genetic variation to facilitate the promise of precision medicine.

    PubMed

    Dayem Ullah, Abu Z; Oscanoa, Jorge; Wang, Jun; Nagano, Ai; Lemoine, Nicholas R; Chelala, Claude

    2018-05-11

    Broader functional annotation of genetic variation is a valuable means for prioritising phenotypically-important variants in further disease studies and large-scale genotyping projects. We developed SNPnexus to meet this need by assessing the potential significance of known and novel SNPs on the major transcriptome, proteome, regulatory and structural variation models. Since its previous release in 2012, we have made significant improvements to the annotation categories and updated the query and data viewing systems. The most notable changes include broader functional annotation of noncoding variants and expanding annotations to the most recent human genome assembly GRCh38/hg38. SNPnexus has now integrated rich resources from ENCODE and Roadmap Epigenomics Consortium to map and annotate the noncoding variants onto different classes of regulatory regions and noncoding RNAs as well as providing their predicted functional impact from eight popular non-coding variant scoring algorithms and computational methods. A novel functionality offered now is the support for neo-epitope predictions from leading tools to facilitate its use in immunotherapeutic applications. These updates to SNPnexus are in preparation for its future expansion towards a fully comprehensive computational workflow for disease-associated variant prioritization from sequencing data, placing its users at the forefront of translational research. SNPnexus is freely available at http://www.snp-nexus.org.

  12. Large differences in proportions of harmful and benign amino acid substitutions between proteins and diseases.

    PubMed

    Schaafsma, Gerard C P; Vihinen, Mauno

    2017-07-01

    Genes and proteins are known to have differences in their sensitivity to alterations. Despite numerous sequencing studies, proportions of harmful and harmless substitutions are not known for proteins and groups of proteins. To address this question, we predicted the outcome for all possible single amino acid substitutions (AASs) in nine representative protein groups by using the PON-P2 method. The effects on 996 proteins were studied and vast differences were noticed. Proteins in the cancer group harbor the largest proportion of harmful variants (42.1%), whereas the non-disease group of proteins not known to have a disease association and not involved in the housekeeping functions had the lowest number of harmful variants (4.2%). Differences in the proportions of the harmful and benign variants are wide within each group, but they still show clear differences between the groups. Frequently appearing protein domains show a wide spectrum of variant frequencies, whereas no major protein structural class-specific differences were noticed. AAS types in the original and variant residues showed distinctive patterns, which are shared by all the protein groups. The observations are relevant for understanding genetic bases of diseases, variation interpretation, and for the development of methods for that purpose. © 2017 Wiley Periodicals, Inc.

  13. Active Narrow-Band Vibration Isolation of Large Engineering Structures

    NASA Technical Reports Server (NTRS)

    Rahman, Zahidul; Spanos, John

    1994-01-01

    We present a narrow-band tracking control method using a variant of the Least Mean Squares (LMS) algorithm to isolate slowly changing periodic disturbances from engineering structures. The advantage of the algorithm is that it has a simple architecture and is relatively easy to implement while it can isolate disturbances on the order of 40-50 dB over decades of frequency band. We also present the results of an experiment conducted on a flexible truss structure. The average disturbance rejection achieved is over 40 dB over the frequency band of 5 Hz to 50 Hz.

  14. Lithium and GSK3-β Promoter Gene Variants Influence White Matter Microstructure in Bipolar Disorder

    PubMed Central

    Benedetti, Francesco; Bollettini, Irene; Barberi, Ignazio; Radaelli, Daniele; Poletti, Sara; Locatelli, Clara; Pirovano, Adele; Lorenzi, Cristina; Falini, Andrea; Colombo, Cristina; Smeraldi, Enrico

    2013-01-01

    Lithium is the mainstay for the treatment of bipolar disorder (BD) and inhibits glycogen synthase kinase 3-β (GSK3-β). The less active GSK3-β promoter gene variants have been associated with less detrimental clinical features of BD. GSK3-β gene variants and lithium can influence brain gray matter structure in psychiatric conditions. Diffusion tensor imaging (DTI) measures of white matter (WM) integrity showed widespred disruption of WM structure in BD. In a sample of 70 patients affected by a major depressive episode in course of BD, we investigated the effect of ongoing long-term lithium treatment and GSK3-β promoter rs334558 polymorphism on WM microstructure, using DTI and tract-based spatial statistics with threshold-free cluster enhancement. We report that the less active GSK3-β rs334558*C gene-promoter variants, and the long-term administration of the GSK3-β inhibitor lithium, were associated with increases of DTI measures of axial diffusivity (AD) in several WM fiber tracts, including corpus callosum, forceps major, anterior and posterior cingulum bundle (bilaterally including its hippocampal part), left superior and inferior longitudinal fasciculus, left inferior fronto-occipital fasciculus, left posterior thalamic radiation, bilateral superior and posterior corona radiata, and bilateral corticospinal tract. AD reflects the integrity of axons and myelin sheaths. We suggest that GSK3-β inhibition and lithium could counteract the detrimental influences of BD on WM structure, with specific benefits resulting from effects on specific WM tracts contributing to the functional integrity of the brain and involving interhemispheric, limbic, and large frontal, parietal, and fronto-occipital connections. PMID:22990942

  15. GWAS-identified risk variants for major depressive disorder: Preliminary support for an association with late-life depressive symptoms and brain structural alterations.

    PubMed

    Ryan, Joanne; Artero, Sylvaine; Carrière, Isabelle; Maller, Jerome J; Meslin, Chantal; Ritchie, Karen; Ancelin, Marie-Laure

    2016-01-01

    A number of genome-wide association studies (GWAS) have investigated risk factors for major depressive disorder (MDD), however there has been little attempt to replicate these findings in population-based studies of depressive symptoms. Variants within three genes, BICC1, PCLO and GRM7 were selected for replication in our study based on the following criteria: they were identified in a prior MDD GWAS study; a subsequent study found evidence that they influenced depression risk; and there is a solid biological basis for a role in depression. We firstly investigated whether these variants were associated with depressive symptoms in our population-based cohort of 929 elderly (238 with clinical depressive symptoms and 691 controls), and secondly to investigate associations with structural brain alterations. A number of nominally significant associations were identified, but none reached Bonferroni-corrected significance levels. Common SNPs in BICC1 and PCLO were associated with a 50% and 30% decreased risk of depression, respectively. PCLO rs2522833 was also associated with the volume of grey matter (p=1.6×10(-3)), and to a lesser extent with hippocampal volume and white matter lesions. Among depressed individuals rs9870680 (GRM7) was associated with the volume of grey and white matter (p=10(-4) and 8.3×10(-3), respectively). Our results provide some support for the involvement of BICC1 and PCLO in late-life depressive disorders and preliminary evidence that these genetic variants may also influence brain structural volumes. However effect sizes remain modest and associations did not reach corrected significance levels. Further large imaging studies are needed to confirm our findings. Copyright © 2015 Elsevier B.V. and ECNP. All rights reserved.

  16. Towards practical multiscale approach for analysis of reinforced concrete structures

    NASA Astrophysics Data System (ADS)

    Moyeda, Arturo; Fish, Jacob

    2017-12-01

    We present a novel multiscale approach for analysis of reinforced concrete structural elements that overcomes two major hurdles in utilization of multiscale technologies in practice: (1) coupling between material and structural scales due to consideration of large representative volume elements (RVE), and (2) computational complexity of solving complex nonlinear multiscale problems. The former is accomplished using a variant of computational continua framework that accounts for sizeable reinforced concrete RVEs by adjusting the location of quadrature points. The latter is accomplished by means of reduced order homogenization customized for structural elements. The proposed multiscale approach has been verified against direct numerical simulations and validated against experimental results.

  17. Three-dimensional spatial analysis of missense variants in RTEL1 identifies pathogenic variants in patients with Familial Interstitial Pneumonia.

    PubMed

    Sivley, R Michael; Sheehan, Jonathan H; Kropski, Jonathan A; Cogan, Joy; Blackwell, Timothy S; Phillips, John A; Bush, William S; Meiler, Jens; Capra, John A

    2018-01-23

    Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease. To illustrate this approach in a clinical application, we analyzed 13 candidate missense variants in regulator of telomere elongation helicase 1 (RTEL1) identified in patients with Familial Interstitial Pneumonia (FIP). We curated pathogenic and neutral RTEL1 variants from the literature and public databases. We then used homology modeling to construct a 3D structural model of RTEL1 and mapped known variants into this structure. We next developed a pathogenicity prediction algorithm based on proximity to known disease causing and neutral variants and evaluated its performance with leave-one-out cross-validation. We further validated our predictions with segregation analyses, telomere lengths, and mutagenesis data from the homologous XPD protein. Our algorithm for classifying RTEL1 VUS based on spatial proximity to pathogenic and neutral variation accurately distinguished 7 known pathogenic from 29 neutral variants (ROC AUC = 0.85) in the N-terminal domains of RTEL1. Pathogenic proximity scores were also significantly correlated with effects on ATPase activity (Pearson r = -0.65, p = 0.0004) in XPD, a related helicase. Applying the algorithm to 13 VUS identified from sequencing of RTEL1 from patients predicted five out of six disease-segregating VUS to be pathogenic. We provide structural hypotheses regarding how these mutations may disrupt RTEL1 ATPase and helicase function. Spatial analysis of missense variation accurately classified candidate VUS in RTEL1 and suggests how such variants cause disease. Incorporating spatial proximity analyses into other pathogenicity prediction tools may improve accuracy for other genes and genetic diseases.

  18. CD and MCD studies of the effects of component B variant binding on the biferrous active site of methane monooxygenase.

    PubMed

    Mitić, Natasa; Schwartz, Jennifer K; Brazeau, Brian J; Lipscomb, John D; Solomon, Edward I

    2008-08-12

    The multicomponent soluble form of methane monooxygenase (sMMO) catalyzes the oxidation of methane through the activation of O 2 at a nonheme biferrous center in the hydroxylase component, MMOH. Reactivity is limited without binding of the sMMO effector protein, MMOB. Past studies show that mutations of specific MMOB surface residues cause large changes in the rates of individual steps in the MMOH reaction cycle. To define the structural and mechanistic bases for these observations, CD, MCD, and VTVH MCD spectroscopies coupled with ligand-field (LF) calculations are used to elucidate changes occurring near and at the MMOH biferrous cluster upon binding of MMOB and the MMOB variants. Perturbations to both the CD and MCD are observed upon binding wild-type MMOB and the MMOB variant that similarly increases O 2 reactivity. MMOB variants that do not greatly increase O 2 reactivity fail to cause one or both of these changes. LF calculations indicate that reorientation of the terminal glutamate on Fe2 reproduces the spectral perturbations in MCD. Although this structural change allows O 2 to bridge the diiron site and shifts the redox active orbitals for good overlap, it is not sufficient for enhanced O 2 reactivity of the enzyme. Binding of the T111Y-MMOB variant to MMOH induces the MCD, but not CD changes, and causes only a small increase in reactivity. Thus, both the geometric rearrangement at Fe2 (observed in MCD) coupled with a more global conformational change that may control O 2 access (probed by CD), induced by MMOB binding, are critical factors in the reactivity of sMMO.

  19. Coagulation factor VII variants resistant to inhibitory antibodies.

    PubMed

    Branchini, Alessio; Baroni, Marcello; Pfeiffer, Caroline; Batorova, Angelika; Giansily-Blaizot, Muriel; Schved, Jean F; Mariani, Guglielmo; Bernardi, Francesco; Pinotti, Mirko

    2014-11-01

    Replacement therapy is currently used to prevent and treat bleeding episodes in coagulation factor deficiencies. However, structural differences between the endogenous and therapeutic proteins might increase the risk for immune complications. This study was aimed at identifying factor (F)VII variants resistant to inhibitory antibodies developed after treatment with recombinant activated factor VII (rFVIIa) in a FVII-deficient patient homozygous for the p.A354V-p.P464Hfs mutation, which predicts trace levels of an elongated FVII variant in plasma. We performed fluorescent bead-based binding, ELISA-based competition as well as fluorogenic functional (activated FX and thrombin generation) assays in plasma and with recombinant proteins. We found that antibodies displayed higher affinity for the active than for the zymogen FVII (half-maximal binding at 0.54 ± 0.04 and 0.78 ± 0.07 BU/ml, respectively), and inhibited the coagulation initiation phase with a second-order kinetics. Isotypic analysis showed a polyclonal response with a large predominance of IgG1. We hypothesised that structural differences in the carboxyl-terminus between the inherited FVII and the therapeutic molecules contributed to the immune response. Intriguingly, a naturally-occurring, poorly secreted and 5-residue truncated FVII (FVII-462X) escaped inhibition. Among a series of truncated rFVII molecules, we identified a well-secreted and catalytically competent variant (rFVII-464X) with reduced binding to antibodies (half-maximal binding at 0.198 ± 0.003 BU/ml) as compared to the rFVII-wt (0.032 ± 0.002 BU/ml), which led to a 40-time reduced inhibition in activated FX generation assays. Taken together our results provide a paradigmatic example of mutation-related inhibitory antibodies, strongly support the FVII carboxyl-terminus as their main target and identify inhibitor-resistant FVII variants.

  20. VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.

    PubMed

    Wala, Jeremiah; Zhang, Cheng-Zhong; Meyerson, Matthew; Beroukhim, Rameen

    2016-07-01

    We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. VariantBam and full documentation are available at github.com/jwalabroad/VariantBam rameen@broadinstitute.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Comparative evaluation of Populus variants total sugar release and structural features following pretreatment and digestion by two distinct biological systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thomas, Vanessa A.; Kothari, Ninad; Bhagia, Samarthya

    Populus natural variants have been shown to realize a broad range of sugar yields during saccharification, however, the structural features responsible for higher sugar release from natural variants are not clear. In addition, the sugar release patterns resulting from digestion with two distinct biological systems, fungal enzymes and Clostridium thermocellum, have yet to be evaluated and compared. This study evaluates the effect of structural features of three natural variant Populus lines, which includes the line BESC standard, with respect to the overall process of sugar release for two different biological systems.

  2. Comparative evaluation of Populus variants total sugar release and structural features following pretreatment and digestion by two distinct biological systems

    DOE PAGES

    Thomas, Vanessa A.; Kothari, Ninad; Bhagia, Samarthya; ...

    2017-11-30

    Populus natural variants have been shown to realize a broad range of sugar yields during saccharification, however, the structural features responsible for higher sugar release from natural variants are not clear. In addition, the sugar release patterns resulting from digestion with two distinct biological systems, fungal enzymes and Clostridium thermocellum, have yet to be evaluated and compared. This study evaluates the effect of structural features of three natural variant Populus lines, which includes the line BESC standard, with respect to the overall process of sugar release for two different biological systems.

  3. Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

    PubMed

    Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

    2016-01-01

    Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.

  4. Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts.

    PubMed

    Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong

    2016-01-08

    Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.

  5. Mutation of Phe413 to Tyr in catalase KatE from Escherichia coli leads to side chain damage and main chain cleavage.

    PubMed

    Jha, Vikash; Donald, Lynda J; Loewen, Peter C

    2012-09-15

    The monofunctional catalase KatE of Esherichia coli exhibits exceptional resistance to heat denaturation and proteolytic degradation. During an investigation of subtle conformation changes in Arg111 and Phe413 on the proximal side of the heme induced by H(2)O(2), variants at position R111, T115 and F413 were constructed. Because the residues are not situated in the distal side heme cavity where catalysis occurs, significant changes in reactivity were not expected and indeed, only small changes in the kinetic characteristics were observed in all of the variants. However, the F413Y variant was found to have undergone main chain cleavage whereas the R111A, T115A, F413E and F413K variants had not. Two sites of cleavage were identified in the crystal structure and by mass spectrometry at residues 111 and 115. In addition to main chain cleavage, modifications to the side chains of Tyr413, Thr115 and Arg111 were suggested by differences in the electron density maps compared to maps of the native and inactive variant H128N/F413Y. The inactive variant H128N/F413Y and the active variant T115A/F413Y both did not exhibit main chain cleavage and the R11A/F413Y variant exhibited less cleavage. In addition, the apparent modification of three side chains was largely absent in these variants. It is also significant that all three F413 single variants contained heme b suggesting that the fidelity of the phenyl group was important for mediating heme b oxidation to heme d. The reactions are attributed to the introduction of a new reactive center possibly involving a transient radical on Tyr413 formed during catalytic turn over. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. Intact Protein Analysis at 21 Tesla and X-Ray Crystallography Define Structural Differences in Single Amino Acid Variants of Human Mitochondrial Branched-Chain Amino Acid Aminotransferase 2 (BCAT2)

    NASA Astrophysics Data System (ADS)

    Anderson, Lissa C.; Håkansson, Maria; Walse, Björn; Nilsson, Carol L.

    2017-09-01

    Structural technologies are an essential component in the design of precision therapeutics. Precision medicine entails the development of therapeutics directed toward a designated target protein, with the goal to deliver the right drug to the right patient at the right time. In the field of oncology, protein structural variants are often associated with oncogenic potential. In a previous proteogenomic screen of patient-derived glioblastoma (GBM) tumor materials, we identified a sequence variant of human mitochondrial branched-chain amino acid aminotransferase 2 as a putative factor of resistance of GBM to standard-of-care-treatments. The enzyme generates glutamate, which is neurotoxic. To elucidate structural coordinates that may confer altered substrate binding or activity of the variant BCAT2 T186R, a 45 kDa protein, we applied combined ETD and CID top-down mass spectrometry in a LC-FT-ICR MS at 21 T, and X-Ray crystallography in the study of both the variant and non-variant intact proteins. The combined ETD/CID fragmentation pattern allowed for not only extensive sequence coverage but also confident localization of the amino acid variant to its position in the sequence. The crystallographic experiments confirmed the hypothesis generated by in silico structural homology modeling, that the Lys59 side-chain of BCAT2 may repulse the Arg186 in the variant protein (PDB code: 5MPR), leading to destabilization of the protein dimer and altered enzyme kinetics. Taken together, the MS and novel 3D structural data give us reason to further pursue BCAT2 T186R as a precision drug target in GBM. [Figure not available: see fulltext.

  7. Variant Interpretation: Functional Assays to the Rescue.

    PubMed

    Starita, Lea M; Ahituv, Nadav; Dunham, Maitreya J; Kitzman, Jacob O; Roth, Frederick P; Seelig, Georg; Shendure, Jay; Fowler, Douglas M

    2017-09-07

    Classical genetic approaches for interpreting variants, such as case-control or co-segregation studies, require finding many individuals with each variant. Because the overwhelming majority of variants are present in only a few living humans, this strategy has clear limits. Fully realizing the clinical potential of genetics requires that we accurately infer pathogenicity even for rare or private variation. Many computational approaches to predicting variant effects have been developed, but they can identify only a small fraction of pathogenic variants with the high confidence that is required in the clinic. Experimentally measuring a variant's functional consequences can provide clearer guidance, but individual assays performed only after the discovery of the variant are both time and resource intensive. Here, we discuss how multiplex assays of variant effect (MAVEs) can be used to measure the functional consequences of all possible variants in disease-relevant loci for a variety of molecular and cellular phenotypes. The resulting large-scale functional data can be combined with machine learning and clinical knowledge for the development of "lookup tables" of accurate pathogenicity predictions. A coordinated effort to produce, analyze, and disseminate large-scale functional data generated by multiplex assays could be essential to addressing the variant-interpretation crisis. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  8. Screening of whole genome sequences identified high-impact variants for stallion fertility.

    PubMed

    Schrimpf, Rahel; Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

    2016-04-14

    Stallion fertility is an economically important trait due to the increase of artificial insemination in horses. The availability of whole genome sequence data facilitates identification of rare high-impact variants contributing to stallion fertility. The aim of our study was to genotype rare high-impact variants retrieved from next-generation sequencing (NGS)-data of 11 horses in order to unravel harmful genetic variants in large samples of stallions. Gene ontology (GO) terms and search results from public databases were used to obtain a comprehensive list of human und mice genes predicted to participate in the regulation of male reproduction. The corresponding equine orthologous genes were searched in whole genome sequence data of seven stallions and four mares and filtered for high-impact genetic variants using SnpEFF, SIFT and Polyphen 2 software. All genetic variants with the missing homozygous mutant genotype were genotyped on 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. Mixed linear model analysis was employed for an association analysis with de-regressed estimated breeding values of the paternal component of the pregnancy rate per estrus (EBV-PAT). We screened next generation sequenced data of whole genomes from 11 horses for equine genetic variants in 1194 human and mice genes involved in male fertility and linked through common gene ontology (GO) with male reproductive processes. Variants were filtered for high-impact on protein structure and validated through SIFT and Polyphen 2. Only those genetic variants were followed up when the homozygote mutant genotype was missing in the detection sample comprising 11 horses. After this filtering process, 17 single nucleotide polymorphism (SNPs) were left. These SNPs were genotyped in 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. An association analysis in 216 Hanoverian stallions revealed a significant association of the splice-site disruption variant g.37455302G>A in NOTCH1 with the de-regressed estimated breeding values of the paternal component of the pregnancy rate per estrus (EBV-PAT). For 9 high-impact variants within the genes CFTR, OVGP1, FBXO43, TSSK6, PKD1, FOXP1, TCP11, SPATA31E1 and NOTCH1 (g.37453246G>C) absence of the homozygous mutant genotype in the validation sample of all 337 fertile stallions was obvious. Therefore, these variants were considered as potentially deleterious factors for stallion fertility. In conclusion, this study revealed 17 genetic variants with a predicted high damaging effect on protein structure and missing homozygous mutant genotype. The g.37455302G>A NOTCH1 variant was identified as a significant stallion fertility locus in Hanoverian stallions and further 9 candidate fertility loci with missing homozygous mutant genotypes were validated in a panel including 19 horse breeds. To our knowledge this is the first study in horses using next generation sequencing data to uncover strong candidate factors for stallion fertility.

  9. Double uterus with obstructed hemivagina and ipsilateral renal agenesis: pelvic anatomic variants in 87 cases.

    PubMed

    Fedele, L; Motta, F; Frontino, G; Restelli, E; Bianchi, S

    2013-06-01

    What are the anatomic variants (and their frequencies) of double uterus, obstructed hemivagina and ipsilateral renal agenesis? Most cases examined (72.4%) were of the classic anatomic variant of the Herlyn-Werner-Wunderlich syndrome (with didelphys uterus, obstructed hemivagina and ipsilateral renal agenesis) but the 27.6% of cases are of a rare variant of the syndrome (with uterus septum or cervical agenesis), showing relevant clinical and surgical implications. The extreme variability of anatomic structures involved in this syndrome (both uterus, cervico-vaginal and renal anomalies) is well known, even if a complete and uniform analysis of all its heterogeneous presentations in a large series is lacking. This is a retrospective study with 87 patients referred to our third level referral center between 1981 and 2011. We analyzed the laparoscopic and chart records of 87 women, who referred to our institute with double uterus, unilateral cervico-vaginal obstruction and ipsilateral renal anomalies. Sixty-three of 87 patients had the more classic variant of didelphys uterus with obstructed hemivagina; 10/87 patients had septate bicollis uterus with obstructed hemivagina; 9/87 patients had bicornuate bicollis uterus with obstructed hemivagina; 4/87 patients had didelphys uterus with unilateral cervical atresia; 1/87 patients had bicornuate uterus with one septate cervix and unilateral obstructed hemivagina. This is a retrospective study with a long enrolling period (30 years). New insights in the anatomic variants of this rare syndrome with their relevant surgical implications.

  10. Interspecific diversity reduces and functionally substitutes for intraspecific variation in biofilm communities

    PubMed Central

    Kelvin Lee, Kai Wei; Hoong Yam, Joey Kuok; Mukherjee, Manisha; Periasamy, Saravanan; Steinberg, Peter D; Kjelleberg, Staffan; Rice, Scott A

    2016-01-01

    Diversity has a key role in the dynamics and resilience of communities and both interspecific (species) and intraspecific (genotypic) diversity can have important effects on community structure and function. However, a critical and unresolved question for understanding the ecology of a community is to what extent these two levels of diversity are functionally substitutable? Here we show, for a mixed-species biofilm community composed of Pseudomonas aeruginosa, P. protegens and Klebsiella pneumoniae, that increased interspecific diversity reduces and functionally substitutes for intraspecific diversity in mediating tolerance to stress. Biofilm populations generated high percentages of genotypic variants, which were largely absent in biofilm communities. Biofilms with either high intra- or interspecific diversity were more tolerant to SDS stress than biofilms with no or low diversity. Unexpectedly, genotypic variants decreased the tolerance of biofilm communities when experimentally introduced into the communities. For example, substituting P. protegens wild type with its genotypic variant within biofilm communities decreased SDS tolerance by twofold, apparently due to perturbation of interspecific interactions. A decrease in variant frequency was also observed when biofilm populations were exposed to cell-free effluents from another species, suggesting that extracellular factors have a role in selection against the appearance of intraspecific variants. This work demonstrates the functional substitution of inter- and intraspecific diversity for an emergent property of biofilms. It also provides a potential explanation for a long-standing paradox in microbiology, in which morphotypic variants are common in laboratory grown biofilm populations, but are rare in diverse, environmental biofilm communities. PMID:26405829

  11. The Conformational Variability of FimH: Which Conformation Represents the Therapeutic Target?

    PubMed

    Eris, Deniz; Preston, Roland C; Scharenberg, Meike; Hulliger, Fabian; Abgottspon, Daniela; Pang, Lijuan; Jiang, Xiaohua; Schwardt, Oliver; Ernst, Beat

    2016-06-02

    FimH is a bacterial lectin found at the tips of type 1 pili of uropathogenic Escherichia coli (UPEC). It mediates shear-enhanced adhesion to mannosylated surfaces. Binding of UPEC to urothelial cells initiates the infection cycle leading to urinary tract infections (UTIs). Antiadhesive glycomimetics based on α-d-mannopyranose offer an attractive alternative to the conventional antibiotic treatment because they do not induce a selection pressure and are therefore expected to have a reduced resistance potential. Genetic variation of the fimH gene in clinically isolated UPEC has been associated with distinct mannose binding phenotypes. For this reason, we investigated the mannose binding characteristics of four FimH variants with mannose-based ligands under static and hydrodynamic conditions. The selected FimH variants showed individually different binding behavior under both sets of conditions as a result of the conformational variability of FimH. Clinically relevant FimH variants typically exist in a dynamic conformational equilibrium. Additionally, we evaluated inhibitory potencies of four FimH antagonists representing different structural classes. Inhibitory potencies of three of the tested antagonists were dependent on the binding phenotype and hence on the conformational equilibrium of the FimH variant. However, the squarate derivative was the notable exception and inhibited FimH variants irrespective of their binding phenotype. Information on antagonist affinities towards various FimH variants has remained largely unconsidered despite being essential for successful antiadhesion therapy. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Biocatalytic Conversion of Avermectin to 4″-Oxo-Avermectin: Improvement of Cytochrome P450 Monooxygenase Specificity by Directed Evolution▿ †

    PubMed Central

    Trefzer, Axel; Jungmann, Volker; Molnár, István; Botejue, Ajit; Buckel, Dagmar; Frey, Gerhard; Hill, D. Steven; Jörg, Mario; Ligon, James M.; Mason, Dylan; Moore, David; Pachlatko, J. Paul; Richardson, Toby H.; Spangenberg, Petra; Wall, Mark A.; Zirkle, Ross; Stege, Justin T.

    2007-01-01

    Discovery of the CYP107Z subfamily of cytochrome P450 oxidases (CYPs) led to an alternative biocatalytic synthesis of 4″-oxo-avermectin, a key intermediate for the commercial production of the semisynthetic insecticide emamectin. However, under industrial process conditions, these wild-type CYPs showed lower yields due to side product formation. Molecular evolution employing GeneReassembly was used to improve the regiospecificity of these enzymes by a combination of random mutagenesis, protein structure-guided site-directed mutagenesis, and recombination of multiple natural and synthetic CYP107Z gene fragments. To assess the specificity of CYP mutants, a miniaturized, whole-cell biocatalytic reaction system that allowed high-throughput screening of large numbers of variants was developed. In an iterative process consisting of four successive rounds of GeneReassembly evolution, enzyme variants with significantly improved specificity for the production of 4″-oxo-avermectin were identified; these variants could be employed for a more economical industrial biocatalytic process to manufacture emamectin. PMID:17483257

  13. Sequencing Structural Variants in Cancer for Precision Therapeutics.

    PubMed

    Macintyre, Geoff; Ylstra, Bauke; Brenton, James D

    2016-09-01

    The identification of mutations that guide therapy selection for patients with cancer is now routine in many clinical centres. The majority of assays used for solid tumour profiling use DNA sequencing to interrogate somatic point mutations because they are relatively easy to identify and interpret. Many cancers, however, including high-grade serous ovarian, oesophageal, and small-cell lung cancer, are driven by somatic structural variants that are not measured by these assays. Therefore, there is currently an unmet need for clinical assays that can cheaply and rapidly profile structural variants in solid tumours. In this review we survey the landscape of 'actionable' structural variants in cancer and identify promising detection strategies based on massively-parallel sequencing. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Enhancing the Predictive Power of Mutations in the C-Terminus of the KCNQ1-Encoded Kv7.1 Voltage-Gated Potassium Channel.

    PubMed

    Kapplinger, Jamie D; Tseng, Andrew S; Salisbury, Benjamin A; Tester, David J; Callis, Thomas E; Alders, Marielle; Wilde, Arthur A M; Ackerman, Michael J

    2015-04-01

    Despite the overrepresentation of Kv7.1 mutations among patients with a robust diagnosis of long QT syndrome (LQTS), a background rate of innocuous Kv7.1 missense variants observed in healthy controls creates ambiguity in the interpretation of LQTS genetic test results. A recent study showed that the probability of pathogenicity for rare missense mutations depends in part on the topological location of the variant in Kv7.1's various structure-function domains. Since the Kv7.1's C-terminus accounts for nearly 50 % of the overall protein and nearly 50 % of the overall background rate of rare variants falls within the C-terminus, further enhancement in mutation calling may provide guidance in distinguishing pathogenic long QT syndrome type 1 (LQT1)-causing mutations from rare non-disease-causing variants in the Kv7.1's C-terminus. Therefore, we have used conservation analysis and a large case-control study to generate topology-based estimative predictive values to aid in interpretation, identifying three regions of high conservation within the Kv7.1's C-terminus which have a high probability of LQT1 pathogenicity.

  15. Joint genetic analysis of hippocampal size in mouse and human identifies a novel gene linked to neurodegenerative disease.

    PubMed

    Ashbrook, David G; Williams, Robert W; Lu, Lu; Stein, Jason L; Hibar, Derrek P; Nichols, Thomas E; Medland, Sarah E; Thompson, Paul M; Hager, Reinmar

    2014-10-03

    Variation in hippocampal volume has been linked to significant differences in memory, behavior, and cognition among individuals. To identify genetic variants underlying such differences and associated disease phenotypes, multinational consortia such as ENIGMA have used large magnetic resonance imaging (MRI) data sets in human GWAS studies. In addition, mapping studies in mouse model systems have identified genetic variants for brain structure variation with great power. A key challenge is to understand how genetically based differences in brain structure lead to the propensity to develop specific neurological disorders. We combine the largest human GWAS of brain structure with the largest mammalian model system, the BXD recombinant inbred mouse population, to identify novel genetic targets influencing brain structure variation that are linked to increased risk for neurological disorders. We first use a novel cross-species, comparative analysis using mouse and human genetic data to identify a candidate gene, MGST3, associated with adult hippocampus size in both systems. We then establish the coregulation and function of this gene in a comprehensive systems-analysis. We find that MGST3 is associated with hippocampus size and is linked to a group of neurodegenerative disorders, such as Alzheimer's.

  16. Rare coding variants in Phospholipase D3 (PLD3) confer risk for Alzheimer's disease

    PubMed Central

    Cruchaga, Carlos; Benitez, Bruno A.; Cai, Yefei; Guerreiro, Rita; Harari, Oscar; Norton, Joanne; Budde, John; Bertelsen, Sarah; Jeng, Amanda T.; Cooper, Breanna; Skorupa, Tara; Carrell, David; Levitch, Denise; Hsu, Simon; Choi, Jiyoon; Ryten, Mina; Sassi, Celeste; Bras, Jose; Gibbs, Raphael J.; Hernandez, Dena G.; Lupton, Michelle K.; Powell, John; Forabosco, Paola; Ridge, Perry G.; Corcoran, Christopher D.; Tschanz, JoAnn T.; Norton, Maria C.; Munger, Ronald G.; Schmutz, Cameron; Leary, Maegan; Demirci, F. Yesim; Bamne, Mikhil N.; Wang, Xingbin; Lopez, Oscar L.; Ganguli, Mary; Medway, Christopher; Turton, James; Lord, Jenny; Braae, Anne; Barber, Imelda; Brown, Kristelle; Pastor, Pau; Lorenzo-Betancor, Oswaldo; Brkanac, Zoran; Scott, Erick; Topol, Eric; Morgan, Kevin; Rogaeva, Ekaterina; Singleton, Andy; Hardy, John; Kamboh, M. Ilyas; George-Hyslop, Peter St; Cairns, Nigel; Morris, John C.; Kauwe, John S.K.; Goate, Alison M.

    2014-01-01

    Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD)1,2. These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low frequency coding variants with large effects on LOAD risk, we performed whole exome-sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large case-control datasets. A rare variant in PLD3 (phospholipase-D family, member 3, rs145999145; V232M) segregated with disease status in two independent families and doubled risk for AD in seven independent case-control series (V232M meta-analysis; OR= 2.10, CI=1.47-2.99; p= 2.93×10-5, 11,354 cases and controls of European-descent). Gene-based burden analyses in 4,387 cases and controls of European-descent and 302 African American cases and controls, with complete sequence data for PLD3, indicate that several variants in this gene increase risk for AD in both populations (EA: OR= 2.75, CI=2.05-3.68; p=1.44×10-11, AA: OR= 5.48, CI=1.77-16.92; p=1.40×10-3). PLD3 is highly expressed in brain regions vulnerable to AD pathology, including hippocampus and cortex, and is expressed at lower levels in neurons from AD brains compared to control brains (p=8.10×10-10). Over-expression of PLD3 leads to a significant decrease in intracellular APP and extracellular Aβ42 and Aβ40, while knock-down of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a two-fold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may be used to identify rare variants with large effects on risk for disease or other complex traits. PMID:24336208

  17. Synthesis and Thermoelectric Properties in the 2D Ti1 – xNbxS3 Trichalcogenides

    PubMed Central

    Misse, Patrick R. N.; Berthebaud, David; Lebedev, Oleg I.; Maignan, Antoine; Guilmeau, Emmanuel

    2015-01-01

    A solid solution of Ti1 − xNbxS3 composition (x = 0, 0.05, 0.07, 0.10) was synthesized by solid-liquid-vapor reaction followed by spark plasma sintering. The obtained compounds crystallize in the monoclinic ZrSe3 structure type. For the x = 0.07 sample, a mixture of both A and B variants of the MX3 structure is evidenced by transmission electron microscopy. This result contrasts with those of pristine TiS3, prepared within the same conditions, which crystallizes as a large majority of A variant. Thermoelectric properties were investigated in the temperature range 323 to 523 K. A decrease in the electrical resistivity and absolute value of the Seebeck coefficient is observed when increasing x due to electron doping. The lattice component of the thermal conductivity is effectively reduced by the Nb for Ti substitution through a mass fluctuation effect and/or a disorder effect created by the mixture of both A and B variants. Due to the low carrier concentration and the semiconductor character of the doped compounds, the too low power factor values leads to ZT values that remain smaller by a factor of 50 than those of the TiS2 layered compound.

  18. Polygenic determinants in extremes of high-density lipoprotein cholesterol[S

    PubMed Central

    Dron, Jacqueline S.; Wang, Jian; Low-Kam, Cécile; Khetarpal, Sumeet A.; Robinson, John F.; McIntyre, Adam D.; Ban, Matthew R.; Cao, Henian; Rhainds, David; Dubé, Marie-Pierre; Rader, Daniel J.; Lettre, Guillaume; Tardif, Jean-Claude

    2017-01-01

    HDL cholesterol (HDL-C) remains a superior biochemical predictor of CVD risk, but its genetic basis is incompletely defined. In patients with extreme HDL-C concentrations, we concurrently evaluated the contributions of multiple large- and small-effect genetic variants. In a discovery cohort of 255 unrelated lipid clinic patients with extreme HDL-C levels, we used a targeted next-generation sequencing panel to evaluate rare variants in known HDL metabolism genes, simultaneously with common variants bundled into a polygenic trait score. Two additional cohorts were used for validation and included 1,746 individuals from the Montréal Heart Institute Biobank and 1,048 individuals from the University of Pennsylvania. Findings were consistent between cohorts: we found rare heterozygous large-effect variants in 18.7% and 10.9% of low- and high-HDL-C patients, respectively. We also found common variant accumulation, indicated by extreme polygenic trait scores, in an additional 12.8% and 19.3% of overall cases of low- and high-HDL-C extremes, respectively. Thus, the genetic basis of extreme HDL-C concentrations encountered clinically is frequently polygenic, with contributions from both rare large-effect and common small-effect variants. Multiple types of genetic variants should be considered as contributing factors in patients with extreme dyslipidemia. PMID:28870971

  19. Polygenic determinants in extremes of high-density lipoprotein cholesterol.

    PubMed

    Dron, Jacqueline S; Wang, Jian; Low-Kam, Cécile; Khetarpal, Sumeet A; Robinson, John F; McIntyre, Adam D; Ban, Matthew R; Cao, Henian; Rhainds, David; Dubé, Marie-Pierre; Rader, Daniel J; Lettre, Guillaume; Tardif, Jean-Claude; Hegele, Robert A

    2017-11-01

    HDL cholesterol (HDL-C) remains a superior biochemical predictor of CVD risk, but its genetic basis is incompletely defined. In patients with extreme HDL-C concentrations, we concurrently evaluated the contributions of multiple large- and small-effect genetic variants. In a discovery cohort of 255 unrelated lipid clinic patients with extreme HDL-C levels, we used a targeted next-generation sequencing panel to evaluate rare variants in known HDL metabolism genes, simultaneously with common variants bundled into a polygenic trait score. Two additional cohorts were used for validation and included 1,746 individuals from the Montréal Heart Institute Biobank and 1,048 individuals from the University of Pennsylvania. Findings were consistent between cohorts: we found rare heterozygous large-effect variants in 18.7% and 10.9% of low- and high-HDL-C patients, respectively. We also found common variant accumulation, indicated by extreme polygenic trait scores, in an additional 12.8% and 19.3% of overall cases of low- and high-HDL-C extremes, respectively. Thus, the genetic basis of extreme HDL-C concentrations encountered clinically is frequently polygenic, with contributions from both rare large-effect and common small-effect variants. Multiple types of genetic variants should be considered as contributing factors in patients with extreme dyslipidemia. Copyright © 2017 by the American Society for Biochemistry and Molecular Biology, Inc.

  20. Community structure in traffic zones based on travel demand

    NASA Astrophysics Data System (ADS)

    Sun, Li; Ling, Ximan; He, Kun; Tan, Qian

    2016-09-01

    Large structure in complex networks can be studied by dividing it into communities or modules. Urban traffic system is one of the most critical infrastructures. It can be abstracted into a complex network composed of tightly connected groups. Here, we analyze community structure in urban traffic zones based on the community detection method in network science. Spectral algorithm using the eigenvectors of matrices is employed. Our empirical results indicate that the traffic communities are variant with the travel demand distribution, since in the morning the majority of the passengers are traveling from home to work and in the evening they are traveling a contrary direction. Meanwhile, the origin-destination pairs with large number of trips play a significant role in urban traffic network's community division. The layout of traffic community in a city also depends on the residents' trajectories.

  1. A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants

    PubMed Central

    Fritsche, Lars G.; Igl, Wilmar; Cooke Bailey, Jessica N.; Grassmann, Felix; Sengupta, Sebanti; Bragg-Gresham, Jennifer L.; Burdon, Kathryn P.; Hebbring, Scott J.; Wen, Cindy; Gorski, Mathias; Kim, Ivana K.; Cho, David; Zack, Donald; Souied, Eric; Scholl, Hendrik P. N.; Bala, Elisa; Lee, Kristine E.; Hunter, David J.; Sardell, Rebecca J.; Mitchell, Paul; Merriam, Joanna E.; Cipriani, Valentina; Hoffman, Joshua D.; Schick, Tina; Lechanteur, Yara T. E.; Guymer, Robyn H.; Johnson, Matthew P.; Jiang, Yingda; Stanton, Chloe M.; Buitendijk, Gabriëlle H. S.; Zhan, Xiaowei; Kwong, Alan M.; Boleda, Alexis; Brooks, Matthew; Gieser, Linn; Ratnapriya, Rinki; Branham, Kari E.; Foerster, Johanna R.; Heckenlively, John R.; Othman, Mohammad I.; Vote, Brendan J.; Liang, Helena Hai; Souzeau, Emmanuelle; McAllister, Ian L.; Isaacs, Timothy; Hall, Janette; Lake, Stewart; Mackey, David A.; Constable, Ian J.; Craig, Jamie E.; Kitchner, Terrie E.; Yang, Zhenglin; Su, Zhiguang; Luo, Hongrong; Chen, Daniel; Ouyang, Hong; Flagg, Ken; Lin, Danni; Mao, Guanping; Ferreyra, Henry; Stark, Klaus; von Strachwitz, Claudia N.; Wolf, Armin; Brandl, Caroline; Rudolph, Guenther; Olden, Matthias; Morrison, Margaux A.; Morgan, Denise J.; Schu, Matthew; Ahn, Jeeyun; Silvestri, Giuliana; Tsironi, Evangelia E.; Park, Kyu Hyung; Farrer, Lindsay A.; Orlin, Anton; Brucker, Alexander; Li, Mingyao; Curcio, Christine; Mohand-Saïd, Saddek; Sahel, José-Alain; Audo, Isabelle; Benchaboune, Mustapha; Cree, Angela J.; Rennie, Christina A.; Goverdhan, Srinivas V.; Grunin, Michelle; Hagbi-Levi, Shira; Campochiaro, Peter; Katsanis, Nicholas; Holz, Frank G.; Blond, Frédéric; Blanché, Hélène; Deleuze, Jean-François; Igo, Robert P.; Truitt, Barbara; Peachey, Neal S.; Meuer, Stacy M.; Myers, Chelsea E.; Moore, Emily L.; Klein, Ronald; Hauser, Michael A.; Postel, Eric A.; Courtenay, Monique D.; Schwartz, Stephen G.; Kovach, Jaclyn L.; Scott, William K.; Liew, Gerald; Tƒan, Ava G.; Gopinath, Bamini; Merriam, John C.; Smith, R. Theodore; Khan, Jane C.; Shahid, Humma; Moore, Anthony T.; McGrath, J. Allie; Laux, Reneé; Brantley, Milam A.; Agarwal, Anita; Ersoy, Lebriz; Caramoy, Albert; Langmann, Thomas; Saksens, Nicole T. M.; de Jong, Eiko K.; Hoyng, Carel B.; Cain, Melinda S.; Richardson, Andrea J.; Martin, Tammy M.; Blangero, John; Weeks, Daniel E.; Dhillon, Bal; van Duijn, Cornelia M.; Doheny, Kimberly F.; Romm, Jane; Klaver, Caroline C. W.; Hayward, Caroline; Gorin, Michael B.; Klein, Michael L.; Baird, Paul N.; den Hollander, Anneke I.; Fauser, Sascha; Yates, John R. W.; Allikmets, Rando; Wang, Jie Jin; Schaumberg, Debra A.; Klein, Barbara E. K.; Hagstrom, Stephanie A.; Chowers, Itay; Lotery, Andrew J.; Léveillard, Thierry; Zhang, Kang; Brilliant, Murray H.; Hewitt, Alex W.; Swaroop, Anand; Chew, Emily Y.; Pericak-Vance, Margaret A.; DeAngelis, Margaret; Stambolian, Dwight; Haines, Jonathan L.; Iyengar, Sudha K.; Weber, Bernhard H. F.; Abecasis, Gonçalo R.; Heid, Iris M.

    2016-01-01

    Advanced age-related macular degeneration (AMD) is the leading cause of blindness in the elderly with limited therapeutic options. Here, we report on a study of >12 million variants including 163,714 directly genotyped, most rare, protein-altering variant. Analyzing 16,144 patients and 17,832 controls, we identify 52 independently associated common and rare variants (P < 5×10–8) distributed across 34 loci. While wet and dry AMD subtypes exhibit predominantly shared genetics, we identify the first signal specific to wet AMD, near MMP9 (difference-P = 4.1×10–10). Very rare coding variants (frequency < 0.1%) in CFH, CFI, and TIMP3 suggest causal roles for these genes, as does a splice variant in SLC16A8. Our results support the hypothesis that rare coding variants can pinpoint causal genes within known genetic loci and illustrate that applying the approach systematically to detect new loci requires extremely large sample sizes. PMID:26691988

  2. Ecotype-specific and chromosome-specific expansion of variant centromeric satellites in Arabidopsis thaliana.

    PubMed

    Ito, Hidetaka; Miura, Asuka; Takashima, Kazuya; Kakutani, Tetsuji

    2007-01-01

    Despite the conserved roles and conserved protein machineries of centromeres, their nucleotide sequences can be highly diverse even among related species. The diversity reflects rapid evolution, but the underlying mechanism is largely unknown. One approach to monitor rapid evolution is examination of intra-specific variation. Here we report variant centromeric satellites of Arabidopsis thaliana found through survey of 103 natural accessions (ecotypes). Among them, a cluster of variant centromeric satellites was detected in one ecotype, Cape Verde Islands (Cvi). Recombinant inbred mapping revealed that the variant satellites are distributed in centromeric region of the chromosome 5 (CEN5) of this ecotype. This apparently recent variant accumulation is associated with large deletion of a pericentromeric region and the expansion of satellite region. The variant satellite was bound to HTR12 (centromeric variant histone H3), although expansion of the satellite was not associated with comparable increase in the HTR12 binding. The results suggest that variant satellites with centromere function can rapidly accumulate in one centromere, supporting the model that the satellite repeats in the array are homogenized by occasional unequal crossing-over, which has a potential to generate an expansion of local sequence variants within a centromere cluster.

  3. Common genetic variants influence human subcortical brain structures.

    PubMed

    Hibar, Derrek P; Stein, Jason L; Renteria, Miguel E; Arias-Vasquez, Alejandro; Desrivières, Sylvane; Jahanshad, Neda; Toro, Roberto; Wittfeld, Katharina; Abramovic, Lucija; Andersson, Micael; Aribisala, Benjamin S; Armstrong, Nicola J; Bernard, Manon; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brown, Andrew A; Chakravarty, M Mallar; Chen, Qiang; Ching, Christopher R K; Cuellar-Partida, Gabriel; den Braber, Anouk; Giddaluru, Sudheer; Goldman, Aaron L; Grimm, Oliver; Guadalupe, Tulio; Hass, Johanna; Woldehawariat, Girma; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Olde Loohuis, Loes M; Luciano, Michelle; Macare, Christine; Mather, Karen A; Mattheisen, Manuel; Milaneschi, Yuri; Nho, Kwangsik; Papmeyer, Martina; Ramasamy, Adaikalavan; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rose, Emma J; Salami, Alireza; Sämann, Philipp G; Schmaal, Lianne; Schork, Andrew J; Shin, Jean; Strike, Lachlan T; Teumer, Alexander; van Donkelaar, Marjolein M J; van Eijk, Kristel R; Walters, Raymond K; Westlye, Lars T; Whelan, Christopher D; Winkler, Anderson M; Zwiers, Marcel P; Alhusaini, Saud; Athanasiu, Lavinia; Ehrlich, Stefan; Hakobjan, Marina M H; Hartberg, Cecilie B; Haukvik, Unn K; Heister, Angelien J G A M; Hoehn, David; Kasperaviciute, Dalia; Liewald, David C M; Lopez, Lorna M; Makkinje, Remco R R; Matarin, Mar; Naber, Marlies A M; McKay, D Reese; Needham, Margaret; Nugent, Allison C; Pütz, Benno; Royle, Natalie A; Shen, Li; Sprooten, Emma; Trabzuni, Daniah; van der Marel, Saskia S L; van Hulzen, Kimm J E; Walton, Esther; Wolf, Christiane; Almasy, Laura; Ames, David; Arepalli, Sampath; Assareh, Amelia A; Bastin, Mark E; Brodaty, Henry; Bulayeva, Kazima B; Carless, Melanie A; Cichon, Sven; Corvin, Aiden; Curran, Joanne E; Czisch, Michael; de Zubicaray, Greig I; Dillman, Allissa; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Fedko, Iryna O; Ferrucci, Luigi; Foroud, Tatiana M; Fox, Peter T; Fukunaga, Masaki; Gibbs, J Raphael; Göring, Harald H H; Green, Robert C; Guelfi, Sebastian; Hansell, Narelle K; Hartman, Catharina A; Hegenscheid, Katrin; Heinz, Andreas; Hernandez, Dena G; Heslenfeld, Dirk J; Hoekstra, Pieter J; Holsboer, Florian; Homuth, Georg; Hottenga, Jouke-Jan; Ikeda, Masashi; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Kanai, Ryota; Keil, Maria; Kent, Jack W; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Liu, Xinmin; Longo, Dan L; McMahon, Katie L; Meisenzahl, Eva; Melle, Ingrid; Mohnke, Sebastian; Montgomery, Grant W; Mostert, Jeanette C; Mühleisen, Thomas W; Nalls, Michael A; Nichols, Thomas E; Nilsson, Lars G; Nöthen, Markus M; Ohi, Kazutaka; Olvera, Rene L; Perez-Iglesias, Rocio; Pike, G Bruce; Potkin, Steven G; Reinvang, Ivar; Reppermund, Simone; Rietschel, Marcella; Romanczuk-Seiferth, Nina; Rosen, Glenn D; Rujescu, Dan; Schnell, Knut; Schofield, Peter R; Smith, Colin; Steen, Vidar M; Sussmann, Jessika E; Thalamuthu, Anbupalam; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Valdés Hernández, Maria C; van 't Ent, Dennis; van der Brug, Marcel; van der Wee, Nic J A; van Tol, Marie-Jose; Veltman, Dick J; Wassink, Thomas H; Westman, Eric; Zielke, Ronald H; Zonderman, Alan B; Ashbrook, David G; Hager, Reinmar; Lu, Lu; McMahon, Francis J; Morris, Derek W; Williams, Robert W; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Cahn, Wiepke; Calhoun, Vince D; Cavalleri, Gianpiero L; Crespo-Facorro, Benedicto; Dale, Anders M; Davies, Gareth E; Delanty, Norman; Depondt, Chantal; Djurovic, Srdjan; Drevets, Wayne C; Espeseth, Thomas; Gollub, Randy L; Ho, Beng-Choon; Hoffmann, Wolfgang; Hosten, Norbert; Kahn, René S; Le Hellard, Stephanie; Meyer-Lindenberg, Andreas; Müller-Myhsok, Bertram; Nauck, Matthias; Nyberg, Lars; Pandolfo, Massimo; Penninx, Brenda W J H; Roffman, Joshua L; Sisodiya, Sanjay M; Smoller, Jordan W; van Bokhoven, Hans; van Haren, Neeltje E M; Völzke, Henry; Walter, Henrik; Weiner, Michael W; Wen, Wei; White, Tonya; Agartz, Ingrid; Andreassen, Ole A; Blangero, John; Boomsma, Dorret I; Brouwer, Rachel M; Cannon, Dara M; Cookson, Mark R; de Geus, Eco J C; Deary, Ian J; Donohoe, Gary; Fernández, Guillén; Fisher, Simon E; Francks, Clyde; Glahn, David C; Grabe, Hans J; Gruber, Oliver; Hardy, John; Hashimoto, Ryota; Hulshoff Pol, Hilleke E; Jönsson, Erik G; Kloszewska, Iwona; Lovestone, Simon; Mattay, Venkata S; Mecocci, Patrizia; McDonald, Colm; McIntosh, Andrew M; Ophoff, Roel A; Paus, Tomas; Pausova, Zdenka; Ryten, Mina; Sachdev, Perminder S; Saykin, Andrew J; Simmons, Andy; Singleton, Andrew; Soininen, Hilkka; Wardlaw, Joanna M; Weale, Michael E; Weinberger, Daniel R; Adams, Hieab H H; Launer, Lenore J; Seiler, Stephan; Schmidt, Reinhold; Chauhan, Ganesh; Satizabal, Claudia L; Becker, James T; Yanek, Lisa; van der Lee, Sven J; Ebling, Maritza; Fischl, Bruce; Longstreth, W T; Greve, Douglas; Schmidt, Helena; Nyquist, Paul; Vinke, Louis N; van Duijn, Cornelia M; Xue, Luting; Mazoyer, Bernard; Bis, Joshua C; Gudnason, Vilmundur; Seshadri, Sudha; Ikram, M Arfan; Martin, Nicholas G; Wright, Margaret J; Schumann, Gunter; Franke, Barbara; Thompson, Paul M; Medland, Sarah E

    2015-04-09

    The highly complex structure of the human brain is strongly shaped by genetic influences. Subcortical brain regions form circuits with cortical areas to coordinate movement, learning, memory and motivation, and altered circuits can lead to abnormal behaviour and disease. To investigate how common genetic variants affect the structure of these brain regions, here we conduct genome-wide association studies of the volumes of seven subcortical regions and the intracranial volume derived from magnetic resonance images of 30,717 individuals from 50 cohorts. We identify five novel genetic variants influencing the volumes of the putamen and caudate nucleus. We also find stronger evidence for three loci with previously established influences on hippocampal volume and intracranial volume. These variants show specific volumetric effects on brain structures rather than global effects across structures. The strongest effects were found for the putamen, where a novel intergenic locus with replicable influence on volume (rs945270; P = 1.08 × 10(-33); 0.52% variance explained) showed evidence of altering the expression of the KTN1 gene in both brain and blood tissue. Variants influencing putamen volume clustered near developmental genes that regulate apoptosis, axon guidance and vesicle transport. Identification of these genetic variants provides insight into the causes of variability in human brain development, and may help to determine mechanisms of neuropsychiatric dysfunction.

  4. Novel GREM1 Variations in Sub-Saharan African Patients With Cleft Lip and/or Cleft Palate.

    PubMed

    Gowans, Lord Jephthah Joojo; Oseni, Ganiyu; Mossey, Peter A; Adeyemo, Wasiu Lanre; Eshete, Mekonen A; Busch, Tamara D; Donkor, Peter; Obiri-Yeboah, Solomon; Plange-Rhule, Gyikua; Oti, Alexander A; Owais, Arwa; Olaitan, Peter B; Aregbesola, Babatunde S; Oginni, Fadekemi O; Bello, Seidu A; Audu, Rosemary; Onwuamah, Chika; Agbenorku, Pius; Ogunlewe, Mobolanle O; Abdur-Rahman, Lukman O; Marazita, Mary L; Adeyemo, A A; Murray, Jeffrey C; Butali, Azeez

    2018-05-01

    Cleft lip and/or cleft palate (CL/P) are congenital anomalies of the face and have multifactorial etiology, with both environmental and genetic risk factors playing crucial roles. Though at least 40 loci have attained genomewide significant association with nonsyndromic CL/P, these loci largely reside in noncoding regions of the human genome, and subsequent resequencing studies of neighboring candidate genes have revealed only a limited number of etiologic coding variants. The present study was conducted to identify etiologic coding variants in GREM1, a locus that has been shown to be largely associated with cleft of both lip and soft palate. We resequenced DNA from 397 sub-Saharan Africans with CL/P and 192 controls using Sanger sequencing. Following analyses of the sequence data, we observed 2 novel coding variants in GREM1. These variants were not found in the 192 African controls and have never been previously reported in any public genetic variant database that includes more than 5000 combined African and African American controls or from the CL/P literature. The novel variants include p.Pro164Ser in an individual with soft palate cleft only and p.Gly61Asp in an individual with bilateral cleft lip and palate. The proband with the p.Gly61Asp GREM1 variant is a van der Woude (VWS) case who also has an etiologic variant in IRF6 gene. Our study demonstrated that there is low number of etiologic coding variants in GREM1, confirming earlier suggestions that variants in regulatory elements may largely account for the association between this locus and CL/P.

  5. Acquiring Structural Information on Virus Particles with Charge Detection Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Keifer, David Z.; Motwani, Tina; Teschke, Carolyn M.; Jarrold, Martin F.

    2016-06-01

    Charge detection mass spectrometry (CDMS) is a single-molecule technique particularly well-suited to measuring the mass and charge distributions of heterogeneous, MDa-sized ions. In this work, CDMS has been used to analyze the assembly products of two coat protein variants of bacteriophage P22. The assembly products show broad mass distributions extending from 5 to 15 MDa for A285Y and 5 to 25 MDa for A285T coat protein variants. Because the charge of large ions generated by electrospray ionization depends on their size, the charge can be used to distinguish hollow shells from more compact structures. A285T was found to form T = 4 and T = 7 procapsids, and A285Y makes a small number of T = 3 and T = 4 procapsids. Owing to the decreased stability of the A285Y and A285T particles, chemical cross-linking was required to stabilize them for electrospray CDMS. Graphical Abstract[Figure not available: see fulltext.

  6. Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast

    PubMed Central

    Jeffares, Daniel C.; Jolly, Clemency; Hoti, Mimoza; Speed, Doug; Shaw, Liam; Rallis, Charalampos; Balloux, Francois; Dessimoz, Christophe; Bähler, Jürg; Sedlazeck, Fritz J.

    2017-01-01

    Large structural variations (SVs) within genomes are more challenging to identify than smaller genetic variants but may substantially contribute to phenotypic diversity and evolution. We analyse the effects of SVs on gene expression, quantitative traits and intrinsic reproductive isolation in the yeast Schizosaccharomyces pombe. We establish a high-quality curated catalogue of SVs in the genomes of a worldwide library of S. pombe strains, including duplications, deletions, inversions and translocations. We show that copy number variants (CNVs) show a variety of genetic signals consistent with rapid turnover. These transient CNVs produce stoichiometric effects on gene expression both within and outside the duplicated regions. CNVs make substantial contributions to quantitative traits, most notably intracellular amino acid concentrations, growth under stress and sugar utilization in winemaking, whereas rearrangements are strongly associated with reproductive isolation. Collectively, these findings have broad implications for evolution and for our understanding of quantitative traits including complex human diseases. PMID:28117401

  7. Use of Single-Cysteine Variants for Trapping Transient States in DNA Mismatch Repair.

    PubMed

    Friedhoff, Peter; Manelyte, Laura; Giron-Monzon, Luis; Winkler, Ines; Groothuizen, Flora S; Sixma, Titia K

    2017-01-01

    DNA mismatch repair (MMR) is necessary to prevent incorporation of polymerase errors into the newly synthesized DNA strand, as they would be mutagenic. In humans, errors in MMR cause a predisposition to cancer, called Lynch syndrome. The MMR process is performed by a set of ATPases that transmit, validate, and couple information to identify which DNA strand requires repair. To understand the individual steps in the repair process, it is useful to be able to study these large molecular machines structurally and functionally. However, the steps and states are highly transient; therefore, the methods to capture and enrich them are essential. Here, we describe how single-cysteine variants can be used for specific cross-linking and labeling approaches that allow trapping of relevant transient states. Analysis of these defined states in functional and structural studies is instrumental to elucidate the molecular mechanism of this important DNA MMR process. © 2017 Elsevier Inc. All rights reserved.

  8. Growth of L1{sub 0}-ordered crystal in FePt and FePd thin films on MgO(001) substrate

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Futamoto, Masaaki, E-mail: futamoto@elect.chuo-u.ac.jp; Nakamura, Masahiro; Ohtake, Mitsuru

    2016-08-15

    Formation of L1{sub 0}-oredered structure from disordered A1 phase has been investigated for FePt and FePd films on MgO(001) substrates employing a two-step method consisting of low temperature deposition at 200 °C followed by high-temperature annealing at 600 °C. L1{sub 0}-(001) variant crystal with the c-axis perpendicular to the substrate grows preferentially in FePd films whereas L1{sub 0}-(100), (010) variants tend to be mixed with the L1{sub 0}-(001) variant in FePt films. The structure analysis by X-ray diffraction indicates that a difference in A1 lattice strain is the influential factor that determines the resulting L1{sub 0}-variant structure in ordered thinmore » films. Misfit dislocations and anti-phase boundaries are observed in high-resolution transmission electron micrographs of 10 nm-thick Fe(Pt, Pd) film consisting of L1{sub 0}-(001) variants which are formed through atomic diffusion at 600 °C in a laterally strained FePt/PeFd epitaxial thin film. Based on the experimental results, a nucleation and growth model for explaining L1{sub 0}-variant formation is proposed, which suggests a possibility in tailoring the L1{sub 0} variant structure in ordered magnetic thin films by controlling the alloy composition, the layer structure, and the substrate material.« less

  9. Wild yeast harbor a variety of distinct amyloid structures with strong prion-inducing capabilities

    PubMed Central

    Westergard, Laura; True, Heather L.

    2014-01-01

    Summary Variation in amyloid structures profoundly influences a wide array of pathological phenotypes in mammalian protein conformation disorders and dominantly inherited phenotypes in yeast. Here, we describe, for the first time, naturally occurring, self-propagating, structural variants of a prion protein isolated from wild strains of the yeast Saccharomyces cerevisiae. Variants of the [RNQ+] prion propagating in a variety of wild yeast differ biochemically, in their intracellular distributions, and in their ability to promote formation of the [PSI+] prion. [PSI+] is an epigenetic regulator of cellular phenotype and adaptability. Strikingly, we find that most natural [RNQ+] variants induced [PSI+] at high frequencies and the majority of [PSI+] variants elicited strong cellular phenotypes. We hypothesize that the presence of an efficient [RNQ+] template primes the cell for [PSI+] formation in order to induce [PSI+] in conditions where it would be advantageous. These studies utilize naturally occurring structural variants to expand our understanding of the consequences of diverse prion conformations on cellular phenotypes. PMID:24673812

  10. Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.

    PubMed

    Carss, Keren J; Arno, Gavin; Erwood, Marie; Stephens, Jonathan; Sanchis-Juan, Alba; Hull, Sarah; Megy, Karyn; Grozeva, Detelina; Dewhurst, Eleanor; Malka, Samantha; Plagnol, Vincent; Penkett, Christopher; Stirrups, Kathleen; Rizzo, Roberta; Wright, Genevieve; Josifova, Dragana; Bitner-Glindzicz, Maria; Scott, Richard H; Clement, Emma; Allen, Louise; Armstrong, Ruth; Brady, Angela F; Carmichael, Jenny; Chitre, Manali; Henderson, Robert H H; Hurst, Jane; MacLaren, Robert E; Murphy, Elaine; Paterson, Joan; Rosser, Elisabeth; Thompson, Dorothy A; Wakeling, Emma; Ouwehand, Willem H; Michaelides, Michel; Moore, Anthony T; Webster, Andrew R; Raymond, F Lucy

    2017-01-05

    Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease. Copyright © 2017. Published by Elsevier Inc.

  11. Rare genetic variants in the endocannabinoid system genes CNR1 and DAGLA are associated with neurological phenotypes in humans.

    PubMed

    Smith, Douglas R; Stanley, Christine M; Foss, Theodore; Boles, Richard G; McKernan, Kevin

    2017-01-01

    Rare genetic variants in the core endocannabinoid system genes CNR1, CNR2, DAGLA, MGLL and FAAH were identified in molecular testing data from 6,032 patients with a broad spectrum of neurological disorders. The variants were evaluated for association with phenotypes similar to those observed in the orthologous gene knockouts in mice. Heterozygous rare coding variants in CNR1, which encodes the type 1 cannabinoid receptor (CB1), were found to be significantly associated with pain sensitivity (especially migraine), sleep and memory disorders-alone or in combination with anxiety-compared to a set of controls without such CNR1 variants. Similarly, heterozygous rare variants in DAGLA, which encodes diacylglycerol lipase alpha, were found to be significantly associated with seizures and neurodevelopmental disorders, including autism and abnormalities of brain morphology, compared to controls. Rare variants in MGLL, FAAH and CNR2 were not associated with any neurological phenotypes in the patients tested. Diacylglycerol lipase alpha synthesizes the endocannabinoid 2-AG in the brain, which interacts with CB1 receptors. The phenotypes associated with rare CNR1 variants are reminiscent of those implicated in the theory of clinical endocannabinoid deficiency syndrome. The severe phenotypes associated with rare DAGLA variants underscore the critical role of rapid 2-AG synthesis and the endocannabinoid system in regulating neurological function and development. Mapping of the variants to the 3D structure of the type 1 cannabinoid receptor, or primary structure of diacylglycerol lipase alpha, reveals clustering of variants in certain structural regions and is consistent with impacts to function.

  12. Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm

    PubMed Central

    Clausen, Rudy; Ma, Buyong; Nussinov, Ruth; Shehu, Amarda

    2015-01-01

    An important goal in molecular biology is to understand functional changes upon single-point mutations in proteins. Doing so through a detailed characterization of structure spaces and underlying energy landscapes is desirable but continues to challenge methods based on Molecular Dynamics. In this paper we propose a novel algorithm, SIfTER, which is based instead on stochastic optimization to circumvent the computational challenge of exploring the breadth of a protein’s structure space. SIfTER is a data-driven evolutionary algorithm, leveraging experimentally-available structures of wildtype and variant sequences of a protein to define a reduced search space from where to efficiently draw samples corresponding to novel structures not directly observed in the wet laboratory. The main advantage of SIfTER is its ability to rapidly generate conformational ensembles, thus allowing mapping and juxtaposing landscapes of variant sequences and relating observed differences to functional changes. We apply SIfTER to variant sequences of the H-Ras catalytic domain, due to the prominent role of the Ras protein in signaling pathways that control cell proliferation, its well-studied conformational switching, and abundance of documented mutations in several human tumors. Many Ras mutations are oncogenic, but detailed energy landscapes have not been reported until now. Analysis of SIfTER-computed energy landscapes for the wildtype and two oncogenic variants, G12V and Q61L, suggests that these mutations cause constitutive activation through two different mechanisms. G12V directly affects binding specificity while leaving the energy landscape largely unchanged, whereas Q61L has pronounced, starker effects on the landscape. An implementation of SIfTER is made available at http://www.cs.gmu.edu/~ashehu/?q=OurTools. We believe SIfTER is useful to the community to answer the question of how sequence mutations affect the function of a protein, when there is an abundance of experimental structures that can be exploited to reconstruct an energy landscape that would be computationally impractical to do via Molecular Dynamics. PMID:26325505

  13. A Comparison Study of Multivariate Fixed Models and Gene Association with Multiple Traits (GAMuT) for Next-Generation Sequencing

    PubMed Central

    Chiu, Chi-yang; Jung, Jeesun; Wang, Yifan; Weeks, Daniel E.; Wilson, Alexander F.; Bailey-Wilson, Joan E.; Amos, Christopher I.; Mills, James L.; Boehnke, Michael; Xiong, Momiao; Fan, Ruzong

    2016-01-01

    In this paper, extensive simulations are performed to compare two statistical methods to analyze multiple correlated quantitative phenotypes: (1) approximate F-distributed tests of multivariate functional linear models (MFLM) and additive models of multivariate analysis of variance (MANOVA), and (2) Gene Association with Multiple Traits (GAMuT) for association testing of high-dimensional genotype data. It is shown that approximate F-distributed tests of MFLM and MANOVA have higher power and are more appropriate for major gene association analysis (i.e., scenarios in which some genetic variants have relatively large effects on the phenotypes); GAMuT has higher power and is more appropriate for analyzing polygenic effects (i.e., effects from a large number of genetic variants each of which contributes a small amount to the phenotypes). MFLM and MANOVA are very flexible and can be used to perform association analysis for: (i) rare variants, (ii) common variants, and (iii) a combination of rare and common variants. Although GAMuT was designed to analyze rare variants, it can be applied to analyze a combination of rare and common variants and it performs well when (1) the number of genetic variants is large and (2) each variant contributes a small amount to the phenotypes (i.e., polygenes). MFLM and MANOVA are fixed effect models which perform well for major gene association analysis. GAMuT can be viewed as an extension of sequence kernel association tests (SKAT). Both GAMuT and SKAT are more appropriate for analyzing polygenic effects and they perform well not only in the rare variant case, but also in the case of a combination of rare and common variants. Data analyses of European cohorts and the Trinity Students Study are presented to compare the performance of the two methods. PMID:27917525

  14. HiView: an integrative genome browser to leverage Hi-C results for the interpretation of GWAS variants.

    PubMed

    Xu, Zheng; Zhang, Guosheng; Duan, Qing; Chai, Shengjie; Zhang, Baqun; Wu, Cong; Jin, Fulai; Yue, Feng; Li, Yun; Hu, Ming

    2016-03-11

    Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits and diseases. However, most of them are located in the non-protein coding regions, and therefore it is challenging to hypothesize the functions of these non-coding GWAS variants. Recent large efforts such as the ENCODE and Roadmap Epigenomics projects have predicted a large number of regulatory elements. However, the target genes of these regulatory elements remain largely unknown. Chromatin conformation capture based technologies such as Hi-C can directly measure the chromatin interactions and have generated an increasingly comprehensive catalog of the interactome between the distal regulatory elements and their potential target genes. Leveraging such information revealed by Hi-C holds the promise of elucidating the functions of genetic variants in human diseases. In this work, we present HiView, the first integrative genome browser to leverage Hi-C results for the interpretation of GWAS variants. HiView is able to display Hi-C data and statistical evidence for chromatin interactions in genomic regions surrounding any given GWAS variant, enabling straightforward visualization and interpretation. We believe that as the first GWAS variants-centered Hi-C genome browser, HiView is a useful tool guiding post-GWAS functional genomics studies. HiView is freely accessible at: http://www.unc.edu/~yunmli/HiView .

  15. Hierarchical hybrid control of manipulators: Artificial intelligence in large scale integrated circuits

    NASA Technical Reports Server (NTRS)

    Greene, P. H.

    1972-01-01

    Both in practical engineering and in control of muscular systems, low level subsystems automatically provide crude approximations to the proper response. Through low level tuning of these approximations, the proper response variant can emerge from standardized high level commands. Such systems are expressly suited to emerging large scale integrated circuit technology. A computer, using symbolic descriptions of subsystem responses, can select and shape responses of low level digital or analog microcircuits. A mathematical theory that reveals significant informational units in this style of control and software for realizing such information structures are formulated.

  16. Occurrence of the Cys311 DRD2 variant in a pedigree multiply affected with panic disorder

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Crawford, F.; Hoyne, J.; Diaz, P.

    1995-08-14

    Following the detection of the rare DRD2 codon 311 variant (Ser{yields}Cys) in an affected member from a large, multiply affected panic disorder family, we investigated the occurrence of this variant in other family members. The variant occurred in both affected and unaffected individuals. Further screening in panic disorder sib pairs unrelated to this family failed to detect the Cys311 variant. Our data suggests that this variant has no pathogenic role in panic disorder. 18 refs., 1 fig.

  17. Genetic variants of human serum cholinesterase influence metabolism of the muscle relaxant succinylcholine.

    PubMed

    Lockridge, O

    1990-01-01

    People with genetic variants of cholinesterase respond abnormally to succinylcholine, experiencing substantial prolongation of muscle paralysis with apnea rather than the usual 2-6 min. The structure of usual cholinesterase has been determined including the complete amino acid and nucleotide sequence. This has allowed identification of altered amino acids and nucleotides. The variant most frequently found in patients who respond abnormally to succinylcholine is atypical cholinesterase, which occurs in homozygous form in 1 out of 3500 Caucasians. Atypical cholinesterase has a single substitution at nucleotide 209 which changes aspartic acid 70 to glycine. This suggests that Asp 70 is part of the anionic site, and that the absence of this negatively charged amino acid explains the reduced affinity of atypical cholinesterase for positively charged substrates and inhibitors. The clinical consequence of reduced affinity for succinylcholine is that none of the succinylcholine is hydrolyzed in blood and a large overdose reaches the nerve-muscle junction where it causes prolonged muscle paralysis. Silent cholinesterase has a frame shift mutation at glycine 117 which prematurely terminates protein synthesis and yields no active enzyme. The K variant, named in honor of W. Kalow, has threonine in place of alanine 539. The K variant is associated with 33% lower activity. All variants arise from a single locus as there is only one gene for human cholinesterase (EC 3.1.1.8). Comparison of amino acid sequences of esterases and proteases shows that cholinesterase belongs to a new family of serine esterases which is different from the serine proteases.

  18. Integrated analysis of germline and somatic variants in ovarian cancer.

    PubMed

    Kanchi, Krishna L; Johnson, Kimberly J; Lu, Charles; McLellan, Michael D; Leiserson, Mark D M; Wendl, Michael C; Zhang, Qunyuan; Koboldt, Daniel C; Xie, Mingchao; Kandoth, Cyriac; McMichael, Joshua F; Wyczalkowski, Matthew A; Larson, David E; Schmidt, Heather K; Miller, Christopher A; Fulton, Robert S; Spellman, Paul T; Mardis, Elaine R; Druley, Todd E; Graubert, Timothy A; Goodfellow, Paul J; Raphael, Benjamin J; Wilson, Richard K; Ding, Li

    2014-01-01

    We report the first large-scale exome-wide analysis of the combined germline-somatic landscape in ovarian cancer. Here we analyse germline and somatic alterations in 429 ovarian carcinoma cases and 557 controls. We identify 3,635 high confidence, rare truncation and 22,953 missense variants with predicted functional impact. We find germline truncation variants and large deletions across Fanconi pathway genes in 20% of cases. Enrichment of rare truncations is shown in BRCA1, BRCA2 and PALB2. In addition, we observe germline truncation variants in genes not previously associated with ovarian cancer susceptibility (NF1, MAP3K4, CDKN2B and MLL3). Evidence for loss of heterozygosity was found in 100 and 76% of cases with germline BRCA1 and BRCA2 truncations, respectively. Germline-somatic interaction analysis combined with extensive bioinformatics annotation identifies 222 candidate functional germline truncation and missense variants, including two pathogenic BRCA1 and 1 TP53 deleterious variants. Finally, integrated analyses of germline and somatic variants identify significantly altered pathways, including the Fanconi, MAPK and MLL pathways.

  19. Investigation of common, low-frequency and rare genome-wide variation in anorexia nervosa

    PubMed Central

    Huckins, L M; Hatzikotoulas, K; Southam, L; Thornton, L M; Steinberg, J; Aguilera-McKay, F; Treasure, J; Schmidt, U; Gunasinghe, C; Romero, A; Curtis, C; Rhodes, D; Moens, J; Kalsi, G; Dempster, D; Leung, R; Keohane, A; Burghardt, R; Ehrlich, S; Hebebrand, J; Hinney, A; Ludolph, A; Walton, E; Deloukas, P; Hofman, A; Palotie, A; Palta, P; van Rooij, F J A; Stirrups, K; Adan, R; Boni, C; Cone, R; Dedoussis, G; van Furth, E; Gonidakis, F; Gorwood, P; Hudson, J; Kaprio, J; Kas, M; Keski-Rahonen, A; Kiezebrink, K; Knudsen, G-P; Slof-Op 't Landt, M C T; Maj, M; Monteleone, A M; Monteleone, P; Raevuori, A H; Reichborn-Kjennerud, T; Tozzi, F; Tsitsika, A; van Elburg, A; Adan, R A H; Alfredsson, L; Ando, T; Andreassen, O A; Aschauer, H; Baker, J H; Barrett, J C; Bencko, V; Bergen, A W; Berrettini, W H; Birgegard, A; Boni, C; Boraska Perica, V; Brandt, H; Breen, G; Bulik, C M; Carlberg, L; Cassina, M; Cichon, S; Clementi, M; Cohen-Woods, S; Coleman, J; Cone, R D; Courtet, P; Crawford, S; Crow, S; Crowley, J; Danner, U N; Davis, O S P; de Zwaan, M; Dedoussis, G; Degortes, D; DeSocio, J E; Dick, D M; Dikeos, D; Dina, C; Ding, B; Dmitrzak-Weglarz, M; Docampo, E; Duncan, L; Egberts, K; Ehrlich, S; Escaramís, G; Esko, T; Espeseth, T; Estivill, X; Favaro, A; Fernández-Aranda, F; Fichter, M M; Finan, C; Fischer, K; Floyd, J A B; Foretova, L; Forzan, M; Franklin, C S; Gallinger, S; Gambaro, G; Gaspar, H A; Giegling, I; Gonidakis, F; Gorwood, P; Gratacos, M; Guillaume, S; Guo, Y; Hakonarson, H; Halmi, K A; Hatzikotoulas, K; Hauser, J; Hebebrand, J; Helder, S; Herms, S; Herpertz-Dahlmann, B; Herzog, W; Hilliard, C E; Hinney, A; Hübel, C; Huckins, L M; Hudson, J I; Huemer, J; Inoko, H; Janout, V; Jiménez-Murcia, S; Johnson, C; Julià, A; Juréus, A; Kalsi, G; Kaminska, D; Kaplan, A S; Kaprio, J; Karhunen, L; Karwautz, A; Kas, M J H; Kaye, W; Kennedy, J L; Keski-Rahkonen, A; Kiezebrink, K; Klareskog, L; Klump, K L; Knudsen, G P S; Koeleman, B P C; Koubek, D; La Via, M C; Landén, M; Le Hellard, S; Levitan, R D; Li, D; Lichtenstein, P; Lilenfeld, L; Lissowska, J; Lundervold, A; Magistretti, P; Maj, M; Mannik, K; Marsal, S; Martin, N; Mattingsdal, M; McDevitt, S; McGuffin, P; Merl, E; Metspalu, A; Meulenbelt, I; Micali, N; Mitchell, J; Mitchell, K; Monteleone, P; Monteleone, A M; Mortensen, P; Munn-Chernoff, M A; Navratilova, M; Nilsson, I; Norring, C; Ntalla, I; Ophoff, R A; O'Toole, J K; Palotie, A; Pante, J; Papezova, H; Pinto, D; Rabionet, R; Raevuori, A; Rajewski, A; Ramoz, N; Rayner, N W; Reichborn-Kjennerud, T; Ripatti, S; Roberts, M; Rotondo, A; Rujescu, D; Rybakowski, F; Santonastaso, P; Scherag, A; Scherer, S W; Schmidt, U; Schork, N J; Schosser, A; Slachtova, L; Sladek, R; Slagboom, P E; Slof-Op 't Landt, M C T; Slopien, A; Soranzo, N; Southam, L; Steen, V M; Strengman, E; Strober, M; Sullivan, P F; Szatkiewicz, J P; Szeszenia-Dabrowska, N; Tachmazidou, I; Tenconi, E; Thornton, L M; Tortorella, A; Tozzi, F; Treasure, J; Tsitsika, A; Tziouvas, K; van Elburg, A A; van Furth, E F; Wagner, G; Walton, E; Watson, H; Wichmann, H-E; Widen, E; Woodside, D B; Yanovski, J; Yao, S; Yilmaz, Z; Zeggini, E; Zerwas, S; Zipfel, S; Collier, D A; Sullivan, P F; Breen, G; Bulik, C M; Zeggini, E

    2018-01-01

    Anorexia nervosa (AN) is a complex neuropsychiatric disorder presenting with dangerously low body weight, and a deep and persistent fear of gaining weight. To date, only one genome-wide significant locus associated with AN has been identified. We performed an exome-chip based genome-wide association studies (GWAS) in 2158 cases from nine populations of European origin and 15 485 ancestrally matched controls. Unlike previous studies, this GWAS also probed association in low-frequency and rare variants. Sixteen independent variants were taken forward for in silico and de novo replication (11 common and 5 rare). No findings reached genome-wide significance. Two notable common variants were identified: rs10791286, an intronic variant in OPCML (P=9.89 × 10−6), and rs7700147, an intergenic variant (P=2.93 × 10−5). No low-frequency variant associations were identified at genome-wide significance, although the study was well-powered to detect low-frequency variants with large effect sizes, suggesting that there may be no AN loci in this genomic search space with large effect sizes. PMID:29155802

  20. Investigation of common, low-frequency and rare genome-wide variation in anorexia nervosa.

    PubMed

    Huckins, L M; Hatzikotoulas, K; Southam, L; Thornton, L M; Steinberg, J; Aguilera-McKay, F; Treasure, J; Schmidt, U; Gunasinghe, C; Romero, A; Curtis, C; Rhodes, D; Moens, J; Kalsi, G; Dempster, D; Leung, R; Keohane, A; Burghardt, R; Ehrlich, S; Hebebrand, J; Hinney, A; Ludolph, A; Walton, E; Deloukas, P; Hofman, A; Palotie, A; Palta, P; van Rooij, F J A; Stirrups, K; Adan, R; Boni, C; Cone, R; Dedoussis, G; van Furth, E; Gonidakis, F; Gorwood, P; Hudson, J; Kaprio, J; Kas, M; Keski-Rahonen, A; Kiezebrink, K; Knudsen, G-P; Slof-Op 't Landt, M C T; Maj, M; Monteleone, A M; Monteleone, P; Raevuori, A H; Reichborn-Kjennerud, T; Tozzi, F; Tsitsika, A; van Elburg, A; Collier, D A; Sullivan, P F; Breen, G; Bulik, C M; Zeggini, E

    2018-05-01

    Anorexia nervosa (AN) is a complex neuropsychiatric disorder presenting with dangerously low body weight, and a deep and persistent fear of gaining weight. To date, only one genome-wide significant locus associated with AN has been identified. We performed an exome-chip based genome-wide association studies (GWAS) in 2158 cases from nine populations of European origin and 15 485 ancestrally matched controls. Unlike previous studies, this GWAS also probed association in low-frequency and rare variants. Sixteen independent variants were taken forward for in silico and de novo replication (11 common and 5 rare). No findings reached genome-wide significance. Two notable common variants were identified: rs10791286, an intronic variant in OPCML (P=9.89 × 10 -6 ), and rs7700147, an intergenic variant (P=2.93 × 10 -5 ). No low-frequency variant associations were identified at genome-wide significance, although the study was well-powered to detect low-frequency variants with large effect sizes, suggesting that there may be no AN loci in this genomic search space with large effect sizes.

  1. Rare and low-frequency coding variants alter human adult height

    PubMed Central

    Marouli, Eirini; Graff, Mariaelisa; Medina-Gomez, Carolina; Lo, Ken Sin; Wood, Andrew R; Kjaer, Troels R; Fine, Rebecca S; Lu, Yingchang; Schurmann, Claudia; Highland, Heather M; Rüeger, Sina; Thorleifsson, Gudmar; Justice, Anne E; Lamparter, David; Stirrups, Kathleen E; Turcot, Valérie; Young, Kristin L; Winkler, Thomas W; Esko, Tõnu; Karaderi, Tugce; Locke, Adam E; Masca, Nicholas GD; Ng, Maggie CY; Mudgal, Poorva; Rivas, Manuel A; Vedantam, Sailaja; Mahajan, Anubha; Guo, Xiuqing; Abecasis, Goncalo; Aben, Katja K; Adair, Linda S; Alam, Dewan S; Albrecht, Eva; Allin, Kristine H; Allison, Matthew; Amouyel, Philippe; Appel, Emil V; Arveiler, Dominique; Asselbergs, Folkert W; Auer, Paul L; Balkau, Beverley; Banas, Bernhard; Bang, Lia E; Benn, Marianne; Bergmann, Sven; Bielak, Lawrence F; Blüher, Matthias; Boeing, Heiner; Boerwinkle, Eric; Böger, Carsten A; Bonnycastle, Lori L; Bork-Jensen, Jette; Bots, Michiel L; Bottinger, Erwin P; Bowden, Donald W; Brandslund, Ivan; Breen, Gerome; Brilliant, Murray H; Broer, Linda; Burt, Amber A; Butterworth, Adam S; Carey, David J; Caulfield, Mark J; Chambers, John C; Chasman, Daniel I; Chen, Yii-Der Ida; Chowdhury, Rajiv; Christensen, Cramer; Chu, Audrey Y; Cocca, Massimiliano; Collins, Francis S; Cook, James P; Corley, Janie; Galbany, Jordi Corominas; Cox, Amanda J; Cuellar-Partida, Gabriel; Danesh, John; Davies, Gail; de Bakker, Paul IW; de Borst, Gert J.; de Denus, Simon; de Groot, Mark CH; de Mutsert, Renée; Deary, Ian J; Dedoussis, George; Demerath, Ellen W; den Hollander, Anneke I; Dennis, Joe G; Di Angelantonio, Emanuele; Drenos, Fotios; Du, Mengmeng; Dunning, Alison M; Easton, Douglas F; Ebeling, Tapani; Edwards, Todd L; Ellinor, Patrick T; Elliott, Paul; Evangelou, Evangelos; Farmaki, Aliki-Eleni; Faul, Jessica D; Feitosa, Mary F; Feng, Shuang; Ferrannini, Ele; Ferrario, Marco M; Ferrieres, Jean; Florez, Jose C; Ford, Ian; Fornage, Myriam; Franks, Paul W; Frikke-Schmidt, Ruth; Galesloot, Tessel E; Gan, Wei; Gandin, Ilaria; Gasparini, Paolo; Giedraitis, Vilmantas; Giri, Ayush; Girotto, Giorgia; Gordon, Scott D; Gordon-Larsen, Penny; Gorski, Mathias; Grarup, Niels; Grove, Megan L.; Gudnason, Vilmundur; Gustafsson, Stefan; Hansen, Torben; Harris, Kathleen Mullan; Harris, Tamara B; Hattersley, Andrew T; Hayward, Caroline; He, Liang; Heid, Iris M; Heikkilä, Kauko; Helgeland, Øyvind; Hernesniemi, Jussi; Hewitt, Alex W; Hocking, Lynne J; Hollensted, Mette; Holmen, Oddgeir L; Hovingh, G. Kees; Howson, Joanna MM; Hoyng, Carel B; Huang, Paul L; Hveem, Kristian; Ikram, M. Arfan; Ingelsson, Erik; Jackson, Anne U; Jansson, Jan-Håkan; Jarvik, Gail P; Jensen, Gorm B; Jhun, Min A; Jia, Yucheng; Jiang, Xuejuan; Johansson, Stefan; Jørgensen, Marit E; Jørgensen, Torben; Jousilahti, Pekka; Jukema, J Wouter; Kahali, Bratati; Kahn, René S; Kähönen, Mika; Kamstrup, Pia R; Kanoni, Stavroula; Kaprio, Jaakko; Karaleftheri, Maria; Kardia, Sharon LR; Karpe, Fredrik; Kee, Frank; Keeman, Renske; Kiemeney, Lambertus A; Kitajima, Hidetoshi; Kluivers, Kirsten B; Kocher, Thomas; Komulainen, Pirjo; Kontto, Jukka; Kooner, Jaspal S; Kooperberg, Charles; Kovacs, Peter; Kriebel, Jennifer; Kuivaniemi, Helena; Küry, Sébastien; Kuusisto, Johanna; La Bianca, Martina; Laakso, Markku; Lakka, Timo A; Lange, Ethan M; Lange, Leslie A; Langefeld, Carl D; Langenberg, Claudia; Larson, Eric B; Lee, I-Te; Lehtimäki, Terho; Lewis, Cora E; Li, Huaixing; Li, Jin; Li-Gao, Ruifang; Lin, Honghuang; Lin, Li-An; Lin, Xu; Lind, Lars; Lindström, Jaana; Linneberg, Allan; Liu, Yeheng; Liu, Yongmei; Lophatananon, Artitaya; Luan, Jian'an; Lubitz, Steven A; Lyytikäinen, Leo-Pekka; Mackey, David A; Madden, Pamela AF; Manning, Alisa K; Männistö, Satu; Marenne, Gaëlle; Marten, Jonathan; Martin, Nicholas G; Mazul, Angela L; Meidtner, Karina; Metspalu, Andres; Mitchell, Paul; Mohlke, Karen L; Mook-Kanamori, Dennis O; Morgan, Anna; Morris, Andrew D; Morris, Andrew P; Müller-Nurasyid, Martina; Munroe, Patricia B; Nalls, Mike A; Nauck, Matthias; Nelson, Christopher P; Neville, Matt; Nielsen, Sune F; Nikus, Kjell; Njølstad, Pål R; Nordestgaard, Børge G; Ntalla, Ioanna; O'Connel, Jeffrey R; Oksa, Heikki; Loohuis, Loes M Olde; Ophoff, Roel A; Owen, Katharine R; Packard, Chris J; Padmanabhan, Sandosh; Palmer, Colin NA; Pasterkamp, Gerard; Patel, Aniruddh P; Pattie, Alison; Pedersen, Oluf; Peissig, Peggy L; Peloso, Gina M; Pennell, Craig E; Perola, Markus; Perry, James A; Perry, John R.B.; Person, Thomas N; Pirie, Ailith; Polasek, Ozren; Posthuma, Danielle; Raitakari, Olli T; Rasheed, Asif; Rauramaa, Rainer; Reilly, Dermot F; Reiner, Alex P; Renström, Frida; Ridker, Paul M; Rioux, John D; Robertson, Neil; Robino, Antonietta; Rolandsson, Olov; Rudan, Igor; Ruth, Katherine S; Saleheen, Danish; Salomaa, Veikko; Samani, Nilesh J; Sandow, Kevin; Sapkota, Yadav; Sattar, Naveed; Schmidt, Marjanka K; Schreiner, Pamela J; Schulze, Matthias B; Scott, Robert A; Segura-Lepe, Marcelo P; Shah, Svati; Sim, Xueling; Sivapalaratnam, Suthesh; Small, Kerrin S; Smith, Albert Vernon; Smith, Jennifer A; Southam, Lorraine; Spector, Timothy D; Speliotes, Elizabeth K; Starr, John M; Steinthorsdottir, Valgerdur; Stringham, Heather M; Stumvoll, Michael; Surendran, Praveen; Hart, Leen M ‘t; Tansey, Katherine E; Tardif, Jean-Claude; Taylor, Kent D; Teumer, Alexander; Thompson, Deborah J; Thorsteinsdottir, Unnur; Thuesen, Betina H; Tönjes, Anke; Tromp, Gerard; Trompet, Stella; Tsafantakis, Emmanouil; Tuomilehto, Jaakko; Tybjaerg-Hansen, Anne; Tyrer, Jonathan P; Uher, Rudolf; Uitterlinden, André G; Ulivi, Sheila; van der Laan, Sander W; Van Der Leij, Andries R; van Duijn, Cornelia M; van Schoor, Natasja M; van Setten, Jessica; Varbo, Anette; Varga, Tibor V; Varma, Rohit; Edwards, Digna R Velez; Vermeulen, Sita H; Vestergaard, Henrik; Vitart, Veronique; Vogt, Thomas F; Vozzi, Diego; Walker, Mark; Wang, Feijie; Wang, Carol A; Wang, Shuai; Wang, Yiqin; Wareham, Nicholas J; Warren, Helen R; Wessel, Jennifer; Willems, Sara M; Wilson, James G; Witte, Daniel R; Woods, Michael O; Wu, Ying; Yaghootkar, Hanieh; Yao, Jie; Yao, Pang; Yerges-Armstrong, Laura M; Young, Robin; Zeggini, Eleftheria; Zhan, Xiaowei; Zhang, Weihua; Zhao, Jing Hua; Zhao, Wei; Zhao, Wei; Zheng, He; Zhou, Wei; Rotter, Jerome I; Boehnke, Michael; Kathiresan, Sekar; McCarthy, Mark I; Willer, Cristen J; Stefansson, Kari; Borecki, Ingrid B; Liu, Dajiang J; North, Kari E; Heard-Costa, Nancy L; Pers, Tune H; Lindgren, Cecilia M; Oxvig, Claus; Kutalik, Zoltán; Rivadeneira, Fernando; Loos, Ruth JF; Frayling, Timothy M; Hirschhorn, Joel N; Deloukas, Panos; Lettre, Guillaume

    2016-01-01

    Summary Height is a highly heritable, classic polygenic trait with ∼700 common associated variants identified so far through genome-wide association studies. Here, we report 83 height-associated coding variants with lower minor allele frequencies (range of 0.1-4.8%) and effects of up to 2 cm/allele (e.g. in IHH, STC2, AR and CRISPLD2), >10 times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (+1-2 cm/allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes mutated in monogenic growth disorders and highlight new biological candidates (e.g. ADAMTS3, IL11RA, NOX4) and pathways (e.g. proteoglycan/glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate to large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways. PMID:28146470

  2. Common genetic variants influence human subcortical brain structures

    PubMed Central

    Hibar, Derrek P.; Stein, Jason L.; Renteria, Miguel E.; Arias-Vasquez, Alejandro; Desrivières, Sylvane; Jahanshad, Neda; Toro, Roberto; Wittfeld, Katharina; Abramovic, Lucija; Andersson, Micael; Aribisala, Benjamin S.; Armstrong, Nicola J.; Bernard, Manon; Bohlken, Marc M.; Boks, Marco P.; Bralten, Janita; Brown, Andrew A.; Chakravarty, M. Mallar; Chen, Qiang; Ching, Christopher R. K.; Cuellar-Partida, Gabriel; den Braber, Anouk; Giddaluru, Sudheer; Goldman, Aaron L.; Grimm, Oliver; Guadalupe, Tulio; Hass, Johanna; Woldehawariat, Girma; Holmes, Avram J.; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H.; Olde Loohuis, Loes M.; Luciano, Michelle; Macare, Christine; Mather, Karen A.; Mattheisen, Manuel; Milaneschi, Yuri; Nho, Kwangsik; Papmeyer, Martina; Ramasamy, Adaikalavan; Risacher, Shannon L.; Roiz-Santiañez, Roberto; Rose, Emma J.; Salami, Alireza; Sämann, Philipp G.; Schmaal, Lianne; Schork, Andrew J.; Shin, Jean; Strike, Lachlan T.; Teumer, Alexander; van Donkelaar, Marjolein M. J.; van Eijk, Kristel R.; Walters, Raymond K.; Westlye, Lars T.; Whelan, Christopher D.; Winkler, Anderson M.; Zwiers, Marcel P.; Alhusaini, Saud; Athanasiu, Lavinia; Ehrlich, Stefan; Hakobjan, Marina M. H.; Hartberg, Cecilie B.; Haukvik, Unn K.; Heister, Angelien J. G. A. M.; Hoehn, David; Kasperaviciute, Dalia; Liewald, David C. M.; Lopez, Lorna M.; Makkinje, Remco R. R.; Matarin, Mar; Naber, Marlies A. M.; McKay, D. Reese; Needham, Margaret; Nugent, Allison C.; Pütz, Benno; Royle, Natalie A.; Shen, Li; Sprooten, Emma; Trabzuni, Daniah; van der Marel, Saskia S. L.; van Hulzen, Kimm J. E.; Walton, Esther; Wolf, Christiane; Almasy, Laura; Ames, David; Arepalli, Sampath; Assareh, Amelia A.; Bastin, Mark E.; Brodaty, Henry; Bulayeva, Kazima B.; Carless, Melanie A.; Cichon, Sven; Corvin, Aiden; Curran, Joanne E.; Czisch, Michael; de Zubicaray, Greig I.; Dillman, Allissa; Duggirala, Ravi; Dyer, Thomas D.; Erk, Susanne; Fedko, Iryna O.; Ferrucci, Luigi; Foroud, Tatiana M.; Fox, Peter T.; Fukunaga, Masaki; Gibbs, J. Raphael; Göring, Harald H. H.; Green, Robert C.; Guelfi, Sebastian; Hansell, Narelle K.; Hartman, Catharina A.; Hegenscheid, Katrin; Heinz, Andreas; Hernandez, Dena G.; Heslenfeld, Dirk J.; Hoekstra, Pieter J.; Holsboer, Florian; Homuth, Georg; Hottenga, Jouke-Jan; Ikeda, Masashi; Jack, Clifford R.; Jenkinson, Mark; Johnson, Robert; Kanai, Ryota; Keil, Maria; Kent, Jack W.; Kochunov, Peter; Kwok, John B.; Lawrie, Stephen M.; Liu, Xinmin; Longo, Dan L.; McMahon, Katie L.; Meisenzahl, Eva; Melle, Ingrid; Mohnke, Sebastian; Montgomery, Grant W.; Mostert, Jeanette C.; Mühleisen, Thomas W.; Nalls, Michael A.; Nichols, Thomas E.; Nilsson, Lars G.; Nöthen, Markus M.; Ohi, Kazutaka; Olvera, Rene L.; Perez-Iglesias, Rocio; Pike, G. Bruce; Potkin, Steven G.; Reinvang, Ivar; Reppermund, Simone; Rietschel, Marcella; Romanczuk-Seiferth, Nina; Rosen, Glenn D.; Rujescu, Dan; Schnell, Knut; Schofield, Peter R.; Smith, Colin; Steen, Vidar M.; Sussmann, Jessika E.; Thalamuthu, Anbupalam; Toga, Arthur W.; Traynor, Bryan J.; Troncoso, Juan; Turner, Jessica A.; Valdés Hernández, Maria C.; van ’t Ent, Dennis; van der Brug, Marcel; van der Wee, Nic J. A.; van Tol, Marie-Jose; Veltman, Dick J.; Wassink, Thomas H.; Westman, Eric; Zielke, Ronald H.; Zonderman, Alan B.; Ashbrook, David G.; Hager, Reinmar; Lu, Lu; McMahon, Francis J.; Morris, Derek W.; Williams, Robert W.; Brunner, Han G.; Buckner, Randy L.; Buitelaar, Jan K.; Cahn, Wiepke; Calhoun, Vince D.; Cavalleri, Gianpiero L.; Crespo-Facorro, Benedicto; Dale, Anders M.; Davies, Gareth E.; Delanty, Norman; Depondt, Chantal; Djurovic, Srdjan; Drevets, Wayne C.; Espeseth, Thomas; Gollub, Randy L.; Ho, Beng-Choon; Hoffmann, Wolfgang; Hosten, Norbert; Kahn, René S.; Le Hellard, Stephanie; Meyer-Lindenberg, Andreas; Müller-Myhsok, Bertram; Nauck, Matthias; Nyberg, Lars; Pandolfo, Massimo; Penninx, Brenda W. J. H.; Roffman, Joshua L.; Sisodiya, Sanjay M.; Smoller, Jordan W.; van Bokhoven, Hans; van Haren, Neeltje E. M.; Völzke, Henry; Walter, Henrik; Weiner, Michael W.; Wen, Wei; White, Tonya; Agartz, Ingrid; Andreassen, Ole A.; Blangero, John; Boomsma, Dorret I.; Brouwer, Rachel M.; Cannon, Dara M.; Cookson, Mark R.; de Geus, Eco J. C.; Deary, Ian J.; Donohoe, Gary; Fernández, Guillén; Fisher, Simon E.; Francks, Clyde; Glahn, David C.; Grabe, Hans J.; Gruber, Oliver; Hardy, John; Hashimoto, Ryota; Hulshoff Pol, Hilleke E.; Jönsson, Erik G.; Kloszewska, Iwona; Lovestone, Simon; Mattay, Venkata S.; Mecocci, Patrizia; McDonald, Colm; McIntosh, Andrew M.; Ophoff, Roel A.; Paus, Tomas; Pausova, Zdenka; Ryten, Mina; Sachdev, Perminder S.; Saykin, Andrew J.; Simmons, Andy; Singleton, Andrew; Soininen, Hilkka; Wardlaw, Joanna M.; Weale, Michael E.; Weinberger, Daniel R.; Adams, Hieab H. H.; Launer, Lenore J.; Seiler, Stephan; Schmidt, Reinhold; Chauhan, Ganesh; Satizabal, Claudia L.; Becker, James T.; Yanek, Lisa; van der Lee, Sven J.; Ebling, Maritza; Fischl, Bruce; Longstreth, W. T.; Greve, Douglas; Schmidt, Helena; Nyquist, Paul; Vinke, Louis N.; van Duijn, Cornelia M.; Xue, Luting; Mazoyer, Bernard; Bis, Joshua C.; Gudnason, Vilmundur; Seshadri, Sudha; Ikram, M. Arfan; Martin, Nicholas G.; Wright, Margaret J.; Schumann, Gunter; Franke, Barbara; Thompson, Paul M.; Medland, Sarah E.

    2015-01-01

    The highly complex structure of the human brain is strongly shaped by genetic influences1. Subcortical brain regions form circuits with cortical areas to coordinate movement2, learning, memory3 and motivation4, and altered circuits can lead to abnormal behaviour and disease2. To investigate how common genetic variants affect the structure of these brain regions, here we conduct genome-wide association studies of the volumes of seven subcortical regions and the intracranial volume derived from magnetic resonance images of 30,717 individuals from 50 cohorts. We identify five novel genetic variants influencing the volumes of the putamen and caudate nucleus. We also find stronger evidence for three loci with previously established influences on hippocampal volume5 and intracranial volume6. These variants show specific volumetric effects on brain structures rather than global effects across structures. The strongest effects were found for the putamen, where a novel intergenic locus with replicable influence on volume (rs945270; P = 1.08 × 10−33; 0.52% variance explained) showed evidence of altering the expression of the KTN1 gene in both brain and blood tissue. Variants influencing putamen volume clustered near developmental genes that regulate apoptosis, axon guidance and vesicle transport. Identification of these genetic variants provides insight into the causes of variability inhuman brain development, and may help to determine mechanisms of neuropsychiatric dysfunction. PMID:25607358

  3. Scripps Genome ADVISER: Annotation and Distributed Variant Interpretation SERver

    PubMed Central

    Pham, Phillip H.; Shipman, William J.; Erikson, Galina A.; Schork, Nicholas J.; Torkamani, Ali

    2015-01-01

    Interpretation of human genomes is a major challenge. We present the Scripps Genome ADVISER (SG-ADVISER) suite, which aims to fill the gap between data generation and genome interpretation by performing holistic, in-depth, annotations and functional predictions on all variant types and effects. The SG-ADVISER suite includes a de-identification tool, a variant annotation web-server, and a user interface for inheritance and annotation-based filtration. SG-ADVISER allows users with no bioinformatics expertise to manipulate large volumes of variant data with ease – without the need to download large reference databases, install software, or use a command line interface. SG-ADVISER is freely available at genomics.scripps.edu/ADVISER. PMID:25706643

  4. Pre- and Post-Conditions Expressed in Variants of the Modal µ-Calculus

    NASA Astrophysics Data System (ADS)

    Tanabe, Yoshinori; Sekizawa, Toshifusa; Yuasa, Yoshifumi; Takahashi, Koichi

    Properties of Kripke structures can be expressed by formulas of the modal µ-calculus. Despite its strong expressive power, the validity problem of the modal µ-calculus is decidable, and so are some of its variants enriched by inverse programs, graded modalities, and nominals. In this study, we show that the pre- and post-conditions of transformations of Kripke structures, such as addition/deletion of states and edges, can be expressed using variants of the modal µ-calculus. Combined with decision procedures we have developed for those variants, the properties of sequences of transformations on Kripke structures can be deduced. We show that these techniques can be used to verify the properties of pointer-manipulating programs.

  5. Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants.

    PubMed

    Dadaev, Tokhir; Saunders, Edward J; Newcombe, Paul J; Anokian, Ezequiel; Leongamornlert, Daniel A; Brook, Mark N; Cieza-Borrella, Clara; Mijuskovic, Martina; Wakerell, Sarah; Olama, Ali Amin Al; Schumacher, Fredrick R; Berndt, Sonja I; Benlloch, Sara; Ahmed, Mahbubl; Goh, Chee; Sheng, Xin; Zhang, Zhuo; Muir, Kenneth; Govindasami, Koveela; Lophatananon, Artitaya; Stevens, Victoria L; Gapstur, Susan M; Carter, Brian D; Tangen, Catherine M; Goodman, Phyllis; Thompson, Ian M; Batra, Jyotsna; Chambers, Suzanne; Moya, Leire; Clements, Judith; Horvath, Lisa; Tilley, Wayne; Risbridger, Gail; Gronberg, Henrik; Aly, Markus; Nordström, Tobias; Pharoah, Paul; Pashayan, Nora; Schleutker, Johanna; Tammela, Teuvo L J; Sipeky, Csilla; Auvinen, Anssi; Albanes, Demetrius; Weinstein, Stephanie; Wolk, Alicja; Hakansson, Niclas; West, Catharine; Dunning, Alison M; Burnet, Neil; Mucci, Lorelei; Giovannucci, Edward; Andriole, Gerald; Cussenot, Olivier; Cancel-Tassin, Géraldine; Koutros, Stella; Freeman, Laura E Beane; Sorensen, Karina Dalsgaard; Orntoft, Torben Falck; Borre, Michael; Maehle, Lovise; Grindedal, Eli Marie; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Martin, Richard M; Travis, Ruth C; Key, Tim J; Hamilton, Robert J; Fleshner, Neil E; Finelli, Antonio; Ingles, Sue Ann; Stern, Mariana C; Rosenstein, Barry; Kerns, Sarah; Ostrer, Harry; Lu, Yong-Jie; Zhang, Hong-Wei; Feng, Ninghan; Mao, Xueying; Guo, Xin; Wang, Guomin; Sun, Zan; Giles, Graham G; Southey, Melissa C; MacInnis, Robert J; FitzGerald, Liesel M; Kibel, Adam S; Drake, Bettina F; Vega, Ana; Gómez-Caamaño, Antonio; Fachal, Laura; Szulkin, Robert; Eklund, Martin; Kogevinas, Manolis; Llorca, Javier; Castaño-Vinyals, Gemma; Penney, Kathryn L; Stampfer, Meir; Park, Jong Y; Sellers, Thomas A; Lin, Hui-Yi; Stanford, Janet L; Cybulski, Cezary; Wokolorczyk, Dominika; Lubinski, Jan; Ostrander, Elaine A; Geybels, Milan S; Nordestgaard, Børge G; Nielsen, Sune F; Weisher, Maren; Bisbjerg, Rasmus; Røder, Martin Andreas; Iversen, Peter; Brenner, Hermann; Cuk, Katarina; Holleczek, Bernd; Maier, Christiane; Luedeke, Manuel; Schnoeller, Thomas; Kim, Jeri; Logothetis, Christopher J; John, Esther M; Teixeira, Manuel R; Paulo, Paula; Cardoso, Marta; Neuhausen, Susan L; Steele, Linda; Ding, Yuan Chun; De Ruyck, Kim; De Meerleer, Gert; Ost, Piet; Razack, Azad; Lim, Jasmine; Teo, Soo-Hwang; Lin, Daniel W; Newcomb, Lisa F; Lessel, Davor; Gamulin, Marija; Kulis, Tomislav; Kaneva, Radka; Usmani, Nawaid; Slavov, Chavdar; Mitev, Vanio; Parliament, Matthew; Singhal, Sandeep; Claessens, Frank; Joniau, Steven; Van den Broeck, Thomas; Larkin, Samantha; Townsend, Paul A; Aukim-Hastie, Claire; Gago-Dominguez, Manuela; Castelao, Jose Esteban; Martinez, Maria Elena; Roobol, Monique J; Jenster, Guido; van Schaik, Ron H N; Menegaux, Florence; Truong, Thérèse; Koudou, Yves Akoli; Xu, Jianfeng; Khaw, Kay-Tee; Cannon-Albright, Lisa; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Thibodeau, Stephen N; McDonnell, Shannon K; Schaid, Daniel J; Lindstrom, Sara; Turman, Constance; Ma, Jing; Hunter, David J; Riboli, Elio; Siddiq, Afshan; Canzian, Federico; Kolonel, Laurence N; Le Marchand, Loic; Hoover, Robert N; Machiela, Mitchell J; Kraft, Peter; Freedman, Matthew; Wiklund, Fredrik; Chanock, Stephen; Henderson, Brian E; Easton, Douglas F; Haiman, Christopher A; Eeles, Rosalind A; Conti, David V; Kote-Jarai, Zsofia

    2018-06-11

    Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.

  6. Three New Alpha1-Antitrypsin Deficiency Variants Help to Define a C-Terminal Region Regulating Conformational Change and Polymerization

    PubMed Central

    Fra, Anna M.; Gooptu, Bibek; Ferrarotti, Ilaria; Miranda, Elena; Scabini, Roberta; Ronzoni, Riccardo; Benini, Federica; Corda, Luciano; Medicina, Daniela; Luisetti, Maurizio; Schiaffonati, Luisa

    2012-01-01

    Alpha1-antitrypsin (AAT) deficiency is a hereditary disorder associated with reduced AAT plasma levels, predisposing adults to pulmonary emphysema. The most common genetic AAT variants found in patients are the mildly deficient S and the severely deficient Z alleles, but several other pathogenic rare alleles have been reported. While the plasma AAT deficiency is a common trait of the disease, only a few AAT variants, including the prototypic Z AAT and some rare variants, form cytotoxic polymers in the endoplasmic reticulum of hepatocytes and predispose to liver disease. Here we report the identification of three new rare AAT variants associated to reduced plasma levels and characterize their molecular behaviour in cellular models. The variants, called Mpisa (Lys259Ile), Etaurisano (Lys368Glu) and Yorzinuovi (Pro391His), showed reduced secretion compared to control M AAT, and accumulated to different extents in the cells as ordered polymeric structures resembling those formed by the Z variant. Structural analysis of the mutations showed that they may facilitate polymerization both by loosening ‘latch’ interactions constraining the AAT reactive loop and through effects on core packing. In conclusion, the new AAT deficiency variants, besides increasing the risk of lung disease, may predispose to liver disease, particularly if associated with the common Z variant. The new mutations cluster structurally, thus defining a region of the AAT molecule critical for regulating its conformational state. PMID:22723858

  7. Neurodegenerative disease mutations in TREM2 reveal a functional surface and distinct loss-of-function mechanisms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kober, Daniel L.; Alexander-Brett, Jennifer M.; Karch, Celeste M.

    Genetic variations in the myeloid immune receptor TREM2 are linked to several neurodegenerative diseases. To determine how TREM2 variants contribute to these diseases, we performed structural and functional studies of wild-type and variant proteins. Our 3.1 Å TREM2 crystal structure revealed that mutations found in Nasu-Hakola disease are buried whereas Alzheimer’s disease risk variants are found on the surface, suggesting that these mutations have distinct effects on TREM2 function. Biophysical and cellular methods indicate that Nasu-Hakola mutations impact protein stability and decrease folded TREM2 surface expression, whereas Alzheimer’s risk variants impact binding to a TREM2 ligand. Additionally, the Alzheimer’s riskmore » variants appear to epitope map a functional surface on TREM2 that is unique within the larger TREM family. These findings provide a guide to structural and functional differences among genetic variants of TREM2, indicating that therapies targeting the TREM2 pathway should be tailored to these genetic and functional differences with patient-specific medicine approaches for neurodegenerative disorders.« less

  8. Study designs for identification of rare disease variants in complex diseases: the utility of family-based designs.

    PubMed

    Ionita-Laza, Iuliana; Ottman, Ruth

    2011-11-01

    The recent progress in sequencing technologies makes possible large-scale medical sequencing efforts to assess the importance of rare variants in complex diseases. The results of such efforts depend heavily on the use of efficient study designs and analytical methods. We introduce here a unified framework for association testing of rare variants in family-based designs or designs based on unselected affected individuals. This framework allows us to quantify the enrichment in rare disease variants in families containing multiple affected individuals and to investigate the optimal design of studies aiming to identify rare disease variants in complex traits. We show that for many complex diseases with small values for the overall sibling recurrence risk ratio, such as Alzheimer's disease and most cancers, sequencing affected individuals with a positive family history of the disease can be extremely advantageous for identifying rare disease variants. In contrast, for complex diseases with large values of the sibling recurrence risk ratio, sequencing unselected affected individuals may be preferable.

  9. Crystal Structure of an Activated Variant of Small Heat Shock Protein Hsp16.5

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mchaourab, Hassane S.; Lin, Yi-Lun; Spiller, Benjamin W.

    How does the sequence of a single small heat shock protein (sHSP) assemble into oligomers of different sizes? To gain insight into the underlying structural mechanism, we determined the crystal structure of an engineered variant of Methanocaldococcus jannaschii Hsp16.5 wherein a 14 amino acid peptide from human heat shock protein 27 (Hsp27) was inserted at the junction of the N-terminal region and the {alpha}-crystallin domain. In response to this insertion, the oligomer shell expands from 24 to 48 subunits while maintaining octahedral symmetry. Oligomer rearrangement does not alter the fold of the conserved {alpha}-crystallin domain nor does it disturb themore » interface holding the dimeric building block together. Rather, the flexible C-terminal tail of Hsp16.5 changes its orientation relative to the {alpha}-crystallin domain which enables alternative packing of dimers. This change in orientation preserves a peptide-in-groove interaction of the C-terminal tail with an adjacent {beta}-sandwich, thereby holding the assembly together. The interior of the expanded oligomer, where substrates presumably bind, retains its predominantly nonpolar character relative to the outside surface. New large windows in the outer shell provide increased access to these substrate-binding regions, thus accounting for the higher affinity of this variant to substrates. Oligomer polydispersity regulates sHSPs chaperone activity in vitro and has been implicated in their physiological roles. The structural mechanism of Hsp16.5 oligomer flexibility revealed here, which is likely to be highly conserved across the sHSP superfamily, explains the relationship between oligomer expansion observed in disease-linked mutants and changes in chaperone activity.« less

  10. Isosteric And Non-Isosteric Base Pairs In RNA Motifs: Molecular Dynamics And Bioinformatics Study Of The Sarcin-Ricin Internal Loop

    PubMed Central

    Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří

    2013-01-01

    The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333

  11. Production and characterization of genetically modified human IL-11 variants.

    PubMed

    Sano, Emiko; Takei, Toshiaki; Ueda, Takuya; Tsumoto, Kouhei

    2017-02-01

    Interleukin-11 (IL-11) has been expected as a drug on severe thrombocytopenia caused by myelo-suppressive chemotherapy. Whereas, development of IL-11 inhibitor is also expected for a treatment against IL-11 related cancer progression. Here, we will demonstrate the creation of various kinds of genetically modified hIL-11s. Modified vectors were constructed by introducing N- or O-glycosylation site on the region of hIL-11 that does not belong to the core α-helical motif based on the predicted secondary structure. N-terminal (N: between 22 to 23 aa), the first loop (M1:70 to 71 aa), the second loop (M2:114-115 aa), the third loop (M3:160-161 aa) and C-terminal (C: 200- aa) were selected for modification. A large scale production system was established and the characteristics of modified hIL-11s were evaluated. The structure was analyzed by amino acid sequence and composition analysis and CD-spectra. Glycan was assessed by monosaccharide composition analysis. Growth promoting activity and biological stability were analyzed by proliferation of T1165 cells. N-terminal modified proteins were well glycosylated and produced. Growth activity of 3NN with NASNASNAS sequence on N-terminal was about tenfold higher than wild type (WT). Structural and biological stabilities of 3NN were also better than WT and residence time in mouse blood was longer than WT. M1 variants lacked growth activity though they are well glycosylated and secondary structure is very stable. Both of 3NN and OM1 with AAATPAPG on M1 associated with hIL-11R strongly. These results indicate N-terminal and M1 variants will be expected for practical use as potent agonists or antagonists of hIL-11. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Crystal structure of an activated variant of small heat shock protein Hsp16.5.

    PubMed

    McHaourab, Hassane S; Lin, Yi-Lun; Spiller, Benjamin W

    2012-06-26

    How does the sequence of a single small heat shock protein (sHSP) assemble into oligomers of different sizes? To gain insight into the underlying structural mechanism, we determined the crystal structure of an engineered variant of Methanocaldococcus jannaschii Hsp16.5 wherein a 14 amino acid peptide from human heat shock protein 27 (Hsp27) was inserted at the junction of the N-terminal region and the α-crystallin domain. In response to this insertion, the oligomer shell expands from 24 to 48 subunits while maintaining octahedral symmetry. Oligomer rearrangement does not alter the fold of the conserved α-crystallin domain nor does it disturb the interface holding the dimeric building block together. Rather, the flexible C-terminal tail of Hsp16.5 changes its orientation relative to the α-crystallin domain which enables alternative packing of dimers. This change in orientation preserves a peptide-in-groove interaction of the C-terminal tail with an adjacent β-sandwich, thereby holding the assembly together. The interior of the expanded oligomer, where substrates presumably bind, retains its predominantly nonpolar character relative to the outside surface. New large windows in the outer shell provide increased access to these substrate-binding regions, thus accounting for the higher affinity of this variant to substrates. Oligomer polydispersity regulates sHSPs chaperone activity in vitro and has been implicated in their physiological roles. The structural mechanism of Hsp16.5 oligomer flexibility revealed here, which is likely to be highly conserved across the sHSP superfamily, explains the relationship between oligomer expansion observed in disease-linked mutants and changes in chaperone activity.

  13. Mutations of Profilin-1 Associated with Amyotrophic Lateral Sclerosis Promote Aggregation Due to Structural Changes of Its Native State.

    PubMed

    Del Poggetto, Edoardo; Bemporad, Francesco; Tatini, Francesca; Chiti, Fabrizio

    2015-11-20

    The PFN1 gene, coding for profilin-1, has recently been associated with familial amyotrophic lateral sclerosis (fALS), as three mutations, namely C71G, M114T, and G118V, have been found in patients with familial forms of the disease and another, E117G, has been proposed to be a moderate risk factor for disease onset. In this work, we have purified the four profilin-1 variants along with the wild-type protein. The resulting aggregates appear to be fibrillar, to have a weak binding to ThT, and to possess a significant amount of intermolecular β-sheet structure. Using ThT fluorescence assays, far-UV circular dichroism, and dynamic light scattering, we found that all four variants have an aggregation propensity higher than that of the wild-type counterpart. In particular, the C71G mutation was found to induce the most dramatic change in aggregation, followed by the G118V and M114T substitutions and then the E117G mutation. Such a propensity was found not to strictly correlate with the conformational stability in this group of profilin-1 variants, determined using both urea-induced denaturation at equilibrium and folding/unfolding kinetics. However, it correlated with structural changes of the folded states, as monitored with far-UV circular dichroism, intrinsic fluorescence spectroscopy, ANS binding, acrylamide quenching, and dynamic light scattering. Overall, the results suggest that all four mutations increase the tendency of profilin-1 to aggregate and that such aggregation behavior is largely determined by the mutation-induced structural changes occurring in the folded state of the protein.

  14. Molecular dynamics simulations revealed structural differences among WRKY domain-DNA interaction in barley (Hordeum vulgare).

    PubMed

    Pandey, Bharati; Grover, Abhinav; Sharma, Pradeep

    2018-02-12

    The WRKY transcription factors are a class of DNA-binding proteins involved in diverse plant processes play critical roles in response to abiotic and biotic stresses. Genome-wide divergence analysis of WRKY gene family in Hordeum vulgare provided a framework for molecular evolution and functional roles. So far, the crystal structure of WRKY from barley has not been resolved; moreover, knowledge of the three-dimensional structure of WRKY domain is pre-requisites for exploring the protein-DNA recognition mechanisms. Homology modelling based approach was used to generate structures for WRKY DNA binding domain (DBD) and its variants using AtWRKY1 as a template. Finally, the stability and conformational changes of the generated model in unbound and bound form was examined through atomistic molecular dynamics (MD) simulations for 100 ns time period. In this study, we investigated the comparative binding pattern of WRKY domain and its variants with W-box cis-regulatory element using molecular docking and dynamics (MD) simulations assays. The atomic insight into WRKY domain exhibited significant variation in the intermolecular hydrogen bonding pattern, leading to the structural anomalies in the variant type and differences in the DNA-binding specificities. Based on the MD analysis, residual contribution and interaction contour, wild-type WRKY (HvWRKY46) were found to interact with DNA through highly conserved heptapeptide in the pre- and post-MD simulated complexes, whereas heptapeptide interaction with DNA was missing in variants (I and II) in post-MD complexes. Consequently, through principal component analysis, wild-type WRKY was also found to be more stable by obscuring a reduced conformational space than the variant I (HvWRKY34). Lastly, high binding free energy for wild-type and variant II allowed us to conclude that wild-type WRKY-DNA complex was more stable relative to variants I. The results of our study revealed complete dynamic and structural information about WRKY domain-DNA interactions. However, no structure base information reported to date for WRKY variants and their mechanism of interaction with DNA. Our findings highlighted the importance of selecting a sequence to generate newer transgenic plants that would be increasingly tolerance to stress conditions.

  15. Do Structural Missense Variants in the ATM Gene Found in Women With Breast Cancer Cause Breast Cancer in Knock-in Mouse Strains?

    DTIC Science & Technology

    2006-04-01

    W81XWH-05-1-0282 TITLE: Do Structural Missense Variants in the ATM Gene Found in Women with Breast Cancer Cause Breast Cancer in "Knock-in...5a. CONTRACT NUMBER Do Structural Missense Variants in the ATM Gene Found in Women with Breast Cancer Cause Breast Cancer in "Knock-in" Mouse...human cohort-specific missense mutations will develop breast cancer with dominant inheritance in a subset of animals. It also is hypothesized that

  16. Ensemble variant interpretation methods to predict enzyme activity and assign pathogenicity in the CAGI4 NAGLU (Human N-acetyl-glucosaminidase) and UBE2I (Human SUMO-ligase) challenges.

    PubMed

    Yin, Yizhou; Kundu, Kunal; Pal, Lipika R; Moult, John

    2017-09-01

    CAGI (Critical Assessment of Genome Interpretation) conducts community experiments to determine the state of the art in relating genotype to phenotype. Here, we report results obtained using newly developed ensemble methods to address two CAGI4 challenges: enzyme activity for population missense variants found in NAGLU (Human N-acetyl-glucosaminidase) and random missense mutations in Human UBE2I (Human SUMO E2 ligase), assayed in a high-throughput competitive yeast complementation procedure. The ensemble methods are effective, ranked second for SUMO-ligase and third for NAGLU, according to the CAGI independent assessors. However, in common with other methods used in CAGI, there are large discrepancies between predicted and experimental activities for a subset of variants. Analysis of the structural context provides some insight into these. Post-challenge analysis shows that the ensemble methods are also effective at assigning pathogenicity for the NAGLU variants. In the clinic, providing an estimate of the reliability of pathogenic assignments is the key. We have also used the NAGLU dataset to show that ensemble methods have considerable potential for this task, and are already reliable enough for use with a subset of mutations. © 2017 Wiley Periodicals, Inc.

  17. Role of protein surface charge in monellin sweetness.

    PubMed

    Xue, Wei-Feng; Szczepankiewicz, Olga; Thulin, Eva; Linse, Sara; Carey, Jannette

    2009-03-01

    A small number of proteins have the unusual property of tasting intensely sweet. Despite many studies aimed at identifying their sweet taste determinants, the molecular basis of protein sweetness is not fully understood. Recent mutational studies of monellin have implicated positively charged residues in sweetness. In the present work, the effect of overall net charge was investigated using the complementary approach of negative charge alterations. Multiple substitutions of Asp/Asn and Glu/Gln residues radically altered the surface charge of single-chain monellin by removing six negative charges or adding four negative charges. Biophysical characterization using circular dichroism, fluorescence, and two-dimensional NMR demonstrates that the native fold of monellin is preserved in the variant proteins under physiological solution conditions although their stability toward chemical denaturation is altered. A human taste test was employed to determine the sweetness detection threshold of the variants. Removal of negative charges preserves monellin sweetness, whereas added negative charge has a large negative impact on sweetness. Meta-analysis of published charge variants of monellin and other sweet proteins reveals a general trend toward increasing sweetness with increasing positive net charge. Structural mapping of monellin variants identifies a hydrophobic surface predicted to face the receptor where introduced positive or negative charge reduces sweetness, and a polar surface where charges modulate long-range electrostatic complementarity.

  18. Saturation scanning of ubiquitin variants reveals a common hot spot for binding to USP2 and USP21.

    PubMed

    Leung, Isabel; Dekel, Ayelet; Shifman, Julia M; Sidhu, Sachdev S

    2016-08-02

    A detailed understanding of the molecular mechanisms whereby ubiquitin (Ub) recognizes enzymes in the Ub proteasome system is crucial for understanding the biological function of Ub. Many structures of Ub complexes have been solved and, in most cases, reveal a large structural epitope on a common face of the Ub molecule. However, owing to the generally weak nature of these interactions, it has been difficult to map in detail the functional contributions of individual Ub side chains to affinity and specificity. Here we took advantage of Ub variants (Ubvs) that bind tightly to particular Ub-specific proteases (USPs) and used phage display and saturation scanning mutagenesis to comprehensively map functional epitopes within the structural epitopes. We found that Ubvs that bind to USP2 or USP21 contain a remarkably similar core functional epitope, or "hot spot," consisting mainly of positions that are conserved as the wild type sequence, but also some positions that prefer mutant sequences. The Ubv core functional epitope contacts residues that are conserved in the human USP family, and thus it is likely important for the interactions of Ub across many family members.

  19. Amelotin Gene Structure and Expression during Enamel Formation in the Opossum Monodelphis domestica

    PubMed Central

    Gasse, Barbara; Liu, Xi; Corre, Erwan; Sire, Jean-Yves

    2015-01-01

    Amelotin (AMTN) is an ameloblast-secreted protein that belongs to the secretory calcium-binding phosphoprotein family, which also includes the enamel matrix proteins amelogenin, ameloblastin and enamelin. Although AMTN is supposed to play an important role in enamel formation, data were long limited to the rodents, in which it is expressed during the maturation stage. Recent comparative studies in sauropsids and amphibians revealed that (i) AMTN was expressed earlier, i.e. as soon as ameloblasts are depositing the enamel matrix, and (ii) AMTN structure was different, a change which mostly resulted from an intraexonic splicing in the large exon 8 of an ancestral mammal. The present study was performed to know whether the differences in AMTN structure and expression in rodents compared to non-mammalian tetrapods dated back to an early ancestral mammal or were acquired later in mammalian evolution. We sequenced, assembled and screened the jaw transcriptome of a neonate opossum Monodelphis domestica, a marsupial. We found two AMTN transcripts. Variant 1, representing 70.8% of AMTN transcripts, displayed the structure known in rodents, whereas variant 2 (29.2%) exhibited the nonmammalian tetrapod structure. Then, we studied AMTN expression during amelogenesis in a neonate specimen. We obtained similar data as those reported in rodents. These findings indicate that more than 180 million years ago, before the divergence of marsupials and placentals, changes occurred in AMTN function and structure. The spatiotemporal expression was delayed to the maturation stage of amelogenesis and the intraexonic splicing gave rise to isoform 1, encoded by variant 1 and lacking the RGD motif. The ancestral isoform 2, housing the RGD, was initially conserved, as demonstrated here in a marsupial, then secondarily lost in the placental lineages. These findings bring new elements towards our understanding of the non-prismatic to prismatic enamel transition that occurred at the onset of mammals. PMID:26186457

  20. Sustainability and durability analysis of reinforced concrete structures

    NASA Astrophysics Data System (ADS)

    Horáková, A.; Broukalová, I.; Kohoutková, A.; Vašková, J.

    2017-09-01

    The article describes an assessment of reinforced concrete structures in terms of durability and sustainable development. There is a short summary of findings from the literature on evaluation methods for environmental impacts and also about corrosive influences acting on the reinforced concrete structure, about factors influencing the durability of these structures and mathematical models describing the corrosion impacts. Variant design of reinforced concrete structure and assessment of these variants in terms of durability and sustainability was performed. The analysed structure was a concrete ceiling structure of a parking house for cars. The variants differ in strength class of concrete and thickness of concrete slab. It was found that in terms of durability and sustainable development it is significantly preferable to use higher class of concrete. There are significant differences in results of concrete structures durability for different mathematical models of corrosive influences.

  1. Healthy brain connectivity predicts atrophy progression in non-fluent variant of primary progressive aphasia.

    PubMed

    Mandelli, Maria Luisa; Vilaplana, Eduard; Brown, Jesse A; Hubbard, H Isabel; Binney, Richard J; Attygalle, Suneth; Santos-Santos, Miguel A; Miller, Zachary A; Pakvasa, Mikhail; Henry, Maya L; Rosen, Howard J; Henry, Roland G; Rabinovici, Gil D; Miller, Bruce L; Seeley, William W; Gorno-Tempini, Maria Luisa

    2016-10-01

    Neurodegeneration has been hypothesized to follow predetermined large-scale networks through the trans-synaptic spread of toxic proteins from a syndrome-specific epicentre. To date, no longitudinal neuroimaging study has tested this hypothesis in vivo in frontotemporal dementia spectrum disorders. The aim of this study was to demonstrate that longitudinal progression of atrophy in non-fluent/agrammatic variant primary progressive aphasia spreads over time from a syndrome-specific epicentre to additional regions, based on their connectivity to the epicentre in healthy control subjects. The syndrome-specific epicentre of the non-fluent/agrammatic variant of primary progressive aphasia was derived in a group of 10 mildly affected patients (clinical dementia rating equal to 0) using voxel-based morphometry. From this region, the inferior frontal gyrus (pars opercularis), we derived functional and structural connectivity maps in healthy controls (n = 30) using functional magnetic resonance imaging at rest and diffusion-weighted imaging tractography. Graph theory analysis was applied to derive functional network features. Atrophy progression was calculated using voxel-based morphometry longitudinal analysis on 34 non-fluent/agrammatic patients. Correlation analyses were performed to compare volume changes in patients with connectivity measures of the healthy functional and structural speech/language network. The default mode network was used as a control network. From the epicentre, the healthy functional connectivity network included the left supplementary motor area and the prefrontal, inferior parietal and temporal regions, which were connected through the aslant, superior longitudinal and arcuate fasciculi. Longitudinal grey and white matter changes were found in the left language-related regions and in the right inferior frontal gyrus. Functional connectivity strength in the healthy speech/language network, but not in the default network, correlated with longitudinal grey matter changes in the non-fluent/agrammatic variant of primary progressive aphasia. Graph theoretical analysis of the speech/language network showed that regions with shorter functional paths to the epicentre exhibited greater longitudinal atrophy. The network contained three modules, including a left inferior frontal gyrus/supplementary motor area, which was most strongly connected with the epicentre. The aslant tract was the white matter pathway connecting these two regions and showed the most significant correlation between fractional anisotropy and white matter longitudinal atrophy changes. This study showed that the pattern of longitudinal atrophy progression in the non-fluent/agrammatic variant of primary progressive aphasia relates to the strength of connectivity in pre-determined functional and structural large-scale speech production networks. These findings support the hypothesis that the spread of neurodegeneration occurs by following specific anatomical and functional neuronal network architectures. © The Author (2016). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Molecular Basis for Necitumumab Inhibition of EGFR Variants Associated with Acquired Cetuximab Resistance.

    PubMed

    Bagchi, Atrish; Haidar, Jaafar N; Eastman, Scott W; Vieth, Michal; Topper, Michael; Iacolina, Michelle D; Walker, Jason M; Forest, Amelie; Shen, Yang; Novosiadly, Ruslan D; Ferguson, Kathryn M

    2018-02-01

    Acquired resistance to cetuximab, an antibody that targets the EGFR, impacts clinical benefit in head and neck, and colorectal cancers. One of the mechanisms of resistance to cetuximab is the acquisition of mutations that map to the cetuximab epitope on EGFR and prevent drug binding. We find that necitumumab, another FDA-approved EGFR antibody, can bind to EGFR that harbors the most common cetuximab-resistant substitution, S468R (or S492R, depending on the amino acid numbering system). We determined an X-ray crystal structure to 2.8 Å resolution of the necitumumab Fab bound to an S468R variant of EGFR domain III. The arginine is accommodated in a large, preexisting cavity in the necitumumab paratope. We predict that this paratope shape will be permissive to other epitope substitutions, and show that necitumumab binds to most cetuximab- and panitumumab-resistant EGFR variants. We find that a simple computational approach can predict with high success which EGFR epitope substitutions abrogate antibody binding. This computational method will be valuable to determine whether necitumumab will bind to EGFR as new epitope resistance variants are identified. This method could also be useful for rapid evaluation of the effect on binding of alterations in other antibody/antigen interfaces. Together, these data suggest that necitumumab may be active in patients who are resistant to cetuximab or panitumumab through EGFR epitope mutation. Furthermore, our analysis leads us to speculate that antibodies with large paratope cavities may be less susceptible to resistance due to mutations mapping to the antigen epitope. Mol Cancer Ther; 17(2); 521-31. ©2017 AACR . ©2017 American Association for Cancer Research.

  3. Spatial distributions of Pseudomonas fluorescens colony variants in mixed-culture biofilms.

    PubMed

    Workentine, Matthew L; Wang, Siyuan; Ceri, Howard; Turner, Raymond J

    2013-07-28

    The emergence of colony morphology variants in structured environments is being recognized as important to both niche specialization and stress tolerance. Pseudomonas fluorescens demonstrates diversity in both its natural environment, the rhizosphere, and in laboratory grown biofilms. Sub-populations of these variants within a biofilm have been suggested as important contributors to antimicrobial stress tolerance given their altered susceptibility to various agents. As such it is of interest to determine how these variants might be distributed in the biofilm environment. Here we present an analysis of the spatial distribution of Pseudomonas fluorescens colony morphology variants in mixed-culture biofilms with the wildtype phenotype. These findings reveal that two variant colony morphotypes demonstrate a significant growth advantage over the wildtype morphotype in the biofilm environment. The two variant morphotypes out-grew the wildtype across the entire biofilm and this occurred within 24 h and was maintained through to 96 h. This competitive advantage was not observed in homogeneous broth culture. The significant advantage that the variants demonstrate in biofilm colonization over the wildtype denotes the importance of this phenotype in structured environments.

  4. Regulating the chromatin landscape: structural and mechanistic perspectives.

    PubMed

    Bartholomew, Blaine

    2014-01-01

    A large family of chromatin remodelers that noncovalently modify chromatin is crucial in cell development and differentiation. They are often the targets of cancer, neurological disorders, and other human diseases. These complexes alter nucleosome positioning, higher-order chromatin structure, and nuclear organization. They also assemble chromatin, exchange out histone variants, and disassemble chromatin at defined locations. We review aspects of the structural organization of these complexes, the functional properties of their protein domains, and variation between complexes. We also address the mechanistic details of these complexes in mobilizing nucleosomes and altering chromatin structure. A better understanding of these issues will be vital for further analyses of subunits of these chromatin remodelers, which are being identified as targets in human diseases by NGS (next-generation sequencing).

  5. Real-world clinical applicability of pathogenicity predictors assessed on SERPINA1 mutations in alpha-1-antitrypsin deficiency.

    PubMed

    Giacopuzzi, Edoardo; Laffranchi, Mattia; Berardelli, Romina; Ravasio, Viola; Ferrarotti, Ilaria; Gooptu, Bibek; Borsani, Giuseppe; Fra, Annamaria

    2018-06-07

    The growth of publicly available data informing upon genetic variations, mechanisms of disease and disease sub-phenotypes offers great potential for personalised medicine. Computational approaches are likely required to assess large numbers of novel genetic variants. However, the integration of genetic, structural and pathophysiological data still represents a challenge for computational predictions and their clinical use. We addressed these issues for alpha-1-antitrypsin deficiency, a disease mediated by mutations in the SERPINA1 gene encoding alpha-1-antitrypsin. We compiled a comprehensive database of SERPINA1 coding mutations and assigned them apparent pathological relevance based upon available data. 'Benign' and 'Pathogenic' mutations were used to assess performance of 31 pathogenicity predictors. Well-performing algorithms clustered the subset of variants known to be severely pathogenic with high scores. Eight new mutations identified in the ExAC database and achieving high scores were selected for characterisation in cell models and showed secretory deficiency and polymer formation, supporting the predictive power of our computational approach. The behaviour of the pathogenic new variants and consistent outliers were rationalised by considering the protein structural context and residue conservation. These findings highlight the potential of computational methods to provide meaningful predictions of the pathogenic significance of novel mutations and identify areas for further investigation. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  6. Identification of common variants associated with human hippocampal and intracranial volumes.

    PubMed

    Stein, Jason L; Medland, Sarah E; Vasquez, Alejandro Arias; Hibar, Derrek P; Senstad, Rudy E; Winkler, Anderson M; Toro, Roberto; Appel, Katja; Bartecek, Richard; Bergmann, Ørjan; Bernard, Manon; Brown, Andrew A; Cannon, Dara M; Chakravarty, M Mallar; Christoforou, Andrea; Domin, Martin; Grimm, Oliver; Hollinshead, Marisa; Holmes, Avram J; Homuth, Georg; Hottenga, Jouke-Jan; Langan, Camilla; Lopez, Lorna M; Hansell, Narelle K; Hwang, Kristy S; Kim, Sungeun; Laje, Gonzalo; Lee, Phil H; Liu, Xinmin; Loth, Eva; Lourdusamy, Anbarasu; Mattingsdal, Morten; Mohnke, Sebastian; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; O'Brien, Carol; Papmeyer, Martina; Pütz, Benno; Ramasamy, Adaikalavan; Rasmussen, Jerod; Rijpkema, Mark; Risacher, Shannon L; Roddey, J Cooper; Rose, Emma J; Ryten, Mina; Shen, Li; Sprooten, Emma; Strengman, Eric; Teumer, Alexander; Trabzuni, Daniah; Turner, Jessica; van Eijk, Kristel; van Erp, Theo G M; van Tol, Marie-Jose; Wittfeld, Katharina; Wolf, Christiane; Woudstra, Saskia; Aleman, Andre; Alhusaini, Saud; Almasy, Laura; Binder, Elisabeth B; Brohawn, David G; Cantor, Rita M; Carless, Melanie A; Corvin, Aiden; Czisch, Michael; Curran, Joanne E; Davies, Gail; de Almeida, Marcio A A; Delanty, Norman; Depondt, Chantal; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Fagerness, Jesen; Fox, Peter T; Freimer, Nelson B; Gill, Michael; Göring, Harald H H; Hagler, Donald J; Hoehn, David; Holsboer, Florian; Hoogman, Martine; Hosten, Norbert; Jahanshad, Neda; Johnson, Matthew P; Kasperaviciute, Dalia; Kent, Jack W; Kochunov, Peter; Lancaster, Jack L; Lawrie, Stephen M; Liewald, David C; Mandl, René; Matarin, Mar; Mattheisen, Manuel; Meisenzahl, Eva; Melle, Ingrid; Moses, Eric K; Mühleisen, Thomas W; Nauck, Matthias; Nöthen, Markus M; Olvera, Rene L; Pandolfo, Massimo; Pike, G Bruce; Puls, Ralf; Reinvang, Ivar; Rentería, Miguel E; Rietschel, Marcella; Roffman, Joshua L; Royle, Natalie A; Rujescu, Dan; Savitz, Jonathan; Schnack, Hugo G; Schnell, Knut; Seiferth, Nina; Smith, Colin; Steen, Vidar M; Valdés Hernández, Maria C; Van den Heuvel, Martijn; van der Wee, Nic J; Van Haren, Neeltje E M; Veltman, Joris A; Völzke, Henry; Walker, Robert; Westlye, Lars T; Whelan, Christopher D; Agartz, Ingrid; Boomsma, Dorret I; Cavalleri, Gianpiero L; Dale, Anders M; Djurovic, Srdjan; Drevets, Wayne C; Hagoort, Peter; Hall, Jeremy; Heinz, Andreas; Jack, Clifford R; Foroud, Tatiana M; Le Hellard, Stephanie; Macciardi, Fabio; Montgomery, Grant W; Poline, Jean Baptiste; Porteous, David J; Sisodiya, Sanjay M; Starr, John M; Sussmann, Jessika; Toga, Arthur W; Veltman, Dick J; Walter, Henrik; Weiner, Michael W; Bis, Joshua C; Ikram, M Arfan; Smith, Albert V; Gudnason, Vilmundur; Tzourio, Christophe; Vernooij, Meike W; Launer, Lenore J; DeCarli, Charles; Seshadri, Sudha; Andreassen, Ole A; Apostolova, Liana G; Bastin, Mark E; Blangero, John; Brunner, Han G; Buckner, Randy L; Cichon, Sven; Coppola, Giovanni; de Zubicaray, Greig I; Deary, Ian J; Donohoe, Gary; de Geus, Eco J C; Espeseth, Thomas; Fernández, Guillén; Glahn, David C; Grabe, Hans J; Hardy, John; Hulshoff Pol, Hilleke E; Jenkinson, Mark; Kahn, René S; McDonald, Colm; McIntosh, Andrew M; McMahon, Francis J; McMahon, Katie L; Meyer-Lindenberg, Andreas; Morris, Derek W; Müller-Myhsok, Bertram; Nichols, Thomas E; Ophoff, Roel A; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda W; Potkin, Steven G; Sämann, Philipp G; Saykin, Andrew J; Schumann, Gunter; Smoller, Jordan W; Wardlaw, Joanna M; Weale, Michael E; Martin, Nicholas G; Franke, Barbara; Wright, Margaret J; Thompson, Paul M

    2012-04-15

    Identifying genetic variants influencing human brain structures may reveal new biological mechanisms underlying cognition and neuropsychiatric illness. The volume of the hippocampus is a biomarker of incipient Alzheimer's disease and is reduced in schizophrenia, major depression and mesial temporal lobe epilepsy. Whereas many brain imaging phenotypes are highly heritable, identifying and replicating genetic influences has been difficult, as small effects and the high costs of magnetic resonance imaging (MRI) have led to underpowered studies. Here we report genome-wide association meta-analyses and replication for mean bilateral hippocampal, total brain and intracranial volumes from a large multinational consortium. The intergenic variant rs7294919 was associated with hippocampal volume (12q24.22; N = 21,151; P = 6.70 × 10(-16)) and the expression levels of the positional candidate gene TESC in brain tissue. Additionally, rs10784502, located within HMGA2, was associated with intracranial volume (12q14.3; N = 15,782; P = 1.12 × 10(-12)). We also identified a suggestive association with total brain volume at rs10494373 within DDR2 (1q23.3; N = 6,500; P = 5.81 × 10(-7)).

  7. Whole-exome SNP array identifies 15 new susceptibility loci for psoriasis

    PubMed Central

    Zuo, Xianbo; Sun, Liangdan; Yin, Xianyong; Gao, Jinping; Sheng, Yujun; Xu, Jinhua; Zhang, Jianzhong; He, Chundi; Qiu, Ying; Wen, Guangdong; Tian, Hongqing; Zheng, Xiaodong; Liu, Shengxiu; Wang, Wenjun; Li, Weiran; Cheng, Yuyan; Liu, Longdan; Chang, Yan; Wang, Zaixing; Li, Zenggang; Li, Longnian; Wu, Jianping; Fang, Ling; Shen, Changbing; Zhou, Fusheng; Liang, Bo; Chen, Gang; Li, Hui; Cui, Yong; Xu, Aie; Yang, Xueqin; Hao, Fei; Xu, Limin; Fan, Xing; Li, Yuzhen; Wu, Rina; Wang, Xiuli; Liu, Xiaoming; Zheng, Min; Song, Shunpeng; Ji, Bihua; Fang, Hong; Yu, Jianbin; Sun, Yongxin; Hui, Yan; Zhang, Furen; Yang, Rongya; Yang, Sen; Zhang, Xuejun

    2015-01-01

    Genome-wide association studies (GWASs) have reproducibly associated ∼40 susceptibility loci with psoriasis. However, the missing heritability is evident and the contributions of coding variants have not yet been systematically evaluated. Here, we present a large-scale whole-exome array analysis for psoriasis consisting of 42,760 individuals. We discover 16 SNPs within 15 new genes/loci associated with psoriasis, including C1orf141, ZNF683, TMC6, AIM2, IL1RL1, CASR, SON, ZFYVE16, MTHFR, CCDC129, ZNF143, AP5B1, SYNE2, IFNGR2 and 3q26.2-q27 (P<5.00 × 10−08). In addition, we also replicate four known susceptibility loci TNIP1, NFKBIA, IL12B and LCE3D–LCE3E. These susceptibility variants identified in the current study collectively account for 1.9% of the psoriasis heritability. The variant within AIM2 is predicted to impact protein structure. Our findings increase the number of genetic risk factors for psoriasis and highlight new and plausible biological pathways in psoriasis. PMID:25854761

  8. Population Structure of Two Rabies Hosts Relative to the Known Distribution of Rabies Virus Variants in Alaska

    PubMed Central

    Goldsmith, Elizabeth W.; Renshaw, Benjamin; Clement, Christopher J.; Himschoot, Elizabeth A.; Hundertmark, Kris J.; Hueffer, Karsten

    2015-01-01

    For pathogens that infect multiple species the distinction between reservoir hosts and spillover hosts is often difficult. In Alaska, three variants of the arctic rabies virus exist with distinct spatial distributions. We test the hypothesis that rabies virus variant distribution corresponds to the population structure of the primary rabies hosts in Alaska, arctic foxes (Vulpes lagopus) and red foxes (V. vulpes) in order to possibly distinguish reservoir and spill over hosts. We used mitochondrial DNA (mtDNA) sequence and nine microsatellites to assess population structure in those two species. mtDNA structure did not correspond to rabies virus variant structure in either species. Microsatellite analyses gave varying results. Bayesian clustering found 2 groups of arctic foxes in the coastal tundra region, but for red foxes it identified tundra and boreal types. Spatial Bayesian clustering and spatial principal components analysis identified 3 and 4 groups of arctic foxes, respectively, closely matching the distribution of rabies virus variants in the state. Red foxes, conversely, showed eight clusters comprising 2 regions (boreal and tundra) with much admixture. These results run contrary to previous beliefs that arctic fox show no fine-scale spatial population structure. While we cannot rule out that the red fox is part of the maintenance host community for rabies in Alaska, the distribution of virus variants appears to be driven primarily by the artic fox Therefore we show that host population genetics can be utilized to distinguish between maintenance and spillover hosts when used in conjunction with other approaches. PMID:26661691

  9. Population structure of two rabies hosts relative to the known distribution of rabies virus variants in Alaska.

    PubMed

    Goldsmith, Elizabeth W; Renshaw, Benjamin; Clement, Christopher J; Himschoot, Elizabeth A; Hundertmark, Kris J; Hueffer, Karsten

    2016-02-01

    For pathogens that infect multiple species, the distinction between reservoir hosts and spillover hosts is often difficult. In Alaska, three variants of the arctic rabies virus exist with distinct spatial distributions. We tested the hypothesis that rabies virus variant distribution corresponds to the population structure of the primary rabies hosts in Alaska, arctic foxes (Vulpes lagopus) and red foxes (Vulpes vulpes) to possibly distinguish reservoir and spillover hosts. We used mitochondrial DNA (mtDNA) sequence and nine microsatellites to assess population structure in those two species. mtDNA structure did not correspond to rabies virus variant structure in either species. Microsatellite analyses gave varying results. Bayesian clustering found two groups of arctic foxes in the coastal tundra region, but for red foxes it identified tundra and boreal types. Spatial Bayesian clustering and spatial principal components analysis identified 3 and 4 groups of arctic foxes, respectively, closely matching the distribution of rabies virus variants in the state. Red foxes, conversely, showed eight clusters comprising two regions (boreal and tundra) with much admixture. These results run contrary to previous beliefs that arctic fox show no fine-scale spatial population structure. While we cannot rule out that the red fox is part of the maintenance host community for rabies in Alaska, the distribution of virus variants appears to be driven primarily by the arctic fox. Therefore, we show that host population genetics can be utilized to distinguish between maintenance and spillover hosts when used in conjunction with other approaches. © 2015 John Wiley & Sons Ltd.

  10. The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease

    PubMed Central

    Lugo-Martinez, Jose; Pejaver, Vikas; Pagel, Kymberleigh A.; Mort, Matthew; Cooper, David N.; Mooney, Sean D.; Radivojac, Predrag

    2016-01-01

    Elucidating the precise molecular events altered by disease-causing genetic variants represents a major challenge in translational bioinformatics. To this end, many studies have investigated the structural and functional impact of amino acid substitutions. Most of these studies were however limited in scope to either individual molecular functions or were concerned with functional effects (e.g. deleterious vs. neutral) without specifically considering possible molecular alterations. The recent growth of structural, molecular and genetic data presents an opportunity for more comprehensive studies to consider the structural environment of a residue of interest, to hypothesize specific molecular effects of sequence variants and to statistically associate these effects with genetic disease. In this study, we analyzed data sets of disease-causing and putatively neutral human variants mapped to protein 3D structures as part of a systematic study of the loss and gain of various types of functional attribute potentially underlying pathogenic molecular alterations. We first propose a formal model to assess probabilistically function-impacting variants. We then develop an array of structure-based functional residue predictors, evaluate their performance, and use them to quantify the impact of disease-causing amino acid substitutions on catalytic activity, metal binding, macromolecular binding, ligand binding, allosteric regulation and post-translational modifications. We show that our methodology generates actionable biological hypotheses for up to 41% of disease-causing genetic variants mapped to protein structures suggesting that it can be reliably used to guide experimental validation. Our results suggest that a significant fraction of disease-causing human variants mapping to protein structures are function-altering both in the presence and absence of stability disruption. PMID:27564311

  11. The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease.

    PubMed

    Lugo-Martinez, Jose; Pejaver, Vikas; Pagel, Kymberleigh A; Jain, Shantanu; Mort, Matthew; Cooper, David N; Mooney, Sean D; Radivojac, Predrag

    2016-08-01

    Elucidating the precise molecular events altered by disease-causing genetic variants represents a major challenge in translational bioinformatics. To this end, many studies have investigated the structural and functional impact of amino acid substitutions. Most of these studies were however limited in scope to either individual molecular functions or were concerned with functional effects (e.g. deleterious vs. neutral) without specifically considering possible molecular alterations. The recent growth of structural, molecular and genetic data presents an opportunity for more comprehensive studies to consider the structural environment of a residue of interest, to hypothesize specific molecular effects of sequence variants and to statistically associate these effects with genetic disease. In this study, we analyzed data sets of disease-causing and putatively neutral human variants mapped to protein 3D structures as part of a systematic study of the loss and gain of various types of functional attribute potentially underlying pathogenic molecular alterations. We first propose a formal model to assess probabilistically function-impacting variants. We then develop an array of structure-based functional residue predictors, evaluate their performance, and use them to quantify the impact of disease-causing amino acid substitutions on catalytic activity, metal binding, macromolecular binding, ligand binding, allosteric regulation and post-translational modifications. We show that our methodology generates actionable biological hypotheses for up to 41% of disease-causing genetic variants mapped to protein structures suggesting that it can be reliably used to guide experimental validation. Our results suggest that a significant fraction of disease-causing human variants mapping to protein structures are function-altering both in the presence and absence of stability disruption.

  12. Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies.

    PubMed

    Standish, Kristopher A; Carland, Tristan M; Lockwood, Glenn K; Pfeiffer, Wayne; Tatineni, Mahidhar; Huang, C Chris; Lamberth, Sarah; Cherkas, Yauheniya; Brodmerkel, Carrie; Jaeger, Ed; Smith, Lance; Rajagopal, Gunaretnam; Curran, Mark E; Schork, Nicholas J

    2015-09-22

    Next-generation sequencing (NGS) technologies have become much more efficient, allowing whole human genomes to be sequenced faster and cheaper than ever before. However, processing the raw sequence reads associated with NGS technologies requires care and sophistication in order to draw compelling inferences about phenotypic consequences of variation in human genomes. It has been shown that different approaches to variant calling from NGS data can lead to different conclusions. Ensuring appropriate accuracy and quality in variant calling can come at a computational cost. We describe our experience implementing and evaluating a group-based approach to calling variants on large numbers of whole human genomes. We explore the influence of many factors that may impact the accuracy and efficiency of group-based variant calling, including group size, the biogeographical backgrounds of the individuals who have been sequenced, and the computing environment used. We make efficient use of the Gordon supercomputer cluster at the San Diego Supercomputer Center by incorporating job-packing and parallelization considerations into our workflow while calling variants on 437 whole human genomes generated as part of large association study. We ultimately find that our workflow resulted in high-quality variant calls in a computationally efficient manner. We argue that studies like ours should motivate further investigations combining hardware-oriented advances in computing systems with algorithmic developments to tackle emerging 'big data' problems in biomedical research brought on by the expansion of NGS technologies.

  13. Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture

    PubMed Central

    Zheng, Hou-Feng; Forgetta, Vincenzo; Hsu, Yi-Hsiang; Estrada, Karol; Rosello-Diez, Alberto; Leo, Paul J; Dahia, Chitra L; Park-Min, Kyung Hyun; Tobias, Jonathan H; Kooperberg, Charles; Kleinman, Aaron; Styrkarsdottir, Unnur; Liu, Ching-Ti; Uggla, Charlotta; Evans, Daniel S; Nielson, Carrie M; Walter, Klaudia; Pettersson-Kymmer, Ulrika; McCarthy, Shane; Eriksson, Joel; Kwan, Tony; Jhamai, Mila; Trajanoska, Katerina; Memari, Yasin; Min, Josine; Huang, Jie; Danecek, Petr; Wilmot, Beth; Li, Rui; Chou, Wen-Chi; Mokry, Lauren E; Moayyeri, Alireza; Claussnitzer, Melina; Cheng, Chia-Ho; Cheung, Warren; Medina-Gómez, Carolina; Ge, Bing; Chen, Shu-Huang; Choi, Kwangbom; Oei, Ling; Fraser, James; Kraaij, Robert; Hibbs, Matthew A; Gregson, Celia L; Paquette, Denis; Hofman, Albert; Wibom, Carl; Tranah, Gregory J; Marshall, Mhairi; Gardiner, Brooke B; Cremin, Katie; Auer, Paul; Hsu, Li; Ring, Sue; Tung, Joyce Y; Thorleifsson, Gudmar; Enneman, Anke W; van Schoor, Natasja M; de Groot, Lisette C.P.G.M.; van der Velde, Nathalie; Melin, Beatrice; Kemp, John P; Christiansen, Claus; Sayers, Adrian; Zhou, Yanhua; Calderari, Sophie; van Rooij, Jeroen; Carlson, Chris; Peters, Ulrike; Berlivet, Soizik; Dostie, Josée; Uitterlinden, Andre G; Williams, Stephen R.; Farber, Charles; Grinberg, Daniel; LaCroix, Andrea Z; Haessler, Jeff; Chasman, Daniel I; Giulianini, Franco; Rose, Lynda M; Ridker, Paul M; Eisman, John A; Nguyen, Tuan V; Center, Jacqueline R; Nogues, Xavier; Garcia-Giralt, Natalia; Launer, Lenore L; Gudnason, Vilmunder; Mellström, Dan; Vandenput, Liesbeth; Karlsson, Magnus K; Ljunggren, Östen; Svensson, Olle; Hallmans, Göran; Rousseau, François; Giroux, Sylvie; Bussière, Johanne; Arp, Pascal P; Koromani, Fjorda; Prince, Richard L; Lewis, Joshua R; Langdahl, Bente L; Hermann, A Pernille; Jensen, Jens-Erik B; Kaptoge, Stephen; Khaw, Kay-Tee; Reeve, Jonathan; Formosa, Melissa M; Xuereb-Anastasi, Angela; Åkesson, Kristina; McGuigan, Fiona E; Garg, Gaurav; Olmos, Jose M; Zarrabeitia, Maria T; Riancho, Jose A; Ralston, Stuart H; Alonso, Nerea; Jiang, Xi; Goltzman, David; Pastinen, Tomi; Grundberg, Elin; Gauguier, Dominique; Orwoll, Eric S; Karasik, David; Davey-Smith, George; Smith, Albert V; Siggeirsdottir, Kristin; Harris, Tamara B; Zillikens, M Carola; van Meurs, Joyce BJ; Thorsteinsdottir, Unnur; Maurano, Matthew T; Timpson, Nicholas J; Soranzo, Nicole; Durbin, Richard; Wilson, Scott G; Ntzani, Evangelia E; Brown, Matthew A; Stefansson, Kari; Hinds, David A; Spector, Tim; Cupples, L Adrienne; Ohlsson, Claes; Greenwood, Celia MT; Jackson, Rebecca D; Rowe, David W; Loomis, Cynthia A; Evans, David M; Ackert-Bicknell, Cheryl L; Joyner, Alexandra L; Duncan, Emma L; Kiel, Douglas P; Rivadeneira, Fernando; Richards, J Brent

    2016-01-01

    SUMMARY The extent to which low-frequency (minor allele frequency [MAF] between 1–5%) and rare (MAF ≤ 1%) variants contribute to complex traits and disease in the general population is largely unknown. Bone mineral density (BMD) is highly heritable, is a major predictor of osteoporotic fractures and has been previously associated with common genetic variants1–8, and rare, population-specific, coding variants9. Here we identify novel non-coding genetic variants with large effects on BMD (ntotal = 53,236) and fracture (ntotal = 508,253) in individuals of European ancestry from the general population. Associations for BMD were derived from whole-genome sequencing (n=2,882 from UK10K), whole-exome sequencing (n= 3,549), deep imputation of genotyped samples using a combined UK10K/1000Genomes reference panel (n=26,534), and de-novo replication genotyping (n= 20,271). We identified a low-frequency non-coding variant near a novel locus, EN1, with an effect size 4-fold larger than the mean of previously reported common variants for lumbar spine BMD8 (rs11692564[T], MAF = 1.7%, replication effect size = +0.20 standard deviations [SD], Pmeta = 2×10−14), which was also associated with a decreased risk of fracture (OR = 0.85; P = 2×10−11; ncases = 98,742 and ncontrols = 409,511). Using an En1Cre/flox mouse model, we observed that conditional loss of En1 results in low bone mass, likely as a consequence of high bone turn-over. We also identified a novel low-frequency non-coding variant with large effects on BMD near WNT16 (rs148771817[T], MAF = 1.1%, replication effect size = +0.39 SD, Pmeta = 1×10−11). In general, there was an excess of association signals arising from deleterious coding and conserved non-coding variants. These findings provide evidence that low-frequency non-coding variants have large effects on BMD and fracture, thereby providing rationale for whole-genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population. PMID:26367794

  14. Structures of the flax-rust effector AvrM reveal insights into the molecular basis of plant-cell entry and effector-triggered immunity.

    PubMed

    Ve, Thomas; Williams, Simon J; Catanzariti, Ann-Maree; Rafiqi, Maryam; Rahman, Motiur; Ellis, Jeffrey G; Hardham, Adrienne R; Jones, David A; Anderson, Peter A; Dodds, Peter N; Kobe, Bostjan

    2013-10-22

    Fungal and oomycete pathogens cause some of the most devastating diseases in crop plants, and facilitate infection by delivering a large number of effector molecules into the plant cell. AvrM is a secreted effector protein from flax rust (Melampsora lini) that can internalize into plant cells in the absence of the pathogen, binds to phosphoinositides (PIPs), and is recognized directly by the resistance protein M in flax (Linum usitatissimum), resulting in effector-triggered immunity. We determined the crystal structures of two naturally occurring variants of AvrM, AvrM-A and avrM, and both reveal an L-shaped fold consisting of a tandem duplicated four-helix motif, which displays similarity to the WY domain core in oomycete effectors. In the crystals, both AvrM variants form a dimer with an unusual nonglobular shape. Our functional analysis of AvrM reveals that a hydrophobic surface patch conserved between both variants is required for internalization into plant cells, whereas the C-terminal coiled-coil domain mediates interaction with M. AvrM binding to PIPs is dependent on positive surface charges, and mutations that abrogate PIP binding have no significant effect on internalization, suggesting that AvrM binding to PIPs is not essential for transport of AvrM across the plant membrane. The structure of AvrM and the identification of functionally important surface regions advance our understanding of the molecular mechanisms underlying how effectors enter plant cells and how they are detected by the plant immune system.

  15. A powerful and efficient set test for genetic markers that handles confounders

    PubMed Central

    Listgarten, Jennifer; Lippert, Christoph; Kang, Eun Yong; Xiang, Jing; Kadie, Carl M.; Heckerman, David

    2013-01-01

    Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects—one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn’s disease case–control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com. Contact: jennl@microsoft.com or lippert@microsoft.com or heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23599503

  16. Structure of a phage display-derived variant of human growth hormone complexed to two copies of the extracellular domain of its receptor: evidence for strong structural coupling between receptor binding sites.

    PubMed

    Schiffer, Celia; Ultsch, Mark; Walsh, Scott; Somers, William; de Vos, Abraham M; Kossiakoff, Anthony

    2002-02-15

    The structure of the ternary complex between the phage display- optimized, high-affinity Site 1 variant of human growth hormone (hGH) and two copies of the extracellular domain (ECD) of the hGH receptor (hGHR) has been determined at 2.6 A resolution. There are widespread and significant structural differences compared to the wild-type ternary hGH hGHR complex. The hGH variant (hGH(v)) contains 15 Site 1 mutations and binds>10(2) tighter to the hGHR ECD (hGH(R1)) at Site 1. It is biologically active and specific to hGHR. The hGH(v) Site 1 interface is somewhat smaller and 20% more hydrophobic compared to the wild-type (wt) counterpart. Of the ten hormone-receptor H-bonds in the site, only one is the same as in the wt complex. Additionally, several regions of hGH(v) structure move up to 9A in forming the interface. The contacts between the C-terminal domains of two receptor ECDs (hGH(R1)- hGH(R2)) are conserved; however, the large changes in Site 1 appear to cause global changes in the domains of hGH(R1) that affect the hGH(v)-hGH(R2) interface indirectly. This coupling is manifested by large changes in the conformation of groups participating in the Site 2 interaction and results in a structure for the site that is reorganized extensively. The hGH(v)- hGH(R2) interface contains seven H-bonds, only one of which is found in the wt complex. Several groups on hGH(v) and hGH(R2) undergo conformational changes of up to 8 A. Asp116 of hGH(v) plays a central role in the reorganization of Site 2 by forming two new H-bonds to the side-chains of Trp104(R2) and Trp169(R2), which are the key binding determinants of the receptor. The fact that a different binding solution is possible for Site 2, where there were no mutations or binding selection pressures, indicates that the structural elements found in these molecules possess an inherent functional plasticity that enables them to bind to a wide variety of binding surfaces. Copyright 2002 Elsevier Science Ltd.

  17. A comprehensive approach to identification of pathogenic FANCA variants in Fanconi anemia patients and their families.

    PubMed

    Kimble, Danielle C; Lach, Francis P; Gregg, Siobhan Q; Donovan, Frank X; Flynn, Elizabeth K; Kamat, Aparna; Young, Alice; Vemulapalli, Meghana; Thomas, James W; Mullikin, James C; Auerbach, Arleen D; Smogorzewska, Agata; Chandrasekharappa, Settara C

    2018-02-01

    Fanconi anemia (FA) is a rare recessive DNA repair deficiency resulting from mutations in one of at least 22 genes. Two-thirds of FA families harbor mutations in FANCA. To genotype patients in the International Fanconi Anemia Registry (IFAR) we employed multiple methodologies, screening 216 families for FANCA mutations. We describe identification of 57 large deletions and 261 sequence variants, in 159 families. All but seven families harbored distinct combinations of two mutations demonstrating high heterogeneity. Pathogenicity of the 18 novel missense variants was analyzed functionally by determining the ability of the mutant cDNA to improve the survival of a FANCA-null cell line when treated with MMC. Overexpressed pathogenic missense variants were found to reside in the cytoplasm, and nonpathogenic in the nucleus. RNA analysis demonstrated that two variants (c.522G > C and c.1565A > G), predicted to encode missense variants, which were determined to be nonpathogenic by a functional assay, caused skipping of exons 5 and 16, respectively, and are most likely pathogenic. We report 48 novel FANCA sequence variants. Defining both variants in a large patient cohort is a major step toward cataloging all FANCA variants, and permitting studies of genotype-phenotype correlations. © Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  18. Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.

    PubMed

    Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N; Baldwin, Clinton T; Andersen, Stacy; Schork, Nicholas J; Steinberg, Martin H; Perls, Thomas T

    2011-01-01

    Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals' DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging.

  19. Whole Genome Sequences of a Male and Female Supercentenarian, Ages Greater than 114 Years

    PubMed Central

    Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N.; Baldwin, Clinton T.; Andersen, Stacy; Schork, Nicholas J.; Steinberg, Martin H.; Perls, Thomas T.

    2012-01-01

    Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals’ DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging. PMID:22303384

  20. Pooled Resequencing of 122 Ulcerative Colitis Genes in a Large Dutch Cohort Suggests Population-Specific Associations of Rare Variants in MUC2.

    PubMed

    Visschedijk, Marijn C; Alberts, Rudi; Mucha, Soren; Deelen, Patrick; de Jong, Dirk J; Pierik, Marieke; Spekhorst, Lieke M; Imhann, Floris; van der Meulen-de Jong, Andrea E; van der Woude, C Janneke; van Bodegraven, Adriaan A; Oldenburg, Bas; Löwenberg, Mark; Dijkstra, Gerard; Ellinghaus, David; Schreiber, Stefan; Wijmenga, Cisca; Rivas, Manuel A; Franke, Andre; van Diemen, Cleo C; Weersma, Rinse K

    2016-01-01

    Genome-wide association studies have revealed several common genetic risk variants for ulcerative colitis (UC). However, little is known about the contribution of rare, large effect genetic variants to UC susceptibility. In this study, we performed a deep targeted re-sequencing of 122 genes in Dutch UC patients in order to investigate the contribution of rare variants to the genetic susceptibility to UC. The selection of genes consists of 111 established human UC susceptibility genes and 11 genes that lead to spontaneous colitis when knocked-out in mice. In addition, we sequenced the promoter regions of 45 genes where known variants exert cis-eQTL-effects. Targeted pooled re-sequencing was performed on DNA of 790 Dutch UC cases. The Genome of the Netherlands project provided sequence data of 500 healthy controls. After quality control and prioritization based on allele frequency and pathogenicity probability, follow-up genotyping of 171 rare variants was performed on 1021 Dutch UC cases and 1166 Dutch controls. Single-variant association and gene-based analyses identified an association of rare variants in the MUC2 gene with UC. The associated variants in the Dutch population could not be replicated in a German replication cohort (1026 UC cases, 3532 controls). In conclusion, this study has identified a putative role for MUC2 on UC susceptibility in the Dutch population and suggests a population-specific contribution of rare variants to UC.

  1. Variant pathogenicity evaluation in the community-driven Inherited Neuropathy Variant Browser.

    PubMed

    Saghira, Cima; Bis, Dana M; Stanek, David; Strickland, Alleene; Herrmann, David N; Reilly, Mary M; Scherer, Steven S; Shy, Michael E; Züchner, Stephan

    2018-05-01

    Charcot-Marie-Tooth disease (CMT) is an umbrella term for inherited neuropathies affecting an estimated one in 2,500 people. Over 120 CMT and related genes have been identified and clinical gene panels often contain more than 100 genes. Such a large genomic space will invariantly yield variants of uncertain clinical significance (VUS) in nearly any person tested. This rise in number of VUS creates major challenges for genetic counseling. Additionally, fewer individual variants in known genes are being published as the academic merit is decreasing, and most testing now happens in clinical laboratories, which typically do not correlate their variants with clinical phenotypes. For CMT, we aim to encourage and facilitate the global capture of variant data to gain a large collection of alleles in CMT genes, ideally in conjunction with phenotypic information. The Inherited Neuropathy Variant Browser provides user-friendly open access to currently reported variation in CMT genes. Geneticists, physicians, and genetic counselors can enter variants detected by clinical tests or in research studies in addition to genetic variation gathered from published literature, which are then submitted to ClinVar biannually. Active participation of the broader CMT community will provide an advance over existing resources for interpretation of CMT genetic variation. © 2018 Wiley Periodicals, Inc.

  2. Telomere extension by telomerase and ALT generates variant repeats by mechanistically distinct processes

    PubMed Central

    Lee, Michael; Hills, Mark; Conomos, Dimitri; Stutz, Michael D.; Dagg, Rebecca A.; Lau, Loretta M.S.; Reddel, Roger R.; Pickett, Hilda A.

    2014-01-01

    Telomeres are terminal repetitive DNA sequences on chromosomes, and are considered to comprise almost exclusively hexameric TTAGGG repeats. We have evaluated telomere sequence content in human cells using whole-genome sequencing followed by telomere read extraction in a panel of mortal cell strains and immortal cell lines. We identified a wide range of telomere variant repeats in human cells, and found evidence that variant repeats are generated by mechanistically distinct processes during telomerase- and ALT-mediated telomere lengthening. Telomerase-mediated telomere extension resulted in biased repeat synthesis of variant repeats that differed from the canonical sequence at positions 1 and 3, but not at positions 2, 4, 5 or 6. This indicates that telomerase is most likely an error-prone reverse transcriptase that misincorporates nucleotides at specific positions on the telomerase RNA template. In contrast, cell lines that use the ALT pathway contained a large range of variant repeats that varied greatly between lines. This is consistent with variant repeats spreading from proximal telomeric regions throughout telomeres in a stochastic manner by recombination-mediated templating of DNA synthesis. The presence of unexpectedly large numbers of variant repeats in cells utilizing either telomere maintenance mechanism suggests a conserved role for variant sequences at human telomeres. PMID:24225324

  3. Detailed genetic characteristics of an international large cohort of patients with Stargardt disease: ProgStar study report 8.

    PubMed

    Fujinami, Kaoru; Strauss, Rupert W; Chiang, John Pei-Wen; Audo, Isabelle S; Bernstein, Paul S; Birch, David G; Bomotti, Samantha M; Cideciyan, Artur V; Ervin, Ann-Margret; Marino, Meghan J; Sahel, José-Alain; Mohand-Said, Saddek; Sunness, Janet S; Traboulsi, Elias I; West, Sheila; Wojciechowski, Robert; Zrenner, Eberhart; Michaelides, Michel; Scholl, Hendrik P N

    2018-06-20

    To describe the genetic characteristics of the cohort enrolled in the international multicentre progression of Stargardt disease 1 (STGD1) studies (ProgStar) and to determine geographic differences based on the allele frequency. 345 participants with a clinical diagnosis of STGD1 and harbouring at least one disease-causing ABCA4 variant were enrolled from 9 centres in the USA and Europe. All variants were reviewed and in silico analysis was performed including allele frequency in public databases and pathogenicity predictions. Participants with multiple likely pathogenic variants were classified into four national subgroups (USA, UK, France, Germany), with subsequent comparison analysis of the allele frequency for each prevalent allele. 211 likely pathogenic variants were identified in the total cohort, including missense (63%), splice site alteration (18%), stop (9%) and others. 50 variants were novel. Exclusively missense variants were detected in 139 (50%) of 279 patients with multiple pathogenic variants. The three most prevalent variants of these patients with multiple pathogenic variants were p.G1961E (15%), p.G863A (7%) and c.5461-10 T>C (5%). Subgroup analysis revealed a statistically significant difference between the four recruiting nations in the allele frequency of nine variants. There is a large spectrum of ABCA4 sequence variants, including 50 novel variants, in a well-characterised cohort thereby further adding to the unique allelic heterogeneity in STGD1. Approximately half of the cohort harbours missense variants only, indicating a relatively mild phenotype of the ProgStar cohort. There are significant differences in allele frequencies between nations, although the three most prevalent variants are shared as frequent variants. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  4. Structure of Arabidopsis thaliana FUT1 Reveals a Variant of the GT-B Class Fold and Provides Insight into Xyloglucan Fucosylation

    PubMed Central

    Chazalet, Valérie

    2016-01-01

    The plant cell wall is a complex and dynamic network made mostly of cellulose, hemicelluloses, and pectins. Xyloglucan, the major hemicellulosic component in Arabidopsis thaliana, is biosynthesized in the Golgi apparatus by a series of glycan synthases and glycosyltransferases before export to the wall. A better understanding of the xyloglucan biosynthetic machinery will give clues toward engineering plants with improved wall properties or designing novel xyloglucan-based biomaterials. The xyloglucan-specific α2-fucosyltransferase FUT1 catalyzes the transfer of fucose from GDP-fucose to terminal galactosyl residues on xyloglucan side chains. Here, we present crystal structures of Arabidopsis FUT1 in its apoform and in a ternary complex with GDP and a xylo-oligosaccharide acceptor (named XLLG). Although FUT1 is clearly a member of the large GT-B fold family, like other fucosyltransferases of known structures, it contains a variant of the GT-B fold. In particular, it includes an extra C-terminal region that is part of the acceptor binding site. Our crystal structures support previous findings that FUT1 behaves as a functional dimer. Mutational studies and structure comparison with other fucosyltransferases suggest that FUT1 uses a SN2-like reaction mechanism similar to that of protein-O-fucosyltransferase 2. Thus, our results provide new insights into the mechanism of xyloglucan fucosylation in the Golgi. PMID:27637560

  5. Improving the thermal stability of cellobiohydrolase Cel7A from Hypocrea jecorina by directed evolution.

    PubMed

    Goedegebuur, Frits; Dankmeyer, Lydia; Gualfetti, Peter; Karkehabadi, Saeid; Hansson, Henrik; Jana, Suvamay; Huynh, Vicky; Kelemen, Bradley R; Kruithof, Paulien; Larenas, Edmund A; Teunissen, Pauline J M; Ståhlberg, Jerry; Payne, Christina M; Mitchinson, Colin; Sandgren, Mats

    2017-10-20

    Secreted mixtures of Hypocrea jecorina cellulases are able to efficiently degrade cellulosic biomass to fermentable sugars at large, commercially relevant scales. H. jecorina Cel7A, cellobiohydrolase I, from glycoside hydrolase family 7, is the workhorse enzyme of the process. However, the thermal stability of Cel7A limits its use to processes where temperatures are no higher than 50 °C. Enhanced thermal stability is desirable to enable the use of higher processing temperatures and to improve the economic feasibility of industrial biomass conversion. Here, we enhanced the thermal stability of Cel7A through directed evolution. Sites with increased thermal stability properties were combined, and a Cel7A variant (FCA398) was obtained, which exhibited a 10.4 °C increase in T m and a 44-fold greater half-life compared with the wild-type enzyme. This Cel7A variant contains 18 mutated sites and is active under application conditions up to at least 75 °C. The X-ray crystal structure of the catalytic domain was determined at 2.1 Å resolution and showed that the effects of the mutations are local and do not introduce major backbone conformational changes. Molecular dynamics simulations revealed that the catalytic domain of wild-type Cel7A and the FCA398 variant exhibit similar behavior at 300 K, whereas at elevated temperature (475 and 525 K), the FCA398 variant fluctuates less and maintains more native contacts over time. Combining the structural and dynamic investigations, rationales were developed for the stabilizing effect at many of the mutated sites. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  6. From bird's eye views to molecular communities: two-layered visualization of structure-activity relationships in large compound data sets

    NASA Astrophysics Data System (ADS)

    Kayastha, Shilva; Kunimoto, Ryo; Horvath, Dragos; Varnek, Alexandre; Bajorath, Jürgen

    2017-11-01

    The analysis of structure-activity relationships (SARs) becomes rather challenging when large and heterogeneous compound data sets are studied. In such cases, many different compounds and their activities need to be compared, which quickly goes beyond the capacity of subjective assessments. For a comprehensive large-scale exploration of SARs, computational analysis and visualization methods are required. Herein, we introduce a two-layered SAR visualization scheme specifically designed for increasingly large compound data sets. The approach combines a new compound pair-based variant of generative topographic mapping (GTM), a machine learning approach for nonlinear mapping, with chemical space networks (CSNs). The GTM component provides a global view of the activity landscapes of large compound data sets, in which informative local SAR environments are identified, augmented by a numerical SAR scoring scheme. Prioritized local SAR regions are then projected into CSNs that resolve these regions at the level of individual compounds and their relationships. Analysis of CSNs makes it possible to distinguish between regions having different SAR characteristics and select compound subsets that are rich in SAR information.

  7. Rare and low-frequency coding variants alter human adult height.

    PubMed

    Marouli, Eirini; Graff, Mariaelisa; Medina-Gomez, Carolina; Lo, Ken Sin; Wood, Andrew R; Kjaer, Troels R; Fine, Rebecca S; Lu, Yingchang; Schurmann, Claudia; Highland, Heather M; Rüeger, Sina; Thorleifsson, Gudmar; Justice, Anne E; Lamparter, David; Stirrups, Kathleen E; Turcot, Valérie; Young, Kristin L; Winkler, Thomas W; Esko, Tõnu; Karaderi, Tugce; Locke, Adam E; Masca, Nicholas G D; Ng, Maggie C Y; Mudgal, Poorva; Rivas, Manuel A; Vedantam, Sailaja; Mahajan, Anubha; Guo, Xiuqing; Abecasis, Goncalo; Aben, Katja K; Adair, Linda S; Alam, Dewan S; Albrecht, Eva; Allin, Kristine H; Allison, Matthew; Amouyel, Philippe; Appel, Emil V; Arveiler, Dominique; Asselbergs, Folkert W; Auer, Paul L; Balkau, Beverley; Banas, Bernhard; Bang, Lia E; Benn, Marianne; Bergmann, Sven; Bielak, Lawrence F; Blüher, Matthias; Boeing, Heiner; Boerwinkle, Eric; Böger, Carsten A; Bonnycastle, Lori L; Bork-Jensen, Jette; Bots, Michiel L; Bottinger, Erwin P; Bowden, Donald W; Brandslund, Ivan; Breen, Gerome; Brilliant, Murray H; Broer, Linda; Burt, Amber A; Butterworth, Adam S; Carey, David J; Caulfield, Mark J; Chambers, John C; Chasman, Daniel I; Chen, Yii-Der Ida; Chowdhury, Rajiv; Christensen, Cramer; Chu, Audrey Y; Cocca, Massimiliano; Collins, Francis S; Cook, James P; Corley, Janie; Galbany, Jordi Corominas; Cox, Amanda J; Cuellar-Partida, Gabriel; Danesh, John; Davies, Gail; de Bakker, Paul I W; de Borst, Gert J; de Denus, Simon; de Groot, Mark C H; de Mutsert, Renée; Deary, Ian J; Dedoussis, George; Demerath, Ellen W; den Hollander, Anneke I; Dennis, Joe G; Di Angelantonio, Emanuele; Drenos, Fotios; Du, Mengmeng; Dunning, Alison M; Easton, Douglas F; Ebeling, Tapani; Edwards, Todd L; Ellinor, Patrick T; Elliott, Paul; Evangelou, Evangelos; Farmaki, Aliki-Eleni; Faul, Jessica D; Feitosa, Mary F; Feng, Shuang; Ferrannini, Ele; Ferrario, Marco M; Ferrieres, Jean; Florez, Jose C; Ford, Ian; Fornage, Myriam; Franks, Paul W; Frikke-Schmidt, Ruth; Galesloot, Tessel E; Gan, Wei; Gandin, Ilaria; Gasparini, Paolo; Giedraitis, Vilmantas; Giri, Ayush; Girotto, Giorgia; Gordon, Scott D; Gordon-Larsen, Penny; Gorski, Mathias; Grarup, Niels; Grove, Megan L; Gudnason, Vilmundur; Gustafsson, Stefan; Hansen, Torben; Harris, Kathleen Mullan; Harris, Tamara B; Hattersley, Andrew T; Hayward, Caroline; He, Liang; Heid, Iris M; Heikkilä, Kauko; Helgeland, Øyvind; Hernesniemi, Jussi; Hewitt, Alex W; Hocking, Lynne J; Hollensted, Mette; Holmen, Oddgeir L; Hovingh, G Kees; Howson, Joanna M M; Hoyng, Carel B; Huang, Paul L; Hveem, Kristian; Ikram, M Arfan; Ingelsson, Erik; Jackson, Anne U; Jansson, Jan-Håkan; Jarvik, Gail P; Jensen, Gorm B; Jhun, Min A; Jia, Yucheng; Jiang, Xuejuan; Johansson, Stefan; Jørgensen, Marit E; Jørgensen, Torben; Jousilahti, Pekka; Jukema, J Wouter; Kahali, Bratati; Kahn, René S; Kähönen, Mika; Kamstrup, Pia R; Kanoni, Stavroula; Kaprio, Jaakko; Karaleftheri, Maria; Kardia, Sharon L R; Karpe, Fredrik; Kee, Frank; Keeman, Renske; Kiemeney, Lambertus A; Kitajima, Hidetoshi; Kluivers, Kirsten B; Kocher, Thomas; Komulainen, Pirjo; Kontto, Jukka; Kooner, Jaspal S; Kooperberg, Charles; Kovacs, Peter; Kriebel, Jennifer; Kuivaniemi, Helena; Küry, Sébastien; Kuusisto, Johanna; La Bianca, Martina; Laakso, Markku; Lakka, Timo A; Lange, Ethan M; Lange, Leslie A; Langefeld, Carl D; Langenberg, Claudia; Larson, Eric B; Lee, I-Te; Lehtimäki, Terho; Lewis, Cora E; Li, Huaixing; Li, Jin; Li-Gao, Ruifang; Lin, Honghuang; Lin, Li-An; Lin, Xu; Lind, Lars; Lindström, Jaana; Linneberg, Allan; Liu, Yeheng; Liu, Yongmei; Lophatananon, Artitaya; Luan, Jian'an; Lubitz, Steven A; Lyytikäinen, Leo-Pekka; Mackey, David A; Madden, Pamela A F; Manning, Alisa K; Männistö, Satu; Marenne, Gaëlle; Marten, Jonathan; Martin, Nicholas G; Mazul, Angela L; Meidtner, Karina; Metspalu, Andres; Mitchell, Paul; Mohlke, Karen L; Mook-Kanamori, Dennis O; Morgan, Anna; Morris, Andrew D; Morris, Andrew P; Müller-Nurasyid, Martina; Munroe, Patricia B; Nalls, Mike A; Nauck, Matthias; Nelson, Christopher P; Neville, Matt; Nielsen, Sune F; Nikus, Kjell; Njølstad, Pål R; Nordestgaard, Børge G; Ntalla, Ioanna; O'Connel, Jeffrey R; Oksa, Heikki; Loohuis, Loes M Olde; Ophoff, Roel A; Owen, Katharine R; Packard, Chris J; Padmanabhan, Sandosh; Palmer, Colin N A; Pasterkamp, Gerard; Patel, Aniruddh P; Pattie, Alison; Pedersen, Oluf; Peissig, Peggy L; Peloso, Gina M; Pennell, Craig E; Perola, Markus; Perry, James A; Perry, John R B; Person, Thomas N; Pirie, Ailith; Polasek, Ozren; Posthuma, Danielle; Raitakari, Olli T; Rasheed, Asif; Rauramaa, Rainer; Reilly, Dermot F; Reiner, Alex P; Renström, Frida; Ridker, Paul M; Rioux, John D; Robertson, Neil; Robino, Antonietta; Rolandsson, Olov; Rudan, Igor; Ruth, Katherine S; Saleheen, Danish; Salomaa, Veikko; Samani, Nilesh J; Sandow, Kevin; Sapkota, Yadav; Sattar, Naveed; Schmidt, Marjanka K; Schreiner, Pamela J; Schulze, Matthias B; Scott, Robert A; Segura-Lepe, Marcelo P; Shah, Svati; Sim, Xueling; Sivapalaratnam, Suthesh; Small, Kerrin S; Smith, Albert Vernon; Smith, Jennifer A; Southam, Lorraine; Spector, Timothy D; Speliotes, Elizabeth K; Starr, John M; Steinthorsdottir, Valgerdur; Stringham, Heather M; Stumvoll, Michael; Surendran, Praveen; 't Hart, Leen M; Tansey, Katherine E; Tardif, Jean-Claude; Taylor, Kent D; Teumer, Alexander; Thompson, Deborah J; Thorsteinsdottir, Unnur; Thuesen, Betina H; Tönjes, Anke; Tromp, Gerard; Trompet, Stella; Tsafantakis, Emmanouil; Tuomilehto, Jaakko; Tybjaerg-Hansen, Anne; Tyrer, Jonathan P; Uher, Rudolf; Uitterlinden, André G; Ulivi, Sheila; van der Laan, Sander W; Van Der Leij, Andries R; van Duijn, Cornelia M; van Schoor, Natasja M; van Setten, Jessica; Varbo, Anette; Varga, Tibor V; Varma, Rohit; Edwards, Digna R Velez; Vermeulen, Sita H; Vestergaard, Henrik; Vitart, Veronique; Vogt, Thomas F; Vozzi, Diego; Walker, Mark; Wang, Feijie; Wang, Carol A; Wang, Shuai; Wang, Yiqin; Wareham, Nicholas J; Warren, Helen R; Wessel, Jennifer; Willems, Sara M; Wilson, James G; Witte, Daniel R; Woods, Michael O; Wu, Ying; Yaghootkar, Hanieh; Yao, Jie; Yao, Pang; Yerges-Armstrong, Laura M; Young, Robin; Zeggini, Eleftheria; Zhan, Xiaowei; Zhang, Weihua; Zhao, Jing Hua; Zhao, Wei; Zhao, Wei; Zheng, He; Zhou, Wei; Rotter, Jerome I; Boehnke, Michael; Kathiresan, Sekar; McCarthy, Mark I; Willer, Cristen J; Stefansson, Kari; Borecki, Ingrid B; Liu, Dajiang J; North, Kari E; Heard-Costa, Nancy L; Pers, Tune H; Lindgren, Cecilia M; Oxvig, Claus; Kutalik, Zoltán; Rivadeneira, Fernando; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Deloukas, Panos; Lettre, Guillaume

    2017-02-09

    Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1-4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1-2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.

  8. Variants of windmill nystagmus.

    PubMed

    Choi, Kwang-Dong; Shin, Hae Kyung; Kim, Ji-Soo; Kim, Sung-Hee; Choi, Jae-Hwan; Kim, Hyo-Jung; Zee, David S

    2016-07-01

    Windmill nystagmus is characterized by a clock-like rotation of the beating direction of a jerk nystagmus suggesting separate horizontal and vertical oscillators, usually 90° out of phase. We report oculographic characteristics in three patients with variants of windmill nystagmus in whom the common denominator was profound visual loss due to retinal diseases. Two patients showed a clock-like pattern, while in the third, the nystagmus was largely diagonal (in phase or 180° out of phase) but also periodically changed direction by 180°. We hypothesize that windmill nystagmus is a unique manifestation of "eye movements of the blind." It emerges when the central structures, including the cerebellum, that normally keep eye movements calibrated and gaze steady can no longer perform their task, because they are deprived of the retinal image motion that signals a need for adaptive recalibration.

  9. Structural and functional analysis of the finished genome of the recently isolated toxic Anabaena sp. WA102.

    PubMed

    Brown, Nathan M; Mueller, Ryan S; Shepardson, Jonathan W; Landry, Zachary C; Morré, Jeffrey T; Maier, Claudia S; Hardy, F Joan; Dreher, Theo W

    2016-06-13

    Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture. The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90. Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.

  10. Protein engineering of subtilisins to improve stability in detergent formulations.

    PubMed

    von der Osten, C; Branner, S; Hastrup, S; Hedegaard, L; Rasmussen, M D; Bisgård-Frantzen, H; Carlsen, S; Mikkelsen, J M

    1993-03-01

    Microbial proteases are used extensively in a large number of industrial processes and most importantly in detergent formulations facilitating the removal of proteinaceous stains. Site-directed mutagenesis has been employed in the construction of subtilisin variants with improved storage and oxidation stabilities. It is shown that in spite of significant structural homology between subtilisins subjected to protein engineering the effects of specific mutations can be quite different. Mutations that stabilize one subtilisin may destabilize another.

  11. Prediction of conformational changes by single mutation in the hepatitis B virus surface antigen (HBsAg) identified in HBsAg-negative blood donors.

    PubMed

    Ie, Susan I; Thedja, Meta D; Roni, Martono; Muljono, David H

    2010-11-18

    Selection of hepatitis B virus (HBV) by host immunity has been suggested to give rise to variants with amino acid substitutions at or around the 'a' determinant of the surface antigen (HBsAg), the main target of antibody neutralization and diagnostic assays. However, there have never been successful attempts to provide evidence for this hypothesis, partly because the 3 D structure of HBsAg molecules has not been determined. Tertiary structure prediction of HBsAg solely from its primary amino acid sequence may reveal the molecular energetic of the mutated proteins. We carried out this preliminary study to analyze the predicted HBsAg conformation changes of HBV variants isolated from Indonesian blood donors undetectable by HBsAg assays and its significance, compared to other previously-reported variants that were associated with diagnostic failure. Three HBV variants (T123A, M133L and T143M) and a wild type sequence were analyzed together with frequently emerged variants T123N, M133I, M133T, M133V, and T143L. Based on the Jameson-Wolf algorithm for calculating antigenic index, the first two amino acid substitutions resulted in slight changes in the antigenicity of the 'a' determinant, while all four of the comparative variants showed relatively more significant changes. In the pattern T143M, changes in antigenic index were more significant, both in its coverage and magnitude, even when compared to variant T143L. These data were also partially supported by the tertiary structure prediction, in which the pattern T143M showed larger shift in the HBsAg second loop structure compared to the others. Single amino acid substitutions within or near the 'a' determinant of HBsAg may alter antigenicity properties of variant HBsAg, which can be shown by both its antigenic index and predicted 3 D conformation. Findings in this study emphasize the significance of variant T143M, the prevalent isolate with highest degree of antigenicity changes found in Indonesian blood donors. This highlights the importance of evaluating the effects of protein structure alterations on the sensitivity of screening methods being used in detection of ongoing HBV infection, as well as the use of vaccines and immunoglobulin therapy in contributing to the selection of HBV variants.

  12. Whole Genome Sequencing Increases Molecular Diagnostic Yield Compared with Current Diagnostic Testing for Inherited Retinal Disease.

    PubMed

    Ellingford, Jamie M; Barton, Stephanie; Bhaskar, Sanjeev; Williams, Simon G; Sergouniotis, Panagiotis I; O'Sullivan, James; Lamb, Janine A; Perveen, Rahat; Hall, Georgina; Newman, William G; Bishop, Paul N; Roberts, Stephen A; Leach, Rick; Tearle, Rick; Bayliss, Stuart; Ramsden, Simon C; Nemeth, Andrea H; Black, Graeme C M

    2016-05-01

    To compare the efficacy of whole genome sequencing (WGS) with targeted next-generation sequencing (NGS) in the diagnosis of inherited retinal disease (IRD). Case series. A total of 562 patients diagnosed with IRD. We performed a direct comparative analysis of current molecular diagnostics with WGS. We retrospectively reviewed the findings from a diagnostic NGS DNA test for 562 patients with IRD. A subset of 46 of 562 patients (encompassing potential clinical outcomes of diagnostic analysis) also underwent WGS, and we compared mutation detection rates and molecular diagnostic yields. In addition, we compared the sensitivity and specificity of the 2 techniques to identify known single nucleotide variants (SNVs) using 6 control samples with publically available genotype data. Diagnostic yield of genomic testing. Across known disease-causing genes, targeted NGS and WGS achieved similar levels of sensitivity and specificity for SNV detection. However, WGS also identified 14 clinically relevant genetic variants through WGS that had not been identified by NGS diagnostic testing for the 46 individuals with IRD. These variants included large deletions and variants in noncoding regions of the genome. Identification of these variants confirmed a molecular diagnosis of IRD for 11 of the 33 individuals referred for WGS who had not obtained a molecular diagnosis through targeted NGS testing. Weighted estimates, accounting for population structure, suggest that WGS methods could result in an overall 29% (95% confidence interval, 15-45) uplift in diagnostic yield. We show that WGS methods can detect disease-causing genetic variants missed by current NGS diagnostic methodologies for IRD and thereby demonstrate the clinical utility and additional value of WGS. Copyright © 2016 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.

  13. GTRAC: fast retrieval from compressed collections of genomic variants

    PubMed Central

    Tatwawadi, Kedar; Hernaez, Mikel; Ochoa, Idoia; Weissman, Tsachy

    2016-01-01

    Motivation: The dramatic decrease in the cost of sequencing has resulted in the generation of huge amounts of genomic data, as evidenced by projects such as the UK10K and the Million Veteran Project, with the number of sequenced genomes ranging in the order of 10 K to 1 M. Due to the large redundancies among genomic sequences of individuals from the same species, most of the medical research deals with the variants in the sequences as compared with a reference sequence, rather than with the complete genomic sequences. Consequently, millions of genomes represented as variants are stored in databases. These databases are constantly updated and queried to extract information such as the common variants among individuals or groups of individuals. Previous algorithms for compression of this type of databases lack efficient random access capabilities, rendering querying the database for particular variants and/or individuals extremely inefficient, to the point where compression is often relinquished altogether. Results: We present a new algorithm for this task, called GTRAC, that achieves significant compression ratios while allowing fast random access over the compressed database. For example, GTRAC is able to compress a Homo sapiens dataset containing 1092 samples in 1.1 GB (compression ratio of 160), while allowing for decompression of specific samples in less than a second and decompression of specific variants in 17 ms. GTRAC uses and adapts techniques from information theory, such as a specialized Lempel-Ziv compressor, and tailored succinct data structures. Availability and Implementation: The GTRAC algorithm is available for download at: https://github.com/kedartatwawadi/GTRAC Contact: kedart@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27587665

  14. GTRAC: fast retrieval from compressed collections of genomic variants.

    PubMed

    Tatwawadi, Kedar; Hernaez, Mikel; Ochoa, Idoia; Weissman, Tsachy

    2016-09-01

    The dramatic decrease in the cost of sequencing has resulted in the generation of huge amounts of genomic data, as evidenced by projects such as the UK10K and the Million Veteran Project, with the number of sequenced genomes ranging in the order of 10 K to 1 M. Due to the large redundancies among genomic sequences of individuals from the same species, most of the medical research deals with the variants in the sequences as compared with a reference sequence, rather than with the complete genomic sequences. Consequently, millions of genomes represented as variants are stored in databases. These databases are constantly updated and queried to extract information such as the common variants among individuals or groups of individuals. Previous algorithms for compression of this type of databases lack efficient random access capabilities, rendering querying the database for particular variants and/or individuals extremely inefficient, to the point where compression is often relinquished altogether. We present a new algorithm for this task, called GTRAC, that achieves significant compression ratios while allowing fast random access over the compressed database. For example, GTRAC is able to compress a Homo sapiens dataset containing 1092 samples in 1.1 GB (compression ratio of 160), while allowing for decompression of specific samples in less than a second and decompression of specific variants in 17 ms. GTRAC uses and adapts techniques from information theory, such as a specialized Lempel-Ziv compressor, and tailored succinct data structures. The GTRAC algorithm is available for download at: https://github.com/kedartatwawadi/GTRAC CONTACT: : kedart@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

    PubMed

    Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-03-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.

  16. A powerful tool for genome analysis in maize: development and evaluation of the high density 600 k SNP genotyping array.

    PubMed

    Unterseer, Sandra; Bauer, Eva; Haberer, Georg; Seidel, Michael; Knaak, Carsten; Ouzunova, Milena; Meitinger, Thomas; Strom, Tim M; Fries, Ruedi; Pausch, Hubert; Bertani, Christofer; Davassi, Alessandro; Mayer, Klaus Fx; Schön, Chris-Carolin

    2014-09-29

    High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species.

  17. Assessment of large copy number variants in patients with apparently isolated congenital left-sided cardiac lesions reveals clinically relevant genomic events.

    PubMed

    Hanchard, Neil A; Umana, Luis A; D'Alessandro, Lisa; Azamian, Mahshid; Poopola, Mojisola; Morris, Shaine A; Fernbach, Susan; Lalani, Seema R; Towbin, Jeffrey A; Zender, Gloria A; Fitzgerald-Butt, Sara; Garg, Vidu; Bowman, Jessica; Zapata, Gladys; Hernandez, Patricia; Arrington, Cammon B; Furthner, Dieter; Prakash, Siddharth K; Bowles, Neil E; McBride, Kim L; Belmont, John W

    2017-08-01

    Congenital left-sided cardiac lesions (LSLs) are a significant contributor to the mortality and morbidity of congenital heart disease (CHD). Structural copy number variants (CNVs) have been implicated in LSL without extra-cardiac features; however, non-penetrance and variable expressivity have created uncertainty over the use of CNV analyses in such patients. High-density SNP microarray genotyping data were used to infer large, likely-pathogenic, autosomal CNVs in a cohort of 1,139 probands with LSL and their families. CNVs were molecularly confirmed and the medical records of individual carriers reviewed. The gene content of novel CNVs was then compared with public CNV data from CHD patients. Large CNVs (>1 MB) were observed in 33 probands (∼3%). Six of these were de novo and 14 were not observed in the only available parent sample. Associated cardiac phenotypes spanned a broad spectrum without clear predilection. Candidate CNVs were largely non-recurrent, associated with heterozygous loss of copy number, and overlapped known CHD genomic regions. Novel CNV regions were enriched for cardiac development genes, including seven that have not been previously associated with human CHD. CNV analysis can be a clinically useful and molecularly informative tool in LSLs without obvious extra-cardiac defects, and may identify a clinically relevant genomic disorder in a small but important proportion of these individuals. © 2017 Wiley Periodicals, Inc.

  18. The Structures of the C185S and C185A Mutants of Sulfite Oxidase Reveal Rearrangement of the Active Site

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Qiu, James A.; Wilson, Heather L.; Pushie, M. Jake

    Sulfite oxidase (SO) catalyzes the physiologically critical conversion of sulfite to sulfate. Enzymatic activity is dependent on the presence of the metal molybdenum complexed with a pyranopterin-dithiolene cofactor termed molybdopterin. Comparison of the amino acid sequences of SOs from a variety of sources has identified a single conserved Cys residue essential for catalytic activity. The crystal structure of chicken liver sulfite oxidase indicated that this residue, Cys185 in chicken SO, coordinates the Mo atom in the active site. To improve our understanding of the role of this residue in the catalytic mechanism of sulfite oxidase, serine and alanine variants atmore » position 185 of recombinant chicken SO were generated. Spectroscopic and kinetic studies indicate that neither variant is capable of sulfite oxidation. The crystal structure of the C185S variant was determined to 1.9 {angstrom} resolution and to 2.4 {angstrom} resolution in the presence of sulfite, and the C185A variant to 2.8 {angstrom} resolution. The structures of the C185S and C185A variants revealed that neither the Ser or Ala side chains appeared to closely interact with the Mo atom and that a third oxo group replaced the usual cysteine sulfur ligand at the Mo center, confirming earlier extended X-ray absorption fine structure spectroscopy (EXAFS) work on the human C207S mutant. An unexpected result was that in the C185S variant, in the absence of sulfite, the active site residue Tyr322 became disordered as did the loop region flanking it. In the C185S variant crystallized in the presence of sulfite, the Tyr322 residue relocalized to the active site. The C185A variant structure also indicated the presence of a third oxygen ligand; however, Tyr322 remained in the active site. EXAFS studies of the Mo coordination environment indicate the Mo atom is in the oxidized Mo{sup VI} state in both the C185S and C185A variants of chicken SO and show the expected trioxodithiolene active site. Density functional theory calculations of the trioxo form of the cofactor reasonably reproducd the Mo=O distances of the complex; however, the calculated Mo-S distances were slightly longer than either crystallographic or EXAFS measurements. Taken together, these results indicate that the active sites of the C185S and C185A variants are essentially catalytically inactive, the crystal structures of C185S and C185A variants contain a fully oxidized, trioxo form of the cofactor, and Tyr322 can undergo a conformational change that is relevant to the reaction mechanism. Additional DFT calculations demonstrated that such methods can reasonably reproduce the geometry and bond lengths of the active site.« less

  19. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    PubMed

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  20. An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.

    PubMed

    Jun, Goo; Wing, Mary Kate; Abecasis, Gonçalo R; Kang, Hyun Min

    2015-06-01

    The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies. © 2015 Jun et al.; Published by Cold Spring Harbor Laboratory Press.

  1. Exploring the feasibility of using copy number variants as genetic markers through large-scale whole genome sequencing experiments

    USDA-ARS?s Scientific Manuscript database

    Copy number variants (CNV) are large scale duplications or deletions of genomic sequence that are caused by a diverse set of molecular phenomena that are distinct from single nucleotide polymorphism (SNP) formation. Due to their different mechanisms of formation, CNVs are often difficult to track us...

  2. Identifying genetic variants that affect viability in large cohorts

    PubMed Central

    Berisa, Tomaz; Day, Felix R.; Perry, John R. B.

    2017-01-01

    A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P~6.2 × 10−6 for fathers and P~2.0 × 10−3 for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10−3). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans. PMID:28873088

  3. A 2-step penalized regression method for family-based next-generation sequencing association studies.

    PubMed

    Ding, Xiuhua; Su, Shaoyong; Nandakumar, Kannabiran; Wang, Xiaoling; Fardo, David W

    2014-01-01

    Large-scale genetic studies are often composed of related participants, and utilizing familial relationships can be cumbersome and computationally challenging. We present an approach to efficiently handle sequencing data from complex pedigrees that incorporates information from rare variants as well as common variants. Our method employs a 2-step procedure that sequentially regresses out correlation from familial relatedness and then uses the resulting phenotypic residuals in a penalized regression framework to test for associations with variants within genetic units. The operating characteristics of this approach are detailed using simulation data based on a large, multigenerational cohort.

  4. The fibrous form of intracellular inclusion bodies in recombinant variant fibrinogen-producing cells is specific to the hepatic fibrinogen storage disease-inducible variant fibrinogen.

    PubMed

    Arai, Shinpei; Ogiwara, Naoko; Mukai, Saki; Takezawa, Yuka; Sugano, Mitsutoshi; Honda, Takayuki; Okumura, Nobuo

    2017-06-01

    Fibrinogen storage disease (FSD) is a rare disorder that is characterized by the accumulation of fibrinogen in hepatocytes and induces liver injury. Six mutations in the γC domain (γG284R, γT314P, γD316N, the deletion of γG346-Q350, γG366S, and γR375W) have been identified for FSD. Our group previously established γ375W fibrinogen-producing Chinese hamster ovary (CHO) cells and observed aberrant large granular and fibrous forms of intracellular inclusion bodies. The aim of this study was to investigate whether fibrous intracellular inclusion bodies are specific to FSD-inducible variant fibrinogen. Thirteen expression vectors encoding the variant γ-chain were stably or transiently transfected into CHO cells expressing normal fibrinogen Aα- and Bβ-chains or HuH-7 cells, which were then immunofluorescently stained. Six CHO and HuH-7 cell lines that transiently produced FSD-inducible variant fibrinogen presented the fibrous (3.2-22.7 and 2.1-24.5%, respectively) and large granular (5.4-25.5 and 7.7-23.9%) forms of intracellular inclusion bodies. Seven CHO and HuH-7 cell lines that transiently produced FSD-non-inducible variant fibrinogen only exhibit the large granular form. These results demonstrate that transiently transfected variant fibrinogen-producing CHO cells and inclusion bodies of the fibrous form may be useful in non-invasive screening for FSD risk factors for FSD before its onset.

  5. Intra-variant substructure in Ni–Mn–Ga martensite: Conjugation boundaries

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Muntifering, B.; Pond, R. C.; Kovarik, L.

    2014-06-01

    The microstructure of a Ni–Mn–Ga alloy in the martensitic phase was investigated using transmission electron microscopy. Inter-variant twin boundaries were observed separating non-modulated tetragonal martensite variants. In addition, intra-variant boundary structures, referred to here as “conjugation boundaries”, were also observed. We propose that conjugation boundaries originate at the transformation interface between austenite and a nascent martensite variant. In the alloy studied, deformation twinning was observed, consistent with being the mode of lattice-invariant deformation, and this can occur on either of two crystallographically equivalent conjugate View the MathML source{101}(101⁻) twinning systems: conjugation boundaries separate regions within a single variant in whichmore » the active modes were distinct. The defect structure of conjugation boundaries and the low-angle of misorientation across them are revealed in detail using high-resolution microscopy. Finally, we anticipate that the mobility of such boundaries is lower than that of inter-variant boundaries, and is therefore likely to significantly affect the kinetics of deformation in the martensitic phase.« less

  6. The Prevalence and Role of Hemoglobin Variants in Biometric Screening of a Multiethnic Population: One Large Health System's Experience.

    PubMed

    Wilburn, Clayton R; Bernard, David W; Zieske, Arthur W; Andrieni, Julia; Miller, Tara; Wang, Ping

    2017-06-01

    To characterize and quantitate hemoglobin (Hb) variants discovered during biometric hemoglobin A1c (HbA1c) analyses in a large multiethnic population with a focus on the effect of variants on testing method and results. In total, 13,913 individuals had their HbA1c measured via ion-exchange high-performance liquid chromatography. Samples that had a variant Hb detected or HbF fraction more than 25% underwent variant Hb characterization and confirmation by gel electrophoresis. RBC indices were also evaluated for possible concomitant thalassemia. Of the 13,913 individuals evaluated, 524 (3.77%) had an Hb variant. The prevalence of each variant was as follows: HbS trait (n = 396, 2.85%), HbSS disease (n = 4, 0.03%), HbC trait (n = 85, 0.61%), HbCC disease (n = 2, 0.01%), HbSC disease (n = 5, 0.04%), HbE trait (n = 18, 0.13%), HbD or G trait (n = 9, 0.06%), HbS β-thalassemia + disease (n = 1, 0.01%), hereditary persistence of HbF (n = 2, 0.01%), and HbMontgomery trait (n = 1, 0.01%). Concomitant α-thalassemia was detected in 20 (3.82%) of the 524 individuals with an Hb variant. This study represents one of the largest epidemiologic investigations into the prevalence of Hb variants in a North American metropolitan, multiethnic workforce and their dependents and reinforces the importance of method selection in populations with Hb variants. © American Society for Clinical Pathology, 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com

  7. The effect of rare variants on inflation of the test statistics in case-control analyses.

    PubMed

    Pirie, Ailith; Wood, Angela; Lush, Michael; Tyrer, Jonathan; Pharoah, Paul D P

    2015-02-20

    The detection of bias due to cryptic population structure is an important step in the evaluation of findings of genetic association studies. The standard method of measuring this bias in a genetic association study is to compare the observed median association test statistic to the expected median test statistic. This ratio is inflated in the presence of cryptic population structure. However, inflation may also be caused by the properties of the association test itself particularly in the analysis of rare variants. We compared the properties of the three most commonly used association tests: the likelihood ratio test, the Wald test and the score test when testing rare variants for association using simulated data. We found evidence of inflation in the median test statistics of the likelihood ratio and score tests for tests of variants with less than 20 heterozygotes across the sample, regardless of the total sample size. The test statistics for the Wald test were under-inflated at the median for variants below the same minor allele frequency. In a genetic association study, if a substantial proportion of the genetic variants tested have rare minor allele frequencies, the properties of the association test may mask the presence or absence of bias due to population structure. The use of either the likelihood ratio test or the score test is likely to lead to inflation in the median test statistic in the absence of population structure. In contrast, the use of the Wald test is likely to result in under-inflation of the median test statistic which may mask the presence of population structure.

  8. Searching for missing heritability: Designing rare variant association studies

    PubMed Central

    Zuk, Or; Schaffner, Stephen F.; Samocha, Kaitlin; Do, Ron; Hechter, Eliana; Kathiresan, Sekar; Daly, Mark J.; Neale, Benjamin M.; Sunyaev, Shamil R.; Lander, Eric S.

    2014-01-01

    Genetic studies have revealed thousands of loci predisposing to hundreds of human diseases and traits, revealing important biological pathways and defining novel therapeutic hypotheses. However, the genes discovered to date typically explain less than half of the apparent heritability. Because efforts have largely focused on common genetic variants, one hypothesis is that much of the missing heritability is due to rare genetic variants. Studies of common variants are typically referred to as genomewide association studies, whereas studies of rare variants are often simply called sequencing studies. Because they are actually closely related, we use the terms common variant association study (CVAS) and rare variant association study (RVAS). In this paper, we outline the similarities and differences between RVAS and CVAS and describe a conceptual framework for the design of RVAS. We apply the framework to address key questions about the sample sizes needed to detect association, the relative merits of testing disruptive alleles vs. missense alleles, frequency thresholds for filtering alleles, the value of predictors of the functional impact of missense alleles, the potential utility of isolated populations, the value of gene-set analysis, and the utility of de novo mutations. The optimal design depends critically on the selection coefficient against deleterious alleles and thus varies across genes. The analysis shows that common variant and rare variant studies require similarly large sample collections. In particular, a well-powered RVAS should involve discovery sets with at least 25,000 cases, together with a substantial replication set. PMID:24443550

  9. Performance of genotype imputation for low frequency and rare variants from the 1000 genomes.

    PubMed

    Zheng, Hou-Feng; Rong, Jing-Jing; Liu, Ming; Han, Fang; Zhang, Xing-Wei; Richards, J Brent; Wang, Li

    2015-01-01

    Genotype imputation is now routinely applied in genome-wide association studies (GWAS) and meta-analyses. However, most of the imputations have been run using HapMap samples as reference, imputation of low frequency and rare variants (minor allele frequency (MAF) < 5%) are not systemically assessed. With the emergence of next-generation sequencing, large reference panels (such as the 1000 Genomes panel) are available to facilitate imputation of these variants. Therefore, in order to estimate the performance of low frequency and rare variants imputation, we imputed 153 individuals, each of whom had 3 different genotype array data including 317k, 610k and 1 million SNPs, to three different reference panels: the 1000 Genomes pilot March 2010 release (1KGpilot), the 1000 Genomes interim August 2010 release (1KGinterim), and the 1000 Genomes phase1 November 2010 and May 2011 release (1KGphase1) by using IMPUTE version 2. The differences between these three releases of the 1000 Genomes data are the sample size, ancestry diversity, number of variants and their frequency spectrum. We found that both reference panel and GWAS chip density affect the imputation of low frequency and rare variants. 1KGphase1 outperformed the other 2 panels, at higher concordance rate, higher proportion of well-imputed variants (info>0.4) and higher mean info score in each MAF bin. Similarly, 1M chip array outperformed 610K and 317K. However for very rare variants (MAF ≤ 0.3%), only 0-1% of the variants were well imputed. We conclude that the imputation of low frequency and rare variants improves with larger reference panels and higher density of genome-wide genotyping arrays. Yet, despite a large reference panel size and dense genotyping density, very rare variants remain difficult to impute.

  10. In vivo study of the surgical anatomy of the axilla.

    PubMed

    Khan, A; Chakravorty, A; Gui, G P H

    2012-06-01

    Classical anatomical descriptions fail to describe variants often observed in the axilla as they are based on studies that looked at individual structures in isolation or textbooks of cadaveric dissections. The presence of variant anatomy heightens the risk of iatrogenic injury. The aim of this study was to document the nature and frequency of these anatomical variations based on in vivo peroperative surgical observations. Detailed anatomical relationships were documented prospectively during consecutive axillary dissections. Relationships between the thoracodorsal pedicle, course of the lateral thoracic vein, presence of latissimus dorsi muscle slips, variations in axillary and angular vein anatomy, and origins and branching of the intercostobrachial nerve were recorded. Among a total of 73 axillary dissections, 43 (59 per cent) revealed at least one anatomical variant. Most notable variants included aberrant courses of the thoracodorsal nerve in ten patients (14 per cent)--three variants; lateral thoracic vein in 12 patients (16 per cent)--four variants; bifid axillary veins in ten patients (14 per cent); latissimus dorsi muscle slips in four patients (5 per cent); and variants in intercostobrachial nerve origins and branching in 26 patients (36 per cent). The angular vein, a subscapular vein tributary, was found to be a constant axillary structure. Variations in axillary anatomical structures are common. Poor understanding of these variants can affect the adequacy of oncological clearance, lead to vascular injury, compromise planned microvascular procedures and result in chronic pain or numbness from nerve injury. Surgeons should be aware of the common anatomical variants to facilitate efficient and safe axillary surgery. Copyright © 2012 British Journal of Surgery Society Ltd. Published by John Wiley & Sons, Ltd.

  11. Functional analysis of a large set of BRCA2 exon 7 variants highlights the predictive value of hexamer scores in detecting alterations of exonic splicing regulatory elements.

    PubMed

    Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra

    2013-11-01

    Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.

  12. Analysis of potential protein-modifying variants in 9000 endometriosis patients and 150000 controls of European ancestry.

    PubMed

    Sapkota, Yadav; Vivo, Immaculata De; Steinthorsdottir, Valgerdur; Fassbender, Amelie; Bowdler, Lisa; Buring, Julie E; Edwards, Todd L; Jones, Sarah; O, Dorien; Peterse, Daniëlle; Rexrode, Kathryn M; Ridker, Paul M; Schork, Andrew J; Thorleifsson, Gudmar; Wallace, Leanne M; Kraft, Peter; Morris, Andrew P; Nyholt, Dale R; Edwards, Digna R Velez; Nyegaard, Mette; D'Hooghe, Thomas; Chasman, Daniel I; Stefansson, Kari; Missmer, Stacey A; Montgomery, Grant W

    2017-09-12

    Genome-wide association (GWA) studies have identified 19 independent common risk loci for endometriosis. Most of the GWA variants are non-coding and the genes responsible for the association signals have not been identified. Herein, we aimed to assess the potential role of protein-modifying variants in endometriosis using exome-array genotyping in 7164 cases and 21005 controls, and a replication set of 1840 cases and 129016 controls of European ancestry. Results in the discovery sample identified significant evidence for association with coding variants in single-variant (rs1801232-CUBN) and gene-level (CIITA and PARP4) meta-analyses, but these did not survive replication. In the combined analysis, there was genome-wide significant evidence for rs13394619 (P = 2.3 × 10 -9 ) in GREB1 at 2p25.1 - a locus previously identified in a GWA meta-analysis of European and Japanese samples. Despite sufficient power, our results did not identify any protein-modifying variants (MAF > 0.01) with moderate or large effect sizes in endometriosis, although these variants may exist in non-European populations or in high-risk families. The results suggest continued discovery efforts should focus on genotyping large numbers of surgically-confirmed endometriosis cases and controls, and/or sequencing high-risk families to identify novel rare variants to provide greater insights into the molecular pathogenesis of the disease.

  13. Deciphering Variability of PKD1 and PKD2 in an Italian Cohort of 643 Patients with Autosomal Dominant Polycystic Kidney Disease (ADPKD)

    PubMed Central

    Carrera, Paola; Calzavara, Silvia; Magistroni, Riccardo; den Dunnen, Johan T.; Rigo, Francesca; Stenirri, Stefania; Testa, Francesca; Messa, Piergiorgio; Cerutti, Roberta; Scolari, Francesco; Izzi, Claudia; Edefonti, Alberto; Negrisolo, Susanna; Benetti, Elisa; Alibrandi, Maria Teresa Sciarrone; Manunta, Paolo; Boletta, Alessandra; Ferrari, Maurizio

    2016-01-01

    Autosomal Dominant Polycystic Kidney Disease (ADPKD) is the most common hereditary kidney disease. We analysed PKD1 and PKD2, in a large cohort of 440 unrelated Italian patients with ADPKD and 203 relatives by direct sequencing and MLPA. Molecular and detailed phenotypic data have been collected and submitted to the PKD1/PKD2 LOVD database. This is the first large retrospective study in Italian patients, describing 701 variants, 249 (35.5%) already associated with ADPKD and 452 (64.5%) novel. According to the criteria adopted, the overall detection rate was 80% (352/440). Novel variants with uncertain significance were found in 14% of patients. Among patients with pathogenic variants, in 301 (85.5%) the disease is associated with PKD1, 196 (55.7%) truncating, 81 (23%) non truncating, 24 (6.8%) IF indels, and in 51 (14.5%) with PKD2. Our results outline the high allelic heterogeneity of variants, complicated by the presence of variants of uncertain significance as well as of multiple variants in the same subject. Classification of novel variants may be particularly cumbersome having an important impact on the genetic counselling. Our study confirms the importance to improve the assessment of variant pathogenicity for ADPKD; to this point databasing of both clinical and molecular data is crucial. PMID:27499327

  14. Variability extraction and modeling for product variants.

    PubMed

    Linsbauer, Lukas; Lopez-Herrejon, Roberto Erick; Egyed, Alexander

    2017-01-01

    Fast-changing hardware and software technologies in addition to larger and more specialized customer bases demand software tailored to meet very diverse requirements. Software development approaches that aim at capturing this diversity on a single consolidated platform often require large upfront investments, e.g., time or budget. Alternatively, companies resort to developing one variant of a software product at a time by reusing as much as possible from already-existing product variants. However, identifying and extracting the parts to reuse is an error-prone and inefficient task compounded by the typically large number of product variants. Hence, more disciplined and systematic approaches are needed to cope with the complexity of developing and maintaining sets of product variants. Such approaches require detailed information about the product variants, the features they provide and their relations. In this paper, we present an approach to extract such variability information from product variants. It identifies traces from features and feature interactions to their implementation artifacts, and computes their dependencies. This work can be useful in many scenarios ranging from ad hoc development approaches such as clone-and-own to systematic reuse approaches such as software product lines. We applied our variability extraction approach to six case studies and provide a detailed evaluation. The results show that the extracted variability information is consistent with the variability in our six case study systems given by their variability models and available product variants.

  15. Unraveling the effects of amino acid substitutions enhancing lipase resistance to an ionic liquid: a molecular dynamics study.

    PubMed

    Zhao, Jing; Frauenkron-Machedjou, Victorine Josiane; Fulton, Alexander; Zhu, Leilei; Davari, Mehdi D; Jaeger, Karl-Erich; Schwaneberg, Ulrich; Bocola, Marco

    2018-04-04

    Understanding of the structural and dynamic properties of enzymes in non-aqueous media (e.g., ionic liquids, ILs) is highly attractive for protein engineers and synthetic biochemists. Despite a growing number of molecular dynamics (MD) simulation studies on the influence of different ILs on wild-type enzymes, the effects of various amino acid substitutions on the stability and activity of enzymes in ILs remain to be unraveled at the molecular level. Herein, we selected fifty previously reported Bacillus subtilis lipase A (BSLA) variants with increased resistance towards an IL (15 vol% 1-butyl-3-methylimidazolium trifluoromethanesulfonate; [Bmim][TfO]), and also ten non-resistant BSLA variants for a MD simulation study to identify the underlying molecular principles. Some important properties differentiating resistant and non-resistant BSLA variants from wild-type were elucidated. Results show that, in 15 vol% [Bmim][TfO] aqueous solution, 40% and 60% of non-resistant variants have lower and equal probabilities to form a catalytically important hydrogen bond between S77 and H156 compared to wild-type, whereas 36% and 56% of resistant variants show increased and equal probabilities, respectively. Introducing positively charged amino acids close to the substrate-binding cleft for instance I12R is beneficial for the BSLA resistance towards 15 vol% [Bmim][TfO], likely due to the reduced probability of [Bmim]+ cations clustering near the cleft. In contrast, substitution with a large hydrophobic residue like I12F can block the cleft through hydrophobic interaction with a neighboring nonpolar loop 134-137 or/and an attractive π-π interaction with [Bmim]+ cations. In addition, the resistant variants having polar substitutions on the surface show higher ability to stabilize the surface water molecule network in comparison to non-resistant variants. This study can guide experimentalists to rationally design promising IL-resistant enzymes, and contribute to a deeper understanding of protein-IL interactions at the molecular level.

  16. Clinical implications of SCN1A missense and truncation variants in a large Japanese cohort with Dravet syndrome.

    PubMed

    Ishii, Atsushi; Watkins, Joseph C; Chen, Debbie; Hirose, Shinichi; Hammer, Michael F

    2017-02-01

    Two major classes of SCN1A variants are associated with Dravet syndrome (DS): those that result in haploinsufficiency (truncating) and those that result in an amino acid substitution (missense). The aim of this retrospective study was to describe the first large cohort of Japanese patients with SCN1A mutation-positive DS (n = 285), and investigate the relationship between variant (type and position) and clinical expression and response to treatment. We sequenced all exons and intron-exon boundaries of SCN1A in our cohort, investigated differences in the distribution of truncating and missense variants, tested for associations between variant type and phenotype, and compared these patterns with those of cohorts with milder epilepsy and healthy individuals. Unlike truncation variants, missense variants are found at higher density in the S4 voltage sensor and pore loops and at lower density in the domain I-II and II-III linkers and the first three segments of domain II. Relative to healthy individuals, there is an increased frequency of truncating (but not missense) variants in the noncoding C-terminus. The rate of cognitive decline is more rapid for patients with truncation variants regardless of age at seizure onset, whereas age at onset is a predictor of the rate of cognitive decline for patients with missense variants. We found significant differences in the distribution of truncating and missense variants across the SCN1A sequence among healthy individuals, patients with DS, and those with milder forms of SCN1A-variant positive epilepsy. Testing for associations with phenotype revealed that variant type can be predictive of rate of cognitive decline. Analysis of descriptive medication data suggests that in addition to conventional drug therapy in DS, bromide, clonazepam and topiramate may reduce seizure frequency. Wiley Periodicals, Inc. © 2016 International League Against Epilepsy.

  17. Crystal structure of p44, a constitutively active splice variant of visual arrestin.

    PubMed

    Granzin, Joachim; Cousin, Anneliese; Weirauch, Moritz; Schlesinger, Ramona; Büldt, Georg; Batra-Safferling, Renu

    2012-03-09

    Visual arrestin specifically binds to photoactivated and phosphorylated rhodopsin and inactivates phototransduction. In contrast, the p44 splice variant can terminate phototransduction by binding to nonphosphorylated light-activated rhodopsin. Here we report the crystal structure of bovine p44 at a resolution of 1.85 Å. Compared to native arrestin, the p44 structure reveals significant differences in regions crucial for receptor binding, namely flexible loop V-VI and polar core regions. Additionally, electrostatic potential is remarkably positive on the N-domain and the C-domain. The p44 structure represents an active conformation that serves as a model to explain the 'constitutive activity' found in arrestin variants. Copyright © 2012 Elsevier Ltd. All rights reserved.

  18. Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults

    PubMed Central

    Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K.; Thomas, Venetta; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Bernstein, Leslie; Blot, William J.; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J.; Cheng, Iona; Chu, Lisa; Deming, Sandra L.; Driver, W. Ryan; Goodman, Phyllis; Hayes, Richard B.; Hennis, Anselm J. M.; Hsing, Ann W.; Hu, Jennifer J.; Ingles, Sue A.; John, Esther M.; Kittles, Rick A.; Kolb, Suzanne; Leske, M. Cristina; Monroe, Kristine R.; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F.; Rodriguez-Gil, Jorge L.; Rybicki, Ben A.; Schumacher, Fredrick; Stanford, Janet L.; Signorello, Lisa B.; Strom, Sara S.; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S.; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G.; Stram, Alexander H.; Kolonel, Laurence N.; Marchand, Loïc Le; Henderson, Brian E.; Haiman, Christopher A.; Stram, Daniel O.

    2015-01-01

    Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious. PMID:26125186

  19. Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.

    PubMed

    Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K; Thomas, Venetta; Ambrosone, Christine B; Bandera, Elisa V; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J; Cheng, Iona; Chu, Lisa; Deming, Sandra L; Driver, W Ryan; Goodman, Phyllis; Hayes, Richard B; Hennis, Anselm J M; Hsing, Ann W; Hu, Jennifer J; Ingles, Sue A; John, Esther M; Kittles, Rick A; Kolb, Suzanne; Leske, M Cristina; Millikan, Robert C; Monroe, Kristine R; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F; Rodriguez-Gil, Jorge L; Rybicki, Ben A; Schumacher, Fredrick; Stanford, Janet L; Signorello, Lisa B; Strom, Sara S; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G; Stram, Alexander H; Kolonel, Laurence N; Le Marchand, Loïc; Henderson, Brian E; Haiman, Christopher A; Stram, Daniel O

    2015-01-01

    Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious.

  20. Characterization of coarse bainite transformation in low carbon steel during simulated welding thermal cycles

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lan, Liangyun, E-mail: lanly@me.neu.edu.cn; State Key Laboratory of Rolling Technology and Automation, Northeastern University, Shenyang 110819; Kong, Xiangwei

    2015-07-15

    Coarse austenite to bainite transformation in low carbon steel under simulated welding thermal cycles was morphologically and crystallographically characterized by means of optical microscope, transmission electron microscope and electron backscattered diffraction technology. The results showed that the main microstructure changes from a mixture of lath martensite and bainitic ferrite to granular bainite with the increase in cooling time. The width of bainitic laths also increases gradually with the cooling time. For a welding thermal cycle with relatively short cooling time (e.g. t{sub 8/5} is 30 s), the main mode of variant grouping at the scale of individual prior austenite grainsmore » changes from Bain grouping to close-packed plane grouping with the progress of phase transformation, which results in inhomogeneous distribution of high angle boundaries. As the cooling time is increased, the Bain grouping of variants becomes predominant mode, which enlarges the effective grain size of product phase. - Highlights: • Main microstructure changes and the width of lath structure increases with cooling time. • Variant grouping changes from Bain zone to close-packed plane grouping with the transformation. • The change of variant grouping results in uneven distribution of high angle grain boundary. • Bain grouping is main mode for large heat input, which lowers the density of high angle boundary.« less

  1. Next generation sequencing gives an insight into the characteristics of highly selected breeds versus non-breed horses in the course of domestication.

    PubMed

    Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar

    2014-07-04

    Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.

  2. Evolution in vitro: analysis of a lineage of ribozymes

    NASA Technical Reports Server (NTRS)

    Lehman, N.; Joyce, G. F.

    1993-01-01

    Background: Catalytic RNAs, or ribozymes, possessing both a genotype and a phenotype, are ideal molecules for evolution experiments in vitro. A large, heterogeneous pool of RNAs can be subjected to multiple rounds of selection, amplification and mutation, leading to the development of variants that have some desired phenotype. Such experiments allow the investigator to correlate specific genetic changes with quantifiable alterations of the catalytic properties of the RNA. In addition, patterns of evolutionary change can be discerned through a detailed examination of the genotypic composition of the evolving RNA population. Results: Beginning with a pool of 10(13) variants of the Tetrahymena ribozyme, we carried out in vitro evolution experiments that led to the generation of ribozymes with the ability to cleave an RNA substrate in the presence of Ca2+ ions, an activity that does not exist for the wild-type molecule. Over the course of 12 generations, a seven-error variant emerged that has substantial Ca(2+)-dependent RNA-cleavage activity. Advantageous mutations increased in frequency in the population according to three distinct dynamics--logarithmic, linear and transient. Through a comparative analysis of 31 individual variants, we infer how certain mutations influence the catalytic properties of the ribozyme. Conclusions: In vitro evolution experiments make it possible to elucidate important aspects of both evolutionary biology and structural biochemistry on a reasonable short time scale.

  3. Nuclear-Localized and Deregulated Calcium- and Calmodulin-Dependent Protein Kinase Activates Rhizobial and Mycorrhizal Responses in Lotus japonicus[W

    PubMed Central

    Takeda, Naoya; Maekawa, Takaki; Hayashi, Makoto

    2012-01-01

    The common symbiosis pathway is at the core of symbiosis signaling between plants and soil microbes. In this pathway, calcium- and calmodulin-dependent protein kinase (CCaMK) plays a crucial role in integrating the signals both in arbuscular mycorrhizal symbiosis (AMS) and in root nodule symbiosis (RNS). However, the molecular mechanism by which CCaMK coordinates AMS and RNS is largely unknown. Here, we report that the gain-of-function (GOF) variants of CCaMK without the regulatory domains activate both AMS and RNS signaling pathways in the absence of symbiotic partners. This activation requires nuclear localization of CCaMK. Enforced nuclear localization of the GOF-CCaMK variants by fusion with a canonical nuclear localization signal enhances signaling activity of AMS and RNS. The GOF-CCaMK variant triggers formation of a structure similar to the prepenetration apparatus, which guides infection of arbuscular mycorrhizal fungi to host root cells. In addition, the GOF-CCaMK variants without the regulatory domains partly restore AMS but fail to support rhizobial infection in ccamk mutants. These data indicate that AMS, the more ancient type of symbiosis, can be mainly regulated by the kinase activity of CCaMK, whereas RNS, which evolved more recently, requires complex regulation performed by the regulatory domains of CCaMK. PMID:22337918

  4. The cytoplasmic expression of MUC1 in papillary thyroid carcinoma of different histological variants and its correlation with cyclin D1 overexpression.

    PubMed

    Abrosimov, Alexander; Saenko, Vladimir; Meirmanov, Serik; Nakashima, Masahiro; Rogounovitch, Tatiana; Shkurko, Olesya; Lushnikov, Eugeny; Mitsutake, Norisato; Namba, Hiroyuki; Yamashita, Shunichi

    2007-01-01

    This study addressed the immunohistochemical expression of MUC1 in papillary thyroid carcinoma (PTC) of different histotypes, sizes, and morphological features of aggressiveness, and its correlation with the overexpression of cyclin D1, a target molecule of the Wnt pathway. MUC1 expression was examined in a total of 209 PTCs. Cytoplasmic MUC1 expression was elevated in the tall, columnar cell and oncocytic variants (100%), Warthin-like (78%), and conventional PTCs (61%), and in papillary microcarcinoma (PMC) with the conventional growth pattern (52%). On the contrary, it was low in the follicular variant (27%) of PTC and PMCs with follicular architecture (13%). Cytoplasmic MUC1 accumulation did not associate with any clinicopathological features except peritumoral lymphoid infiltration in PTCs and in PMCs with the conventional growth pattern. MUC1 staining correlated with cyclin D1 overexpression in conventional PTCs and PMCs and PMCs with follicular architecture. The results demonstrate that MUC1 expression varies broadly in different histological variants of PTC, being the lowest in tumors with follicular structure. In general, it does not prove to be a prognosticator of PTC aggressiveness. A high correlation between MUC1 and cyclin D1 implies MUC1 involvement in the Wnt cascade functioning in a large subset of human PTCs and PMCs.

  5. Large-scale gene-centric analysis identifies novel variants for coronary artery disease.

    PubMed

    2011-09-01

    Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10(-33); LPA:p<10(-19); 1p13.3:p<10(-17)) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10(-7)). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06-1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and clarified the literature with regard to many previously suggested genes.

  6. Controlling coherence using the internal structure of hard pi pulses.

    PubMed

    Dong, Yanqun; Ramos, R G; Li, Dale; Barrett, S E

    2008-06-20

    The tiny difference between hard pi pulses and their delta-function approximation can be exploited to control coherence. Variants on the magic echo that work despite a large spread in resonance offsets are demonstrated using the zeroth- and first-order average Hamiltonian terms, for 13C NMR in 60C. The 29Si NMR linewidth of silicon has been reduced by a factor of about 70,00 using this approach, which also has potential applications in magnetic resonance microscopy and imaging of solids.

  7. Inexpensive and Highly Reproducible Cloud-Based Variant Calling of 2,535 Human Genomes

    PubMed Central

    Shringarpure, Suyash S.; Carroll, Andrew; De La Vega, Francisco M.; Bustamante, Carlos D.

    2015-01-01

    Population scale sequencing of whole human genomes is becoming economically feasible; however, data management and analysis remains a formidable challenge for many research groups. Large sequencing studies, like the 1000 Genomes Project, have improved our understanding of human demography and the effect of rare genetic variation in disease. Variant calling on datasets of hundreds or thousands of genomes is time-consuming, expensive, and not easily reproducible given the myriad components of a variant calling pipeline. Here, we describe a cloud-based pipeline for joint variant calling in large samples using the Real Time Genomics population caller. We deployed the population caller on the Amazon cloud with the DNAnexus platform in order to achieve low-cost variant calling. Using our pipeline, we were able to identify 68.3 million variants in 2,535 samples from Phase 3 of the 1000 Genomes Project. By performing the variant calling in a parallel manner, the data was processed within 5 days at a compute cost of $7.33 per sample (a total cost of $18,590 for completed jobs and $21,805 for all jobs). Analysis of cost dependence and running time on the data size suggests that, given near linear scalability, cloud computing can be a cheap and efficient platform for analyzing even larger sequencing studies in the future. PMID:26110529

  8. Regression and Data Mining Methods for Analyses of Multiple Rare Variants in the Genetic Analysis Workshop 17 Mini-Exome Data

    PubMed Central

    Bailey-Wilson, Joan E.; Brennan, Jennifer S.; Bull, Shelley B; Culverhouse, Robert; Kim, Yoonhee; Jiang, Yuan; Jung, Jeesun; Li, Qing; Lamina, Claudia; Liu, Ying; Mägi, Reedik; Niu, Yue S.; Simpson, Claire L.; Wang, Libo; Yilmaz, Yildiz E.; Zhang, Heping; Zhang, Zhaogong

    2012-01-01

    Group 14 of Genetic Analysis Workshop 17 examined several issues related to analysis of complex traits using DNA sequence data. These issues included novel methods for analyzing rare genetic variants in an aggregated manner (often termed collapsing rare variants), evaluation of various study designs to increase power to detect effects of rare variants, and the use of machine learning approaches to model highly complex heterogeneous traits. Various published and novel methods for analyzing traits with extreme locus and allelic heterogeneity were applied to the simulated quantitative and disease phenotypes. Overall, we conclude that power is (as expected) dependent on locus-specific heritability or contribution to disease risk, large samples will be required to detect rare causal variants with small effect sizes, extreme phenotype sampling designs may increase power for smaller laboratory costs, methods that allow joint analysis of multiple variants per gene or pathway are more powerful in general than analyses of individual rare variants, population-specific analyses can be optimal when different subpopulations harbor private causal mutations, and machine learning methods may be useful for selecting subsets of predictors for follow-up in the presence of extreme locus heterogeneity and large numbers of potential predictors. PMID:22128066

  9. HapFABIA: Identification of very short segments of identity by descent characterized by rare variants in large sequencing data

    PubMed Central

    Hochreiter, Sepp

    2013-01-01

    Identity by descent (IBD) can be reliably detected for long shared DNA segments, which are found in related individuals. However, many studies contain cohorts of unrelated individuals that share only short IBD segments. New sequencing technologies facilitate identification of short IBD segments through rare variants, which convey more information on IBD than common variants. Current IBD detection methods, however, are not designed to use rare variants for the detection of short IBD segments. Short IBD segments reveal genetic structures at high resolution. Therefore, they can help to improve imputation and phasing, to increase genotyping accuracy for low-coverage sequencing and to increase the power of association studies. Since short IBD segments are further assumed to be old, they can shed light on the evolutionary history of humans. We propose HapFABIA, a computational method that applies biclustering to identify very short IBD segments characterized by rare variants. HapFABIA is designed to detect short IBD segments in genotype data that were obtained from next-generation sequencing, but can also be applied to DNA microarray data. Especially in next-generation sequencing data, HapFABIA exploits rare variants for IBD detection. HapFABIA significantly outperformed competing algorithms at detecting short IBD segments on artificial and simulated data with rare variants. HapFABIA identified 160 588 different short IBD segments characterized by rare variants with a median length of 23 kb (mean 24 kb) in data for chromosome 1 of the 1000 Genomes Project. These short IBD segments contain 752 000 single nucleotide variants (SNVs), which account for 39% of the rare variants and 23.5% of all variants. The vast majority—152 000 IBD segments—are shared by Africans, while only 19 000 and 11 000 are shared by Europeans and Asians, respectively. IBD segments that match the Denisova or the Neandertal genome are found significantly more often in Asians and Europeans but also, in some cases exclusively, in Africans. The lengths of IBD segments and their sharing between continental populations indicate that many short IBD segments from chromosome 1 existed before humans migrated out of Africa. Thus, rare variants that tag these short IBD segments predate human migration from Africa. The software package HapFABIA is available from Bioconductor. All data sets, result files and programs for data simulation, preprocessing and evaluation are supplied at http://www.bioinf.jku.at/research/short-IBD. PMID:24174545

  10. Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework.

    PubMed

    Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W

    2017-12-18

    The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.

  11. Secreted histidyl-tRNA synthetase splice variants elaborate major epitopes for autoantibodies in inflammatory myositis.

    PubMed

    Zhou, Jie J; Wang, Feng; Xu, Zhiwen; Lo, Wing-Sze; Lau, Ching-Fun; Chiang, Kyle P; Nangle, Leslie A; Ashlock, Melissa A; Mendlein, John D; Yang, Xiang-Lei; Zhang, Mingjie; Schimmel, Paul

    2014-07-11

    Inflammatory and debilitating myositis and interstitial lung disease are commonly associated with autoantibodies (anti-Jo-1 antibodies) to cytoplasmic histidyl-tRNA synthetase (HisRS). Anti-Jo-1 antibodies from different disease-afflicted patients react mostly with spatially separated epitopes in the three-dimensional structure of human HisRS. We noted that two HisRS splice variants (SVs) include these spatially separated regions, but each SV lacks the HisRS catalytic domain. Despite the large deletions, the two SVs cross-react with a substantial population of anti-Jo-l antibodies from myositis patients. Moreover, expression of at least one of the SVs is up-regulated in dermatomyositis patients, and cell-based experiments show that both SVs and HisRS can be secreted. We suggest that, in patients with inflammatory myositis, anti-Jo-1 antibodies may have extracellular activity. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.

  12. Spatiotemporal multivariate mixture models for Bayesian model selection in disease mapping.

    PubMed

    Lawson, A B; Carroll, R; Faes, C; Kirby, R S; Aregay, M; Watjou, K

    2017-12-01

    It is often the case that researchers wish to simultaneously explore the behavior of and estimate overall risk for multiple, related diseases with varying rarity while accounting for potential spatial and/or temporal correlation. In this paper, we propose a flexible class of multivariate spatio-temporal mixture models to fill this role. Further, these models offer flexibility with the potential for model selection as well as the ability to accommodate lifestyle, socio-economic, and physical environmental variables with spatial, temporal, or both structures. Here, we explore the capability of this approach via a large scale simulation study and examine a motivating data example involving three cancers in South Carolina. The results which are focused on four model variants suggest that all models possess the ability to recover simulation ground truth and display improved model fit over two baseline Knorr-Held spatio-temporal interaction model variants in a real data application.

  13. Rare high-impact disease variants: properties and identifications.

    PubMed

    Park, Leeyoung; Kim, Ju Han

    2016-03-21

    Although many genome-wide association studies have been performed, the identification of disease polymorphisms remains important. It is now suspected that many rare disease variants induce the association signal of common variants in linkage disequilibrium (LD). Based on recent development of genetic models, the current study provides explanations of the existence of rare variants with high impacts and common variants with low impacts. Disease variants are neither necessary nor sufficient due to gene-gene or gene-environment interactions. A new method was developed based on theoretical aspects to identify both rare and common disease variants by their genotypes. Common disease variants were identified with relatively small odds ratios and relatively small sample sizes, except for specific situations in which the disease variants were in strong LD with a variant with a higher frequency. Rare disease variants with small impacts were difficult to identify without increasing sample sizes; however, the method was reasonably accurate for rare disease variants with high impacts. For rare variants, dominant variants generally showed better Type II error rates than recessive variants; however, the trend was reversed for common variants. Type II error rates increased in gene regions containing more than two disease variants because the more common variant, rather than both disease variants, was usually identified. The proposed method would be useful for identifying common disease variants with small impacts and rare disease variants with large impacts when disease variants have the same effects on disease presentation.

  14. Rare and Coding Region Genetic Variants Associated With Risk of Ischemic Stroke: The NHLBI Exome Sequence Project.

    PubMed

    Auer, Paul L; Nalls, Mike; Meschia, James F; Worrall, Bradford B; Longstreth, W T; Seshadri, Sudha; Kooperberg, Charles; Burger, Kathleen M; Carlson, Christopher S; Carty, Cara L; Chen, Wei-Min; Cupples, L Adrienne; DeStefano, Anita L; Fornage, Myriam; Hardy, John; Hsu, Li; Jackson, Rebecca D; Jarvik, Gail P; Kim, Daniel S; Lakshminarayan, Kamakshi; Lange, Leslie A; Manichaikul, Ani; Quinlan, Aaron R; Singleton, Andrew B; Thornton, Timothy A; Nickerson, Deborah A; Peters, Ulrike; Rich, Stephen S

    2015-07-01

    Stroke is the second leading cause of death and the third leading cause of years of life lost. Genetic factors contribute to stroke prevalence, and candidate gene and genome-wide association studies (GWAS) have identified variants associated with ischemic stroke risk. These variants often have small effects without obvious biological significance. Exome sequencing may discover predicted protein-altering variants with a potentially large effect on ischemic stroke risk. To investigate the contribution of rare and common genetic variants to ischemic stroke risk by targeting the protein-coding regions of the human genome. The National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP) analyzed approximately 6000 participants from numerous cohorts of European and African ancestry. For discovery, 365 cases of ischemic stroke (small-vessel and large-vessel subtypes) and 809 European ancestry controls were sequenced; for replication, 47 affected sibpairs concordant for stroke subtype and an African American case-control series were sequenced, with 1672 cases and 4509 European ancestry controls genotyped. The ESP's exome sequencing and genotyping started on January 1, 2010, and continued through June 30, 2012. Analyses were conducted on the full data set between July 12, 2012, and July 13, 2013. Discovery of new variants or genes contributing to ischemic stroke risk and subtype (primary analysis) and determination of support for protein-coding variants contributing to risk in previously published candidate genes (secondary analysis). We identified 2 novel genes associated with an increased risk of ischemic stroke: a protein-coding variant in PDE4DIP (rs1778155; odds ratio, 2.15; P = 2.63 × 10(-8)) with an intracellular signal transduction mechanism and in ACOT4 (rs35724886; odds ratio, 2.04; P = 1.24 × 10(-7)) with a fatty acid metabolism; confirmation of PDE4DIP was observed in affected sibpair families with large-vessel stroke subtype and in African Americans. Replication of protein-coding variants in candidate genes was observed for 2 previously reported GWAS associations: ZFHX3 (cardioembolic stroke) and ABCA1 (large-vessel stroke). Exome sequencing discovered 2 novel genes and mechanisms, PDE4DIP and ACOT4, associated with increased risk for ischemic stroke. In addition, ZFHX3 and ABCA1 were discovered to have protein-coding variants associated with ischemic stroke. These results suggest that genetic variation in novel pathways contributes to ischemic stroke risk and serves as a target for prediction, prevention, and therapy.

  15. Three-State Ferroelastic Switching and Large Electromechanical Responses in PbTiO3 Thin Films.

    PubMed

    Damodaran, Anoop R; Pandya, Shishir; Agar, Josh C; Cao, Ye; Vasudevan, Rama K; Xu, Ruijuan; Saremi, Sahar; Li, Qian; Kim, Jieun; McCarter, Margaret R; Dedon, Liv R; Angsten, Tom; Balke, Nina; Jesse, Stephen; Asta, Mark; Kalinin, Sergei V; Martin, Lane W

    2017-10-01

    Leveraging competition between energetically degenerate states to achieve large field-driven responses is a hallmark of functional materials, but routes to such competition are limited. Here, a new route to such effects involving domain-structure competition is demonstrated, which arises from strain-induced spontaneous partitioning of PbTiO 3 thin films into nearly energetically degenerate, hierarchical domain architectures of coexisting c/a and a 1 /a 2 domain structures. Using band-excitation piezoresponse force microscopy, this study manipulates and acoustically detects a facile interconversion of different ferroelastic variants via a two-step, three-state ferroelastic switching process (out-of-plane polarized c + → in-plane polarized a → out-of-plane polarized c - state), which is concomitant with large nonvolatile electromechanical strains (≈1.25%) and tunability of the local piezoresponse and elastic modulus (>23%). It is further demonstrated that deterministic, nonvolatile writing/erasure of large-area patterns of this electromechanical response is possible, thus showing a new pathway to improved function and properties. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Three-State Ferroelastic Switching and Large Electromechanical Responses in PbTiO 3 Thin Films

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Damodaran, Anoop R.; Pandya, Shishir; Agar, Josh C.

    Leveraging competition between energetically degenerate states to achieve large field-driven responses is a hallmark of functional materials, but routes to such competition are limited. Here, a new route to such effects involving domain-structure competition is demonstrated, which arises from straininduced spontaneous partitioning of PbTiO 3 thin films into nearly energetically degenerate, hierarchical domain architectures of coexisting c/a and a 1/a 2 domain structures. Using band-excitation piezoresponse force microscopy, this study manipulates and acoustically detects a facile interconversion of different ferroelastic variants via a two-step, three-state ferroelastic switching process (out-of-plane polarized c+ → in-plane polarized a → out-of-plane polarized c- state),more » which is concomitant with large nonvolatile electromechanical strains (≈1.25%) and tunability of the local piezoresponse and elastic modulus (>23%). It is further demonstrated that deterministic, nonvolatile writing/erasure of large-area patterns of this electromechanical response is possible, thus showing a new pathway to improved function and properties.« less

  17. Dynamic HypA zinc site is essential for acid viability and proper urease maturation in Helicobacter pylori.

    PubMed

    Johnson, Ryan C; Hu, Heidi Q; Merrell, D Scott; Maroney, Michael J

    2015-04-01

    Helicobacter pylori requires urease activity in order to survive in the acid environment of the human stomach. Urease is regulated in part by nickelation, a process that requires the HypA protein, which is a putative nickel metallochaperone that is generally associated with hydrogenase maturation. However, in H. pylori, HypA plays a dual role. In addition to an N-terminal nickel binding site, HypA proteins also contain a structural zinc site that is coordinated by two rigorously conserved CXXC sequences, which in H. pylori are flanked by His residues. These structural Zn sites are known to be dynamic, converting from Zn(Cys)4 centers at pH 7.2 to Zn(Cys)2(His)2 centers at pH 6.3 in the presence of Ni(ii) ions. In this study, mutant strains of H. pylori that express zinc site variants of the HypA protein are used to show that the structural changes in the zinc site are important for the acid viability of the bacterium, and that a reduction in acid viability in these variants can be traced in large measure to deficient urease activity. This in turn leads to a model that connects the Zn(Cys)4 coordination to urease maturation.

  18. Structural and Functional Characterization of a New Double Variant Haemoglobin (HbG-Philadelphia/Duarte α(2)β(2)).

    PubMed

    Fais, Antonella; Casu, Mariano; Ruggerone, Paolo; Ceccarelli, Matteo; Porcu, Simona; Era, Benedetta; Anedda, Roberto; Sollaino, Maria Carla; Galanello, Renzo; Corda, Marcella

    2011-01-01

    WE REPORT THE FIRST CASE OF COSEGREGATION OF TWO HAEMOGLOBINS (HBS): HbG-Philadelphia [α68(E17)Asn → Lys] and HbDuarte [β62(E6)Ala → Pro]. The proband is a young patient heterozygous also for β°-thalassaemia. We detected exclusively two haemoglobin variants: HbDuarte and HbG-Philadelphia/Duarte. Functional study of the new double variant HbG-Philadelphia/Duarte exhibited an increase in oxygen affinity, with a slight decrease of cooperativity and Bohr effect. This functional behaviour is attributed to β62Ala → Pro instead of α68Asn → Lys substitution. Indeed, HbG-Philadelphia isolated in our laboratory from blood cells donor carrier for this variant is not affected by any functional modification, whereas purified Hb Duarte showed functional properties very similar to the double variant. NMR and MD simulation studies confirmed that the presence of Pro instead of Ala at the β62 position produces displacement of the E helix and modifications of the tertiary structure. The substitution α68(E17)Asn → Lys does not cause significant structural and dynamical modifications of the protein. A possible structure-based rational of substitution effects is suggested.

  19. Structural analysis of an HLA-B27 functional variant, B27d detected in American blacks

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rojo, S.; Aparicio, P.; Hansen, J.A.

    1987-11-15

    The structure of a new functional variant B27d has been established by comparative peptide mapping and radiochemical sequencing. This analysis complete the structural characterization of the six know histocompatibility leukocyte antigen (HLA)-B27 subtypes. The only detected amino acid change between the main HLA-B27.1 subtype and B27d is that of Try/sub 59/ to His/sub 59/. Position 59 has not been previously found to vary among class I HLA or H-2 antigens. Such substitution accounts for the reported isoelectric focusing pattern of this variant. HLA-B27d is the only B27 variant found to differ from other subtypes by a single amino acid replacement.more » The nature of the change is compatible with its origin by a point mutation from HLB-B27.1. Because B27d was found only American blacks and in no other ethnic groups, it is suggested that this variant originated as a result of a mutation of the B27.1 gene that occurred within the black population. Structural analysis of B27d was done by comparative mapping. Radiochemical sequencing was carried out with /sup 14/C-labeled and /sup 3/H-labeled amino acids.« less

  20. Mutations of Glucose-6-Phosphate Dehydrogenase Durham, Santa-Maria and A+ Variants Are Associated with Loss Functional and Structural Stability of the Protein

    PubMed Central

    Gómez-Manzo, Saúl; Marcial-Quino, Jaime; Vanoye-Carlo, America; Enríquez-Flores, Sergio; De la Mora-De la Mora, Ignacio; González-Valdez, Abigail; García-Torres, Itzhel; Martínez-Rosas, Víctor; Sierra-Palacios, Edgar; Lazcano-Pérez, Fernando; Rodríguez-Bustamante, Eduardo; Arreguin-Espinosa, Roberto

    2015-01-01

    Glucose-6-phosphate dehydrogenase (G6PD) deficiency is the most common enzymopathy in the world. More than 160 mutations causing the disease have been identified, but only 10% of these variants have been studied at biochemical and biophysical levels. In this study we report on the functional and structural characterization of three naturally occurring variants corresponding to different classes of disease severity: Class I G6PD Durham, Class II G6PD Santa Maria, and Class III G6PD A+. The results showed that the G6PD Durham (severe deficiency), and the G6PD Santa Maria and A+ (less severe deficiency) (Class I, II and III, respectively) affect the catalytic efficiency of these enzymes, are more sensitive to temperature denaturing, and affect the stability of the overall protein when compared to the wild type WT-G6PD. In the variants, the exposure of more and buried hydrophobic pockets was induced and monitored with 8-Anilinonaphthalene-1-sulfonic acid (ANS) fluorescence, directly affecting the compaction of structure at different levels and probably reducing the stability of the protein. The degree of functional and structural perturbation by each variant correlates with the clinical severity reported in different patients. PMID:26633385

  1. Increased Sensitivity of Diagnostic Mutation Detection by Re-analysis Incorporating Local Reassembly of Sequence Reads.

    PubMed

    Watson, Christopher M; Camm, Nick; Crinnion, Laura A; Clokie, Samuel; Robinson, Rachel L; Adlard, Julian; Charlton, Ruth; Markham, Alexander F; Carr, Ian M; Bonthron, David T

    2017-12-01

    Diagnostic genetic testing programmes based on next-generation DNA sequencing have resulted in the accrual of large datasets of targeted raw sequence data. Most diagnostic laboratories process these data through an automated variant-calling pipeline. Validation of the chosen analytical methods typically depends on confirming the detection of known sequence variants. Despite improvements in short-read alignment methods, current pipelines are known to be comparatively poor at detecting large insertion/deletion mutations. We performed clinical validation of a local reassembly tool, ABRA (assembly-based realigner), through retrospective reanalysis of a cohort of more than 2000 hereditary cancer cases. ABRA enabled detection of a 96-bp deletion, 4-bp insertion mutation in PMS2 that had been initially identified using a comparative read-depth approach. We applied an updated pipeline incorporating ABRA to the entire cohort of 2000 cases and identified one previously undetected pathogenic variant, a 23-bp duplication in PTEN. We demonstrate the effect of read length on the ability to detect insertion/deletion variants by comparing HiSeq2500 (2 × 101-bp) and NextSeq500 (2 × 151-bp) sequence data for a range of variants and thereby show that the limitations of shorter read lengths can be mitigated using appropriate informatics tools. This work highlights the need for ongoing development of diagnostic pipelines to maximize test sensitivity. We also draw attention to the large differences in computational infrastructure required to perform day-to-day versus large-scale reprocessing tasks.

  2. Ferroelasticity and domain physics in two-dimensional transition metal dichalcogenide monolayers.

    PubMed

    Li, Wenbin; Li, Ju

    2016-02-24

    Monolayers of transition metal dichalcogenides can exist in several structural polymorphs, including 2H, 1T and 1T'. The low-symmetry 1T' phase has three orientation variants, resulting from the three equivalent directions of Peierls distortion in the parental 1T phase. Using first-principles calculations, we predict that mechanical strain can switch the relative thermodynamic stability between the orientation variants of the 1T' phase. We find that such strain-induced variant switching only requires a few percent elastic strain, which is eminently achievable experimentally with transition metal dichalcogenide monolayers. Calculations indicate that the transformation barrier associated with such variant switching is small (<0.2 eV per chemical formula unit), suggesting that strain-induced variant switching can happen under laboratory conditions. Monolayers of transition metal dichalcogenides with 1T' structure therefore have the potential to be ferroelastic and shape memory materials with interesting domain physics.

  3. Ferroelasticity and domain physics in two-dimensional transition metal dichalcogenide monolayers

    DOE PAGES

    Li, Wenbin; Li, Ju

    2016-02-24

    Monolayers of transition metal dichalcogenides can exist in several structural polymorphs, including 2H, 1T and 1T'. The low-symmetry 1T' phase has three orientation variants, resulting from the three equivalent directions of Peierls distortion in the parental 1T phase. Using first-principles calculations, we predict that mechanical strain can switch the relative thermodynamic stability between the orientation variants of the 1T' phase. We find that such strain-induced variant switching only requires a few percent elastic strain, which is eminently achievable experimentally with transition metal dichalcogenide monolayers. Calculations indicate that the transformation barrier associated with such variant switching is small (<0.2 eV permore » chemical formula unit), suggesting that strain-induced variant switching can happen under laboratory conditions. Furthermore, monolayers of transition metal dichalcogenides with 1T' structure therefore have the potential to be ferroelastic and shape memory materials with interesting domain physics.« less

  4. Short communication: Validation of 4 candidate causative trait variants in 2 cattle breeds using targeted sequence imputation.

    PubMed

    Pausch, Hubert; Wurmser, Christine; Reinhardt, Friedrich; Emmerling, Reiner; Fries, Ruedi

    2015-06-01

    Most association studies for pinpointing trait-associated variants are performed within breed. The availability of sequence data from key ancestors of several cattle breeds now enables immediate assessment of the frequency of trait-associated variants in populations different from the mapping population and their imputation into large validation populations. The objective of this study was to validate the effects of 4 putatively causative variants on milk production traits, male fertility, and stature in German Fleckvieh and Holstein-Friesian animals using targeted sequence imputation. We used whole-genome sequence data of 456 animals to impute 4 missense mutations in DGAT1, GHR, PRLR, and PROP1 into 10,363 Fleckvieh and 8,812 Holstein animals. The accuracy of the imputed genotypes exceeded 95% for all variants. Association testing with imputed variants revealed consistent antagonistic effects of the DGAT1 p.A232K and GHR p.F279Y variants on milk yield and protein and fat contents, respectively, in both breeds. The allele frequency of both polymorphisms has changed considerably in the past 20 yr, indicating that they were targets of recent selection for milk production traits. The PRLR p.S18N variant was associated with yield traits in Fleckvieh but not in Holstein, suggesting that it may be in linkage disequilibrium with a mutation affecting yield traits rather than being causal. The reported effects of the PROP1 p.H173R variant on milk production, male fertility, and stature could not be confirmed. Our results demonstrate that population-wide imputation of candidate causal variants from sequence data is feasible, enabling their rapid validation in large independent populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  5. High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE

    PubMed Central

    Majoros, William H.; Campbell, Michael S.; Holt, Carson; DeNardo, Erin K.; Ware, Doreen; Allen, Andrew S.; Yandell, Mark; Reddy, Timothy E.

    2017-01-01

    Abstract Motivation: The accurate interpretation of genetic variants is critical for characterizing genotype–phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. Results: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (‘Assessing Changes to Exons’) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. Availability and Implementation: ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE Contact: myandell@genetics.utah.edu or tim.reddy@duke.edu Supplementary information: Supplementary information is available at Bioinformatics online. PMID:28011790

  6. High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE.

    PubMed

    Majoros, William H; Campbell, Michael S; Holt, Carson; DeNardo, Erin K; Ware, Doreen; Allen, Andrew S; Yandell, Mark; Reddy, Timothy E

    2017-05-15

    The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. myandell@genetics.utah.edu or tim.reddy@duke.edu. Supplementary information is available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  7. Identification of common variants associated with human hippocampal and intracranial volumes

    PubMed Central

    Stein, Jason L; Medland, Sarah E; Vasquez, Alejandro Arias; Hibar, Derrek P; Senstad, Rudy E; Winkler, Anderson M; Toro, Roberto; Appel, Katja; Bartecek, Richard; Bergmann, Ørjan; Bernard, Manon; Brown, Andrew A; Cannon, Dara M; Chakravarty, M Mallar; Christoforou, Andrea; Domin, Martin; Grimm, Oliver; Hollinshead, Marisa; Holmes, Avram J; Homuth, Georg; Hottenga, Jouke-Jan; Langan, Camilla; Lopez, Lorna M; Hansell, Narelle K; Hwang, Kristy S; Kim, Sungeun; Laje, Gonzalo; Lee, Phil H; Liu, Xinmin; Loth, Eva; Lourdusamy, Anbarasu; Mattingsdal, Morten; Mohnke, Sebastian; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; O’Brien, Carol; Papmeyer, Martina; Pütz, Benno; Ramasamy, Adaikalavan; Rasmussen, Jerod; Rijpkema, Mark; Risacher, Shannon L; Roddey, J Cooper; Rose, Emma J; Ryten, Mina; Shen, Li; Sprooten, Emma; Strengman, Eric; Teumer, Alexander; Trabzuni, Daniah; Turner, Jessica; van Eijk, Kristel; van Erp, Theo G M; van Tol, Marie-Jose; Wittfeld, Katharina; Wolf, Christiane; Woudstra, Saskia; Aleman, Andre; Alhusaini, Saud; Almasy, Laura; Binder, Elisabeth B; Brohawn, David G; Cantor, Rita M; Carless, Melanie A; Corvin, Aiden; Czisch, Michael; Curran, Joanne E; Davies, Gail; de Almeida, Marcio A A; Delanty, Norman; Depondt, Chantal; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Fagerness, Jesen; Fox, Peter T; Freimer, Nelson B; Gill, Michael; Göring, Harald H H; Hagler, Donald J; Hoehn, David; Holsboer, Florian; Hoogman, Martine; Hosten, Norbert; Jahanshad, Neda; Johnson, Matthew P; Kasperaviciute, Dalia; Kent, Jack W; Kochunov, Peter; Lancaster, Jack L; Lawrie, Stephen M; Liewald, David C; Mandl, René; Matarin, Mar; Mattheisen, Manuel; Meisenzahl, Eva; Melle, Ingrid; Moses, Eric K; Mühleisen, Thomas W; Nauck, Matthias; Nöthen, Markus M; Olvera, Rene L; Pandolfo, Massimo; Pike, G Bruce; Puls, Ralf; Reinvang, Ivar; Rentería, Miguel E; Rietschel, Marcella; Roffman, Joshua L; Royle, Natalie A; Rujescu, Dan; Savitz, Jonathan; Schnack, Hugo G; Schnell, Knut; Seiferth, Nina; Smith, Colin; Steen, Vidar M; Valdés Hernández, Maria C; Van den Heuvel, Martijn; van der Wee, Nic J; Van Haren, Neeltje E M; Veltman, Joris A; Völzke, Henry; Walker, Robert; Westlye, Lars T; Whelan, Christopher D; Agartz, Ingrid; Boomsma, Dorret I; Cavalleri, Gianpiero L; Dale, Anders M; Djurovic, Srdjan; Drevets, Wayne C; Hagoort, Peter; Hall, Jeremy; Heinz, Andreas; Jack, Clifford R; Foroud, Tatiana M; Le Hellard, Stephanie; Macciardi, Fabio; Montgomery, Grant W; Poline, Jean Baptiste; Porteous, David J; Sisodiya, Sanjay M; Starr, John M; Sussmann, Jessika; Toga, Arthur W; Veltman, Dick J; Walter, Henrik; Weiner, Michael W; Bis, Joshua C; Ikram, M Arfan; Smith, Albert V; Gudnason, Vilmundur; Tzourio, Christophe; Vernooij, Meike W; Launer, Lenore J; DeCarli, Charles; Seshadri, Sudha; Andreassen, Ole A; Apostolova, Liana G; Bastin, Mark E; Blangero, John; Brunner, Han G; Buckner, Randy L; Cichon, Sven; Coppola, Giovanni; de Zubicaray, Greig I; Deary, Ian J; Donohoe, Gary; de Geus, Eco J C; Espeseth, Thomas; Fernández, Guillén; Glahn, David C; Grabe, Hans J; Hardy, John; Hulshoff Pol, Hilleke E; Jenkinson, Mark; Kahn, René S; McDonald, Colm; McIntosh, Andrew M; McMahon, Francis J; McMahon, Katie L; Meyer-Lindenberg, Andreas; Morris, Derek W; Müller-Myhsok, Bertram; Nichols, Thomas E; Ophoff, Roel A; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda W; Potkin, Steven G; Sämann, Philipp G; Saykin, Andrew J; Schumann, Gunter; Smoller, Jordan W; Wardlaw, Joanna M; Weale, Michael E; Martin, Nicholas G; Franke, Barbara; Wright, Margaret J; Thompson, Paul M

    2013-01-01

    Identifying genetic variants influencing human brain structures may reveal new biological mechanisms underlying cognition and neuropsychiatric illness. The volume of the hippocampus is a biomarker of incipient Alzheimer’s disease1,2 and is reduced in schizophrenia3, major depression4 and mesial temporal lobe epilepsy5. Whereas many brain imaging phenotypes are highly heritable6,7, identifying and replicating genetic influences has been difficult, as small effects and the high costs of magnetic resonance imaging (MRI) have led to underpowered studies. Here we report genome-wide association meta-analyses and replication for mean bilateral hippocampal, total brain and intracranial volumes from a large multinational consortium. The intergenic variant rs7294919 was associated with hippocampal volume (12q24.22; N = 21,151; P = 6.70 × 10−16) and the expression levels of the positional candidate gene TESC in brain tissue. Additionally, rs10784502, located within HMGA2, was associated with intracranial volume (12q14.3; N = 15,782; P = 1.12 × 10−12). We also identified a suggestive association with total brain volume at rs10494373 within DDR2 (1q23.3; N = 6,500; P = 5.81 × 10−7). PMID:22504417

  8. Minireview: Toward the Establishment of a Link between Melatonin and Glucose Homeostasis: Association of Melatonin MT2 Receptor Variants with Type 2 Diabetes

    PubMed Central

    Karamitri, Angeliki; Renault, Nicolas; Clement, Nathalie; Guillaume, Jean-Luc

    2013-01-01

    The existence of interindividual variations in G protein-coupled receptor sequences has been recognized early on. Recent advances in large-scale exon sequencing techniques are expected to dramatically increase the number of variants identified in G protein-coupled receptors, giving rise to new challenges regarding their functional characterization. The current minireview will illustrate these challenges based on the MTNR1B gene, which encodes the melatonin MT2 receptor, for which exon sequencing revealed 40 rare nonsynonymous variants in the general population and in type 2 diabetes (T2D) cohorts. Functional characterization of these MT2 mutants revealed 14 mutants with loss of Gi protein activation that associate with increased risk of T2D development. This repertoire of disease-associated mutants is a rich source for structure-activity studies and will help to define the still poorly understood role of melatonin in glucose homeostasis and T2D development in humans. Defining the functional defects in carriers of rare MT2 mutations will help to provide personalized therapies to these patients in the future. PMID:23798576

  9. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus

    PubMed Central

    Carbonell-Caballero, Jose; Alonso, Roberto; Ibañez, Victoria; Terol, Javier; Talon, Manuel; Dopazo, Joaquin

    2015-01-01

    Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels (insertions and deletions), and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analyzed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus but also provides original insights into other elusive evolutionary processes, such as chloroplast inheritance, heteroplasmy, and gene selection. PMID:25873589

  10. Left-dominant arrhythmogenic cardiomyopathy in a large family: associated desmosomal or nondesmosomal genotype?

    PubMed

    Groeneweg, Judith A; van der Zwaag, Paul A; Jongbloed, Jan D H; Cox, Moniek G P J; Vreeker, Arnold; de Boer, Rudolf A; van der Heijden, Jeroen F; van Veen, Toon A B; McKenna, William J; van Tintelen, J Peter; Dooijes, Dennis; Hauer, Richard N W

    2013-04-01

    Arrhythmogenic cardiomyopathy (AC) is considered a predominantly right ventricular (RV) desmosomal disease. However, left-dominant forms due to desmosomal gene mutations, including PKP2 variant c.419C>T, have been described. Recently, a nondesmosomal phospholamban (PLN) mutation (c.40_42delAGA) has been identified, causing dilated cardiomyopathy and arrhythmias. To gain more insight into pathogenicity of the PKP2 variant c.419C>T by cosegregation analysis of the PKP2 variant c.419C>T vs the PLN mutation c.40_42delAGA. A Dutch family (13 family members, median age 49 years, range 34-71 years) with ventricular tachycardia underwent (1) meticulous phenotypic characterization and (2) screening of 5 desmosomal genes (PKP2, DSC2, DSG2, DSP, JUP) and PLN. Six family members fulfilled 2010 AC Task Force Criteria. Seven had signs of left ventricular (LV) involvement (inverted T waves in leads V4-V6, LV wall motion abnormalities and late enhancement, and reduced LV ejection fraction), including 6 family members with proven AC. The PKP2 variant c.419C>T was found as a single variant in 3 family members, combined with the PLN mutation c.40_42delAGA in 3 others. PLN mutation was found in 9 family members, including the 6 with AC and all 7 with LV involvement. The PLN mutation c.40_42delAGA was found as a single mutation in 6, combined with the PKP2 variant c.419C>T in 3 others. A low-voltage electrocardiogram was seen in 4 of 9 PLN mutation-positive subjects. None of the family members with the single PKP2 variant showed any sign of RV or LV involvement. The PLN mutation c.40_42delAGA cosegregates with AC and with electrocardiographic and structural LV abnormalities. In this family, there was no evidence of disease-causing contribution of the PKP2 variant c.419C>T. Copyright © 2013 Heart Rhythm Society. Published by Elsevier Inc. All rights reserved.

  11. GWASeq: targeted re-sequencing follow up to GWAS.

    PubMed

    Salomon, Matthew P; Li, Wai Lok Sibon; Edlund, Christopher K; Morrison, John; Fortini, Barbara K; Win, Aung Ko; Conti, David V; Thomas, Duncan C; Duggan, David; Buchanan, Daniel D; Jenkins, Mark A; Hopper, John L; Gallinger, Steven; Le Marchand, Loïc; Newcomb, Polly A; Casey, Graham; Marjoram, Paul

    2016-03-03

    For the last decade the conceptual framework of the Genome-Wide Association Study (GWAS) has dominated the investigation of human disease and other complex traits. While GWAS have been successful in identifying a large number of variants associated with various phenotypes, the overall amount of heritability explained by these variants remains small. This raises the question of how best to follow up on a GWAS, localize causal variants accounting for GWAS hits, and as a consequence explain more of the so-called "missing" heritability. Advances in high throughput sequencing technologies now allow for the efficient and cost-effective collection of vast amounts of fine-scale genomic data to complement GWAS. We investigate these issues using a colon cancer dataset. After QC, our data consisted of 1993 cases, 899 controls. Using marginal tests of associations, we identify 10 variants distributed among six targeted regions that are significantly associated with colorectal cancer, with eight of the variants being novel to this study. Additionally, we perform so-called 'SNP-set' tests of association and identify two sets of variants that implicate both common and rare variants in the etiology of colorectal cancer. Here we present a large-scale targeted re-sequencing resource focusing on genomic regions implicated in colorectal cancer susceptibility previously identified in several GWAS, which aims to 1) provide fine-scale targeted sequencing data for fine-mapping and 2) provide data resources to address methodological questions regarding the design of sequencing-based follow-up studies to GWAS. Additionally, we show that this strategy successfully identifies novel variants associated with colorectal cancer susceptibility and can implicate both common and rare variants.

  12. Crystal Structure of Serine Racemase that Produces Neurotransmitter d-Serine for Stimulation of the NMDA Receptor

    NASA Astrophysics Data System (ADS)

    Goto, Masaru

    d-Serine is an endogenous coagonist for the N-methyl-d-aspartate receptor and is involved in excitatory neurotransmission in the brain. Mammalian pyridoxal 5’-phosphate-dependent serine racemase, which is localized in the mammalian brain, catalyzes the racemization of l-serine to yield d-serine and vice versa. We have determined the structures of three forms of the mammalian enzyme homolog from Schizosaccharomyces pombe. Lys57 and Ser82 located on the protein and solvent sides, respectively, with respect to the cofactor plane, are acid-base catalysts that shuttle protons to the substrate. The modified enzyme, which has a unique lysino-d-alanyl residue at the active site, also binds the substrate serine in the active site, suggesting that the lysino-d-alanyl residue acts as a catalytic base in the same manner as Lys57 of the wild type enzyme.

  13. Hemoglobin Variants: Biochemical Properties and Clinical Correlates

    PubMed Central

    Thom, Christopher S.; Dickson, Claire F.; Gell, David A.; Weiss, Mitchell J.

    2013-01-01

    Diseases affecting hemoglobin synthesis and function are extremely common worldwide. More than 1000 naturally occurring human hemoglobin variants with single amino acid substitutions throughout the molecule have been discovered, mainly through their clinical and/or laboratory manifestations. These variants alter hemoglobin structure and biochemical properties with physiological effects ranging from insignificant to severe. Studies of these mutations in patients and in the laboratory have produced a wealth of information on hemoglobin biochemistry and biology with significant implications for hematology practice. More generally, landmark studies of hemoglobin performed over the past 60 years have established important paradigms for the disciplines of structural biology, genetics, biochemistry, and medicine. Here we review the major classes of hemoglobin variants, emphasizing general concepts and illustrative examples. PMID:23388674

  14. Selection of an HLA-C*03:04-Restricted HIV-1 p24 Gag Sequence Variant Is Associated with Viral Escape from KIR2DL3+ Natural Killer Cells: Data from an Observational Cohort in South Africa.

    PubMed

    Hölzemer, Angelique; Thobakgale, Christina F; Jimenez Cruz, Camilo A; Garcia-Beltran, Wilfredo F; Carlson, Jonathan M; van Teijlingen, Nienke H; Mann, Jaclyn K; Jaggernath, Manjeetha; Kang, Seung-gu; Körner, Christian; Chung, Amy W; Schafer, Jamie L; Evans, David T; Alter, Galit; Walker, Bruce D; Goulder, Philip J; Carrington, Mary; Hartmann, Pia; Pertel, Thomas; Zhou, Ruhong; Ndung'u, Thumbi; Altfeld, Marcus

    2015-11-01

    Viruses can evade immune surveillance, but the underlying mechanisms are insufficiently understood. Here, we sought to understand the mechanisms by which natural killer (NK) cells recognize HIV-1-infected cells and how this virus can evade NK-cell-mediated immune pressure. Two sequence mutations in p24 Gag associated with the presence of specific KIR/HLA combined genotypes were identified in HIV-1 clade C viruses from a large cohort of infected, untreated individuals in South Africa (n = 392), suggesting viral escape from KIR+ NK cells through sequence variations within HLA class I-presented epitopes. One sequence polymorphism at position 303 of p24 Gag (TGag303V), selected for in infected individuals with both KIR2DL3 and HLA-C*03:04, enabled significantly better binding of the inhibitory KIR2DL3 receptor to HLA-C*03:04-expressing cells presenting this variant epitope compared to the wild-type epitope (wild-type mean 18.01 ± 10.45 standard deviation [SD] and variant mean 44.67 ± 14.42 SD, p = 0.002). Furthermore, activation of primary KIR2DL3+ NK cells from healthy donors in response to HLA-C*03:04+ target cells presenting the variant epitope was significantly reduced in comparison to cells presenting the wild-type sequence (wild-type mean 0.78 ± 0.07 standard error of the mean [SEM] and variant mean 0.63 ± 0.07 SEM, p = 0.012). Structural modeling and surface plasmon resonance of KIR/peptide/HLA interactions in the context of the different viral sequence variants studied supported these results. Future studies will be needed to assess processing and antigen presentation of the investigated HIV-1 epitope in natural infection, and the consequences for viral control. These data provide novel insights into how viruses can evade NK cell immunity through the selection of mutations in HLA-presented epitopes that enhance binding to inhibitory NK cell receptors. Better understanding of the mechanisms by which HIV-1 evades NK-cell-mediated immune pressure and the functional validation of a structural modeling approach will facilitate the development of novel targeted immune interventions to harness the antiviral activities of NK cells.

  15. New Genes and New Insights from Old Genes: Update on Alzheimer Disease

    PubMed Central

    Ringman, John M.; Coppola, Giovanni

    2013-01-01

    Purpose of Review: This article discusses the current status of knowledge regarding the genetic basis of Alzheimer disease (AD) with a focus on clinically relevant aspects. Recent Findings: The genetic architecture of AD is complex, as it includes multiple susceptibility genes and likely nongenetic factors. Rare but highly penetrant autosomal dominant mutations explain a small minority of the cases but have allowed tremendous advances in understanding disease pathogenesis. The identification of a strong genetic risk factor, APOE, reshaped the field and introduced the notion of genetic risk for AD. More recently, large-scale genome-wide association studies are adding to the picture a number of common variants with very small effect sizes. Large-scale resequencing studies are expected to identify additional risk factors, including rare susceptibility variants and structural variation. Summary: Genetic assessment is currently of limited utility in clinical practice because of the low frequency (Mendelian mutations) or small effect size (common risk factors) of the currently known susceptibility genes. However, genetic studies are identifying with confidence a number of novel risk genes, and this will further our understanding of disease biology and possibly the identification of therapeutic targets. PMID:23558482

  16. Random Plant Viral Variants Attain Temporal Advantages During Systemic Infections and in Turn Resist other Variants of the Same Virus.

    PubMed

    Zhang, Xiao-Feng; Guo, Jiangbo; Zhang, Xiuchun; Meulia, Tea; Paul, Pierce; Madden, Laurence V; Li, Dawei; Qu, Feng

    2015-10-20

    Infection of plants with viruses containing multiple variants frequently leads to dominance by a few random variants in the systemically infected leaves (SLs), for which a plausible explanation is lacking. We show here that SL dominance by a given viral variant is adequately explained by its fortuitous lead in systemic spread, coupled with its resistance to superinfection by other variants. We analyzed the fate of a multi-variant turnip crinkle virus (TCV) population in Arabidopsis and N. benthamiana plants. Both wild-type and RNA silencing-defective plants displayed a similar pattern of random dominance by a few variant genotypes, thus discounting a prominent role for RNA silencing. When introduced to plants sequentially as two subpopulations, a twelve-hour head-start was sufficient for the first set to dominate. Finally, SLs of TCV-infected plants became highly resistant to secondary invasions of another TCV variant. We propose that random distribution of variant foci on inoculated leaves allows different variants to lead systemic movement in different plants. The leading variants then colonize large areas of SLs, and resist the superinfection of lagging variants in the same areas. In conclusion, superinfection resistance is the primary driver of random enrichment of viral variants in systemically infected plants.

  17. Disruption of N terminus long range non covalent interactions shifted temp.opt 25°C to cold: Evolution of point mutant Bacillus lipase by error prone PCR.

    PubMed

    Goomber, Shelly; Kumar, Arbind; Kaur, Jagdeep

    2016-01-15

    Cold adapted enzymes have applications in detergent, textile, food, bioremediation and biotechnology processes. Bacillus lipases are 'generally recognized as safe' (GRAS) and hence are industrially attractive. Bacillus lipase of 1.4 subfamily are of lowest molecular weight and are reversibly unfolded due to absence of disulphide bonds. Therefore these are largely used to study energetic of protein stability that represents unfolding of native protein to fully unfolded state. In present study, metagenomically isolated Bacillus LipJ was laboratory evolved for cold adaptation by error Prone PCR. Library of variants were screened for high relative activity at low temperature of 10°C compared to native protein LipJ. Point mutant sequenced as Phe19→Leu was determined to be active at cold and was selected for extensive biochemical, biophysical characterization. Variant F19L showed its maximum activity at 10°C where parent protein LipJ had 20% relative activity. Psychrophilic nature of F19L was established with about 50% relative active at 5°C where native protein was frozen to act. Variant F19L showed no activity at temperature 40°C and above, establishing its thermolabile nature. Thermostability studies determined mutant to be unstable above 20°C and three fold decrease in its half life at 30°C compared to native protein. Far UV-CD and intrinsic fluorescence study demonstrated unstable tertiary structure of point variant F19L leading to its unfolding at low temperature of 20°C. Cold adaptation of mutant F19L is accompanied with increased specific activity. Mutant was catalytically more efficient with 1.3 fold increase in kcat. Homologue structure modelling predicted disruption of intersecondary hydrophobic core formed by aromatic ring of Phe19 with non polar residues placed at β3, β4, β5, β6, αF. Increased local flexibility of variant F19L explains molecular basis of its psychrophilic nature. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Large transcription units unify copy number variants and common fragile sites arising under replication stress.

    PubMed

    Wilson, Thomas E; Arlt, Martin F; Park, So Hae; Rajendran, Sountharia; Paulsen, Michelle; Ljungman, Mats; Glover, Thomas W

    2015-02-01

    Copy number variants (CNVs) resulting from genomic deletions and duplications and common fragile sites (CFSs) seen as breaks on metaphase chromosomes are distinct forms of structural chromosome instability precipitated by replication inhibition. Although they share a common induction mechanism, it is not known how CNVs and CFSs are related or why some genomic loci are much more prone to their occurrence. Here we compare large sets of de novo CNVs and CFSs in several experimental cell systems to each other and to overlapping genomic features. We first show that CNV hotpots and CFSs occurred at the same human loci within a given cultured cell line. Bru-seq nascent RNA sequencing further demonstrated that although genomic regions with low CNV frequencies were enriched in transcribed genes, the CNV hotpots that matched CFSs specifically corresponded to the largest active transcription units in both human and mouse cells. Consistently, active transcription units >1 Mb were robust cell-type-specific predictors of induced CNV hotspots and CFS loci. Unlike most transcribed genes, these very large transcription units replicated late and organized deletion and duplication CNVs into their transcribed and flanking regions, respectively, supporting a role for transcription in replication-dependent lesion formation. These results indicate that active large transcription units drive extreme locus- and cell-type-specific genomic instability under replication stress, resulting in both CNVs and CFSs as different manifestations of perturbed replication dynamics. © 2015 Wilson et al.; Published by Cold Spring Harbor Laboratory Press.

  19. Large transcription units unify copy number variants and common fragile sites arising under replication stress

    PubMed Central

    Park, So Hae; Rajendran, Sountharia; Paulsen, Michelle; Ljungman, Mats; Glover, Thomas W.

    2015-01-01

    Copy number variants (CNVs) resulting from genomic deletions and duplications and common fragile sites (CFSs) seen as breaks on metaphase chromosomes are distinct forms of structural chromosome instability precipitated by replication inhibition. Although they share a common induction mechanism, it is not known how CNVs and CFSs are related or why some genomic loci are much more prone to their occurrence. Here we compare large sets of de novo CNVs and CFSs in several experimental cell systems to each other and to overlapping genomic features. We first show that CNV hotpots and CFSs occurred at the same human loci within a given cultured cell line. Bru-seq nascent RNA sequencing further demonstrated that although genomic regions with low CNV frequencies were enriched in transcribed genes, the CNV hotpots that matched CFSs specifically corresponded to the largest active transcription units in both human and mouse cells. Consistently, active transcription units >1 Mb were robust cell-type-specific predictors of induced CNV hotspots and CFS loci. Unlike most transcribed genes, these very large transcription units replicated late and organized deletion and duplication CNVs into their transcribed and flanking regions, respectively, supporting a role for transcription in replication-dependent lesion formation. These results indicate that active large transcription units drive extreme locus- and cell-type-specific genomic instability under replication stress, resulting in both CNVs and CFSs as different manifestations of perturbed replication dynamics. PMID:25373142

  20. Breast and Prostate Cancer and Hormone-Related Gene Variant Study

    Cancer.gov

    The Breast and Prostate Cancer and Hormone-Related Gene Variant Study allows large-scale analyses of breast and prostate cancer risk in relation to genetic polymorphisms and gene-environment interactions that affect hormone metabolism.

  1. Effects of myosin variants on interacting-heads motif explain distinct hypertrophic and dilated cardiomyopathy phenotypes

    PubMed Central

    Alamo, Lorenzo; Ware, James S; Pinto, Antonio; Gillilan, Richard E; Seidman, Jonathan G; Seidman, Christine E; Padrón, Raúl

    2017-01-01

    Cardiac β-myosin variants cause hypertrophic (HCM) or dilated (DCM) cardiomyopathy by disrupting sarcomere contraction and relaxation. The locations of variants on isolated myosin head structures predict contractility effects but not the prominent relaxation and energetic deficits that characterize HCM. During relaxation, pairs of myosins form interacting-heads motif (IHM) structures that with other sarcomere proteins establish an energy-saving, super-relaxed (SRX) state. Using a human β-cardiac myosin IHM quasi-atomic model, we defined interactions sites between adjacent myosin heads and associated protein partners, and then analyzed rare variants from 6112 HCM and 1315 DCM patients and 33,370 ExAC controls. HCM variants, 72% that changed electrostatic charges, disproportionately altered IHM interaction residues (expected 23%; HCM 54%, p=2.6×10−19; DCM 26%, p=0.66; controls 20%, p=0.23). HCM variant locations predict impaired IHM formation and stability, and attenuation of the SRX state - accounting for altered contractility, reduced diastolic relaxation, and increased energy consumption, that fully characterizes HCM pathogenesis. DOI: http://dx.doi.org/10.7554/eLife.24634.001 PMID:28606303

  2. PB1-F2 Influenza A Virus Protein Adopts a β-Sheet Conformation and Forms Amyloid Fibers in Membrane Environments

    PubMed Central

    Chevalier, Christophe; Al Bazzal, Ali; Vidic, Jasmina; Février, Vincent; Bourdieu, Christiane; Bouguyon, Edwige; Le Goffic, Ronan; Vautherot, Jean-François; Bernard, Julie; Moudjou, Mohammed; Noinville, Sylvie; Chich, Jean-François; Da Costa, Bruno; Rezaei, Human; Delmas, Bernard

    2010-01-01

    The influenza A virus PB1-F2 protein, encoded by an alternative reading frame in the PB1 polymerase gene, displays a high sequence polymorphism and is reported to contribute to viral pathogenesis in a sequence-specific manner. To gain insights into the functions of PB1-F2, the molecular structure of several PB1-F2 variants produced in Escherichia coli was investigated in different environments. Circular dichroism spectroscopy shows that all variants have a random coil secondary structure in aqueous solution. When incubated in trifluoroethanol polar solvent, all PB1-F2 variants adopt an α-helix-rich structure, whereas incubated in acetonitrile, a solvent of medium polarity mimicking the membrane environment, they display β-sheet secondary structures. Incubated with asolectin liposomes and SDS micelles, PB1-F2 variants also acquire a β-sheet structure. Dynamic light scattering revealed that the presence of β-sheets is correlated with an oligomerization/aggregation of PB1-F2. Electron microscopy showed that PB1-F2 forms amorphous aggregates in acetonitrile. In contrast, at low concentrations of SDS, PB1-F2 variants exhibited various abilities to form fibers that were evidenced as amyloid fibers in a thioflavin T assay. Using a recombinant virus and its PB1-F2 knock-out mutant, we show that PB1-F2 also forms amyloid structures in infected cells. Functional membrane permeabilization assays revealed that the PB1-F2 variants can perforate membranes at nanomolar concentrations but with activities found to be sequence-dependent and not obviously correlated with their differential ability to form amyloid fibers. All of these observations suggest that PB1-F2 could be involved in physiological processes through different pathways, permeabilization of cellular membranes, and amyloid fiber formation. PMID:20172856

  3. PB1-F2 influenza A virus protein adopts a beta-sheet conformation and forms amyloid fibers in membrane environments.

    PubMed

    Chevalier, Christophe; Al Bazzal, Ali; Vidic, Jasmina; Février, Vincent; Bourdieu, Christiane; Bouguyon, Edwige; Le Goffic, Ronan; Vautherot, Jean-François; Bernard, Julie; Moudjou, Mohammed; Noinville, Sylvie; Chich, Jean-François; Da Costa, Bruno; Rezaei, Human; Delmas, Bernard

    2010-04-23

    The influenza A virus PB1-F2 protein, encoded by an alternative reading frame in the PB1 polymerase gene, displays a high sequence polymorphism and is reported to contribute to viral pathogenesis in a sequence-specific manner. To gain insights into the functions of PB1-F2, the molecular structure of several PB1-F2 variants produced in Escherichia coli was investigated in different environments. Circular dichroism spectroscopy shows that all variants have a random coil secondary structure in aqueous solution. When incubated in trifluoroethanol polar solvent, all PB1-F2 variants adopt an alpha-helix-rich structure, whereas incubated in acetonitrile, a solvent of medium polarity mimicking the membrane environment, they display beta-sheet secondary structures. Incubated with asolectin liposomes and SDS micelles, PB1-F2 variants also acquire a beta-sheet structure. Dynamic light scattering revealed that the presence of beta-sheets is correlated with an oligomerization/aggregation of PB1-F2. Electron microscopy showed that PB1-F2 forms amorphous aggregates in acetonitrile. In contrast, at low concentrations of SDS, PB1-F2 variants exhibited various abilities to form fibers that were evidenced as amyloid fibers in a thioflavin T assay. Using a recombinant virus and its PB1-F2 knock-out mutant, we show that PB1-F2 also forms amyloid structures in infected cells. Functional membrane permeabilization assays revealed that the PB1-F2 variants can perforate membranes at nanomolar concentrations but with activities found to be sequence-dependent and not obviously correlated with their differential ability to form amyloid fibers. All of these observations suggest that PB1-F2 could be involved in physiological processes through different pathways, permeabilization of cellular membranes, and amyloid fiber formation.

  4. Genetics and Genomics of Single-Gene Cardiovascular Diseases: Common Hereditary Cardiomyopathies as Prototypes of Single-Gene Disorders

    PubMed Central

    Marian, Ali J.; van Rooij, Eva; Roberts, Robert

    2016-01-01

    This is the first of 2 review papers on genetics and genomics appearing as part of the series on “omics.” Genomics pertains to all components of an organism’s genes, whereas genetics involves analysis of a specific gene(s) in the context of heredity. The paper provides introductory comments, describes the basis of human genetic diversity, and addresses the phenotypic consequences of genetic variants. Rare variants with large effect sizes are responsible for single-gene disorders, whereas complex polygenic diseases are typically due to multiple genetic variants, each exerting a modest effect size. To illustrate the clinical implications of genetic variants with large effect sizes, 3 common forms of hereditary cardiomyopathies are discussed as prototypic examples of single-gene disorders, including their genetics, clinical manifestations, pathogenesis, and treatment. The genetic basis of complex traits is discussed in a separate paper. PMID:28007145

  5. Clinical application of ACMG-AMP guidelines in HNF1A and GCK variants in a cohort of MODY families.

    PubMed

    Santana, L S; Caetano, L A; Costa-Riquetto, A D; Quedas, E P S; Nery, M; Collett-Solberg, P; Boguszewski, M C S; Vendramini, M F; Crisostomo, L G; Floh, F O; Zarabia, Z I; Kohara, S K; Guastapaglia, L; Passone, C G B; Sewaybricker, L E; Jorge, A A L; Teles, M G

    2017-10-01

    Maturity-onset diabetes of the young (MODY) is a form of monogenic diabetes with autosomal dominant inheritance. GCK -MODY and HNF1A -MODY are the prevalent subtypes. Currently, there is growing concern regarding the correct interpretation of molecular genetic findings. The American College of Medical Genetics and Genomics (ACMG) updated guidelines to interpret and classify molecular variants. This study aimed to determine the prevalence of MODY ( GCK / HNF1A ) in a large cohort of Brazilian families, to report variants related to phenotype, and to classify them according to ACMG guidelines. One hundred and nine probands were investigated, 45% with clinical suspicion of GCK -MODY and 55% with suspicion of HNF1A -MODY. Twenty-five different variants were identified in GCK gene (30 probands-61% of positivity), and 7 variants in HNF1A (10 probands-17% of positivity). Fourteen of them were novel (12- GCK /2- HNF1A ). ACMG guidelines were able to classify a large portion of variants as pathogenic (36%- GCK /86%- HNF1A ) and likely pathogenic (44%- GCK /14%- HNF1A ), with 16% (5/32) as uncertain significance. This allows us to determine the pathogenicity classification more efficiently, and also reinforces the suspected associations with the phenotype among novel variants. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  6. Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

    PubMed Central

    Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

    2014-01-01

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775

  7. Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol.

    PubMed

    Lange, Leslie A; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M; Smith, Joshua D; Turner, Emily H; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-Ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A; Holmen, Oddgeir L; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C; Correa, Adolfo; Griswold, Michael E; Jakobsdottir, Johanna; Smith, Albert V; Schreiner, Pamela J; Feitosa, Mary F; Zhang, Qunyuan; Huffman, Jennifer E; Crosby, Jacy; Wassel, Christina L; Do, Ron; Franceschini, Nora; Martin, Lisa W; Robinson, Jennifer G; Assimes, Themistocles L; Crosslin, David R; Rosenthal, Elisabeth A; Tsai, Michael; Rieder, Mark J; Farlow, Deborah N; Folsom, Aaron R; Lumley, Thomas; Fox, Ervin R; Carlson, Christopher S; Peters, Ulrike; Jackson, Rebecca D; van Duijn, Cornelia M; Uitterlinden, André G; Levy, Daniel; Rotter, Jerome I; Taylor, Herman A; Gudnason, Vilmundur; Siscovick, David S; Fornage, Myriam; Borecki, Ingrid B; Hayward, Caroline; Rudan, Igor; Chen, Y Eugene; Bottinger, Erwin P; Loos, Ruth J F; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M; Gabriel, Stacey B; O'Donnell, Christopher J; Post, Wendy S; North, Kari E; Reiner, Alexander P; Boerwinkle, Eric; Psaty, Bruce M; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P; Cupples, L Adrienne; Kooperberg, Charles; Wilson, James G; Nickerson, Deborah A; Abecasis, Goncalo R; Rich, Stephen S; Tracy, Russell P; Willer, Cristen J

    2014-02-06

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98(th) or <2(nd) percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  8. Population Structure Shapes Copy Number Variation in Malaria Parasites.

    PubMed

    Cheeseman, Ian H; Miller, Becky; Tan, John C; Tan, Asako; Nair, Shalini; Nkhoma, Standwell C; De Donato, Marcos; Rodulfo, Hectorina; Dondorp, Arjen; Branch, Oralee H; Mesia, Lastenia Ruiz; Newton, Paul; Mayxay, Mayfong; Amambua-Ngwa, Alfred; Conway, David J; Nosten, François; Ferdig, Michael T; Anderson, Tim J C

    2016-03-01

    If copy number variants (CNVs) are predominantly deleterious, we would expect them to be more efficiently purged from populations with a large effective population size (Ne) than from populations with a small Ne. Malaria parasites (Plasmodium falciparum) provide an excellent organism to examine this prediction, because this protozoan shows a broad spectrum of population structures within a single species, with large, stable, outbred populations in Africa, small unstable inbred populations in South America and with intermediate population characteristics in South East Asia. We characterized 122 single-clone parasites, without prior laboratory culture, from malaria-infected patients in seven countries in Africa, South East Asia and South America using a high-density single-nucleotide polymorphism/CNV microarray. We scored 134 high-confidence CNVs across the parasite exome, including 33 deletions and 102 amplifications, which ranged in size from <500 bp to 59 kb, as well as 10,107 flanking, biallelic single-nucleotide polymorphisms. Overall, CNVs were rare, small, and skewed toward low frequency variants, consistent with the deleterious model. Relative to African and South East Asian populations, CNVs were significantly more common in South America, showed significantly less skew in allele frequencies, and were significantly larger. On this background of low frequency CNV, we also identified several high-frequency CNVs under putative positive selection using an FST outlier analysis. These included known adaptive CNVs containing rh2b and pfmdr1, and several other CNVs (e.g., DNA helicase and three conserved proteins) that require further investigation. Our data are consistent with a significant impact of genetic structure on CNV burden in an important human pathogen. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori

    NASA Astrophysics Data System (ADS)

    Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny

    2015-09-01

    Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.

  10. Genomic variants in an inbred mouse model predict mania-like behaviors.

    PubMed

    Saul, Michael C; Stevenson, Sharon A; Zhao, Changjiu; Driessen, Terri M; Eisinger, Brian E; Gammie, Stephen C

    2018-01-01

    Contemporary rodent models for bipolar disorders split the bipolar spectrum into complimentary behavioral endophenotypes representing mania and depression. Widely accepted mania models typically utilize single gene transgenics or pharmacological manipulations, but inbred rodent strains show great potential as mania models. Their acceptance is often limited by the lack of genotypic data needed to establish construct validity. In this study, we used a unique strategy to inexpensively explore and confirm population allele differences in naturally occurring candidate variants in a manic rodent strain, the Madison (MSN) mouse strain. Variants were identified using whole exome resequencing on a small population of animals. Interesting candidate variants were confirmed in a larger population with genotyping. We enriched these results with observations of locomotor behavior from a previous study. Resequencing identified 447 structural variants that are mostly fixed in the MSN strain relative to control strains. After filtering and annotation, we found 11 non-synonymous MSN variants that we believe alter protein function. The allele frequencies for 6 of these variants were consistent with explanatory variants for the Madison strain's phenotype. The variants are in the Npas2, Cp, Polr3c, Smarca4, Trpv1, and Slc5a7 genes, and many of these genes' products are in pathways implicated in human bipolar disorders. Variants in Smarca4 and Polr3c together explained over 40% of the variance in locomotor behavior in the Hsd:ICR founder strain. These results enhance the MSN strain's construct validity and implicate altered nucleosome structure and transcriptional regulation as a chief molecular system underpinning behavior.

  11. Altools: a user friendly NGS data analyser.

    PubMed

    Camiolo, Salvatore; Sablok, Gaurav; Porceddu, Andrea

    2016-02-17

    Genotyping by re-sequencing has become a standard approach to estimate single nucleotide polymorphism (SNP) diversity, haplotype structure and the biodiversity and has been defined as an efficient approach to address geographical population genomics of several model species. To access core SNPs and insertion/deletion polymorphisms (indels), and to infer the phyletic patterns of speciation, most such approaches map short reads to the reference genome. Variant calling is important to establish patterns of genome-wide association studies (GWAS) for quantitative trait loci (QTLs), and to determine the population and haplotype structure based on SNPs, thus allowing content-dependent trait and evolutionary analysis. Several tools have been developed to investigate such polymorphisms as well as more complex genomic rearrangements such as copy number variations, presence/absence variations and large deletions. The programs available for this purpose have different strengths (e.g. accuracy, sensitivity and specificity) and weaknesses (e.g. low computation speed, complex installation procedure and absence of a user-friendly interface). Here we introduce Altools, a software package that is easy to install and use, which allows the precise detection of polymorphisms and structural variations. Altools uses the BWA/SAMtools/VarScan pipeline to call SNPs and indels, and the dnaCopy algorithm to achieve genome segmentation according to local coverage differences in order to identify copy number variations. It also uses insert size information from the alignment of paired-end reads and detects potential large deletions. A double mapping approach (BWA/BLASTn) identifies precise breakpoints while ensuring rapid elaboration. Finally, Altools implements several processes that yield deeper insight into the genes affected by the detected polymorphisms. Altools was used to analyse both simulated and real next-generation sequencing (NGS) data and performed satisfactorily in terms of positive predictive values, sensitivity, the identification of large deletion breakpoints and copy number detection. Altools is fast, reliable and easy to use for the mining of NGS data. The software package also attempts to link identified polymorphisms and structural variants to their biological functions thus providing more valuable information than similar tools.

  12. Proteomic characterization of histone variants in the mouse testis by mass spectrometry-based top-down analysis.

    PubMed

    Kwak, Ho-Geun; Dohmae, Naoshi

    2016-11-15

    Various histones, including testis-specific histones, exist during spermatogenesis and some of them have been reported to play a key role in chromatin remodeling. Mass spectrometry (MS)-based characterization has become the important step to understand histone structures. Although individual histones or partial histone variant groups have been characterized, the comprehensive analysis of histone variants has not yet been conducted in the mouse testis. Here, we present the comprehensive separation and characterization of histone variants from mouse testes by a top-down approach using MS. Histone variants were successfully separated on a reversed phase column using high performance liquid chromatography (HPLC) with an ion-pairing reagent. Increasing concentrations of testis-specific histones were observed in the mouse testis and some somatic histones increased in the epididymis. Specifically, the increase of mass abundance in H3.2 in the epididymis was inversely proportional to the decrease in H3t in the testis, which was approximately 80%. The top-down characterization of intact histone variants in the mouse testis was performed using LC-MS/MS. The masses of separated histone variants and their expected post-translation modifications were calculated by performing deconvolution with information taken from the database. TH2A, TH2B and H3t were characterized by MS/MS fragmentation. Our approach provides comprehensive knowledge for identification of histone variants in the mouse testis that will contribute to the structural and functional research of histone variants during spermatogenesis.

  13. Structural Variation of Type I-F CRISPR RNA Guided DNA Surveillance.

    PubMed

    Pausch, Patrick; Müller-Esparza, Hanna; Gleditzsch, Daniel; Altegoer, Florian; Randau, Lennart; Bange, Gert

    2017-08-17

    CRISPR-Cas systems are prokaryotic immune systems against invading nucleic acids. Type I CRISPR-Cas systems employ highly diverse, multi-subunit surveillance Cascade complexes that facilitate duplex formation between crRNA and complementary target DNA for R-loop formation, retention, and DNA degradation by the subsequently recruited nuclease Cas3. Typically, the large subunit recognizes bona fide targets through the PAM (protospacer adjacent motif), and the small subunit guides the non-target DNA strand. Here, we present the Apo- and target-DNA-bound structures of the I-Fv (type I-F variant) Cascade lacking the small and large subunits. Large and small subunits are functionally replaced by the 5' terminal crRNA cap Cas5fv and the backbone protein Cas7fv, respectively. Cas5fv facilitates PAM recognition from the DNA major groove site, in contrast to all other described type I systems. Comparison of the type I-Fv Cascade with an anti-CRISPR protein-bound I-F Cascade reveals that the type I-Fv structure differs substantially at known anti-CRISPR protein target sites and might therefore be resistant to viral Cascade interception. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Structure and dynamics of mesophilic variants from the homing endonuclease I-DmoI

    NASA Astrophysics Data System (ADS)

    Alba, Josephine; Marcaida, Maria Jose; Prieto, Jesus; Montoya, Guillermo; Molina, Rafael; D'Abramo, Marco

    2017-12-01

    I-DmoI, from the hyperthermophilic archaeon Desulfurococcus mobilis, belongs to the LAGLIDADG homing endonuclease protein family. Its members are highly specific enzymes capable of recognizing long DNA target sequences, thus providing potential tools for genome manipulation. Working towards this particular application, many efforts have been made to generate mesophilic variants of I-DmoI that function at lower temperatures than the wild-type. Here, we report a structural and computational analysis of two I-DmoI mesophilic mutants. Despite very limited structural variations between the crystal structures of these variants and the wild-type, a different dynamical behaviour near the cleavage sites is observed. In particular, both the dynamics of the water molecules and the protein perturbation effect on the cleavage site correlate well with the changes observed in the experimental enzymatic activity.

  15. Optional elements and variant structures in the productions of bei2 'to give' dative constructions in Cantonese-speaking adults and three-year-old children.

    PubMed

    Wong, Anita M-Y; Chow, Dorcas C-C; McBride-Cheng, Catherine; Stokes, Stephanie F

    2010-01-01

    To express object transfer, Cantonese-speakers use a 'ditransitive' ([V-R-T] or [V-T-R] where V=Verb, T=Theme, R=Recipient), or a more complex prepositional/serial-verb (P/SV) construction. Clausal elements in Cantonese datives can be optional (resulting in 'full' versus 'non-full' forms) or appear in variant orders (full non-canonical and full canonical). We report on usage of dative constructions with the word bei2 'to give' in 86 parents and 53 three-year-old children during conversations. The parents used more P/SV than ditransitive bei2-datives, and vice versa for the children. Both groups showed a similar usage pattern of optional elements and variant structures in their ditransitive and P/SV bei2-datives. The roles of multiple construction types, optional elements and variant structures in children's learning of bei2-dative constructions are described.

  16. Proteolysis of truncated hemolysin A yields a stable dimerization interface

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Novak, Walter R. P.; Bhattacharyya, Basudeb; Grilley, Daniel P.

    2017-02-21

    Wild-type and variant forms of HpmA265 (truncated hemolysin A) fromProteus mirabilisreveal a right-handed, parallel β-helix capped and flanked by segments of antiparallel β-strands. The low-salt crystal structures form a dimeric structureviathe implementation of on-edge main-chain hydrogen bonds donated by residues 243–263 of adjacent monomers. Surprisingly, in the high-salt structures of two variants, Y134A and Q125A-Y134A, a new dimeric interface is formedviamain-chain hydrogen bonds donated by residues 203–215 of adjacent monomers, and a previously unobserved tetramer is formed. In addition, an eight-stranded antiparallel β-sheet is formed from the flap regions of crystallographically related monomers in the high-salt structures. This new interfacemore » is possible owing to additional proteolysis of these variants after Tyr240. The interface formed in the high-salt crystal forms of hemolysin A variants may mimic the on-edge β-strand positioning used in template-assisted hemolytic activity.« less

  17. Large-Scale Gene-Centric Analysis Identifies Novel Variants for Coronary Artery Disease

    PubMed Central

    2011-01-01

    Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10−33; LPA:p<10−19; 1p13.3:p<10−17) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10−7). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06–1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and clarified the literature with regard to many previously suggested genes. PMID:21966275

  18. Functional Assays to Screen and Dissect Genomic Hits: Doubling Down on the National Investment in Genomic Research.

    PubMed

    Musunuru, Kiran; Bernstein, Daniel; Cole, F Sessions; Khokha, Mustafa K; Lee, Frank S; Lin, Shin; McDonald, Thomas V; Moskowitz, Ivan P; Quertermous, Thomas; Sankaran, Vijay G; Schwartz, David A; Silverman, Edwin K; Zhou, Xiaobo; Hasan, Ahmed A K; Luo, Xiao-Zhong James

    2018-04-01

    The National Institutes of Health have made substantial investments in genomic studies and technologies to identify DNA sequence variants associated with human disease phenotypes. The National Heart, Lung, and Blood Institute has been at the forefront of these commitments to ascertain genetic variation associated with heart, lung, blood, and sleep diseases and related clinical traits. Genome-wide association studies, exome- and genome-sequencing studies, and exome-genotyping studies of the National Heart, Lung, and Blood Institute-funded epidemiological and clinical case-control studies are identifying large numbers of genetic variants associated with heart, lung, blood, and sleep phenotypes. However, investigators face challenges in identification of genomic variants that are functionally disruptive among the myriad of computationally implicated variants. Studies to define mechanisms of genetic disruption encoded by computationally identified genomic variants require reproducible, adaptable, and inexpensive methods to screen candidate variant and gene function. High-throughput strategies will permit a tiered variant discovery and genetic mechanism approach that begins with rapid functional screening of a large number of computationally implicated variants and genes for discovery of those that merit mechanistic investigation. As such, improved variant-to-gene and gene-to-function screens-and adequate support for such studies-are critical to accelerating the translation of genomic findings. In this White Paper, we outline the variety of novel technologies, assays, and model systems that are making such screens faster, cheaper, and more accurate, referencing published work and ongoing work supported by the National Heart, Lung, and Blood Institute's R21/R33 Functional Assays to Screen Genomic Hits program. We discuss priorities that can accelerate the impressive but incomplete progress represented by big data genomic research. © 2018 American Heart Association, Inc.

  19. Clinical Applications of Molecular Genetic Discoveries

    PubMed Central

    Marian, A.J.

    2015-01-01

    Genome-wide association studies (GWAS) of complex traits have mapped more than 15,000 common single nucleotide variants (SNVs). Likewise, applications of massively parallel nucleic acid sequencing technologies often referred to as Next Generation Sequencing, to molecular genetic studies of complex traits have catalogued a large number of rare variants (population frequency of <0.01) in cases with complex traits. Moreover, high throughput nucleic acid sequencing, variant burden analysis, and linkage studies are illuminating the presence of large number of SNVs in cases and families with single gene disorders. The plethora of the genetic variants has exposed the formidable challenge of identifying the causal and pathogenic variants from the enormous number of innocuous common and rare variants that exist in the population as well as in an individual genome. The arduous task of identifying the causal and pathogenic variants is further compounded by the pleiotropic effects of the variants, complexity of cis and trans interactions in the genome, variability in phenotypic expression of the disease, as well as phenotypic plasticity, and the multifarious determinants of the phenotype. Population genetic studies offer the initial roadmaps and have the potential to elucidate novel pathways involved in the pathogenesis of the disease. However, the genome of an individual is unique, rendering unambiguous identification of the causal or pathogenic variant in a single individual exceedingly challenging. Yet, the focus of the practice of medicine is on the individual, as Sir William Osler elegantly expressed in his insightful quotation: “The good physician treats the disease; the great physician treats the patient who has the disease.” The daunting task facing physicians, patients, and researchers alike is to apply the modern genetic discoveries to care of the individual with or at risk of the disease. PMID:26548329

  20. Investigation of established genetic risk variants for glioma in prediagnostic samples from a population-based nested case-control study.

    PubMed

    Wibom, Carl; Späth, Florentin; Dahlin, Anna M; Langseth, Hilde; Hovig, Eivind; Rajaraman, Preetha; Johannesen, Tom Børge; Andersson, Ulrika; Melin, Beatrice

    2015-05-01

    Although glioma etiology is poorly understood in general, growing evidence indicates a genetic component. Four large genome-wide association studies (GWAS) have linked common genetic variants with an increased glioma risk. However, to date, these studies are based largely on a case-control design, where cases have been recruited at the time of or after diagnosis. They may therefore suffer from a degree of survival bias, introduced when rapidly fatal cases are not included. To confirm glioma risk variants in a prospective setting, we have analyzed 11 previously identified risk variants in a set of prediagnostic serum samples with 598 cases and 595 matched controls. Serum samples were acquired from The Janus Serum Bank, a Norwegian population-based biobank reserved for cancer research. We confirmed the association with glioma risk for variants within five genomic regions: 8q24.21 (CCDC26), 9p21.3 (CDKN2B-AS1), 11q23.3 (PHLDB1), 17p13.1 (TP53), and 20q13.33 (RTEL1). However, previously identified risk variants within the 7p11.2 (EGFR) region were not confirmed by this study. Our results indicate that the risk variants that were confirmed by this study are truly associated with glioma risk and may, consequently, affect gliomagenesis. Though the lack of positive confirmation of EGFR risk variants may be attributable to relatively limited statistical power, it nevertheless raises the question whether they truly are risk variants or markers for glioma prognosis. Our findings indicate the need for further studies to clarify the role of glioma risk loci with respect to prolonged survival versus etiology. ©2015 American Association for Cancer Research.

  1. "weil--das ist eben doch richtig so" Teaching Variant Types of "Weil"- and "Obwohl"-Structures in German

    ERIC Educational Resources Information Center

    Bendig, Ina; Betz, Emma; Huth, Thorsten

    2016-01-01

    Researchers have observed that in spoken German, the conjunctions "weil" and "obwohl" commonly occur with verb-second (V2) instead of verb-final (V[subscript f]) word order (Gaumann, 1983; Gänthner, 1993, 1996; Uhmann, 1998). Current findings document that this syntactic variant of "weil/obwohl-structures" has an…

  2. Silencing of the Drosophila ortholog of SOX5 leads to abnormal neuronal development and behavioral impairment.

    PubMed

    Li, Airong; Hooli, Basavaraj; Mullin, Kristina; Tate, Rebecca E; Bubnys, Adele; Kirchner, Rory; Chapman, Brad; Hofmann, Oliver; Hide, Winston; Tanzi, Rudolph E

    2017-04-15

    SOX5 encodes a transcription factor that is expressed in multiple tissues including heart, lung and brain. Mutations in SOX5 have been previously found in patients with amyotrophic lateral sclerosis (ALS) and developmental delay, intellectual disability and dysmorphic features. To characterize the neuronal role of SOX5, we silenced the Drosophila ortholog of SOX5, Sox102F, by RNAi in various neuronal subtypes in Drosophila. Silencing of Sox102F led to misorientated and disorganized michrochaetes, neurons with shorter dendritic arborization (DA) and reduced complexity, diminished larval peristaltic contractions, loss of neuromuscular junction bouton structures, impaired olfactory perception, and severe neurodegeneration in brain. Silencing of SOX5 in human SH-SY5Y neuroblastoma cells resulted in a significant repression of WNT signaling activity and altered expression of WNT-related genes. Genetic association and meta-analyses of the results in several large family-based and case-control late-onset familial Alzheimer's disease (LOAD) samples of SOX5 variants revealed several variants that show significant association with AD disease status. In addition, analysis for rare and highly penetrate functional variants revealed four novel variants/mutations in SOX5, which taken together with functional prediction analysis, suggests a strong role of SOX5 causing AD in the carrier families. Collectively, these findings indicate that SOX5 is a novel candidate gene for LOAD with an important role in neuronal function. The genetic findings warrant further studies to identify and characterize SOX5 variants that confer risk for AD, ALS and intellectual disability. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. Loss-of-function nuclear factor κB subunit 1 (NFKB1) variants are the most common monogenic cause of common variable immunodeficiency in Europeans.

    PubMed

    Tuijnenburg, Paul; Lango Allen, Hana; Burns, Siobhan O; Greene, Daniel; Jansen, Machiel H; Staples, Emily; Stephens, Jonathan; Carss, Keren J; Biasci, Daniele; Baxendale, Helen; Thomas, Moira; Chandra, Anita; Kiani-Alikhan, Sorena; Longhurst, Hilary J; Seneviratne, Suranjith L; Oksenhendler, Eric; Simeoni, Ilenia; de Bree, Godelieve J; Tool, Anton T J; van Leeuwen, Ester M M; Ebberink, Eduard H T M; Meijer, Alexander B; Tuna, Salih; Whitehorn, Deborah; Brown, Matthew; Turro, Ernest; Thrasher, Adrian J; Smith, Kenneth G C; Thaventhiran, James E; Kuijpers, Taco W

    2018-03-02

    The genetic cause of primary immunodeficiency disease (PID) carries prognostic information. We conducted a whole-genome sequencing study assessing a large proportion of the NIHR BioResource-Rare Diseases cohort. In the predominantly European study population of principally sporadic unrelated PID cases (n = 846), a novel Bayesian method identified nuclear factor κB subunit 1 (NFKB1) as one of the genes most strongly associated with PID, and the association was explained by 16 novel heterozygous truncating, missense, and gene deletion variants. This accounted for 4% of common variable immunodeficiency (CVID) cases (n = 390) in the cohort. Amino acid substitutions predicted to be pathogenic were assessed by means of analysis of structural protein data. Immunophenotyping, immunoblotting, and ex vivo stimulation of lymphocytes determined the functional effects of these variants. Detailed clinical and pedigree information was collected for genotype-phenotype cosegregation analyses. Both sporadic and familial cases demonstrated evidence of the noninfective complications of CVID, including massive lymphadenopathy (24%), unexplained splenomegaly (48%), and autoimmune disease (48%), features prior studies correlated with worse clinical prognosis. Although partial penetrance of clinical symptoms was noted in certain pedigrees, all carriers have a deficiency in B-lymphocyte differentiation. Detailed assessment of B-lymphocyte numbers, phenotype, and function identifies the presence of an increased CD21 low B-cell population. Combined with identification of the disease-causing variant, this distinguishes between healthy subjects, asymptomatic carriers, and clinically affected cases. We show that heterozygous loss-of-function variants in NFKB1 are the most common known monogenic cause of CVID, which results in a temporally progressive defect in the formation of immunoglobulin-producing B cells. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  4. ATOH7 mutations cause autosomal recessive persistent hyperplasia of the primary vitreous

    PubMed Central

    Prasov, Lev; Masud, Tehmina; Khaliq, Shagufta; Mehdi, S. Qasim; Abid, Aiysha; Oliver, Edward R.; Silva, Eduardo D.; Lewanda, Amy; Brodsky, Michael C.; Borchert, Mark; Kelberman, Daniel; Sowden, Jane C.; Dattani, Mehul T.; Glaser, Tom

    2012-01-01

    The vertebrate basic helix–loop–helix (bHLH) transcription factor ATOH7 (Math5) is specifically expressed in the embryonic neural retina and is required for the genesis of retinal ganglion cells (RGCs) and optic nerves. In Atoh7 mutant mice, the absence of trophic factors secreted by RGCs prevents the development of the intrinsic retinal vasculature and the regression of fetal blood vessels, causing persistent hyperplasia of the primary vitreous (PHPV). We therefore screened patients with hereditary PHPV, as well as bilateral optic nerve aplasia (ONA) or hypoplasia (ONH), for mutations in ATOH7. We identified a homozygous ATOH7 mutation (N46H) in a large family with an autosomal recessive PHPV disease trait linked to 10q21, and a heterozygous variant (R65G, p.Arg65Gly) in one of five sporadic ONA patients. High-density single-nucleotide polymorphism analysis also revealed a CNTN4 duplication and an OTX2 deletion in the ONA cohort. Functional analysis of ATOH7 bHLH domain substitutions, by electrophoretic mobility shift and luciferase cotransfection assays, revealed that the N46H variant cannot bind DNA or activate transcription, consistent with structural modeling. The N46H variant also failed to rescue RGC development in mouse Atoh7−/− retinal explants. The R65G variant retains all of these activities, similar to wild-type human ATOH7. Our results strongly suggest that autosomal recessive persistent hyperplastic primary vitreous is caused by N46H and is etiologically related to nonsyndromic congenital retinal nonattachment. The R65G allele, however, cannot explain the ONA phenotype. Our study firmly establishes ATOH7 as a retinal disease gene and provides a functional basis to analyze new coding variants. PMID:22645276

  5. H3.3 demarcates GC-rich coding and subtelomeric regions and serves as potential memory mark for virulence gene expression in Plasmodium falciparum

    PubMed Central

    Fraschka, Sabine Anne-Kristin; Henderson, Rob Wilhelmus Maria; Bártfai, Richárd

    2016-01-01

    Histones, by packaging and organizing the DNA into chromatin, serve as essential building blocks for eukaryotic life. The basic structure of the chromatin is established by four canonical histones (H2A, H2B, H3 and H4), while histone variants are more commonly utilized to alter the properties of specific chromatin domains. H3.3, a variant of histone H3, was found to have diverse localization patterns and functions across species but has been rather poorly studied in protists. Here we present the first genome-wide analysis of H3.3 in the malaria-causing, apicomplexan parasite, P. falciparum, which revealed a complex occupancy profile consisting of conserved and parasite-specific features. In contrast to other histone variants, PfH3.3 primarily demarcates euchromatic coding and subtelomeric repetitive sequences. Stable occupancy of PfH3.3 in these regions is largely uncoupled from the transcriptional activity and appears to be primarily dependent on the GC-content of the underlying DNA. Importantly, PfH3.3 specifically marks the promoter region of an active and poised, but not inactive antigenic variation (var) gene, thereby potentially contributing to immune evasion. Collectively, our data suggest that PfH3.3, together with other histone variants, indexes the P. falciparum genome to functionally distinct domains and contribute to a key survival strategy of this deadly pathogen. PMID:27555062

  6. Biochemistry of Microbial Degradation of Hexachlorocyclohexane and Prospects for Bioremediation

    PubMed Central

    Lal, Rup; Pandey, Gunjan; Sharma, Pooja; Kumari, Kirti; Malhotra, Shweta; Pandey, Rinku; Raina, Vishakha; Kohler, Hans-Peter E.; Holliger, Christof; Jackson, Colin; Oakeshott, John G.

    2010-01-01

    Summary: Lindane, the γ-isomer of hexachlorocyclohexane (HCH), is a potent insecticide. Purified lindane or unpurified mixtures of this and α-, β-, and δ-isomers of HCH were widely used as commercial insecticides in the last half of the 20th century. Large dumps of unused HCH isomers now constitute a major hazard because of their long residence times in soil and high nontarget toxicities. The major pathway for the aerobic degradation of HCH isomers in soil is the Lin pathway, and variants of this pathway will degrade all four of the HCH isomers although only slowly. Sequence differences in the primary LinA and LinB enzymes in the pathway play a key role in determining their ability to degrade the different isomers. LinA is a dehydrochlorinase, but little is known of its biochemistry. LinB is a hydrolytic dechlorinase that has been heterologously expressed and crystallized, and there is some understanding of the sequence-structure-function relationships underlying its substrate specificity and kinetics, although there are also some significant anomalies. The kinetics of some LinB variants are reported to be slow even for their preferred isomers. It is important to develop a better understanding of the biochemistries of the LinA and LinB variants and to use that knowledge to build better variants, because field trials of some bioremediation strategies based on the Lin pathway have yielded promising results but would not yet achieve economic levels of remediation. PMID:20197499

  7. Accurate and fast multiple-testing correction in eQTL studies.

    PubMed

    Sul, Jae Hoon; Raj, Towfique; de Jong, Simone; de Bakker, Paul I W; Raychaudhuri, Soumya; Ophoff, Roel A; Stranger, Barbara E; Eskin, Eleazar; Han, Buhm

    2015-06-04

    In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  8. The functional spectrum of low-frequency coding variation.

    PubMed

    Marth, Gabor T; Yu, Fuli; Indap, Amit R; Garimella, Kiran; Gravel, Simon; Leong, Wen Fung; Tyler-Smith, Chris; Bainbridge, Matthew; Blackwell, Tom; Zheng-Bradley, Xiangqun; Chen, Yuan; Challis, Danny; Clarke, Laura; Ball, Edward V; Cibulskis, Kristian; Cooper, David N; Fulton, Bob; Hartl, Chris; Koboldt, Dan; Muzny, Donna; Smith, Richard; Sougnez, Carrie; Stewart, Chip; Ward, Alistair; Yu, Jin; Xue, Yali; Altshuler, David; Bustamante, Carlos D; Clark, Andrew G; Daly, Mark; DePristo, Mark; Flicek, Paul; Gabriel, Stacey; Mardis, Elaine; Palotie, Aarno; Gibbs, Richard

    2011-09-14

    Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.

  9. Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples

    PubMed Central

    Wang, Jingwen; Skoog, Tiina; Einarsdottir, Elisabet; Kaartokallio, Tea; Laivuori, Hannele; Grauers, Anna; Gerdhem, Paul; Hytönen, Marjo; Lohi, Hannes; Kere, Juha; Jiao, Hong

    2016-01-01

    High-throughput sequencing using pooled DNA samples can facilitate genome-wide studies on rare and low-frequency variants in a large population. Some major questions concerning the pooling sequencing strategy are whether rare and low-frequency variants can be detected reliably, and whether estimated minor allele frequencies (MAFs) can represent the actual values obtained from individually genotyped samples. In this study, we evaluated MAF estimates using three variant detection tools with two sets of pooled whole exome sequencing (WES) and one set of pooled whole genome sequencing (WGS) data. Both GATK and Freebayes displayed high sensitivity, specificity and accuracy when detecting rare or low-frequency variants. For the WGS study, 56% of the low-frequency variants in Illumina array have identical MAFs and 26% have one allele difference between sequencing and individual genotyping data. The MAF estimates from WGS correlated well (r = 0.94) with those from Illumina arrays. The MAFs from the pooled WES data also showed high concordance (r = 0.88) with those from the individual genotyping data. In conclusion, the MAFs estimated from pooled DNA sequencing data reflect the MAFs in individually genotyped samples well. The pooling strategy can thus be a rapid and cost-effective approach for the initial screening in large-scale association studies. PMID:27633116

  10. Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer.

    PubMed

    Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji

    2010-07-01

    We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.

  11. Genetic Influences on the Development of Alcoholism

    PubMed Central

    Enoch, Mary-Anne

    2014-01-01

    Alcoholism has a substantial heritability yet the detection of specific genetic influences has largely proved elusive. The strongest findings are with genes encoding alcohol metabolizing enzymes. A few candidate genes such as GABRA2 have shown robust associations with alcoholism. Moreover, it has become apparent that variants in stress-related genes such as CRHR1, may only confer risk in individuals exposed to trauma, particularly in early life. Over the past decade there have been tremendous advances in large scale SNP genotyping technologies allowing for genome-wide associations studies (GWAS). As a result, it is now recognized that genetic risk for alcoholism is likely to be due to common variants in very many genes, each of small effect, although rare variants with large effects might also play a role. This has resulted in a paradigm shift away from gene centric studies towards analyses of gene interactions and gene networks within biologically relevant pathways. PMID:24091936

  12. Genetic influences on the development of alcoholism.

    PubMed

    Enoch, Mary-Anne

    2013-11-01

    Alcoholism has a substantial heritability yet the detection of specific genetic influences has largely proved elusive. The strongest findings are with genes encoding alcohol metabolizing enzymes. A few candidate genes such as GABRA2 have shown robust associations with alcoholism. Moreover, it has become apparent that variants in stress-related genes such as CRHR1, may only confer risk in individuals exposed to trauma, particularly in early life. Over the past decade there have been tremendous advances in large scale SNP genotyping technologies allowing for genome-wide associations studies (GWAS). As a result, it is now recognized that genetic risk for alcoholism is likely to be due to common variants in very many genes, each of small effect, although rare variants with large effects might also play a role. This has resulted in a paradigm shift away from gene centric studies toward analyses of gene interactions and gene networks within biologically relevant pathways.

  13. Magnetic Protostars

    NASA Astrophysics Data System (ADS)

    Glagolevskij, Yu. V.

    2015-09-01

    A possible variant of the evolution of magnetic protostars "before the Hayashi phase" is discussed. Arguments are given in support of the following major properties of magnetic stars: (1) global magnetic dipole fields with predominant orientation of the magnetic lines of force in the plane of the equator of revolution; (2) slow rotation; (3) complex, two and three dipole structures of the magnetic field in a large part of the stars; (4) partition of stars into magnetic and normal in a proportion of 1:10 occurs during the period when the protostellar clouds undergo gravitational collapse "before the Hayashi phase."

  14. Quadratic integrand double-hybrid made spin-component-scaled

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brémond, Éric, E-mail: eric.bremond@iit.it; Savarese, Marika; Sancho-García, Juan C.

    2016-03-28

    We propose two analytical expressions aiming to rationalize the spin-component-scaled (SCS) and spin-opposite-scaled (SOS) schemes for double-hybrid exchange-correlation density-functionals. Their performances are extensively tested within the framework of the nonempirical quadratic integrand double-hybrid (QIDH) model on energetic properties included into the very large GMTKN30 benchmark database, and on structural properties of semirigid medium-sized organic compounds. The SOS variant is revealed as a less computationally demanding alternative to reach the accuracy of the original QIDH model without losing any theoretical background.

  15. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes.

    PubMed

    Mahajan, Anubha; Wessel, Jennifer; Willems, Sara M; Zhao, Wei; Robertson, Neil R; Chu, Audrey Y; Gan, Wei; Kitajima, Hidetoshi; Taliun, Daniel; Rayner, N William; Guo, Xiuqing; Lu, Yingchang; Li, Man; Jensen, Richard A; Hu, Yao; Huo, Shaofeng; Lohman, Kurt K; Zhang, Weihua; Cook, James P; Prins, Bram Peter; Flannick, Jason; Grarup, Niels; Trubetskoy, Vassily Vladimirovich; Kravic, Jasmina; Kim, Young Jin; Rybin, Denis V; Yaghootkar, Hanieh; Müller-Nurasyid, Martina; Meidtner, Karina; Li-Gao, Ruifang; Varga, Tibor V; Marten, Jonathan; Li, Jin; Smith, Albert Vernon; An, Ping; Ligthart, Symen; Gustafsson, Stefan; Malerba, Giovanni; Demirkan, Ayse; Tajes, Juan Fernandez; Steinthorsdottir, Valgerdur; Wuttke, Matthias; Lecoeur, Cécile; Preuss, Michael; Bielak, Lawrence F; Graff, Marielisa; Highland, Heather M; Justice, Anne E; Liu, Dajiang J; Marouli, Eirini; Peloso, Gina Marie; Warren, Helen R; Afaq, Saima; Afzal, Shoaib; Ahlqvist, Emma; Almgren, Peter; Amin, Najaf; Bang, Lia B; Bertoni, Alain G; Bombieri, Cristina; Bork-Jensen, Jette; Brandslund, Ivan; Brody, Jennifer A; Burtt, Noël P; Canouil, Mickaël; Chen, Yii-Der Ida; Cho, Yoon Shin; Christensen, Cramer; Eastwood, Sophie V; Eckardt, Kai-Uwe; Fischer, Krista; Gambaro, Giovanni; Giedraitis, Vilmantas; Grove, Megan L; de Haan, Hugoline G; Hackinger, Sophie; Hai, Yang; Han, Sohee; Tybjærg-Hansen, Anne; Hivert, Marie-France; Isomaa, Bo; Jäger, Susanne; Jørgensen, Marit E; Jørgensen, Torben; Käräjämäki, Annemari; Kim, Bong-Jo; Kim, Sung Soo; Koistinen, Heikki A; Kovacs, Peter; Kriebel, Jennifer; Kronenberg, Florian; Läll, Kristi; Lange, Leslie A; Lee, Jung-Jin; Lehne, Benjamin; Li, Huaixing; Lin, Keng-Hung; Linneberg, Allan; Liu, Ching-Ti; Liu, Jun; Loh, Marie; Mägi, Reedik; Mamakou, Vasiliki; McKean-Cowdin, Roberta; Nadkarni, Girish; Neville, Matt; Nielsen, Sune F; Ntalla, Ioanna; Peyser, Patricia A; Rathmann, Wolfgang; Rice, Kenneth; Rich, Stephen S; Rode, Line; Rolandsson, Olov; Schönherr, Sebastian; Selvin, Elizabeth; Small, Kerrin S; Stančáková, Alena; Surendran, Praveen; Taylor, Kent D; Teslovich, Tanya M; Thorand, Barbara; Thorleifsson, Gudmar; Tin, Adrienne; Tönjes, Anke; Varbo, Anette; Witte, Daniel R; Wood, Andrew R; Yajnik, Pranav; Yao, Jie; Yengo, Loïc; Young, Robin; Amouyel, Philippe; Boeing, Heiner; Boerwinkle, Eric; Bottinger, Erwin P; Chowdhury, Rajiv; Collins, Francis S; Dedoussis, George; Dehghan, Abbas; Deloukas, Panos; Ferrario, Marco M; Ferrières, Jean; Florez, Jose C; Frossard, Philippe; Gudnason, Vilmundur; Harris, Tamara B; Heckbert, Susan R; Howson, Joanna M M; Ingelsson, Martin; Kathiresan, Sekar; Kee, Frank; Kuusisto, Johanna; Langenberg, Claudia; Launer, Lenore J; Lindgren, Cecilia M; Männistö, Satu; Meitinger, Thomas; Melander, Olle; Mohlke, Karen L; Moitry, Marie; Morris, Andrew D; Murray, Alison D; de Mutsert, Renée; Orho-Melander, Marju; Owen, Katharine R; Perola, Markus; Peters, Annette; Province, Michael A; Rasheed, Asif; Ridker, Paul M; Rivadineira, Fernando; Rosendaal, Frits R; Rosengren, Anders H; Salomaa, Veikko; Sheu, Wayne H-H; Sladek, Rob; Smith, Blair H; Strauch, Konstantin; Uitterlinden, André G; Varma, Rohit; Willer, Cristen J; Blüher, Matthias; Butterworth, Adam S; Chambers, John Campbell; Chasman, Daniel I; Danesh, John; van Duijn, Cornelia; Dupuis, Josée; Franco, Oscar H; Franks, Paul W; Froguel, Philippe; Grallert, Harald; Groop, Leif; Han, Bok-Ghee; Hansen, Torben; Hattersley, Andrew T; Hayward, Caroline; Ingelsson, Erik; Kardia, Sharon L R; Karpe, Fredrik; Kooner, Jaspal Singh; Köttgen, Anna; Kuulasmaa, Kari; Laakso, Markku; Lin, Xu; Lind, Lars; Liu, Yongmei; Loos, Ruth J F; Marchini, Jonathan; Metspalu, Andres; Mook-Kanamori, Dennis; Nordestgaard, Børge G; Palmer, Colin N A; Pankow, James S; Pedersen, Oluf; Psaty, Bruce M; Rauramaa, Rainer; Sattar, Naveed; Schulze, Matthias B; Soranzo, Nicole; Spector, Timothy D; Stefansson, Kari; Stumvoll, Michael; Thorsteinsdottir, Unnur; Tuomi, Tiinamaija; Tuomilehto, Jaakko; Wareham, Nicholas J; Wilson, James G; Zeggini, Eleftheria; Scott, Robert A; Barroso, Inês; Frayling, Timothy M; Goodarzi, Mark O; Meigs, James B; Boehnke, Michael; Saleheen, Danish; Morris, Andrew P; Rotter, Jerome I; McCarthy, Mark I

    2018-04-01

    We aggregated coding variant data for 81,412 type 2 diabetes cases and 370,832 controls of diverse ancestry, identifying 40 coding variant association signals (P < 2.2 × 10 -7 ); of these, 16 map outside known risk-associated loci. We make two important observations. First, only five of these signals are driven by low-frequency variants: even for these, effect sizes are modest (odds ratio ≤1.29). Second, when we used large-scale genome-wide association data to fine-map the associated variants in their regional context, accounting for the global enrichment of complex trait associations in coding sequence, compelling evidence for coding variant causality was obtained for only 16 signals. At 13 others, the associated coding variants clearly represent 'false leads' with potential to generate erroneous mechanistic inference. Coding variant associations offer a direct route to biological insight for complex diseases and identification of validated therapeutic targets; however, appropriate mechanistic inference requires careful specification of their causal contribution to disease predisposition.

  16. Pathogenic Germline Variants in 10,389 Adult Cancers.

    PubMed

    Huang, Kuan-Lin; Mashl, R Jay; Wu, Yige; Ritter, Deborah I; Wang, Jiayin; Oh, Clara; Paczkowska, Marta; Reynolds, Sheila; Wyczalkowski, Matthew A; Oak, Ninad; Scott, Adam D; Krassowski, Michal; Cherniack, Andrew D; Houlahan, Kathleen E; Jayasinghe, Reyka; Wang, Liang-Bo; Zhou, Daniel Cui; Liu, Di; Cao, Song; Kim, Young Won; Koire, Amanda; McMichael, Joshua F; Hucthagowder, Vishwanathan; Kim, Tae-Beom; Hahn, Abigail; Wang, Chen; McLellan, Michael D; Al-Mulla, Fahd; Johnson, Kimberly J; Lichtarge, Olivier; Boutros, Paul C; Raphael, Benjamin; Lazar, Alexander J; Zhang, Wei; Wendl, Michael C; Govindan, Ramaswamy; Jain, Sanjay; Wheeler, David; Kulkarni, Shashikant; Dipersio, John F; Reimand, Jüri; Meric-Bernstam, Funda; Chen, Ken; Shmulevich, Ilya; Plon, Sharon E; Chen, Feng; Ding, Li

    2018-04-05

    We conducted the largest investigation of predisposition variants in cancer to date, discovering 853 pathogenic or likely pathogenic variants in 8% of 10,389 cases from 33 cancer types. Twenty-one genes showed single or cross-cancer associations, including novel associations of SDHA in melanoma and PALB2 in stomach adenocarcinoma. The 659 predisposition variants and 18 additional large deletions in tumor suppressors, including ATM, BRCA1, and NF1, showed low gene expression and frequent (43%) loss of heterozygosity or biallelic two-hit events. We also discovered 33 such variants in oncogenes, including missenses in MET, RET, and PTPN11 associated with high gene expression. We nominated 47 additional predisposition variants from prioritized VUSs supported by multiple evidences involving case-control frequency, loss of heterozygosity, expression effect, and co-localization with mutations and modified residues. Our integrative approach links rare predisposition variants to functional consequences, informing future guidelines of variant classification and germline genetic testing in cancer. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  17. Extracavitary/solid variant of primary effusion lymphoma presenting as a gastric mass.

    PubMed

    Liao, Guanghong; Cai, Junchao; Yue, Changjun; Qing, Xin

    2015-12-01

    Primary effusion lymphoma (PEL) is a rare subtype of large B-cell lymphoma associated with human herpesvirus 8 (HHV8). It has the highest incidence in HIV-positive individuals. It often presents as a malignant pleural, peritoneal and/or pericardial effusion without a detectable solid mass. Most cases are co-infected with Epstein-Barr virus (EBV). Rare cases of HHV8-positive lymphoma with features similar to PEL can present as tumor masses and are considered to represent an extracavitary or solid variant of PEL. We report a case of EBV negative, extracavitary/solid variant of primary effusion lymphoma presenting as a gastric mass. A 48-year-old man was admitted to an outside hospital with abdominal pain and weight loss. At the outside hospital, he was found to be HIV positive and have a 3 × 2 cm gastric mass. He was subsequently diagnosed with ALK negative anaplastic large cell lymphoma by gastric biopsy. The patient was referred to Harbor-UCLA Medical Center for further management. Review of the outside slides and additional stains performed at our hospital revealed sheets of large anaplastic lymphoma cells that were positive for CD30, CD138, MUM1 and HHV8, focally weakly positive for CD3, and negative for other T- and B-cell markers and EBER, consistent with extracavitary/solid variant of primary effusion lymphoma. Interestingly, for the first time, cyclin D1 positivity was also demonstrated in PEL. Primary effusion lymphoma, particularly the extracavitary/solid variant, is very rare, and the diagnosis can be challenging. In some cases, when CD30 is uniformly positive, this lymphoma can be misdiagnosed as ALK negative anaplastic large cell lymphoma. This lymphoma can also aberrantly express T-cell markers as seen in this case, making diagnosis even more difficult. Awareness of the existence and the features of solid variant PEL and assessment for HHV8 infection are essential for correct diagnosis. Published by Elsevier Inc.

  18. A Genome-Wide Linkage Study for Chronic Obstructive Pulmonary Disease in a Dutch Genetic Isolate Identifies Novel Rare Candidate Variants.

    PubMed

    Nedeljkovic, Ivana; Terzikhan, Natalie; Vonk, Judith M; van der Plaat, Diana A; Lahousse, Lies; van Diemen, Cleo C; Hobbs, Brian D; Qiao, Dandi; Cho, Michael H; Brusselle, Guy G; Postma, Dirkje S; Boezen, H M; van Duijn, Cornelia M; Amin, Najaf

    2018-01-01

    Chronic obstructive pulmonary disease (COPD) is a complex and heritable disease, associated with multiple genetic variants. Specific familial types of COPD may be explained by rare variants, which have not been widely studied. We aimed to discover rare genetic variants underlying COPD through a genome-wide linkage scan. Affected-only analysis was performed using the 6K Illumina Linkage IV Panel in 142 cases clustered in 27 families from a genetic isolate, the Erasmus Rucphen Family (ERF) study. Potential causal variants were identified by searching for shared rare variants in the exome-sequence data of the affected members of the families contributing most to the linkage peak. The identified rare variants were then tested for association with COPD in a large meta-analysis of several cohorts. Significant evidence for linkage was observed on chromosomes 15q14-15q25 [logarithm of the odds (LOD) score = 5.52], 11p15.4-11q14.1 (LOD = 3.71) and 5q14.3-5q33.2 (LOD = 3.49). In the chromosome 15 peak, that harbors the known COPD locus for nicotinic receptors, and in the chromosome 5 peak we could not identify shared variants. In the chromosome 11 locus, we identified four rare (minor allele frequency (MAF) <0.02), predicted pathogenic, missense variants. These were shared among the affected family members. The identified variants localize to genes including neuroblast differentiation-associated protein ( AHNAK ), previously associated with blood biomarkers in COPD, phospholipase C Beta 3 ( PLCB3 ), shown to increase airway hyper-responsiveness, solute carrier family 22-A11 ( SLC22A11 ), involved in amino acid metabolism and ion transport, and metallothionein-like protein 5 ( MTL5 ), involved in nicotinate and nicotinamide metabolism. Association of SLC22A11 and MTL5 variants were confirmed in the meta-analysis of 9,888 cases and 27,060 controls. In conclusion, we have identified novel rare variants in plausible genes related to COPD. Further studies utilizing large sample whole-genome sequencing should further confirm the associations at chromosome 11 and investigate the chromosome 15 and 5 linked regions.

  19. Polymerization-Defective Fibrinogen Variant gammaD364A Binds Knob “A” Peptide Mimic

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowley,S.; Merenbloom, B.; Heroux, A.

    2008-01-01

    Fibrin polymerization is supported in part by interactions called 'A:a'. Crystallographic studies revealed ?364Asp is part of hole 'a' that interacts with knob 'A' peptide mimic, GPRP. Biochemical studies have shown ?364Asp is critical to polymerization, as polymerization of variants ?D364A, ?D364H, and ?D364V is exceptionally impaired. To understand the molecular basis for the aberrant function, we solved the crystal structure of fragment D from ?D364A. Surprisingly, the structure (rfD-?D364A+GP) showed near normal 'A:a' interactions with GPRP bound to hole 'a' and no change in the overall structure of ?D364A. Of note, inspection of the structure showed negative electrostatic potentialmore » inside hole 'a' was diminished by this substitution. We examined GPRP binding to the ?364Asp variants in solution by plasmin protection assay. We found no protection of either ?D364H or ?D364V but partial protection of ?D364A, indicating the peptide does not bind to either ?D364H or ?D364V and binds more weakly than normal to ?D364A. We also examined protection by calcium and found all variants were indistinguishable from normal, suggesting the global structures of the variants are not markedly different from normal. Our data imply that ?364Asp per se is not required for knob 'A' binding to hole 'a'; rather, this residue's negative charge has a critical role in the electrostatic interactions that facilitate the important first step in fibrin polymerization.« less

  20. A Multi-scale Computational Platform to Mechanistically Assess the Effect of Genetic Variation on Drug Responses in Human Erythrocyte Metabolism

    PubMed Central

    Bordbar, Aarash; Palsson, Bernhard O.

    2016-01-01

    Progress in systems medicine brings promise to addressing patient heterogeneity and individualized therapies. Recently, genome-scale models of metabolism have been shown to provide insight into the mechanistic link between drug therapies and systems-level off-target effects while being expanded to explicitly include the three-dimensional structure of proteins. The integration of these molecular-level details, such as the physical, structural, and dynamical properties of proteins, notably expands the computational description of biochemical network-level properties and the possibility of understanding and predicting whole cell phenotypes. In this study, we present a multi-scale modeling framework that describes biological processes which range in scale from atomistic details to an entire metabolic network. Using this approach, we can understand how genetic variation, which impacts the structure and reactivity of a protein, influences both native and drug-induced metabolic states. As a proof-of-concept, we study three enzymes (catechol-O-methyltransferase, glucose-6-phosphate dehydrogenase, and glyceraldehyde-3-phosphate dehydrogenase) and their respective genetic variants which have clinically relevant associations. Using all-atom molecular dynamic simulations enables the sampling of long timescale conformational dynamics of the proteins (and their mutant variants) in complex with their respective native metabolites or drug molecules. We find that changes in a protein’s structure due to a mutation influences protein binding affinity to metabolites and/or drug molecules, and inflicts large-scale changes in metabolism. PMID:27467583

  1. A Multi-scale Computational Platform to Mechanistically Assess the Effect of Genetic Variation on Drug Responses in Human Erythrocyte Metabolism.

    PubMed

    Mih, Nathan; Brunk, Elizabeth; Bordbar, Aarash; Palsson, Bernhard O

    2016-07-01

    Progress in systems medicine brings promise to addressing patient heterogeneity and individualized therapies. Recently, genome-scale models of metabolism have been shown to provide insight into the mechanistic link between drug therapies and systems-level off-target effects while being expanded to explicitly include the three-dimensional structure of proteins. The integration of these molecular-level details, such as the physical, structural, and dynamical properties of proteins, notably expands the computational description of biochemical network-level properties and the possibility of understanding and predicting whole cell phenotypes. In this study, we present a multi-scale modeling framework that describes biological processes which range in scale from atomistic details to an entire metabolic network. Using this approach, we can understand how genetic variation, which impacts the structure and reactivity of a protein, influences both native and drug-induced metabolic states. As a proof-of-concept, we study three enzymes (catechol-O-methyltransferase, glucose-6-phosphate dehydrogenase, and glyceraldehyde-3-phosphate dehydrogenase) and their respective genetic variants which have clinically relevant associations. Using all-atom molecular dynamic simulations enables the sampling of long timescale conformational dynamics of the proteins (and their mutant variants) in complex with their respective native metabolites or drug molecules. We find that changes in a protein's structure due to a mutation influences protein binding affinity to metabolites and/or drug molecules, and inflicts large-scale changes in metabolism.

  2. Structure of the DBL3X-DBL4ε region of the VAR2CSA placental malaria vaccine candidate: insight into DBL domain interactions

    PubMed Central

    Gangnard, Stéphane; Lewit-Bentley, Anita; Dechavanne, Sébastien; Srivastava, Anand; Amirat, Faroudja; Bentley, Graham A.; Gamain, Benoît

    2015-01-01

    The human malaria parasite, Plasmodium falciparum, is able to evade spleen-mediated clearing from blood stream by sequestering in peripheral organs. This is due to the adhesive properties conferred by the P. falciparum Erythrocyte Membrane Protein 1 (PfEMP1) family exported by the parasite to the surface of infected erythrocytes. Expression of the VAR2CSA variant of PfEMP1 leads to pregnancy-associated malaria, which occurs when infected erythrocytes massively sequester in the placenta by binding to low-sulfated Chondroitin Sulfate A (CSA) present in the intervillous spaces. VAR2CSA is a 350 kDa protein that carries six Duffy-Binding Like (DBL) domains, one Cysteine-rich Inter-Domain Regions (CIDR) and several inter-domain regions. In the present paper, we report for the first time the crystal structure at 2.9 Å of a VAR2CSA double domain, DBL3X-DBL4ε, from the FCR3 strain. DBL3X and DBL4ε share a large contact interface formed by residues that are invariant or highly conserved in VAR2CSA variants, which suggests that these two central DBL domains (DBL3X-DBL4ε) contribute significantly to the structuring of the functional VAR2CSA extracellular region. We have also examined the antigenicity of peptides corresponding to exposed loop regions of the DBL4ε structure. PMID:26450557

  3. A hetero-micro-seeding strategy for readily crystallizing closely related protein variants.

    PubMed

    Islam, Mohammad M; Kuroda, Yutaka

    2017-11-04

    Protein crystallization remains difficult to rationalize and screening for optimal crystallization conditions is a tedious and time consuming procedure. Here, we report a hetero-micro-seeding strategy for producing high resolution crystals of closely related protein variants, where micro crystals from a readily crystallized variant are used as seeds to develop crystals of other variants less amenable to crystallization. We applied this strategy to Bovine Pancreatic Trypsin Inhibitor (BPTI) variants, which would not crystallize using standard crystallization practice. Out of six variants in our analysis, only one called BPTI-[5,55]A14G formed well behaving crystals; and the remaining five (A14GA38G, A14GA38V, A14GA38L, A14GA38I, and A14GA38K) could be crystallized only using micro-seeds from the BPTI-[5,55]A14G crystal. All hetero-seeded crystals diffracted at high resolution with minimum mosaicity, retaining the same space group and cell dimension. Moreover, hetero-micro-seeding did not introduce any biases into the mutant's structure toward the seed structure, as demonstrated by A14GA38I structures solved using micro-seeds from A14GA38G, A14GA38L and A14GA38I. Though hetero-micro-seeding is a simple and almost naïve strategy, this is the first direct demonstration of its workability. We believe that hetero-micro-seeding, which is contrasting with the popular idea that crystallization requires highly purified proteins, could contribute a new tool for rapidly solving protein structures in mutational analysis studies. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Using high-resolution variant frequencies to empower clinical genome interpretation.

    PubMed

    Whiffin, Nicola; Minikel, Eric; Walsh, Roddy; O'Donnell-Luria, Anne H; Karczewski, Konrad; Ing, Alexander Y; Barton, Paul J R; Funke, Birgit; Cook, Stuart A; MacArthur, Daniel; Ware, James S

    2017-10-01

    PurposeWhole-exome and whole-genome sequencing have transformed the discovery of genetic variants that cause human Mendelian disease, but discriminating pathogenic from benign variants remains a daunting challenge. Rarity is recognized as a necessary, although not sufficient, criterion for pathogenicity, but frequency cutoffs used in Mendelian analysis are often arbitrary and overly lenient. Recent very large reference datasets, such as the Exome Aggregation Consortium (ExAC), provide an unprecedented opportunity to obtain robust frequency estimates even for very rare variants.MethodsWe present a statistical framework for the frequency-based filtering of candidate disease-causing variants, accounting for disease prevalence, genetic and allelic heterogeneity, inheritance mode, penetrance, and sampling variance in reference datasets.ResultsUsing the example of cardiomyopathy, we show that our approach reduces by two-thirds the number of candidate variants under consideration in the average exome, without removing true pathogenic variants (false-positive rate<0.001).ConclusionWe outline a statistically robust framework for assessing whether a variant is "too common" to be causative for a Mendelian disorder of interest. We present precomputed allele frequency cutoffs for all variants in the ExAC dataset.

  5. Structural Basis for the Altered PAM Recognition by Engineered CRISPR-Cpf1.

    PubMed

    Nishimasu, Hiroshi; Yamano, Takashi; Gao, Linyi; Zhang, Feng; Ishitani, Ryuichiro; Nureki, Osamu

    2017-07-06

    The RNA-guided Cpf1 nuclease cleaves double-stranded DNA targets complementary to the CRISPR RNA (crRNA), and it has been harnessed for genome editing technologies. Recently, Acidaminococcus sp. BV3L6 (AsCpf1) was engineered to recognize altered DNA sequences as the protospacer adjacent motif (PAM), thereby expanding the target range of Cpf1-mediated genome editing. Whereas wild-type AsCpf1 recognizes the TTTV PAM, the RVR (S542R/K548V/N552R) and RR (S542R/K607R) variants can efficiently recognize the TATV and TYCV PAMs, respectively. However, their PAM recognition mechanisms remained unknown. Here we present the 2.0 Å resolution crystal structures of the RVR and RR variants bound to a crRNA and its target DNA. The structures revealed that the RVR and RR variants primarily recognize the PAM-complementary nucleotides via the substituted residues. Our high-resolution structures delineated the altered PAM recognition mechanisms of the AsCpf1 variants, providing a basis for the further engineering of CRISPR-Cpf1. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Alteration of the α1β2/α2β1 subunit interface contributes to the increased hemoglobin-oxygen affinity of high-altitude deer mice

    PubMed Central

    Inoguchi, Noriko; Mizuno, Nobuhiro; Baba, Seiki; Kumasaka, Takashi; Natarajan, Chandrasekhar; Storz, Jay F.

    2017-01-01

    Background Deer mice (Peromyscus maniculatus) that are native to high altitudes in the Rocky Mountains have evolved hemoglobins with an increased oxygen-binding affinity relative to those of lowland conspecifics. To elucidate the molecular mechanisms responsible for the evolved increase in hemoglobin-oxygen affinity, the crystal structure of the highland hemoglobin variant was solved and compared with the previously reported structure for the lowland variant. Results Highland hemoglobin yielded at least two crystal types, in which the longest axes were 507 and 230 Å. Using the smaller unit cell crystal, the structure was solved at 2.2 Å resolution. The asymmetric unit contained two tetrameric hemoglobin molecules. Conclusions The analyses revealed that αPro50 in the highland hemoglobin variant promoted a stable interaction between αHis45 and heme that was not seen in the αHis50 lowland variant. The αPro50 mutation also altered the nature of atomic contacts at the α1β2/α2β1 intersubunit interfaces. These results demonstrate how affinity-altering changes in intersubunit interactions can be produced by mutations at structurally remote sites. PMID:28362841

  7. Alteration of the α1β2/α2β1 subunit interface contributes to the increased hemoglobin-oxygen affinity of high-altitude deer mice

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Inoguchi, Noriko; Mizuno, Nobuhiro; Baba, Seiki

    2017-03-31

    Deer mice (Peromyscus maniculatus) that are native to high altitudes in the Rocky Mountains have evolved hemoglobins with an increased oxygen-binding affinity relative to those of lowland conspecifics. To elucidate the molecular mechanisms responsible for the evolved increase in hemoglobin-oxygen affinity, the crystal structure of the highland hemoglobin variant was solved and compared with the previously reported structure for the lowland variant. Highland hemoglobin yielded at least two crystal types, in which the longest axes were 507 and 230 Å. Using the smaller unit cell crystal, the structure was solved at 2.2 Å resolution. The asymmetric unit contained two tetramericmore » hemoglobin molecules. The analyses revealed that αPro50 in the highland hemoglobin variant promoted a stable interaction between αHis45 and heme that was not seen in the αHis50 lowland variant. The αPro50 mutation also altered the nature of atomic contacts at the α1β2/α2β1 intersubunit interfaces. These results demonstrate how affinity-altering changes in intersubunit interactions can be produced by mutations at structurally remote sites.« less

  8. Focal temporal pole atrophy and network degeneration in semantic variant primary progressive aphasia

    PubMed Central

    Collins, Jessica A; Montal, Victor; Hochberg, Daisy; Quimby, Megan; Mandelli, Maria Luisa; Makris, Nikos; Seeley, William W; Gorno-Tempini, Maria Luisa; Dickerson, Bradford C

    2017-01-01

    Abstract A wealth of neuroimaging research has associated semantic variant primary progressive aphasia with distributed cortical atrophy that is most prominent in the left anterior temporal cortex; however, there is little consensus regarding which region within the anterior temporal cortex is most prominently damaged, which may indicate the putative origin of neurodegeneration. In this study, we localized the most prominent and consistent region of atrophy in semantic variant primary progressive aphasia using cortical thickness analysis in two independent patient samples (n = 16 and 28, respectively) relative to age-matched controls (n = 30). Across both samples the point of maximal atrophy was located in the same region of the left temporal pole. This same region was the point of maximal atrophy in 100% of individual patients in both semantic variant primary progressive aphasia samples. Using resting state functional connectivity in healthy young adults (n = 89), we showed that the seed region derived from the semantic variant primary progressive aphasia analysis was strongly connected with a large-scale network that closely resembled the distributed atrophy pattern in semantic variant primary progressive aphasia. In both patient samples, the magnitude of atrophy within a brain region was predicted by that region’s strength of functional connectivity to the temporopolar seed region in healthy adults. These findings suggest that cortical atrophy in semantic variant primary progressive aphasia may follow connectional pathways within a large-scale network that converges on the temporal pole. PMID:28040670

  9. SORL1 variants across Alzheimer's disease European American cohorts.

    PubMed

    Fernández, Maria Victoria; Black, Kathleen; Carrell, David; Saef, Ben; Budde, John; Deming, Yuetiva; Howells, Bill; Del-Aguila, Jorge L; Ma, Shengmei; Bi, Catherine; Norton, Joanne; Chasse, Rachel; Morris, John; Goate, Alison; Cruchaga, Carlos

    2016-12-01

    The accumulation of the toxic Aβ peptide in Alzheimer's disease (AD) largely relies upon an efficient recycling of amyloid precursor protein (APP). Recent genetic association studies have described rare variants in SORL1 with putative pathogenic consequences in the recycling of APP. In this work, we examine the presence of rare coding variants in SORL1 in three different European American cohorts: early-onset, late-onset AD (LOAD) and familial LOAD.

  10. Structural variants in genes associated with human Williams-Beuren syndrome underlie stereotypical hypersociability in domestic dogs.

    PubMed

    vonHoldt, Bridgett M; Shuldiner, Emily; Koch, Ilana Janowitz; Kartzinel, Rebecca Y; Hogan, Andrew; Brubaker, Lauren; Wanser, Shelby; Stahler, Daniel; Wynne, Clive D L; Ostrander, Elaine A; Sinsheimer, Janet S; Udell, Monique A R

    2017-07-01

    Although considerable progress has been made in understanding the genetic basis of morphologic traits (for example, body size and coat color) in dogs and wolves, the genetic basis of their behavioral divergence is poorly understood. An integrative approach using both behavioral and genetic data is required to understand the molecular underpinnings of the various behavioral characteristics associated with domestication. We analyze a 5-Mb genomic region on chromosome 6 previously found to be under positive selection in domestic dog breeds. Deletion of this region in humans is linked to Williams-Beuren syndrome (WBS), a multisystem congenital disorder characterized by hypersocial behavior. We associate quantitative data on behavioral phenotypes symptomatic of WBS in humans with structural changes in the WBS locus in dogs. We find that hypersociability, a central feature of WBS, is also a core element of domestication that distinguishes dogs from wolves. We provide evidence that structural variants in GTF2I and GTF2IRD1 , genes previously implicated in the behavioral phenotype of patients with WBS and contained within the WBS locus, contribute to extreme sociability in dogs. This finding suggests that there are commonalities in the genetic architecture of WBS and canine tameness and that directional selection may have targeted a unique set of linked behavioral genes of large phenotypic effect, allowing for rapid behavioral divergence of dogs and wolves, facilitating coexistence with humans.

  11. The curation of genetic variants: difficulties and possible solutions.

    PubMed

    Pandey, Kapil Raj; Maden, Narendra; Poudel, Barsha; Pradhananga, Sailendra; Sharma, Amit Kumar

    2012-12-01

    The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods. Copyright © 2012. Published by Elsevier Ltd.

  12. Negligible impact of rare autoimmune-locus coding-region variants on missing heritability.

    PubMed

    Hunt, Karen A; Mistry, Vanisha; Bockett, Nicholas A; Ahmad, Tariq; Ban, Maria; Barker, Jonathan N; Barrett, Jeffrey C; Blackburn, Hannah; Brand, Oliver; Burren, Oliver; Capon, Francesca; Compston, Alastair; Gough, Stephen C L; Jostins, Luke; Kong, Yong; Lee, James C; Lek, Monkol; MacArthur, Daniel G; Mansfield, John C; Mathew, Christopher G; Mein, Charles A; Mirza, Muddassar; Nutland, Sarah; Onengut-Gumuscu, Suna; Papouli, Efterpi; Parkes, Miles; Rich, Stephen S; Sawcer, Steven; Satsangi, Jack; Simmonds, Matthew J; Trembath, Richard C; Walker, Neil M; Wozniak, Eva; Todd, John A; Simpson, Michael A; Plagnol, Vincent; van Heel, David A

    2013-06-13

    Genome-wide association studies (GWAS) have identified common variants of modest-effect size at hundreds of loci for common autoimmune diseases; however, a substantial fraction of heritability remains unexplained, to which rare variants may contribute. To discover rare variants and test them for association with a phenotype, most studies re-sequence a small initial sample size and then genotype the discovered variants in a larger sample set. This approach fails to analyse a large fraction of the rare variants present in the entire sample set. Here we perform simultaneous amplicon-sequencing-based variant discovery and genotyping for coding exons of 25 GWAS risk genes in 41,911 UK residents of white European origin, comprising 24,892 subjects with six autoimmune disease phenotypes and 17,019 controls, and show that rare coding-region variants at known loci have a negligible role in common autoimmune disease susceptibility. These results do not support the rare-variant synthetic genome-wide-association hypothesis (in which unobserved rare causal variants lead to association detected at common tag variants). Many known autoimmune disease risk loci contain multiple, independently associated, common and low-frequency variants, and so genes at these loci are a priori stronger candidates for harbouring rare coding-region variants than other genes. Our data indicate that the missing heritability for common autoimmune diseases may not be attributable to the rare coding-region variant portion of the allelic spectrum, but perhaps, as others have proposed, may be a result of many common-variant loci of weak effect.

  13. The Curation of Genetic Variants: Difficulties and Possible Solutions

    PubMed Central

    Pandey, Kapil Raj; Maden, Narendra; Poudel, Barsha; Pradhananga, Sailendra; Sharma, Amit Kumar

    2012-01-01

    The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods. PMID:23317699

  14. Structure–activity correlations of variant forms of the B pentamer of Escherichia coli type II heat-labile enterotoxin LT-IIb with Toll-like receptor 2 binding

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cody, Vivian, E-mail: cody@hwi.buffalo.edu; University at Buffalo, 700 Ellicott Street, Buffalo, NY 14203; Pace, Jim

    2012-12-01

    Structural data for the S74D variant of the pentameric B subunit of type II heat-labile enterotoxin of Escherichia coli reveal a smaller pore opening that may explain its reduced Toll-like receptor binding affinity compared to that of the wild type enterotoxin. The explanation for the enhanced Toll-like receptor binding affinity of the S74A variant is more complex than simply being attributed to the pore opening. The pentameric B subunit of the type II heat-labile enterotoxin of Escherichia coli (LT-IIb-B{sub 5}) is a potent signaling molecule capable of modulating innate immune responses. It has previously been shown that LT-IIb-B{sub 5}, butmore » not the LT-IIb-B{sub 5} Ser74Asp variant [LT-IIb-B{sub 5}(S74D)], activates Toll-like receptor (TLR2) signaling in macrophages. Consistent with this, the LT-IIb-B{sub 5}(S74D) variant failed to bind TLR2, in contrast to LT-IIb-B{sub 5} and the LT-IIb-B{sub 5} Thr13Ile [LT-IIb-B{sub 5}(T13I)] and LT-IIb-B{sub 5} Ser74Ala [LT-IIb-B{sub 5}(S74A)] variants, which displayed the highest binding activity to TLR2. Crystal structures of the Ser74Asp, Ser74Ala and Thr13Ile variants of LT-IIb-B{sub 5} have been determined to 1.90, 1.40 and 1.90 Å resolution, respectively. The structural data for the Ser74Asp variant reveal that the carboxylate side chain points into the pore, thereby reducing the pore size compared with that of the wild-type or the Ser74Ala variant B pentamer. On the basis of these crystallographic data, the reduced TLR2-binding affinity of the LT-IIb-B{sub 5}(S74D) variant may be the result of the pore of the pentamer being closed. On the other hand, the explanation for the enhanced TLR2-binding activity of the LT-IIb-B{sub 5}(S74A) variant is more complex as its activity is greater than that of the wild-type B pentamer, which also has an open pore as the Ser74 side chain points away from the pore opening. Data for the LT-IIb-B{sub 5}(T13I) variant show that four of the five variant side chains point to the outside surface of the pentamer and one residue points inside. These data are consistent with the lack of binding of the LT-IIb-B{sub 5}(T13I) variant to GD1a ganglioside.« less

  15. Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.

    PubMed

    Lee, Sangmoon; Seo, Jihae; Park, Jinman; Nam, Jae-Yong; Choi, Ahyoung; Ignatius, Jason S; Bjornson, Robert D; Chae, Jong-Hee; Jang, In-Jin; Lee, Sanghyuk; Park, Woong-Yang; Baek, Daehyun; Choi, Murim

    2017-06-27

    Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean population is a distinct ethnic group comparable to other discrete ethnic groups in Africa and Europe, providing a rationale for such independent genomic datasets. Indeed, KOVA conferred 22.8% increased variant filtering power in addition to Exome Aggregation Consortium (ExAC) when used on Korean exomes. Functional assessment of nonsynonymous variant supported the presence of purifying selection in Koreans. Analysis of copy number variants detected 5.2 deletions and 10.3 amplifications per individual with an increased fraction of novel variants among smaller and rarer copy number variable segments. We also report a list of germline variants that are associated with increased tumor susceptibility. This catalog can function as a critical addition to the pre-existing variant databases in pursuing genetic studies of Korean individuals.

  16. Rare and Common Variants Conferring Risk of Tooth Agenesis.

    PubMed

    Jonsson, L; Magnusson, T E; Thordarson, A; Jonsson, T; Geller, F; Feenstra, B; Melbye, M; Nohr, E A; Vucic, S; Dhamo, B; Rivadeneira, F; Ongkosuwito, E M; Wolvius, E B; Leslie, E J; Marazita, M L; Howe, B J; Moreno Uribe, L M; Alonso, I; Santos, M; Pinho, T; Jonsson, R; Audolfsson, G; Gudmundsson, L; Nawaz, M S; Olafsson, S; Gustafsson, O; Ingason, A; Unnsteinsdottir, U; Bjornsdottir, G; Walters, G B; Zervas, M; Oddsson, A; Gudbjartsson, D F; Steinberg, S; Stefansson, H; Stefansson, K

    2018-05-01

    We present association results from a large genome-wide association study of tooth agenesis (TA) as well as selective TA, including 1,944 subjects with congenitally missing teeth, excluding third molars, and 338,554 controls, all of European ancestry. We also tested the association of previously identified risk variants, for timing of tooth eruption and orofacial clefts, with TA. We report associations between TA and 9 novel risk variants. Five of these variants associate with selective TA, including a variant conferring risk of orofacial clefts. These results contribute to a deeper understanding of the genetic architecture of tooth development and disease. The few variants previously associated with TA were uncovered through candidate gene studies guided by mouse knockouts. Knowing the etiology and clinical features of TA is important for planning oral rehabilitation that often involves an interdisciplinary approach.

  17. The impact of rare variation on gene expression across tissues.

    PubMed

    Li, Xin; Kim, Yungil; Tsang, Emily K; Davis, Joe R; Damani, Farhan N; Chiang, Colby; Hess, Gaelen T; Zappala, Zachary; Strober, Benjamin J; Scott, Alexandra J; Li, Amy; Ganna, Andrea; Bassik, Michael C; Merker, Jason D; Hall, Ira M; Battle, Alexis; Montgomery, Stephen B

    2017-10-11

    Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.

  18. The glycan structure of albumin Redhill, a glycosylated variant of human serum albumin.

    PubMed

    Kragh-Hansen, U; Donaldson, D; Jensen, P H

    2001-11-26

    Although human serum albumin is synthesized without carbohydrate, glycosylated variants of the protein can be found. We have determined the structure of the glycan bound to the double-mutant albumin Redhill (-1 Arg, 320 Ala-->Thr). The oligosaccharide was released from the protein using anhydrous hydrazine, and its structure was investigated using neuraminidase and a reagent array analysis method, which is based on the use of specific exoglycosidases. The glycan was shown to be a disialylated biantennary complex type oligosaccharide N-linked to 318 Asn. However, a minor part (11 mol%) of the glycan was without sialic acid. The structure is principally the same as that of glycans bound to two other types of glycosylated albumin variants. Glycosylation can affect, for example, the fatty acid binding properties of albumin. Taking the present information into account, it is apparent that different effects on binding are caused not by different glycan structures but by different locations of attachment, with the possible addition of local conformational changes in the protein molecule.

  19. Using ClinVar as a Resource to Support Variant Interpretations

    PubMed Central

    Harrison, Steven M.; Riggs, Erin R.; Maglott, Donna R.; Lee, Jennifer M.; Azzariti, Danielle R.; Niehaus, Annie; Ramos, Erin M.; Martin, Christa L.; Landrum, Melissa J.; Rehm, Heidi L.

    2016-01-01

    ClinVar is a freely accessible, public archive of reports of the relationships among genomic variants and phenotypes. To facilitate evaluation of the clinical significance of each variant, ClinVar aggregates submissions of the same variant, displays supporting data from each submission, and determines if the submitted clinical interpretations are conflicting or concordant. The unit describes how to (1) identify sequence and structural variants of interest in ClinVar with by multiple searching approaches, including Variation Viewer and (2) understand the display of submissions to ClinVar and the evidence supporting each interpretation. By following this protocol, ClinVar users will be able to learn how to incorporate the wealth of resources and knowledge in ClinVar into variant curation and interpretation. PMID:27037489

  20. A hybrid computational strategy to address WGS variant analysis in >5000 samples.

    PubMed

    Huang, Zhuoyi; Rustagi, Navin; Veeraraghavan, Narayanan; Carroll, Andrew; Gibbs, Richard; Boerwinkle, Eric; Venkata, Manjunath Gorentla; Yu, Fuli

    2016-09-10

    The decreasing costs of sequencing are driving the need for cost effective and real time variant calling of whole genome sequencing data. The scale of these projects are far beyond the capacity of typical computing resources available with most research labs. Other infrastructures like the cloud AWS environment and supercomputers also have limitations due to which large scale joint variant calling becomes infeasible, and infrastructure specific variant calling strategies either fail to scale up to large datasets or abandon joint calling strategies. We present a high throughput framework including multiple variant callers for single nucleotide variant (SNV) calling, which leverages hybrid computing infrastructure consisting of cloud AWS, supercomputers and local high performance computing infrastructures. We present a novel binning approach for large scale joint variant calling and imputation which can scale up to over 10,000 samples while producing SNV callsets with high sensitivity and specificity. As a proof of principle, we present results of analysis on Cohorts for Heart And Aging Research in Genomic Epidemiology (CHARGE) WGS freeze 3 dataset in which joint calling, imputation and phasing of over 5300 whole genome samples was produced in under 6 weeks using four state-of-the-art callers. The callers used were SNPTools, GATK-HaplotypeCaller, GATK-UnifiedGenotyper and GotCloud. We used Amazon AWS, a 4000-core in-house cluster at Baylor College of Medicine, IBM power PC Blue BioU at Rice and Rhea at Oak Ridge National Laboratory (ORNL) for the computation. AWS was used for joint calling of 180 TB of BAM files, and ORNL and Rice supercomputers were used for the imputation and phasing step. All other steps were carried out on the local compute cluster. The entire operation used 5.2 million core hours and only transferred a total of 6 TB of data across the platforms. Even with increasing sizes of whole genome datasets, ensemble joint calling of SNVs for low coverage data can be accomplished in a scalable, cost effective and fast manner by using heterogeneous computing platforms without compromising on the quality of variants.

  1. Contribution of TyrB26 to the Function and Stability of Insulin

    PubMed Central

    Pandyarajan, Vijay; Phillips, Nelson B.; Rege, Nischay; Lawrence, Michael C.; Whittaker, Jonathan; Weiss, Michael A.

    2016-01-01

    Crystallographic studies of insulin bound to receptor domains have defined the primary hormone-receptor interface. We investigated the role of TyrB26, a conserved aromatic residue at this interface. To probe the evolutionary basis for such conservation, we constructed 18 variants at B26. Surprisingly, non-aromatic polar or charged side chains (such as Glu, Ser, or ornithine (Orn)) conferred high activity, whereas the weakest-binding analogs contained Val, Ile, and Leu substitutions. Modeling of variant complexes suggested that the B26 side chains pack within a shallow depression at the solvent-exposed periphery of the interface. This interface would disfavor large aliphatic side chains. The analogs with highest activity exhibited reduced thermodynamic stability and heightened susceptibility to fibrillation. Perturbed self-assembly was also demonstrated in studies of the charged variants (Orn and Glu); indeed, the GluB26 analog exhibited aberrant aggregation in either the presence or absence of zinc ions. Thus, although TyrB26 is part of insulin's receptor-binding surface, our results suggest that its conservation has been enjoined by the aromatic ring's contributions to native stability and self-assembly. We envisage that such classical structural relationships reflect the implicit threat of toxic misfolding (rather than hormonal function at the receptor level) as a general evolutionary determinant of extant protein sequences. PMID:27129279

  2. Additive gene-environment effects on hippocampal structure in healthy humans.

    PubMed

    Rabl, Ulrich; Meyer, Bernhard M; Diers, Kersten; Bartova, Lucie; Berger, Andreas; Mandorfer, Dominik; Popovic, Ana; Scharinger, Christian; Huemer, Julia; Kalcher, Klaudius; Pail, Gerald; Haslacher, Helmuth; Perkmann, Thomas; Windischberger, Christian; Brocke, Burkhard; Sitte, Harald H; Pollak, Daniela D; Dreher, Jean-Claude; Kasper, Siegfried; Praschak-Rieder, Nicole; Moser, Ewald; Esterbauer, Harald; Pezawas, Lukas

    2014-07-23

    Hippocampal volume loss has been related to chronic stress as well as genetic factors. Although genetic and environmental variables affecting hippocampal volume have extensively been studied and related to mental illness, limited evidence is available with respect to G × E interactions on hippocampal volume. The present MRI study investigated interaction effects on hippocampal volume between three well-studied functional genetic variants (COMT Val158Met, BDNF Val66Met, 5-HTTLPR) associated with hippocampal volume and a measure of environmental adversity (life events questionnaire) in a large sample of healthy humans (n = 153). All three variants showed significant interactions with environmental adversity with respect to hippocampal volume. Observed effects were additive by nature and driven by both recent as well as early life events. A consecutive analysis of hippocampal subfields revealed a spatially distinct profile for each genetic variant suggesting a specific role of 5-HTTLPR for the subiculum, BDNF Val66Met for CA4/dentate gyrus, and COMT Val158Met for CA2/3 volume changes. The present study underscores the importance of G × E interactions as determinants of hippocampal volume, which is crucial for the neurobiological understanding of stress-related conditions, such as mood disorders or post-traumatic stress disorder (PTSD). Copyright © 2014 the authors 0270-6474/14/349917-10$15.00/0.

  3. Genetic Epidemiology, Hematological and Clinical Features of Hemoglobinopathies in Iran

    PubMed Central

    Rahimi, Zohreh

    2013-01-01

    There is large variation in the molecular genetics and clinical features of hemoglobinopathies in Iran. Studying structural variants of hemoglobin demonstrated that the β-chain variants of hemoglobin S and D-Punjab are more prevalent in the Fars (southwestern Iran) and Kermanshah (western Iran) provinces, respectively. Also, α-chain variants of Hb Q-Iran and Hb Setif are prevalent in western Iran. The molecular basis and clinical severity of thalassemias are extremely heterogenous among Iranians due to the presence of multiethnic groups in the country. β-Thalassemia is more prevalent in northern and southern Iran. Among 52 different β-thalassemia mutations that have been identified among Iranian populations, IVSII-1 G:A is the most frequent mutation in most parts of the country. The presence of IVS I-5 G:C mutation with high frequency in southeastern Iran might reflect gene flow from neighboring countries. A wide spectrum of α-thalassemia alleles has been detected among Iranians with −α 3.7 kb as the most prevalent α-thalassemia mutation. The prevention program of thalassemia birth in Iran has reduced the birth rate of homozygous β-thalassemia since the implementation of the program in 1997. In this review genetic epidemiology, clinical and hematological aspects of hemoglobinopathies, and the prevention programs of β-thalassemia in Iran will be discussed. PMID:23853772

  4. Rare variants in axonogenesis genes connect three families with sound-color synesthesia.

    PubMed

    Tilot, Amanda K; Kucera, Katerina S; Vino, Arianna; Asher, Julian E; Baron-Cohen, Simon; Fisher, Simon E

    2018-03-20

    Synesthesia is a rare nonpathological phenomenon where stimulation of one sense automatically provokes a secondary perception in another. Hypothesized to result from differences in cortical wiring during development, synesthetes show atypical structural and functional neural connectivity, but the underlying molecular mechanisms are unknown. The trait also appears to be more common among people with autism spectrum disorder and savant abilities. Previous linkage studies searching for shared loci of large effect size across multiple families have had limited success. To address the critical lack of candidate genes, we applied whole-exome sequencing to three families with sound-color (auditory-visual) synesthesia affecting multiple relatives across three or more generations. We identified rare genetic variants that fully cosegregate with synesthesia in each family, uncovering 37 genes of interest. Consistent with reports indicating genetic heterogeneity, no variants were shared across families. Gene ontology analyses highlighted six genes- COL4A1 , ITGA2 , MYO10 , ROBO3 , SLC9A6 , and SLIT2 -associated with axonogenesis and expressed during early childhood when synesthetic associations are formed. These results are consistent with neuroimaging-based hypotheses about the role of hyperconnectivity in the etiology of synesthesia and offer a potential entry point into the neurobiology that organizes our sensory experiences. Copyright © 2018 the Author(s). Published by PNAS.

  5. The phenotypic spectrum of ARHGEF9 includes intellectual disability, focal epilepsy and febrile seizures.

    PubMed

    Klein, Karl Martin; Pendziwiat, Manuela; Eilam, Anda; Gilad, Ronit; Blatt, Ilan; Rosenow, Felix; Kanaan, Moien; Helbig, Ingo; Afawi, Zaid

    2017-07-01

    Mutations or structural genomic alterations of the X-chromosomal gene ARHGEF9 have been described in male and female patients with intellectual disability. Hyperekplexia and epilepsy were observed to a variable degree, but incompletely described. Here, we expand the phenotypic spectrum of ARHGEF9 by describing a large Ethiopian-Jewish family with epilepsy and intellectual disability. The four affected male siblings, their unaffected parents and two unaffected female siblings were recruited and phenotyped. Parametric linkage analysis was performed using SNP microarrays. Variants from exome sequencing in two affected individuals were confirmed by Sanger sequencing. All affected male siblings had febrile seizures from age 2-3 years and intellectual disability. Three developed afebrile seizures between age 7-17 years. Three showed focal seizure semiology. None had hyperekplexia. A novel ARHGEF9 variant (c.967G>A, p.G323R, NM_015185.2) was hemizygous in all affected male siblings and heterozygous in the mother. This family reveals that the phenotypic spectrum of ARHGEF9 is broader than commonly assumed and includes febrile seizures and focal epilepsy with intellectual disability in the absence of hyperekplexia or other clinically distinguishing features. Our findings suggest that pathogenic variants in ARHGEF9 may be more common than previously assumed in patients with intellectual disability and mild epilepsy.

  6. Functional analysis of human cytochrome P450 21A2 variants involved in congenital adrenal hyperplasia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Chunxue; Pallan, Pradeep S.; Zhang, Wei

    Cytochrome P450 (P450, CYP) 21A2 is the major steroid 21-hydroxylase, converting progesterone to 11-deoxycorticosterone and 17α-hydroxyprogesterone (17α-OH-progesterone) to 11-deoxycortisol. More than 100 CYP21A2 variants give rise to congenital adrenal hyperplasia (CAH). We previously reported a structure of WT human P450 21A2 with bound progesterone and now present a structure bound to the other substrate (17α-OH-progesterone). We found that the 17α-OH-progesterone- and progesterone-bound complex structures are highly similar, with only some minor differences in surface loop regions. Twelve P450 21A2 variants associated with either salt-wasting or nonclassical forms of CAH were expressed, purified, and analyzed. The catalytic activities of these 12more » variants ranged from 0.00009% to 30% of WT P450 21A2 and the extent of heme incorporation from 10% to 95% of the WT. Substrate dissociation constants (Ks) for four variants were 37–13,000-fold higher than for WT P450 21A2. Cytochrome b5, which augments several P450 activities, inhibited P450 21A2 activity. Similar to the WT enzyme, high noncompetitive intermolecular kinetic deuterium isotope effects (≥ 5.5) were observed for all six P450 21A2 variants examined for 21-hydroxylation of 21-d3-progesterone, indicating that C–H bond breaking is a rate-limiting step over a 104-fold range of catalytic efficiency. Using UV-visible and CD spectroscopy, we found that P450 21A2 thermal stability assessed in bacterial cells and with purified enzymes differed among salt-wasting- and nonclassical-associated variants, but these differences did not correlate with catalytic activity. Our in-depth investigation of CAH-associated P450 21A2 variants reveals critical insight into the effects of disease-causing mutations on this important enzyme.« less

  7. Strategic approaches to unraveling genetic causes of cardiovascular diseases

    USDA-ARS?s Scientific Manuscript database

    DNA sequence variants are major components of the "causal field" for virtually all medical phenotypes, whether single gene familial disorders or complex traits without a clear familial aggregation. The causal variants in single gene disorders are necessary and sufficient to impart large effects. In ...

  8. Height-reducing variants and selection for short stature in Sardinia

    PubMed Central

    Mulas, Antonella; Steri, Maristella; Busonero, Fabio; Marcus, Joseph H.; Marongiu, Michele; Maschio, Andrea; Ortega Del Vecchyo, Diego; Floris, Matteo; Meloni, Antonella; Delitala, Alessandro; Concas, Maria Pina; Murgia, Federico; Biino, Ginevra; Vaccargiu, Simona; Nagaraja, Ramaiah; Lohmueller, Kirk E.; Timpson, Nicholas J.; Soranzo, Nicole; Tachmazidou, Ioanna; Dedoussis, George; Zeggini, Eleftheria; Uzzau, Sergio; Jones, Chris; Lyons, Robert; Angius, Andrea; Abecasis, Gonçalo R.; Novembre, John; Schlessinger, David; Cucca, Francesco

    2015-01-01

    We report sequencing-based whole-genome association analyses to evaluate the impact of rare and founder variants on stature in 6,307 individuals on the island of Sardinia. We identified two variants with large effects. One is a stop codon in the GHR gene, relatively frequent in Sardinia (0.87% vs <0.01% elsewhere), which in homozygosity causes the short stature Laron syndrome. We find that it reduces height in heterozygotes by an average of 4.2 cm (−0.64 s.d). The other variant, in the imprinted KCNQ1 gene (MAF = 7.7% vs <1% elsewhere) reduces height by an average of 1.83 cm (−0.31 s.d.) when maternally inherited. Additionally, polygenic scores indicate that known height-decreasing alleles are at systematically higher frequency in Sardinians than would be expected by genetic drift. The findings are consistent with selection toward shorter stature in Sardinia and a suggestive human example of the proposed “island effect” reducing the size of large mammals. PMID:26366551

  9. Variant-aware saturating mutagenesis using multiple Cas9 nucleases identifies regulatory elements at trait-associated loci.

    PubMed

    Canver, Matthew C; Lessard, Samuel; Pinello, Luca; Wu, Yuxuan; Ilboudo, Yann; Stern, Emily N; Needleman, Austen J; Galactéros, Frédéric; Brugnara, Carlo; Kutlar, Abdullah; McKenzie, Colin; Reid, Marvin; Chen, Diane D; Das, Partha Pratim; A Cole, Mitchel; Zeng, Jing; Kurita, Ryo; Nakamura, Yukio; Yuan, Guo-Cheng; Lettre, Guillaume; Bauer, Daniel E; Orkin, Stuart H

    2017-04-01

    Cas9-mediated, high-throughput, saturating in situ mutagenesis permits fine-mapping of function across genomic segments. Disease- and trait-associated variants identified in genome-wide association studies largely cluster at regulatory loci. Here we demonstrate the use of multiple designer nucleases and variant-aware library design to interrogate trait-associated regulatory DNA at high resolution. We developed a computational tool for the creation of saturating-mutagenesis libraries with single or multiple nucleases with incorporation of variants. We applied this methodology to the HBS1L-MYB intergenic region, which is associated with red-blood-cell traits, including fetal hemoglobin levels. This approach identified putative regulatory elements that control MYB expression. Analysis of genomic copy number highlighted potential false-positive regions, thus emphasizing the importance of off-target analysis in the design of saturating-mutagenesis experiments. Together, these data establish a widely applicable high-throughput and high-resolution methodology to identify minimal functional sequences within large disease- and trait-associated regions.

  10. Photoswitchable red fluorescent protein with a large Stokes shift

    PubMed Central

    Piatkevich, Kiryl D.; English, Brian P.; Malashkevich, Vladimir N.; Xiao, Hui; Almo, Steven C.; Singer, Robert H.; Verkhusha, Vladislav V.

    2014-01-01

    SUMMARY Subclass of fluorescent proteins, large Stokes shift fluorescent proteins, is characterized by their increased spread between the excitation and emission maxima. Here we report a photoswitchable variant of a red fluorescent protein with a large Stokes shift, PSLSSmKate, which initially exhibits excitation/emission at 445/622 nm, but irradiation with violet light photoswitches PSLSSmKate into a common red form with excitation/emission at 573/621 nm. We characterize spectral, photophysical and biochemical properties of PSLSSmKate in vitro and in mammalian cells, and determine its crystal structure in the large Stokes shift form. Mass-spectrometry, mutagenesis and spectroscopic analysis of PSLSSmKate allow us to propose molecular mechanisms for the large Stokes shift, pH dependence and light-induced chromophore transformation. We demonstrate applicability of PSLSSmKate to superresolution PALM microscopy and protein dynamics in live cells. Given its promising properties, we expect that PSLSSmKate-like phenotype will be further used for photoactivatable imaging and tracking multiple populations of intracellular objects. PMID:25242289

  11. A Novel Genome-Information Content-Based Statistic for Genome-Wide Association Analysis Designed for Next-Generation Sequencing Data

    PubMed Central

    Luo, Li; Zhu, Yun

    2012-01-01

    Abstract The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T2, collapsing method, multivariate and collapsing (CMC) method, individual χ2 test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets. PMID:22651812

  12. A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

    PubMed

    Luo, Li; Zhu, Yun; Xiong, Momiao

    2012-06-01

    The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.

  13. Genetic Architecture of Vitamin B12 and Folate Levels Uncovered Applying Deeply Sequenced Large Datasets

    PubMed Central

    Thorleifsson, Gudmar; Ahluwalia, Tarunveer S.; Steinthorsdottir, Valgerdur; Bjarnason, Helgi; Gudbjartsson, Daniel F.; Magnusson, Olafur T.; Sparsø, Thomas; Albrechtsen, Anders; Kong, Augustine; Masson, Gisli; Tian, Geng; Cao, Hongzhi; Nie, Chao; Kristiansen, Karsten; Husemoen, Lise Lotte; Thuesen, Betina; Li, Yingrui; Nielsen, Rasmus; Linneberg, Allan; Olafsson, Isleifur; Eyjolfsson, Gudmundur I.; Jørgensen, Torben; Wang, Jun; Hansen, Torben; Thorsteinsdottir, Unnur; Stefánsson, Kari; Pedersen, Oluf

    2013-01-01

    Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B12 and folate measurements, respectively. We found six novel loci associating with serum B12 (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B12 and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations. PMID:23754956

  14. The structure of Pseudomonas P51 Cl-muconate lactonizing enzyme: Co-evolution of structure and dynamics with the dehalogenation function

    PubMed Central

    Kajander, Tommi; Lehtiö, Lari; Schlömann, Michael; Goldman, Adrian

    2003-01-01

    Bacterial muconate lactonizing enzymes (MLEs) catalyze the conversion of cis,cis-muconate as a part of the β-ketoadipate pathway, and some MLEs are also able to dehalogenate chlorinated muconates (Cl-MLEs). The basis for the Cl-MLEs dehalogenating activity is still unclear. To further elucidate the differences between MLEs and Cl-MLEs, we have solved the structure of Pseudomonas P51 Cl-MLE at 1.95 Å resolution. Comparison of Pseudomonas MLE and Cl-MLE structures reveals the presence of a large cavity in the Cl-MLEs. The cavity may be related to conformational changes on substrate binding in Cl-MLEs, at Gly52. Site-directed mutagenesis on Pseudomonas MLE core positions to the equivalent Cl-MLE residues showed that the variant Thr52Gly was rather inactive, whereas the Thr52Gly-Phe103Ser variant had regained part of the activity. These residues form a hydrogen bond in the Cl-MLEs. The Cl-MLE structure, as a result of the Thr-to-Gly change, is more flexible than MLE: As a mobile loop closes over the active site, a conformational change at Gly52 is observed in Cl-MLEs. The loose packing and structural motions in Cl-MLE may be required for the rotation of the lactone ring in the active site necessary for the dehalogenating activity of Cl-MLEs. Furthermore, we also suggest that differences in the active site mobile loop sequence between MLEs and Cl-MLEs result in lower active site polarity in Cl-MLEs, possibly affecting catalysis. These changes could result in slower product release from Cl-MLEs and make it a better enzyme for dehalogenation of substrate. PMID:12930985

  15. GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly

    PubMed Central

    Do, Hongdo; Molania, Ramyar

    2017-01-01

    The identification of genomic rearrangements with high sensitivity and specificity using massively parallel sequencing remains a major challenge, particularly in precision medicine and cancer research. Here, we describe a new method for detecting rearrangements, GRIDSS (Genome Rearrangement IDentification Software Suite). GRIDSS is a multithreaded structural variant (SV) caller that performs efficient genome-wide break-end assembly prior to variant calling using a novel positional de Bruijn graph-based assembler. By combining assembly, split read, and read pair evidence using a probabilistic scoring, GRIDSS achieves high sensitivity and specificity on simulated, cell line, and patient tumor data, recently winning SV subchallenge #5 of the ICGC-TCGA DREAM8.5 Somatic Mutation Calling Challenge. On human cell line data, GRIDSS halves the false discovery rate compared to other recent methods while matching or exceeding their sensitivity. GRIDSS identifies nontemplate sequence insertions, microhomologies, and large imperfect homologies, estimates a quality score for each breakpoint, stratifies calls into high or low confidence, and supports multisample analysis. PMID:29097403

  16. Chemical Synthesis of the Highly Hydrophobic Antiviral Membrane-Associated Protein IFITM3 and Modified Variants.

    PubMed

    Harmand, Thibault J; Pattabiraman, Vijaya R; Bode, Jeffrey W

    2017-10-02

    Interferon-induced transmembrane protein 3 (IFITM3) is an antiviral transmembrane protein that is thought to serve as the primary factor for inhibiting the replication of a large number of viruses, including West Nile virus, Dengue virus, Ebola virus, and Zika virus. Production of this 14.5 kDa, 133-residue transmembrane protein, especially with essential posttranslational modifications, by recombinant expression is challenging. In this report, we document the chemical synthesis of IFTIM3 in multi-milligram quantities (>15 mg) and the preparation of phosphorylated and fluorescent variants. The synthesis was accomplished by using KAHA ligations, which operate under acidic aqueous/organic mixtures that excel at solubilizing even the exceptionally hydrophobic C-terminal region of IFITM3. The synthetic material is readily incorporated into model vesicles and forms the basis for using synthetic, homogenous IFITM3 and its derivatives for further studying its structure and biological mode of action. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Genome sequencing reveals loci under artificial selection that underlie disease phenotypes in the laboratory rat.

    PubMed

    Atanur, Santosh S; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R; Kaisaki, Pamela J; Otto, Georg W; Ma, Man Chun John; Keane, Thomas M; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J

    2013-08-01

    Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  18. Genome Sequencing Reveals Loci under Artificial Selection that Underlie Disease Phenotypes in the Laboratory Rat

    PubMed Central

    Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.

    2013-01-01

    Summary Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. PaperClip PMID:23890820

  19. An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes

    PubMed Central

    Cho, Yun Sung; Kim, Hyunho; Kim, Hak-Min; Jho, Sungwoong; Jun, JeHoon; Lee, Yong Joo; Chae, Kyun Shik; Kim, Chang Geun; Kim, Sangsoo; Eriksson, Anders; Edwards, Jeremy S.; Lee, Semin; Kim, Byung Chul; Manica, Andrea; Oh, Tae-Kwang; Church, George M.; Bhak, Jong

    2016-01-01

    Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity. PMID:27882922

  20. An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder.

    PubMed

    Werling, Donna M; Brand, Harrison; An, Joon-Yong; Stone, Matthew R; Zhu, Lingxue; Glessner, Joseph T; Collins, Ryan L; Dong, Shan; Layer, Ryan M; Markenscoff-Papadimitriou, Eirene; Farrell, Andrew; Schwartz, Grace B; Wang, Harold Z; Currall, Benjamin B; Zhao, Xuefang; Dea, Jeanselle; Duhn, Clif; Erdman, Carolyn A; Gilson, Michael C; Yadav, Rachita; Handsaker, Robert E; Kashin, Seva; Klei, Lambertus; Mandell, Jeffrey D; Nowakowski, Tomasz J; Liu, Yuwen; Pochareddy, Sirisha; Smith, Louw; Walker, Michael F; Waterman, Matthew J; He, Xin; Kriegstein, Arnold R; Rubenstein, John L; Sestan, Nenad; McCarroll, Steven A; Neale, Benjamin M; Coon, Hilary; Willsey, A Jeremy; Buxbaum, Joseph D; Daly, Mark J; State, Matthew W; Quinlan, Aaron R; Marth, Gabor T; Roeder, Kathryn; Devlin, Bernie; Talkowski, Michael E; Sanders, Stephan J

    2018-05-01

    Genomic association studies of common or rare protein-coding variation have established robust statistical approaches to account for multiple testing. Here we present a comparable framework to evaluate rare and de novo noncoding single-nucleotide variants, insertion/deletions, and all classes of structural variation from whole-genome sequencing (WGS). Integrating genomic annotations at the level of nucleotides, genes, and regulatory regions, we define 51,801 annotation categories. Analyses of 519 autism spectrum disorder families did not identify association with any categories after correction for 4,123 effective tests. Without appropriate correction, biologically plausible associations are observed in both cases and controls. Despite excluding previously identified gene-disrupting mutations, coding regions still exhibited the strongest associations. Thus, in autism, the contribution of de novo noncoding variation is probably modest in comparison to that of de novo coding variants. Robust results from future WGS studies will require large cohorts and comprehensive analytical strategies that consider the substantial multiple-testing burden.

  1. affy2sv: an R package to pre-process Affymetrix CytoScan HD and 750K arrays for SNP, CNV, inversion and mosaicism calling.

    PubMed

    Hernandez-Ferrer, Carles; Quintela Garcia, Ines; Danielski, Katharina; Carracedo, Ángel; Pérez-Jurado, Luis A; González, Juan R

    2015-05-20

    The well-known Genome-Wide Association Studies (GWAS) had led to many scientific discoveries using SNP data. Even so, they were not able to explain the full heritability of complex diseases. Now, other structural variants like copy number variants or DNA inversions, either germ-line or in mosaicism events, are being studies. We present the R package affy2sv to pre-process Affymetrix CytoScan HD/750k array (also for Genome-Wide SNP 5.0/6.0 and Axiom) in structural variant studies. We illustrate the capabilities of affy2sv using two different complete pipelines on real data. The first one performing a GWAS and a mosaic alterations detection study, and the other detecting CNVs and performing an inversion calling. Both examples presented in the article show up how affy2sv can be used as part of more complex pipelines aimed to analyze Affymetrix SNP arrays data in genetic association studies, where different types of structural variants are considered.

  2. Structural and biophysical properties of metal-free pathogenic SOD1 mutants A4V and G93A

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Galaleldeen, Ahmad; Strange, Richard W.; Whitson, Lisa J.

    2010-07-19

    Amyotrophic lateral sclerosis (ALS) is a fatal, progressive neurodegenerative disease characterized by the destruction of motor neurons in the spinal cord and brain. A subset of ALS cases are linked to dominant mutations in copper-zinc superoxide dismutase (SOD1). The pathogenic SOD1 variants A4V and G93A have been the foci of multiple studies aimed at understanding the molecular basis for SOD1-linked ALS. The A4V variant is responsible for the majority of familial ALS cases in North America, causing rapidly progressing paralysis once symptoms begin and the G93A SOD1 variant is overexpressed in often studied murine models of the disease. Here wemore » report the three-dimensional structures of metal-free A4V and of metal-bound and metal-free G93A SOD1. In the metal-free structures, the metal-binding loop elements are observed to be severely disordered, suggesting that these variants may share mechanisms of aggregation proposed previously for other pathogenic SOD1 proteins.« less

  3. CNV-association meta-analysis in 191,161 European adults reveals new loci associated with anthropometric traits.

    PubMed

    Macé, Aurélien; Tuke, Marcus A; Deelen, Patrick; Kristiansson, Kati; Mattsson, Hannele; Nõukas, Margit; Sapkota, Yadav; Schick, Ursula; Porcu, Eleonora; Rüeger, Sina; McDaid, Aaron F; Porteous, David; Winkler, Thomas W; Salvi, Erika; Shrine, Nick; Liu, Xueping; Ang, Wei Q; Zhang, Weihua; Feitosa, Mary F; Venturini, Cristina; van der Most, Peter J; Rosengren, Anders; Wood, Andrew R; Beaumont, Robin N; Jones, Samuel E; Ruth, Katherine S; Yaghootkar, Hanieh; Tyrrell, Jessica; Havulinna, Aki S; Boers, Harmen; Mägi, Reedik; Kriebel, Jennifer; Müller-Nurasyid, Martina; Perola, Markus; Nieminen, Markku; Lokki, Marja-Liisa; Kähönen, Mika; Viikari, Jorma S; Geller, Frank; Lahti, Jari; Palotie, Aarno; Koponen, Päivikki; Lundqvist, Annamari; Rissanen, Harri; Bottinger, Erwin P; Afaq, Saima; Wojczynski, Mary K; Lenzini, Petra; Nolte, Ilja M; Sparsø, Thomas; Schupf, Nicole; Christensen, Kaare; Perls, Thomas T; Newman, Anne B; Werge, Thomas; Snieder, Harold; Spector, Timothy D; Chambers, John C; Koskinen, Seppo; Melbye, Mads; Raitakari, Olli T; Lehtimäki, Terho; Tobin, Martin D; Wain, Louise V; Sinisalo, Juha; Peters, Annette; Meitinger, Thomas; Martin, Nicholas G; Wray, Naomi R; Montgomery, Grant W; Medland, Sarah E; Swertz, Morris A; Vartiainen, Erkki; Borodulin, Katja; Männistö, Satu; Murray, Anna; Bochud, Murielle; Jacquemont, Sébastien; Rivadeneira, Fernando; Hansen, Thomas F; Oldehinkel, Albertine J; Mangino, Massimo; Province, Michael A; Deloukas, Panos; Kooner, Jaspal S; Freathy, Rachel M; Pennell, Craig; Feenstra, Bjarke; Strachan, David P; Lettre, Guillaume; Hirschhorn, Joel; Cusi, Daniele; Heid, Iris M; Hayward, Caroline; Männik, Katrin; Beckmann, Jacques S; Loos, Ruth J F; Nyholt, Dale R; Metspalu, Andres; Eriksson, Johan G; Weedon, Michael N; Salomaa, Veikko; Franke, Lude; Reymond, Alexandre; Frayling, Timothy M; Kutalik, Zoltán

    2017-09-29

    There are few examples of robust associations between rare copy number variants (CNVs) and complex continuous human traits. Here we present a large-scale CNV association meta-analysis on anthropometric traits in up to 191,161 adult samples from 26 cohorts. The study reveals five CNV associations at 1q21.1, 3q29, 7q11.23, 11p14.2, and 18q21.32 and confirms two known loci at 16p11.2 and 22q11.21, implicating at least one anthropometric trait. The discovered CNVs are recurrent and rare (0.01-0.2%), with large effects on height (>2.4 cm), weight (>5 kg), and body mass index (BMI) (>3.5 kg/m 2 ). Burden analysis shows a 0.41 cm decrease in height, a 0.003 increase in waist-to-hip ratio and increase in BMI by 0.14 kg/m 2 for each Mb of total deletion burden (P = 2.5 × 10 -10 , 6.0 × 10 -5 , and 2.9 × 10 -3 ). Our study provides evidence that the same genes (e.g., MC4R, FIBIN, and FMO5) harbor both common and rare variants affecting body size and that anthropometric traits share genetic loci with developmental and psychiatric disorders.Individual SNPs have small effects on anthropometric traits, yet the impact of CNVs has remained largely unknown. Here, Kutalik and co-workers perform a large-scale genome-wide meta-analysis of structural variation and find rare CNVs associated with height, weight and BMI with large effect sizes.

  4. Structural optimization under overhang constraints imposed by additive manufacturing technologies

    NASA Astrophysics Data System (ADS)

    Allaire, G.; Dapogny, C.; Estevez, R.; Faure, A.; Michailidis, G.

    2017-12-01

    This article addresses one of the major constraints imposed by additive manufacturing processes on shape optimization problems - that of overhangs, i.e. large regions hanging over void without sufficient support from the lower structure. After revisiting the 'classical' geometric criteria used in the literature, based on the angle between the structural boundary and the build direction, we propose a new mechanical constraint functional, which mimics the layer by layer construction process featured by additive manufacturing technologies, and thereby appeals to the physical origin of the difficulties caused by overhangs. This constraint, as well as some variants, is precisely defined; their shape derivatives are computed in the sense of Hadamard's method, and numerical strategies are extensively discussed, in two and three space dimensions, to efficiently deal with the appearance of overhang features in the course of shape optimization processes.

  5. Heliostat cost optimization study

    NASA Astrophysics Data System (ADS)

    von Reeken, Finn; Weinrebe, Gerhard; Keck, Thomas; Balz, Markus

    2016-05-01

    This paper presents a methodology for a heliostat cost optimization study. First different variants of small, medium sized and large heliostats are designed. Then the respective costs, tracking and optical quality are determined. For the calculation of optical quality a structural model of the heliostat is programmed and analyzed using finite element software. The costs are determined based on inquiries and from experience with similar structures. Eventually the levelised electricity costs for a reference power tower plant are calculated. Before each annual simulation run the heliostat field is optimized. Calculated LCOEs are then used to identify the most suitable option(s). Finally, the conclusions and findings of this extensive cost study are used to define the concept of a new cost-efficient heliostat called `Stellio'.

  6. Integration of bioinformatics and imaging informatics for identifying rare PSEN1 variants in Alzheimer's disease.

    PubMed

    Nho, Kwangsik; Horgusluoglu, Emrin; Kim, Sungeun; Risacher, Shannon L; Kim, Dokyoon; Foroud, Tatiana; Aisen, Paul S; Petersen, Ronald C; Jack, Clifford R; Shaw, Leslie M; Trojanowski, John Q; Weiner, Michael W; Green, Robert C; Toga, Arthur W; Saykin, Andrew J

    2016-08-12

    Pathogenic mutations in PSEN1 are known to cause familial early-onset Alzheimer's disease (EOAD) but common variants in PSEN1 have not been found to strongly influence late-onset AD (LOAD). The association of rare variants in PSEN1 with LOAD-related endophenotypes has received little attention. In this study, we performed a rare variant association analysis of PSEN1 with quantitative biomarkers of LOAD using whole genome sequencing (WGS) by integrating bioinformatics and imaging informatics. A WGS data set (N = 815) from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort was used in this analysis. 757 non-Hispanic Caucasian participants underwent WGS from a blood sample and high resolution T1-weighted structural MRI at baseline. An automated MRI analysis technique (FreeSurfer) was used to measure cortical thickness and volume of neuroanatomical structures. We assessed imaging and cerebrospinal fluid (CSF) biomarkers as LOAD-related quantitative endophenotypes. Single variant analyses were performed using PLINK and gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). A total of 839 rare variants (MAF < 1/√(2 N) = 0.0257) were found within a region of ±10 kb from PSEN1. Among them, six exonic (three non-synonymous) variants were observed. A single variant association analysis showed that the PSEN1 p. E318G variant increases the risk of LOAD only in participants carrying APOE ε4 allele where individuals carrying the minor allele of this PSEN1 risk variant have lower CSF Aβ1-42 and higher CSF tau. A gene-based analysis resulted in a significant association of rare but not common (MAF ≥ 0.0257) PSEN1 variants with bilateral entorhinal cortical thickness. This is the first study to show that PSEN1 rare variants collectively show a significant association with the brain atrophy in regions preferentially affected by LOAD, providing further support for a role of PSEN1 in LOAD. The PSEN1 p. E318G variant increases the risk of LOAD only in APOE ε4 carriers. Integrating bioinformatics with imaging informatics for identification of rare variants could help explain the missing heritability in LOAD.

  7. Contribution of Large Region Joint Associations to Complex Traits Genetics

    PubMed Central

    Paré, Guillaume; Asma, Senay; Deng, Wei Q.

    2015-01-01

    A polygenic model of inheritance, whereby hundreds or thousands of weakly associated variants contribute to a trait’s heritability, has been proposed to underlie the genetic architecture of complex traits. However, relatively few genetic variants have been positively identified so far and they collectively explain only a small fraction of the predicted heritability. We hypothesized that joint association of multiple weakly associated variants over large chromosomal regions contributes to complex traits variance. Confirmation of such regional associations can help identify new loci and lead to a better understanding of known ones. To test this hypothesis, we first characterized the ability of commonly used genetic association models to identify large region joint associations. Through theoretical derivation and simulation, we showed that multivariate linear models where multiple SNPs are included as independent predictors have the most favorable association profile. Based on these results, we tested for large region association with height in 3,740 European participants from the Health and Retirement Study (HRS) study. Adjusting for SNPs with known association with height, we demonstrated clustering of weak associations (p = 2x10-4) in regions extending up to 433.0 Kb from known height loci. The contribution of regional associations to phenotypic variance was estimated at 0.172 (95% CI 0.063-0.279; p < 0.001), which compared favorably to 0.129 explained by known height variants. Conversely, we showed that suggestively associated regions are enriched for known height loci. To extend our findings to other traits, we also tested BMI, HDLc and CRP for large region associations, with consistent results for CRP. Our results demonstrate the presence of large region joint associations and suggest these can be used to pinpoint weakly associated SNPs. PMID:25856144

  8. The QTN program and the alleles that matter for evolution: all that's gold does not glitter.

    PubMed

    Rockman, Matthew V

    2012-01-01

    The search for the alleles that matter, the quantitative trait nucleotides (QTNs) that underlie heritable variation within populations and divergence among them, is a popular pursuit. But what is the question to which QTNs are the answer? Although their pursuit is often invoked as a means of addressing the molecular basis of phenotypic evolution or of estimating the roles of evolutionary forces, the QTNs that are accessible to experimentalists, QTNs of relatively large effect, may be uninformative about these issues if large-effect variants are unrepresentative of the alleles that matter. Although 20th century evolutionary biology generally viewed large-effect variants as atypical, the field has recently undergone a quiet realignment toward a view of readily discoverable large-effect alleles as the primary molecular substrates for evolution. I argue that neither theory nor data justify this realignment. Models and experimental findings covering broad swaths of evolutionary phenomena suggest that evolution often acts via large numbers of small-effect polygenes, individually undetectable. Moreover, these small-effect variants are different in kind, at the molecular level, from the large-effect alleles accessible to experimentalists. Although discoverable QTNs address some fundamental evolutionary questions, they are essentially misleading about many others. © 2011 The Author(s). Evolution © 2011 The Society for the Study of Evolution.

  9. Structural insights into methanol-stable variants of lipase T6 from Geobacillus stearothermophilus.

    PubMed

    Dror, Adi; Kanteev, Margarita; Kagan, Irit; Gihaz, Shalev; Shahar, Anat; Fishman, Ayelet

    2015-11-01

    Enzymatic production of biodiesel by transesterification of triglycerides and alcohol, catalyzed by lipases, offers an environmentally friendly and efficient alternative to the chemically catalyzed process while using low-grade feedstocks. Methanol is utilized frequently as the alcohol in the reaction due to its reactivity and low cost. However, one of the major drawbacks of the enzymatic system is the presence of high methanol concentrations which leads to methanol-induced unfolding and inactivation of the biocatalyst. Therefore, a methanol-stable lipase is of great interest for the biodiesel industry. In this study, protein engineering was applied to substitute charged surface residues with hydrophobic ones to enhance the stability in methanol of a lipase from Geobacillus stearothermophilus T6. We identified a methanol-stable variant, R374W, and combined it with a variant found previously, H86Y/A269T. The triple mutant, H86Y/A269T/R374W, had a half-life value at 70 % methanol of 324 min which reflects an 87-fold enhanced stability compared to the wild type together with elevated thermostability in buffer and in 50 % methanol. This variant also exhibited an improved biodiesel yield from waste chicken oil compared to commercial Lipolase 100L® and Novozyme® CALB. Crystal structures of the wild type and the methanol-stable variants provided insights regarding structure-stability correlations. The most prominent features were the extensive formation of new hydrogen bonds between surface residues directly or mediated by structural water molecules and the stabilization of Zn and Ca binding sites. Mutation sites were also characterized by lower B-factor values calculated from the X-ray structures indicating improved rigidity.

  10. Crystal structure of EML1 reveals the basis for Hsp90 dependence of oncogenic EML4-ALK by disruption of an atypical β-propeller domain

    PubMed Central

    Richards, Mark W.; Law, Edward W. P.; Rennalls, La’Verne P.; Busacca, Sara; O’Regan, Laura; Fry, Andrew M.; Fennell, Dean A.; Bayliss, Richard

    2014-01-01

    Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners. PMID:24706829

  11. Crystal structure of EML1 reveals the basis for Hsp90 dependence of oncogenic EML4-ALK by disruption of an atypical β-propeller domain.

    PubMed

    Richards, Mark W; Law, Edward W P; Rennalls, La'Verne P; Busacca, Sara; O'Regan, Laura; Fry, Andrew M; Fennell, Dean A; Bayliss, Richard

    2014-04-08

    Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners.

  12. Proposal for the nomenclature of human plasminogen (PLG) polymorphism.

    PubMed

    Skoda, U; Bertrams, J; Dykes, D; Eiberg, H; Hobart, M; Hummel, K; Kühnl, P; Mauff, G; Nakamura, S; Nishimukai, H

    1986-01-01

    Since its discovery, human plasminogen (PLG) polymorphism has received widespread acceptance in population genetics and forensic haematology. Due to the large number of variant alleles described, a PLG reference typing and Plasminogen Symposium was held, at which a nomenclature proposal was inaugurated. The technology of comparing PLG variants was based on isoelectric focusing and subsequent detection by caseinolytic overlay and 'Western' blotting. Typing results permitted comparison of so far described variant designations and resulted in a new nomenclature proposal for PLG polymorphism. It is recommended that the two most common alleles found in all investigated races be called: PLG*A (previously also PLG*1) and PLG*B (previously also PLG*2), the known variants with acidic pI: PLG*A1 to *A3, intermediate variants: PLG*M1 to *M5, PLG*M5 being functionally inactive, and basic variants: PLG*B1 to *B3. For future classification of newly discovered variants, samples should be compared at any of the laboratories participating in the reference typing.

  13. How the structure of the large subunit controls function in an oxygen-tolerant [NiFe]-hydrogenase

    PubMed Central

    Bowman, Lisa; Flanagan, Lindsey; Fyfe, Paul K.; Parkin, Alison; Hunter, William N.; Sargent, Frank

    2014-01-01

    Salmonella enterica is an opportunistic pathogen that produces a [NiFe]-hydrogenase under aerobic conditions. In the present study, genetic engineering approaches were used to facilitate isolation of this enzyme, termed Hyd-5. The crystal structure was determined to a resolution of 3.2 Å and the hydro-genase was observed to comprise associated large and small subunits. The structure indicated that His229 from the large subunit was close to the proximal [4Fe–3S] cluster in the small subunit. In addition, His229 was observed to lie close to a buried glutamic acid (Glu73), which is conserved in oxygen-tolerant hydrogenases. His229 and Glu73 of the Hyd-5 large subunit were found to be important in both hydrogen oxidation activity and the oxygen-tolerance mechanism. Substitution of His229 or Glu73 with alanine led to a loss in the ability of Hyd-5 to oxidize hydrogen in air. Furthermore, the H229A variant was found to have lost the overpotential requirement for activity that is always observed with oxygen-tolerant [NiFe]-hydrogenases. It is possible that His229 has a role in stabilizing the super-oxidized form of the proximal cluster in the presence of oxygen, and it is proposed that Glu73could play a supporting role in fine-tuning the chemistry of His229 to enable this function. PMID:24428762

  14. Genome of the Netherlands population-specific imputations identify an ABCA6 variant associated with cholesterol levels

    PubMed Central

    van Leeuwen, Elisabeth M.; Karssen, Lennart C.; Deelen, Joris; Isaacs, Aaron; Medina-Gomez, Carolina; Mbarek, Hamdi; Kanterakis, Alexandros; Trompet, Stella; Postmus, Iris; Verweij, Niek; van Enckevort, David J.; Huffman, Jennifer E.; White, Charles C.; Feitosa, Mary F.; Bartz, Traci M.; Manichaikul, Ani; Joshi, Peter K.; Peloso, Gina M.; Deelen, Patrick; van Dijk, Freerk; Willemsen, Gonneke; de Geus, Eco J.; Milaneschi, Yuri; Penninx, Brenda W.J.H.; Francioli, Laurent C.; Menelaou, Androniki; Pulit, Sara L.; Rivadeneira, Fernando; Hofman, Albert; Oostra, Ben A.; Franco, Oscar H.; Leach, Irene Mateo; Beekman, Marian; de Craen, Anton J.M.; Uh, Hae-Won; Trochet, Holly; Hocking, Lynne J.; Porteous, David J.; Sattar, Naveed; Packard, Chris J.; Buckley, Brendan M.; Brody, Jennifer A.; Bis, Joshua C.; Rotter, Jerome I.; Mychaleckyj, Josyf C.; Campbell, Harry; Duan, Qing; Lange, Leslie A.; Wilson, James F.; Hayward, Caroline; Polasek, Ozren; Vitart, Veronique; Rudan, Igor; Wright, Alan F.; Rich, Stephen S.; Psaty, Bruce M.; Borecki, Ingrid B.; Kearney, Patricia M.; Stott, David J.; Adrienne Cupples, L.; Neerincx, Pieter B.T.; Elbers, Clara C.; Francesco Palamara, Pier; Pe'er, Itsik; Abdellaoui, Abdel; Kloosterman, Wigard P.; van Oven, Mannis; Vermaat, Martijn; Li, Mingkun; Laros, Jeroen F.J.; Stoneking, Mark; de Knijff, Peter; Kayser, Manfred; Veldink, Jan H.; van den Berg, Leonard H.; Byelas, Heorhiy; den Dunnen, Johan T.; Dijkstra, Martijn; Amin, Najaf; Joeri van der Velde, K.; van Setten, Jessica; Kattenberg, Mathijs; van Schaik, Barbera D.C.; Bot, Jan; Nijman, Isaäc J.; Mei, Hailiang; Koval, Vyacheslav; Ye, Kai; Lameijer, Eric-Wubbo; Moed, Matthijs H.; Hehir-Kwa, Jayne Y.; Handsaker, Robert E.; Sunyaev, Shamil R.; Sohail, Mashaal; Hormozdiari, Fereydoun; Marschall, Tobias; Schönhuth, Alexander; Guryev, Victor; Suchiman, H. Eka D.; Wolffenbuttel, Bruce H.; Platteel, Mathieu; Pitts, Steven J.; Potluri, Shobha; Cox, David R.; Li, Qibin; Li, Yingrui; Du, Yuanping; Chen, Ruoyan; Cao, Hongzhi; Li, Ning; Cao, Sujie; Wang, Jun; Bovenberg, Jasper A.; Jukema, J. Wouter; van der Harst, Pim; Sijbrands, Eric J.; Hottenga, Jouke-Jan; Uitterlinden, Andre G.; Swertz, Morris A.; van Ommen, Gert-Jan B.; de Bakker, Paul I.W.; Eline Slagboom, P.; Boomsma, Dorret I.; Wijmenga, Cisca; van Duijn, Cornelia M.

    2015-01-01

    Variants associated with blood lipid levels may be population-specific. To identify low-frequency variants associated with this phenotype, population-specific reference panels may be used. Here we impute nine large Dutch biobanks (~35,000 samples) with the population-specific reference panel created by the Genome of the Netherlands Project and perform association testing with blood lipid levels. We report the discovery of five novel associations at four loci (P value <6.61 × 10−4), including a rare missense variant in ABCA6 (rs77542162, p.Cys1359Arg, frequency 0.034), which is predicted to be deleterious. The frequency of this ABCA6 variant is 3.65-fold increased in the Dutch and its effect (βLDL-C=0.135, βTC=0.140) is estimated to be very similar to those observed for single variants in well-known lipid genes, such as LDLR. PMID:25751400

  15. The structure and mobility of the intervariant boundaries in 18R martensite in a Cu-Zn-Al alloy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, J.X.; Zheng, Y.F.; Zhao, L.C.

    1999-05-28

    Detailed crystallographic analysis was carried out on the martensitic transformation and the various variant combinations in 18R martensite in a Cu-Zn-Al alloy. The self-accommodation of martensitic shear strain is quite perfect within a variant group, but not effective or even does not exist for variant combinations which belong to different groups. Twenty-three unique variant combinations between 24 martensite variants can be divided into four groups, i.e. reflection twin, 180 rotation twin, 120 rotation twin and 90 rotation twin. TEM and HREM observations show that the A C boundary is straight, well-defined and perfectly coherent, the A B boundary is irrational,more » coherent and gradually curved, and the A D boundary is stepped. The A C and A B boundaries have obvious mobility, and the mobility is not effective for A D boundary. The interplate group boundaries are curved, blurred and immobile. The morphology, structure and mobility of interplate boundary are all related to the degree of self-accommodation and the misorientation of twin boundary.« less

  16. Large-Scale Exome-wide Association Analysis Identifies Loci for White Blood Cell Traits and Pleiotropy with Immune-Mediated Diseases.

    PubMed

    Tajuddin, Salman M; Schick, Ursula M; Eicher, John D; Chami, Nathalie; Giri, Ayush; Brody, Jennifer A; Hill, W David; Kacprowski, Tim; Li, Jin; Lyytikäinen, Leo-Pekka; Manichaikul, Ani; Mihailov, Evelin; O'Donoghue, Michelle L; Pankratz, Nathan; Pazoki, Raha; Polfus, Linda M; Smith, Albert Vernon; Schurmann, Claudia; Vacchi-Suzzi, Caterina; Waterworth, Dawn M; Evangelou, Evangelos; Yanek, Lisa R; Burt, Amber; Chen, Ming-Huei; van Rooij, Frank J A; Floyd, James S; Greinacher, Andreas; Harris, Tamara B; Highland, Heather M; Lange, Leslie A; Liu, Yongmei; Mägi, Reedik; Nalls, Mike A; Mathias, Rasika A; Nickerson, Deborah A; Nikus, Kjell; Starr, John M; Tardif, Jean-Claude; Tzoulaki, Ioanna; Velez Edwards, Digna R; Wallentin, Lars; Bartz, Traci M; Becker, Lewis C; Denny, Joshua C; Raffield, Laura M; Rioux, John D; Friedrich, Nele; Fornage, Myriam; Gao, He; Hirschhorn, Joel N; Liewald, David C M; Rich, Stephen S; Uitterlinden, Andre; Bastarache, Lisa; Becker, Diane M; Boerwinkle, Eric; de Denus, Simon; Bottinger, Erwin P; Hayward, Caroline; Hofman, Albert; Homuth, Georg; Lange, Ethan; Launer, Lenore J; Lehtimäki, Terho; Lu, Yingchang; Metspalu, Andres; O'Donnell, Chris J; Quarells, Rakale C; Richard, Melissa; Torstenson, Eric S; Taylor, Kent D; Vergnaud, Anne-Claire; Zonderman, Alan B; Crosslin, David R; Deary, Ian J; Dörr, Marcus; Elliott, Paul; Evans, Michele K; Gudnason, Vilmundur; Kähönen, Mika; Psaty, Bruce M; Rotter, Jerome I; Slater, Andrew J; Dehghan, Abbas; White, Harvey D; Ganesh, Santhi K; Loos, Ruth J F; Esko, Tõnu; Faraday, Nauder; Wilson, James G; Cushman, Mary; Johnson, Andrew D; Edwards, Todd L; Zakai, Neil A; Lettre, Guillaume; Reiner, Alex P; Auer, Paul L

    2016-07-07

    White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise healthy individuals can provide insights into genes and biologic pathways involved in production, differentiation, or clearance of particular WBC lineages (myeloid, lymphoid) and also potentially inform the genetic basis of autoimmune, allergic, and blood diseases. We performed an exome array-based meta-analysis of total WBC and subtype counts (neutrophils, monocytes, lymphocytes, basophils, and eosinophils) in a multi-ancestry discovery and replication sample of ∼157,622 individuals from 25 studies. We identified 16 common variants (8 of which were coding variants) associated with one or more WBC traits, the majority of which are pleiotropically associated with autoimmune diseases. Based on functional annotation, these loci included genes encoding surface markers of myeloid, lymphoid, or hematopoietic stem cell differentiation (CD69, CD33, CD87), transcription factors regulating lineage specification during hematopoiesis (ASXL1, IRF8, IKZF1, JMJD1C, ETS2-PSMG1), and molecules involved in neutrophil clearance/apoptosis (C10orf54, LTA), adhesion (TNXB), or centrosome and microtubule structure/function (KIF9, TUBD1). Together with recent reports of somatic ASXL1 mutations among individuals with idiopathic cytopenias or clonal hematopoiesis of undetermined significance, the identification of a common regulatory 3' UTR variant of ASXL1 suggests that both germline and somatic ASXL1 mutations contribute to lower blood counts in otherwise asymptomatic individuals. These association results shed light on genetic mechanisms that regulate circulating WBC counts and suggest a prominent shared genetic architecture with inflammatory and autoimmune diseases. Copyright © 2016 American Society of Human Genetics. All rights reserved.

  17. Novel Computational Protocols for Functionally Classifying and Characterising Serine Beta-Lactamases

    PubMed Central

    Das, Sayoni; Dawson, Natalie L.; Dobrijevic, Dragana; Orengo, Christine

    2016-01-01

    Beta-lactamases represent the main bacterial mechanism of resistance to beta-lactam antibiotics and are a significant challenge to modern medicine. We have developed an automated classification and analysis protocol that exploits structure- and sequence-based approaches and which allows us to propose a grouping of serine beta-lactamases that more consistently captures and rationalizes the existing three classification schemes: Classes, (A, C and D, which vary in their implementation of the mechanism of action); Types (that largely reflect evolutionary distance measured by sequence similarity); and Variant groups (which largely correspond with the Bush-Jacoby clinical groups). Our analysis platform exploits a suite of in-house and public tools to identify Functional Determinants (FDs), i.e. residue sites, responsible for conferring different phenotypes between different classes, different types and different variants. We focused on Class A beta-lactamases, the most highly populated and clinically relevant class, to identify FDs implicated in the distinct phenotypes associated with different Class A Types and Variants. We show that our FunFHMMer method can separate the known beta-lactamase classes and identify those positions likely to be responsible for the different implementations of the mechanism of action in these enzymes. Two novel algorithms, ASSP and SSPA, allow detection of FD sites likely to contribute to the broadening of the substrate profiles. Using our approaches, we recognise 151 Class A types in UniProt. Finally, we used our beta-lactamase FunFams and ASSP profiles to detect 4 novel Class A types in microbiome samples. Our platforms have been validated by literature studies, in silico analysis and some targeted experimental verification. Although developed for the serine beta-lactamases they could be used to classify and analyse any diverse protein superfamily where sub-families have diverged over both long and short evolutionary timescales. PMID:27332861

  18. Convergence between biological, behavioural and genetic determinants of obesity.

    PubMed

    Ghosh, Sujoy; Bouchard, Claude

    2017-12-01

    Multiple biological, behavioural and genetic determinants or correlates of obesity have been identified to date. Genome-wide association studies (GWAS) have contributed to the identification of more than 100 obesity-associated genetic variants, but their roles in causal processes leading to obesity remain largely unknown. Most variants are likely to have tissue-specific regulatory roles through joint contributions to biological pathways and networks, through changes in gene expression that influence quantitative traits, or through the regulation of the epigenome. The recent availability of large-scale functional genomics resources provides an opportunity to re-examine obesity GWAS data to begin elucidating the function of genetic variants. Interrogation of knockout mouse phenotype resources provides a further avenue to test for evidence of convergence between genetic variation and biological or behavioural determinants of obesity.

  19. Novel variants in GNAI3 associated with auriculocondylar syndrome strengthen a common dominant negative effect.

    PubMed

    Romanelli Tavares, Vanessa L; Gordon, Christopher T; Zechi-Ceide, Roseli M; Kokitsu-Nakata, Nancy Mizue; Voisin, Norine; Tan, Tiong Y; Heggie, Andrew A; Vendramini-Pittoli, Siulan; Propst, Evan J; Papsin, Blake C; Torres, Tatiana T; Buermans, Henk; Capelo, Luciane Portas; den Dunnen, Johan T; Guion-Almeida, Maria L; Lyonnet, Stanislas; Amiel, Jeanne; Passos-Bueno, Maria Rita

    2015-04-01

    Auriculocondylar syndrome is a rare craniofacial disorder comprising core features of micrognathia, condyle dysplasia and question mark ear. Causative variants have been identified in PLCB4, GNAI3 and EDN1, which are predicted to function within the EDN1-EDNRA pathway during early pharyngeal arch patterning. To date, two GNAI3 variants in three families have been reported. Here we report three novel GNAI3 variants, one segregating with affected members in a family previously linked to 1p21.1-q23.3 and two de novo variants in simplex cases. Two variants occur in known functional motifs, the G1 and G4 boxes, and the third variant is one amino acid outside of the G1 box. Structural modeling shows that all five altered GNAI3 residues identified to date cluster in a region involved in GDP/GTP binding. We hypothesize that all GNAI3 variants lead to dominant negative effects.

  20. Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes

    PubMed Central

    2013-01-01

    Motivation Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. Results We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity that can be directly compared to pair-wise measures of phenotypic proximity. Several known AD-related variants have been identified, including APOE4 and TOMM40. We also present experimental evidence supporting the hypothesis of a linear relationship between the number of top-ranked mutated states, or frequent mutation patterns, and an indicator of disease severity. Availability The Java codes are freely available at http://www2.imperial.ac.uk/~gmontana. PMID:24564704

  1. Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes.

    PubMed

    Wang, Yue; Goh, Wilson; Wong, Limsoon; Montana, Giovanni

    2013-01-01

    Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity that can be directly compared to pair-wise measures of phenotypic proximity. Several known AD-related variants have been identified, including APOE4 and TOMM40. We also present experimental evidence supporting the hypothesis of a linear relationship between the number of top-ranked mutated states, or frequent mutation patterns, and an indicator of disease severity. The Java codes are freely available at http://www2.imperial.ac.uk/~gmontana.

  2. Ischemic Stroke Is Associated with the ABO Locus: The EuroCLOT Study

    PubMed Central

    Williams, Frances M K; Carter, Angela M; Hysi, Pirro G; Surdulescu, Gabriela; Hodgkiss, Dylan; Soranzo, Nicole; Traylor, Matthew; Bevan, Steve; Dichgans, Martin; Rothwell, Peter M W; Sudlow, Cathie; Farrall, Martin; Silander, Kaisa; Kaunisto, Mari; Wagner, Peter; Saarela, Olli; Kuulasmaa, Kari; Virtamo, Jarmo; Salomaa, Veikko; Amouyel, Philippe; Arveiler, Dominique; Ferrieres, Jean; Wiklund, Per-Gunnar; Arfan Ikram, M; Hofman, Albert; Boncoraglio, Giorgio B; Parati, Eugenio A; Helgadottir, Anna; Gretarsdottir, Solveig; Thorsteinsdottir, Unnur; Thorleifsson, Gudmar; Stefansson, Kari; Seshadri, Sudha; DeStefano, Anita; Gschwendtner, Andreas; Psaty, Bruce; Longstreth, Will; Mitchell, Braxton D; Cheng, Yu-Ching; Clarke, Robert; Ferrario, Marco; Bis, Joshua C; Levi, Christopher; Attia, John; Holliday, Elizabeth G; Scott, Rodney J; Fornage, Myriam; Sharma, Pankaj; Furie, Karen L; Rosand, Jonathan; Nalls, Mike; Meschia, James; Mosely, Thomas H; Evans, Alun; Palotie, Aarno; Markus, Hugh S; Grant, Peter J; Spector, Tim D

    2013-01-01

    Objective End-stage coagulation and the structure/function of fibrin are implicated in the pathogenesis of ischemic stroke. We explored whether genetic variants associated with end-stage coagulation in healthy REFVIDunteers account for the genetic predisposition to ischemic stroke and examined their influence on stroke subtype. Methods Common genetic variants identified through genome-wide association studies of coagulation factors and fibrin structure/function in healthy twins (n = 2,100, Stage 1) were examined in ischemic stroke (n = 4,200 cases) using 2 independent samples of European ancestry (Stage 2). A third clinical collection having stroke subtyping (total 8,900 cases, 55,000 controls) was used for replication (Stage 3). Results Stage 1 identified 524 single nucleotide polymorphisms (SNPs) from 23 linkage disequilibrium blocks having significant association (p < 5 × 10–8) with 1 or more coagulation/fibrin phenotypes. The most striking associations included SNP rs5985 with factor XIII activity (p = 2.6 × 10–186), rs10665 with FVII (p = 2.4 × 10–47), and rs505922 in the ABO gene with both von Willebrand factor (p = 4.7 × 10–57) and factor VIII (p = 1.2 × 10–36). In Stage 2, the 23 independent SNPs were examined in stroke cases/noncases using MOnica Risk, Genetics, Archiving and Monograph (MORGAM) and Wellcome Trust Case Control Consortium 2 collections. SNP rs505922 was nominally associated with ischemic stroke (odds ratio = 0.94, 95% confidence interval = 0.88–0.99, p = 0.023). Independent replication in Meta-Stroke confirmed the rs505922 association with stroke, beta (standard error, SE) = 0.066 (0.02), p = 0.001, a finding specific to large-vessel and cardioembolic stroke (p = 0.001 and p = < 0.001, respectively) but not seen with small-vessel stroke (p = 0.811). Interpretation ABO gene variants are associated with large-vessel and cardioembolic stroke but not small-vessel disease. This work sheds light on the different pathogenic mechanisms underpinning stroke subtype. Ann Neurol 2013 PMID:23381943

  3. Somatic Mosaicism: Implications for Disease and Transmission Genetics

    PubMed Central

    Campbell, Ian M.; Shaw, Chad A.; Stankiewicz, Pawel; Lupski, James R.

    2015-01-01

    Nearly all of the genetic material among cells within an organism is identical. However, single nucleotide variants (SNVs), indels, copy number variants (CNVs), and other structural variants (SVs) continually accumulate as cells divide during development. This process results in an organism composed of countless cells, each with its own unique personal genome. Thus, every human is undoubtedly mosaic. Mosaic mutations can go unnoticed, underlie genetic disease or normal human variation, and may be transmitted to the next generation as constitutional variants. Here, we review the influence of the developmental timing of mutations, the mechanisms by which they arise, methods for detecting mosaic variants, and the risk of passing these mutations on to the next generation. PMID:25910407

  4. Genome-wide association identifies genetic variants associated with lentiform nucleus volume in N = 1345 young and elderly subjects.

    PubMed

    Hibar, Derrek P; Stein, Jason L; Ryles, April B; Kohannim, Omid; Jahanshad, Neda; Medland, Sarah E; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Montgomery, Grant W; Martin, Nicholas G; Wright, Margaret J; Saykin, Andrew J; Jack, Clifford R; Weiner, Michael W; Toga, Arthur W; Thompson, Paul M

    2013-06-01

    Deficits in lentiform nucleus volume and morphometry are implicated in a number of genetically influenced disorders, including Parkinson's disease, schizophrenia, and ADHD. Here we performed genome-wide searches to discover common genetic variants associated with differences in lentiform nucleus volume in human populations. We assessed structural MRI scans of the brain in two large genotyped samples: the Alzheimer's Disease Neuroimaging Initiative (ADNI; N = 706) and the Queensland Twin Imaging Study (QTIM; N = 639). Statistics of association from each cohort were combined meta-analytically using a fixed-effects model to boost power and to reduce the prevalence of false positive findings. We identified a number of associations in and around the flavin-containing monooxygenase (FMO) gene cluster. The most highly associated SNP, rs1795240, was located in the FMO3 gene; after meta-analysis, it showed genome-wide significant evidence of association with lentiform nucleus volume (P MA  = 4.79 × 10(-8)). This commonly-carried genetic variant accounted for 2.68 % and 0.84 % of the trait variability in the ADNI and QTIM samples, respectively, even though the QTIM sample was on average 50 years younger. Pathway enrichment analysis revealed significant contributions of this gene to the cytochrome P450 pathway, which is involved in metabolizing numerous therapeutic drugs for pain, seizures, mania, depression, anxiety, and psychosis. The genetic variants we identified provide replicated, genome-wide significant evidence for the FMO gene cluster's involvement in lentiform nucleus volume differences in human populations.

  5. Nonketotic hyperglycinemia: Functional assessment of missense variants in GLDC to understand phenotypes of the disease.

    PubMed

    Bravo-Alonso, Irene; Navarrete, Rosa; Arribas-Carreira, Laura; Perona, Almudena; Abia, David; Couce, María Luz; García-Cazorla, Angels; Morais, Ana; Domingo, Rosario; Ramos, María Antonia; Swanson, Michael A; Van Hove, Johan L K; Ugarte, Magdalena; Pérez, Belén; Pérez-Cerdá, Celia; Rodríguez-Pombo, Pilar

    2017-06-01

    The rapid analysis of genomic data is providing effective mutational confirmation in patients with clinical and biochemical hallmarks of a specific disease. This is the case for nonketotic hyperglycinemia (NKH), a Mendelian disorder causing seizures in neonates and early-infants, primarily due to mutations in the GLDC gene. However, understanding the impact of missense variants identified in this gene is a major challenge for the application of genomics into clinical practice. Herein, a comprehensive functional and structural analysis of 19 GLDC missense variants identified in a cohort of 26 NKH patients was performed. Mutant cDNA constructs were expressed in COS7 cells followed by enzymatic assays and Western blot analysis of the GCS P-protein to assess the residual activity and mutant protein stability. Structural analysis, based on molecular modeling of the 3D structure of GCS P-protein, was also performed. We identify hypomorphic variants that produce attenuated phenotypes with improved prognosis of the disease. Structural analysis allows us to interpret the effects of mutations on protein stability and catalytic activity, providing molecular evidence for clinical outcome and disease severity. Moreover, we identify an important number of mutants whose loss-of-functionality is associated with instability and, thus, are potential targets for rescue using folding therapeutic approaches. © 2017 Wiley Periodicals, Inc.

  6. Structures of native and affinity-enhanced WT1 epitopes bound to HLA-A*0201: implications for WT1-based cancer therapeutics.

    PubMed

    Borbulevych, Oleg Y; Do, Priscilla; Baker, Brian M

    2010-09-01

    Presentation of peptides by class I or class II major histocompatibility complex (MHC) molecules is required for the initiation and propagation of a T cell-mediated immune response. Peptides from the Wilms Tumor 1 transcription factor (WT1), upregulated in many hematopoetic and solid tumors, can be recognized by T cells and numerous efforts are underway to engineer WT1-based cancer vaccines. Here we determined the structures of the class I MHC molecule HLA-A*0201 bound to the native 126-134 epitope of the WT1 peptide and a recently described variant (R1Y) with improved MHC binding. The R1Y variant, a potential vaccine candidate, alters the positions of MHC charged side chains near the peptide N-terminus and significantly reduces the peptide/MHC electrostatic surface potential. These alterations indicate that the R1Y variant is an imperfect mimic of the native WT1 peptide, and suggest caution in its use as a therapeutic vaccine. Stability measurements revealed how the R1Y substitution enhances MHC binding affinity, and together with the structures suggest a strategy for engineering WT1 variants with improved MHC binding that retain the structural features of the native peptide/MHC complex. Copyright 2010 Elsevier Ltd. All rights reserved.

  7. Feature discrimination/identification based upon SAR return variations

    NASA Technical Reports Server (NTRS)

    Rasco, W. A., Sr.; Pietsch, R.

    1978-01-01

    A study of the statistics of The look-to-look variation statistics in the returns recorded in-flight by a digital, realtime SAR system are analyzed. The determination that the variations in the look-to-look returns from different classes do carry information content unique to the classes was illustrated by a model based on four variants derived from four look in-flight SAR data under study. The model was limited to four classes of returns: mowed grass on a athletic field, rough unmowed grass and weeds on a large vacant field, young fruit trees in a large orchard, and metal mobile homes and storage buildings in a large mobile home park. The data population in excess of 1000 returns represented over 250 individual pixels from the four classes. The multivariant discriminant model operated on the set of returns for each pixel and assigned that pixel to one of the four classes, based on the target variants and the probability distribution function of the four variants for each class.

  8. Efficient population-scale variant analysis and prioritization with VAPr.

    PubMed

    Birmingham, Amanda; Mark, Adam M; Mazzaferro, Carlo; Xu, Guorong; Fisch, Kathleen M

    2018-04-06

    With the growing availability of population-scale whole-exome and whole-genome sequencing, demand for reproducible, scalable variant analysis has spread within genomic research communities. To address this need, we introduce the Python package VAPr (Variant Analysis and Prioritization). VAPr leverages existing annotation tools ANNOVAR and MyVariant.info with MongoDB-based flexible storage and filtering functionality. It offers biologists and bioinformatics generalists easy-to-use and scalable analysis and prioritization of genomic variants from large cohort studies. VAPr is developed in Python and is available for free use and extension under the MIT License. An install package is available on PyPi at https://pypi.python.org/pypi/VAPr, while source code and extensive documentation are on GitHub at https://github.com/ucsd-ccbb/VAPr. kfisch@ucsd.edu.

  9. The UCL low-density lipoprotein receptor gene variant database: pathogenicity update

    PubMed Central

    Futema, Marta; Whittall, Ros; Taylor-Beadling, Alison; Williams, Maggie; den Dunnen, Johan T; Humphries, Steve E

    2017-01-01

    Background Familial hypercholesterolaemia (OMIM 143890) is most frequently caused by variations in the low-density lipoprotein receptor (LDLR) gene. Predicting whether novel variants are pathogenic may not be straightforward, especially for missense and synonymous variants. In 2013, the Association of Clinical Genetic Scientists published guidelines for the classification of variants, with categories 1 and 2 representing clearly not or unlikely pathogenic, respectively, 3 representing variants of unknown significance (VUS), and 4 and 5 representing likely to be or clearly pathogenic, respectively. Here, we update the University College London (UCL) LDLR variant database according to these guidelines. Methods PubMed searches and alerts were used to identify novel LDLR variants for inclusion in the database. Standard in silico tools were used to predict potential pathogenicity. Variants were designated as class 4/5 only when the predictions from the different programs were concordant and as class 3 when predictions were discordant. Results The updated database (http://www.lovd.nl/LDLR) now includes 2925 curated variants, representing 1707 independent events. All 129 nonsense variants, 337 small frame-shifting and 117/118 large rearrangements were classified as 4 or 5. Of the 795 missense variants, 115 were in classes 1 and 2, 605 in class 4 and 75 in class 3. 111/181 intronic variants, 4/34 synonymous variants and 14/37 promoter variants were assigned to classes 4 or 5. Overall, 112 (7%) of reported variants were class 3. Conclusions This study updates the LDLR variant database and identifies a number of reported VUS where additional family and in vitro studies will be required to confirm or refute their pathogenicity. PMID:27821657

  10. Functional alterations due to amino acid changes and evolutionary comparative analysis of ARPKD and ADPKD genes.

    PubMed

    Edrees, Burhan M; Athar, Mohammad; Abduljaleel, Zainularifeen; Al-Allaf, Faisal A; Taher, Mohiuddin M; Khan, Wajahatullah; Bouazzaoui, Abdellatif; Al-Harbi, Naffaa; Safar, Ramzia; Al-Edressi, Howaida; Alansary, Khawala; Anazi, Abulkareem; Altayeb, Naji; Ahmed, Muawia A

    2016-12-01

    A targeted customized sequencing of genes implicated in autosomal recessive polycystic kidney disease (ARPKD) phenotype was performed to identify candidate variants using the Ion torrent PGM next-generation sequencing. The results identified four potential pathogenic variants in PKHD1 gene [c.4870C > T, p.(Arg1624Trp), c.5725C > T, p.(Arg1909Trp), c.1736C > T, p.(Thr579Met) and c.10628T > G, p.(Leu3543Trp)] among 12 out of 18 samples. However, one variant c.4870C > T, p.(Arg1624Trp) was common among eight patients. Some patient samples also showed few variants in autosomal dominant polycystic kidney disease (ADPKD) disease causing genes PKD1 and PKD2 such as c.12433G > A, p.(Val4145Ile) and c.1445T > G, p.(Phe482Cys), respectively. All causative variants were validated by capillary sequencing and confirmed the presence of a novel homozygous variant c.10628T > G, p.(Leu3543Trp) in a male proband. We have recently published the results of these studies (Edrees et al., 2016). Here we report for the first time the effect of the common mutation p.(Arg1624Trp) found in eight samples on the protein structure and function due to the specific amino acid changes of PKHD1 protein using molecular dynamics simulations. The computational approaches provide tool predict the phenotypic effect of variant on the structure and function of the altered protein. The structural analysis with the common mutation p.(Arg1624Trp) in the native and mutant modeled protein were also studied for solvent accessibility, secondary structure and stabilizing residues to find out the stability of the protein between wild type and mutant forms. Furthermore, comparative genomics and evolutionary analyses of variants observed in PKHD1 , PKD1 , and PKD2 genes were also performed in some mammalian species including human to understand the complexity of genomes among closely related mammalian species. Taken together, the results revealed that the evolutionary comparative analyses and characterization of PKHD1 , PKD1 , and PKD2 genes among various related and unrelated mammalian species will provide important insights into their evolutionary process and understanding for further disease characterization and management.

  11. Adaptation of tick-borne encephalitis virus from human brain to different cell cultures induces multiple genomic substitutions.

    PubMed

    Ponomareva, Eugenia P; Ternovoi, Vladimir A; Mikryukova, Tamara P; Protopopova, Elena V; Gladysheva, Anastasia V; Shvalov, Alexander N; Konovalova, Svetlana N; Chausov, Eugene V; Loktev, Valery B

    2017-10-01

    The C11-13 strain from the Siberian subtype of tick-borne encephalitis virus (TBEV) was isolated from human brain using pig embryo kidney (PEK), 293, and Neuro-2a cells. Analysis of the complete viral genome of the C11-13 variants during six passages in these cells revealed that the cell-adapted C11-13 variants had multiple amino acid substitutions as compared to TBEV from human brain. Seven out of eight amino acids substitutions in the high-replicating C11-13(PEK) variant mapped to non-structural proteins; 13 out of 14 substitutions in the well-replicating C11-13(293) variant, and all four substitutions in the low-replicating C11-13(Neuro-2a) variant were also localized in non-structural proteins, predominantly in the NS2a (2), NS3 (6) and NS5 (3) proteins. The substitutions NS2a 1067 (Asn → Asp), NS2a 1168 (Leu → Val) in the N-terminus of NS2a and NS3 1745 (His → Gln) in the helicase domain of NS3 were found in all selected variants. We postulate that multiple substitutions in the NS2a, NS3 and NS5 genes play a key role in adaptation of TBEV to different cells.

  12. Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency.

    PubMed

    Breitfeld, Jana; Martens, Susanne; Klammt, Jürgen; Schlicke, Marina; Pfäffle, Roland; Krause, Kerstin; Weidle, Kerstin; Schleinitz, Dorit; Stumvoll, Michael; Führer, Dagmar; Kovacs, Peter; Tönjes, Anke

    2013-12-01

    The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD.

  13. Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency

    PubMed Central

    2013-01-01

    Background The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. Methods We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Results Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. Conclusions A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD. PMID:24289245

  14. Exome sequencing supports a de novo mutational paradigm for schizophrenia

    PubMed Central

    Xu, Bin; Roos, J. Louw; Dexheimer, Phillip; Boone, Braden; Plummer, Brooks; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria

    2011-01-01

    Despite high heritability, a large fraction of cases with schizophrenia do not have a family history of the disease (sporadic cases). Here, we examine the possibility that rare de novo protein-altering mutations contribute to the genetic component of schizophrenia by sequencing the exome of 53 sporadic cases, 22 unaffected controls and their parents. We identified 40 de novo mutations in 27 patients affecting 40 genes including a potentially disruptive mutation in DGCR2, a gene removed by the recurrent schizophrenia-predisposing 22q11.2 microdeletion. Comparison to rare inherited variants revealed that the identified de novo mutations show a large excess of nonsynonymous changes in cases, as well as a greater potential to affect protein structure and function. Our analysis reveals a major role of de novo mutations in schizophrenia and also a large mutational target, which together provide a plausible explanation for the high global incidence and persistence of the disease. PMID:21822266

  15. Functional and Structural Consequence of Rare Exonic Single Nucleotide Polymorphisms: One Story, Two Tales

    PubMed Central

    Gu, Wanjun; Gurguis, Christopher I.; Zhou, Jin J.; Zhu, Yihua; Ko, Eun-A.; Ko, Jae-Hong; Wang, Ting; Zhou, Tong

    2015-01-01

    Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases. PMID:26454016

  16. Prioritisation of associations between protein domains and complex diseases using domain-domain interaction networks.

    PubMed

    Wang, W; Zhang, W; Jiang, R; Luan, Y

    2010-05-01

    It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.

  17. Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

    PubMed

    Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

    2018-03-01

    Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.

  18. Three-dimensional structure of a variant `Termamyl-like' Geobacillus stearothermophilus α-amylase at 1.9 Å resolution.

    PubMed

    Offen, Wendy A; Viksoe-Nielsen, Anders; Borchert, Torben V; Wilson, Keith S; Davies, Gideon J

    2015-01-01

    The enzyme-catalysed degradation of starch is central to many industrial processes, including sugar manufacture and first-generation biofuels. Classical biotechnological platforms involve steam explosion of starch followed by the action of endo-acting glycoside hydrolases termed α-amylases and then exo-acting α-glucosidases (glucoamylases) to yield glucose, which is subsequently processed. A key enzymatic player in this pipeline is the `Termamyl' class of bacterial α-amylases and designed/evolved variants thereof. Here, the three-dimensional structure of one such Termamyl α-amylase variant based upon the parent Geobacillus stearothermophilus α-amylase is presented. The structure has been solved at 1.9 Å resolution, revealing the classical three-domain fold stabilized by Ca2+ and a Ca2+-Na+-Ca2+ triad. As expected, the structure is similar to the G. stearothermophilus α-amylase but with main-chain deviations of up to 3 Å in some regions, reflecting both the mutations and differing crystal-packing environments.

  19. Separation by hydrophobic interaction chromatography and structural determination by mass spectrometry of mannosylated glycoforms of a recombinant transferrin-exendin-4 fusion protein from yeast.

    PubMed

    Zolodz, Melissa D; Herberg, John T; Narepekha, Halyna E; Raleigh, Emily; Farber, Matthew R; Dufield, Robert L; Boyle, Denis M

    2010-01-08

    Obtaining sufficient amounts of pure glycoprotein variants to characterize their structures is an important goal in both functional biology and the biotechnology industry. We have developed preparative HIC conditions that resolve glycoform variants on the basis of overall carbohydrate content for a recombinant transferrin-exendin-4 fusion protein. The fusion protein was expressed from the yeast Saccharomyces cerevisiae from high density fermentation and is post-translationally modified with mannose sugars through O-glycosidic linkages. Overall hydrophobic behavior appeared to be dominated by the N-terminal 39 amino acids from the exendin-4 and linker peptide sequences as compared to the less hydrophobic behavior of human transferrin alone. In addition, using LC techniques that measure total glycans released from the pure protein combined with new high resolution technologies using mass spectrometry, we have determined the locations and chain lengths of mannose residues on specific peptides derived from tryptic maps of the transferrin-exendin-4 protein. Though the protein is large (80,488kDa) and contains 78 possible serine and threonine residues as potential sites for sugar addition, mannosylation was observed on only two tryptic peptides located within the first 55 amino acids of the N-terminus. These glycopeptides were highly heterogeneous and contained between 1 and 10 mannose residues scattered among the various serine and threonine sites which were identified by electron transfer dissociation mass spectrometry. Glycan sequences from 1 to 6 linear mannose residues were detected, but mannose chain lengths of 3 or 4 were more common and formed 80% of the total oligosaccharides. This work introduces new technological capabilities for the purification and characterization of glycosylated variants of therapeutic recombinant proteins. Copyright 2009 Elsevier B.V. All rights reserved.

  20. Clinical detection of deletion structural variants in whole-genome sequences

    PubMed Central

    Noll, Aaron C; Miller, Neil A; Smith, Laurie D; Yoo, Byunggil; Fiedler, Stephanie; Cooley, Linda D; Willig, Laurel K; Petrikin, Josh E; Cakici, Julie; Lesko, John; Newton, Angela; Detherage, Kali; Thiffault, Isabelle; Saunders, Carol J; Farrow, Emily G; Kingsmore, Stephen F

    2016-01-01

    Optimal management of acutely ill infants with monogenetic diseases requires rapid identification of causative haplotypes. Whole-genome sequencing (WGS) has been shown to identify pathogenic nucleotide variants in such infants. Deletion structural variants (DSVs, >50 nt) are implicated in many genetic diseases, and tools have been designed to identify DSVs using short-read WGS. Optimisation and integration of these tools into a WGS pipeline could improve diagnostic sensitivity and specificity of WGS. In addition, it may improve turnaround time when compared with current CNV assays, enhancing utility in acute settings. Here we describe DSV detection methods for use in WGS for rapid diagnosis in acutely ill infants: SKALD (Screening Konsensus and Annotation of Large Deletions) combines calls from two tools (Breakdancer and GenomeStrip) with calibrated filters and clinical interpretation rules. In four WGS runs, the average analytic precision (positive predictive value) of SKALD was 78%, and recall (sensitivity) was 27%, when compared with validated reference DSV calls. When retrospectively applied to a cohort of 36 families with acutely ill infants SKALD identified causative DSVs in two. The first was heterozygous deletion of exons 1–3 of MMP21 in trans with a heterozygous frame-shift deletion in two siblings with transposition of the great arteries and heterotaxy. In a newborn female with dysmorphic features, ventricular septal defect and persistent pulmonary hypertension, SKALD identified the breakpoints of a heterozygous, de novo 1p36.32p36.13 deletion. In summary, consensus DSV calling, implemented in an 8-h computational pipeline with parameterised filtering, has the potential to increase the diagnostic yield of WGS in acutely ill neonates and discover novel disease genes. PMID:29263817

  1. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Hong; Zeng, Hong; Lam, Robert

    Mismatch repair prevents the accumulation of erroneous insertions/deletions and non-Watson–Crick base pairs in the genome. Pathogenic mutations in theMLH1gene are associated with a predisposition to Lynch and Turcot's syndromes. Although genetic testing for these mutations is available, robust classification of variants requires strong clinical and functional support. Here, the first structure of the N-terminus of human MLH1, determined by X-ray crystallography, is described. Lastly, the structure shares a high degree of similarity with previously determined prokaryoticMLH1homologs; however, this structure affords a more accurate platform for the classification ofMLH1variants.

  2. Chromatin in embryonic stem cell neuronal differentiation.

    PubMed

    Meshorer, E

    2007-03-01

    Chromatin, the basic regulatory unit of the eukaryotic genetic material, is controlled by epigenetic mechanisms including histone modifications, histone variants, DNA methylation and chromatin remodeling. Cellular differentiation involves large changes in gene expression concomitant with alterations in genome organization and chromatin structure. Such changes are particularly evident in self-renewing pluripotent embryonic stem cells, which begin, in terms of cell fate, as a tabula rasa, and through the process of differentiation, acquire distinct identities. Here I describe the changes in chromatin that accompany neuronal differentiation, particularly of embryonic stem cells, and discuss how chromatin serves as the master regulator of cellular destiny.

  3. Polarization-induced Zener tunnel junctions in wide-band-gap heterostructures.

    PubMed

    Simon, John; Zhang, Ze; Goodman, Kevin; Xing, Huili; Kosel, Thomas; Fay, Patrick; Jena, Debdeep

    2009-07-10

    The large electronic polarization in III-V nitrides allows for novel physics not possible in other semiconductor families. In this work, interband Zener tunneling in wide-band-gap GaN heterojunctions is demonstrated by using polarization-induced electric fields. The resulting tunnel diodes are more conductive under reverse bias, which has applications for zero-bias rectification and mm-wave imaging. Since interband tunneling is traditionally prohibitive in wide-band-gap semiconductors, these polarization-induced structures and their variants can enable a number of devices such as multijunction solar cells that can operate under elevated temperatures and high fields.

  4. Variant Review with the Integrative Genomics Viewer.

    PubMed

    Robinson, James T; Thorvaldsdóttir, Helga; Wenger, Aaron M; Zehir, Ahmet; Mesirov, Jill P

    2017-11-01

    Manual review of aligned reads for confirmation and interpretation of variant calls is an important step in many variant calling pipelines for next-generation sequencing (NGS) data. Visual inspection can greatly increase the confidence in calls, reduce the risk of false positives, and help characterize complex events. The Integrative Genomics Viewer (IGV) was one of the first tools to provide NGS data visualization, and it currently provides a rich set of tools for inspection, validation, and interpretation of NGS datasets, as well as other types of genomic data. Here, we present a short overview of IGV's variant review features for both single-nucleotide variants and structural variants, with examples from both cancer and germline datasets. IGV is freely available at https://www.igv.org Cancer Res; 77(21); e31-34. ©2017 AACR . ©2017 American Association for Cancer Research.

  5. Solubilization of a membrane protein by combinatorial supercharging.

    PubMed

    Hajduczki, Agnes; Majumdar, Sudipta; Fricke, Marie; Brown, Isola A M; Weiss, Gregory A

    2011-04-15

    Hydrophobic and aggregation-prone, membrane proteins often prove too insoluble for conventional in vitro biochemical studies. To engineer soluble variants of human caveolin-1, a phage-displayed library of caveolin variants targeted the hydrophobic intramembrane domain with substitutions to charged residues. Anti-selections for insolubility removed hydrophobic variants, and positive selections for binding to the known caveolin ligand HIV gp41 isolated functional, folded variants. Assays with several caveolin binding partners demonstrated the successful folding and functionality by a solubilized, full-length caveolin variant selected from the library. This caveolin variant allowed assay of the direct interaction between caveolin and cavin. Clustered along one face of a putative helix, the solubilizing mutations suggest a structural model for the intramembrane domain of caveolin. The approach provides a potentially general method for solubilization and engineering of membrane-associated proteins by phage display.

  6. Mass Spectrometric Determination of ILPR G-quadruplex Binding Sites in Insulin and IGF-2

    PubMed Central

    Xiao, JunFeng

    2009-01-01

    The insulin-linked polymorphic region (ILPR) of the human insulin gene promoter region forms G-quadruplex structures in vitro. Previous studies show that insulin and insulin-like growth factor-2 (IGF-2) exhibit high affinity binding in vitro to 2-repeat sequences of ILPR variants a and h, but negligible binding to variant i. Two-repeat sequences of variants a and h form intramolecular G-quadruplex structures that are not evidenced for variant i. Here we report on the use of protein digestion combined with affinity capture and MALDI-MS detection to pinpoint ILPR binding sites in insulin and IGF-2. Peptides captured by ILPR variants a and h were sequenced by MALDI-MS/MS, LC-MS and in silico digestion. On-bead digestion of insulin-ILPR variant a complexes supported the conclusions. The results indicate that the sequence VCG(N)RGF is generally present in the captured peptides and is likely involved in the affinity binding interactions of the proteins with the ILPR G-quadruplexes. The significance of arginine in the interactions was studied by comparing the affinities of synthesized peptides VCGERGF and VCGEAGF with ILPR variant a. Peptides from other regions of the proteins that are connected through disulfide linkages were also detected in some capture experiments. Identification of binding sites could facilitate design of DNA binding ligands for capture and detection of insulin and IGF-2. The interactions may have biological significance as well. PMID:19747845

  7. Cryo-EM of the pathogenic VCP variant R155P reveals long-range conformational changes in the D2 ATPase ring

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mountassif, Driss; Fabre, Lucien; Zaid, Younes

    Single amino acid mutations in valosin containing protein (VCP/p97), a highly conserved member of the ATPases associated with diverse cellular activities (AAA) family of ATPases has been linked to a severe degenerative disease affecting brain, muscle and bone tissue. Previous studies have demonstrated the role of VCP mutations in altering the ATPase activity of the D2 ring; however the structural consequences of these mutations remain unclear. In this study, we report the three-dimensional (3D) map of the pathogenic VCP variant, R155P, as revealed by single-particle Cryo-Electron Microscopy (EM) analysis at 14 Å resolution. We show that the N-terminal R155P mutation inducesmore » a large structural reorganisation of the D2 ATPase ring. Results from docking studies using crystal structure data of available wild-type VCP in the EM density maps indicate that the major difference is localized at the interface between two protomers within the D2 ring. Consistent with a conformational change, the VCP R155P variant shifted the isoelectric point of the protein and reduced its interaction with its well-characterized cofactor, nuclear protein localization-4 (Npl4). Together, our results demonstrate that a single amino acid substitution in the N-terminal domain can relay long-range conformational changes to the distal D2 ATPase ring. Our results provide the first structural clues of how VCP mutations may influence the activity and function of the D2 ATPase ring. - Highlights: • p97{sub R155P} and p97{sub A232E} decrease the ability of p97 to bind to its co-factor Npl4. • p97{sub R155P} has a different isoelectric point than that of p97{sub R95G}, p97{sub A232E} and p97{sub WT}. • Mutation R155P changes principally the conformation of the D2 ring. • Mutation R155P modifies the interface between two protomers within the D2 ring.« less

  8. Cap-independent translation of human SP-A 5′-UTR variants: a double-loop structure and cis-element contribution

    PubMed Central

    Wang, Guirong; Guo, Xiaoxuan; Silveyra, Patricia; Kimball, Scot R.; Floros, Joanna

    2009-01-01

    Human surfactant protein A (hSP-A), a molecule of innate immunity and surfactant-related functions, consists of two functional genes, SP-A1 and SP-A2. SP-A expression is regulated by several factors including environmental stressors. SP-A1 and SP-A2 5′-untranslated region (5′-UTR) splice variants have a differential impact on translation efficiency and mRNA stability. To study whether these variants mediate internal ribosome entry site (IRES) activity (i.e., cap-independent translation), we performed transient transfection experiments in H441 cells with constructs containing one SP-A1 (A′D′, AB′D′, or A′CD′) or SP-A2 (ABD) 5′-UTR splice variant between the Renilla and firefly luciferase genes of a bicistronic reporter vector. We found that 1) variants A′D′, ABD, and AB′D′ exhibit significantly higher IRES activities than negative control (no SP-A 5′-UTR) and A′CD′ has no activity; the order of highest IRES activity was ABD > A′D′ > AB′D; 2) IRES activity of ABD significantly increased in response to diesel particulate matter (20 μg/ml) but not in response to ozone (1 ppm for 1 h); 3) deletion mutants of ABD revealed regulatory elements associated with IRES activity; one at the end of exon A attenuated activity, whereas a region containing a short adenosine-rich motif in the second half of exon B and the start of exon D enhanced activity; 4) elimination of a predicted double-loop structure or increase in free energy significantly reduced IRES activity; 5) elimination of one or both double-loop structures in A′D′ did not affect cap-dependent translation activity. Thus several factors, including cis-elements and secondary structure type and stability, are required for hSP-A 5′-UTR variant-mediated cap-independent translation. PMID:19181744

  9. Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes

    PubMed Central

    López-Garriga, Juan; Cadilla, Carmen L.

    2016-01-01

    The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233

  10. Design of a radiator shade for testing in a simulated lunar environment

    NASA Technical Reports Server (NTRS)

    Huff, Jaimi; Remington, Randy; Tang, Toan

    1992-01-01

    The National Aeronautics and Space Administration (NASA) and The Universities Space Research Association (USRA) have chosen the parabolic/catenary concept from their sponsored Fall 1991 lunar radiation shade project for further testing and development. NASA asked the design team to build a shading device and support structure for testing in a vacuum chamber. Besides the support structure for the catenary shading device, the design team was asked to develop a system for varying the shade shape so that the device can be tested at different focal lengths. The design team developed concept variants and combined the concept variants to form overall designs. Using a decision matrix, an overall design was selected by the team from several overall design alternatives. Concept variants were developed for three primary functions. The three functions were structural support, shape adjustments, and end shielding. The shade adjustment function was divided into two sub-functions, arc length adjustment, and width adjustment.

  11. NMR backbone resonance assignments of the prodomain variants of BDNF in the urea denatured state.

    PubMed

    Wang, Jing; Bains, Henrietta; Anastasia, Agustin; Bracken, Clay

    2018-04-01

    Brain derived neurotrophic factor (BDNF) is a member of the neurotrophin family of proteins which plays a central role in neuronal survival, growth, plasticity and memory. A single Val66Met variant has been identified in the prodomain of human BDNF that is associated with anxiety, depression and memory disorders. The structural differences within the full-length prodomain Val66 and Met66 isoforms could shed light on the mechanism of action of the Met66 and its impact on the development of neuropsychiatric-associated disorders. In the present study, we report the backbone 1 H, 13 C, and 15 N NMR assignments of both full-length Val66 and Met66 prodomains in the presence of 2 M urea. These conditions were utilized to suppress residual structure and aid subsequent native state structural investigations aimed at mapping and identifying variant-dependent conformational differences under native-state conditions.

  12. Heterozygous RFX6 protein truncating variants are associated with MODY with reduced penetrance.

    PubMed

    Patel, Kashyap A; Kettunen, Jarno; Laakso, Markku; Stančáková, Alena; Laver, Thomas W; Colclough, Kevin; Johnson, Matthew B; Abramowicz, Marc; Groop, Leif; Miettinen, Päivi J; Shepherd, Maggie H; Flanagan, Sarah E; Ellard, Sian; Inagaki, Nobuya; Hattersley, Andrew T; Tuomi, Tiinamaija; Cnop, Miriam; Weedon, Michael N

    2017-10-12

    Finding new causes of monogenic diabetes helps understand glycaemic regulation in humans. To find novel genetic causes of maturity-onset diabetes of the young (MODY), we sequenced MODY cases with unknown aetiology and compared variant frequencies to large public databases. From 36 European patients, we identify two probands with novel RFX6 heterozygous nonsense variants. RFX6 protein truncating variants are enriched in the MODY discovery cohort compared to the European control population within ExAC (odds ratio = 131, P = 1 × 10 -4 ). We find similar results in non-Finnish European (n = 348, odds ratio = 43, P = 5 × 10 -5 ) and Finnish (n = 80, odds ratio = 22, P = 1 × 10 -6 ) replication cohorts. RFX6 heterozygotes have reduced penetrance of diabetes compared to common HNF1A and HNF4A-MODY mutations (27, 70 and 55% at 25 years of age, respectively). The hyperglycaemia results from beta-cell dysfunction and is associated with lower fasting and stimulated gastric inhibitory polypeptide (GIP) levels. Our study demonstrates that heterozygous RFX6 protein truncating variants are associated with MODY with reduced penetrance.Maturity-onset diabetes of the young (MODY) is the most common subtype of familial diabetes. Here, Patel et al. use targeted DNA sequencing of MODY patients and large-scale publically available data to show that RFX6 heterozygous protein truncating variants cause reduced penetrance MODY.

  13. GWAS and fine-mapping of 35 production, reproduction and conformation traits with imputed sequences of 27K Holstein bulls

    USDA-ARS?s Scientific Manuscript database

    Fine-mapping of causal variants is becoming feasible for complex traits in livestock GWAS, as an increasing number of animals are sequenced. Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on small reference populations of sequenced animals. ...

  14. GWAS and fine-mapping of 35 production, reproduction, and conformation traits with imputed sequences of 27K Holstein bulls

    USDA-ARS?s Scientific Manuscript database

    Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on reference populations of sequenced animals. With the implementation of the 1000 Bull Genomes Project and increasing numbers of animals sequenced, fine-mapping of causal variants is becoming f...

  15. Polygenic influences on dyslipidemias.

    PubMed

    Dron, Jacqueline S; Hegele, Robert A

    2018-04-01

    Rare large-effect genetic variants underlie monogenic dyslipidemias, whereas common small-effect genetic variants - single nucleotide polymorphisms (SNPs) - have modest influences on lipid traits. Over the past decade, these small-effect SNPs have been shown to cumulatively exert consistent effects on lipid phenotypes under a polygenic framework, which is the focus of this review. Several groups have reported polygenic risk scores assembled from lipid-associated SNPs, and have applied them to their respective phenotypes. For lipid traits in the normal population distribution, polygenic effects quantified by a score that integrates several common polymorphisms account for about 20-30% of genetic variation. Among individuals at the extremes of the distribution, that is, those with clinical dyslipidemia, the polygenic component includes both rare variants with large effects and common polymorphisms: depending on the trait, 20-50% of susceptibility can be accounted for by this assortment of genetic variants. Accounting for polygenic effects increases the numbers of dyslipidemic individuals who can be explained genetically, but a substantial proportion of susceptibility remains unexplained. Whether documenting the polygenic basis of dyslipidemia will affect outcomes in clinical trials or prospective observational studies remains to be determined.

  16. Structural Variation Shapes the Landscape of Recombination in Mouse

    PubMed Central

    Morgan, Andrew P.; Gatti, Daniel M.; Najarian, Maya L.; Keane, Thomas M.; Galante, Raymond J.; Pack, Allan I.; Mott, Richard; Churchill, Gary A.; de Villena, Fernando Pardo-Manuel

    2017-01-01

    Meiotic recombination is an essential feature of sexual reproduction that ensures faithful segregation of chromosomes and redistributes genetic variants in populations. Multiparent populations such as the Diversity Outbred (DO) mouse stock accumulate large numbers of crossover (CO) events between founder haplotypes, and thus present a unique opportunity to study the role of genetic variation in shaping the recombination landscape. We obtained high-density genotype data from 6886 DO mice, and localized 2.2 million CO events to intervals with a median size of 28 kb. The resulting sex-averaged genetic map of the DO population is highly concordant with large-scale (order 10 Mb) features of previously reported genetic maps for mouse. To examine fine-scale (order 10 kb) patterns of recombination in the DO, we overlaid putative recombination hotspots onto our CO intervals. We found that CO intervals are enriched in hotspots compared to the genomic background. However, as many as 26% of CO intervals do not overlap any putative hotspots, suggesting that our understanding of hotspots is incomplete. We also identified coldspots encompassing 329 Mb, or 12% of observable genome, in which there is little or no recombination. In contrast to hotspots, which are a few kilobases in size, and widely scattered throughout the genome, coldspots have a median size of 2.1 Mb and are spatially clustered. Coldspots are strongly associated with copy-number variant (CNV) regions, especially multi-allelic clusters, identified from whole-genome sequencing of 228 DO mice. Genes in these regions have reduced expression, and epigenetic features of closed chromatin in male germ cells, which suggests that CNVs may repress recombination by altering chromatin structure in meiosis. Our findings demonstrate how multiparent populations, by bridging the gap between large-scale and fine-scale genetic mapping, can reveal new features of the recombination landscape. PMID:28592499

  17. Structural Variation Shapes the Landscape of Recombination in Mouse.

    PubMed

    Morgan, Andrew P; Gatti, Daniel M; Najarian, Maya L; Keane, Thomas M; Galante, Raymond J; Pack, Allan I; Mott, Richard; Churchill, Gary A; de Villena, Fernando Pardo-Manuel

    2017-06-01

    Meiotic recombination is an essential feature of sexual reproduction that ensures faithful segregation of chromosomes and redistributes genetic variants in populations. Multiparent populations such as the Diversity Outbred (DO) mouse stock accumulate large numbers of crossover (CO) events between founder haplotypes, and thus present a unique opportunity to study the role of genetic variation in shaping the recombination landscape. We obtained high-density genotype data from [Formula: see text] DO mice, and localized 2.2 million CO events to intervals with a median size of 28 kb. The resulting sex-averaged genetic map of the DO population is highly concordant with large-scale (order 10 Mb) features of previously reported genetic maps for mouse. To examine fine-scale (order 10 kb) patterns of recombination in the DO, we overlaid putative recombination hotspots onto our CO intervals. We found that CO intervals are enriched in hotspots compared to the genomic background. However, as many as [Formula: see text] of CO intervals do not overlap any putative hotspots, suggesting that our understanding of hotspots is incomplete. We also identified coldspots encompassing 329 Mb, or [Formula: see text] of observable genome, in which there is little or no recombination. In contrast to hotspots, which are a few kilobases in size, and widely scattered throughout the genome, coldspots have a median size of 2.1 Mb and are spatially clustered. Coldspots are strongly associated with copy-number variant (CNV) regions, especially multi-allelic clusters, identified from whole-genome sequencing of 228 DO mice. Genes in these regions have reduced expression, and epigenetic features of closed chromatin in male germ cells, which suggests that CNVs may repress recombination by altering chromatin structure in meiosis. Our findings demonstrate how multiparent populations, by bridging the gap between large-scale and fine-scale genetic mapping, can reveal new features of the recombination landscape. Copyright © 2017 by the Genetics Society of America.

  18. Structure of the human MLH1 N-terminus: implications for predisposition to Lynch syndrome

    DOE PAGES

    Wu, Hong; Zeng, Hong; Lam, Robert; ...

    2015-08-01

    Mismatch repair prevents the accumulation of erroneous insertions/deletions and non-Watson–Crick base pairs in the genome. Pathogenic mutations in theMLH1gene are associated with a predisposition to Lynch and Turcot's syndromes. Although genetic testing for these mutations is available, robust classification of variants requires strong clinical and functional support. Here, the first structure of the N-terminus of human MLH1, determined by X-ray crystallography, is described. Lastly, the structure shares a high degree of similarity with previously determined prokaryoticMLH1homologs; however, this structure affords a more accurate platform for the classification ofMLH1variants.

  19. Structure-guided evolution of antigenically distinct adeno-associated virus variants for immune evasion.

    PubMed

    Tse, Longping Victor; Klinc, Kelli A; Madigan, Victoria J; Castellanos Rivera, Ruth M; Wells, Lindsey F; Havlik, L Patrick; Smith, J Kennon; Agbandje-McKenna, Mavis; Asokan, Aravind

    2017-06-13

    Preexisting neutralizing antibodies (NAbs) against adeno-associated viruses (AAVs) pose a major, unresolved challenge that restricts patient enrollment in gene therapy clinical trials using recombinant AAV vectors. Structural studies suggest that despite a high degree of sequence variability, antibody recognition sites or antigenic hotspots on AAVs and other related parvoviruses might be evolutionarily conserved. To test this hypothesis, we developed a structure-guided evolution approach that does not require selective pressure exerted by NAbs. This strategy yielded highly divergent antigenic footprints that do not exist in natural AAV isolates. Specifically, synthetic variants obtained by evolving murine antigenic epitopes on an AAV serotype 1 capsid template can evade NAbs without compromising titer, transduction efficiency, or tissue tropism. One lead AAV variant generated by combining multiple evolved antigenic sites effectively evades polyclonal anti-AAV1 neutralizing sera from immunized mice and rhesus macaques. Furthermore, this variant displays robust immune evasion in nonhuman primate and human serum samples at dilution factors as high as 1:5, currently mandated by several clinical trials. Our results provide evidence that antibody recognition of AAV capsids is conserved across species. This approach can be applied to any AAV strain to evade NAbs in prospective patients for human gene therapy.

  20. Analysis of the [RNQ+] Prion Reveals Stability of Amyloid Fibers as the Key Determinant of Yeast Prion Variant Propagation*

    PubMed Central

    Kalastavadi, Tejas; True, Heather L.

    2010-01-01

    Variation in pathology of human prion disease is believed to be caused, in part, by distinct conformations of aggregated protein resulting in different prion strains. Several prions also exist in yeast and maintain different self-propagating structures, referred to as prion variants. Investigation of the yeast prion [PSI+] has been instrumental in deciphering properties of prion variants and modeling the physical basis of their formation. Here, we describe the generation of specific variants of the [RNQ+] prion in yeast transformed with fibers formed in vitro in different conditions. The fibers of the Rnq1p prion-forming domain (PFD) that induce different variants in vivo have distinct biochemical properties. The physical basis of propagation of prion variants has been previously correlated to rates of aggregation and disaggregation. With [RNQ+] prion variants, we found that the prion variant does not correlate with the rate of aggregation as anticipated but does correlate with stability. Interestingly, we found that there are differences in the ability of the [RNQ+] prion variants to faithfully propagate themselves and to template the aggregation of other proteins. Incorporating the mechanism of variant formation elucidated in this study with that previously proposed for [PSI+] variants has provided a framework to separate general characteristics of prion variant properties from those specific to individual prion proteins. PMID:20442412

  1. Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.

    PubMed

    Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao

    2016-04-01

    To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.

  2. Structural basis for human NADPH-cytochrome P450 oxidoreductase deficiency

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xia, Chuanwu; Panda, Satya P.; Marohnic, Christopher C.

    2012-03-15

    NADPH-cytochrome P450 oxidoreductase (CYPOR) is essential for electron donation to microsomal cytochrome P450-mediated monooxygenation in such diverse physiological processes as drug metabolism (approximately 85-90% of therapeutic drugs), steroid biosynthesis, and bioactive metabolite production (vitamin D and retinoic acid metabolites). Expressed by a single gene, CYPOR's role with these multiple redox partners renders it a model for understanding protein-protein interactions at the structural level. Polymorphisms in human CYPOR have been shown to lead to defects in bone development and steroidogenesis, resulting in sexual dimorphisms, the severity of which differs significantly depending on the degree of CYPOR impairment. The atomic structure ofmore » human CYPOR is presented, with structures of two naturally occurring missense mutations, V492E and R457H. The overall structures of these CYPOR variants are similar to wild type. However, in both variants, local disruption of H bonding and salt bridging, involving the FAD pyrophosphate moiety, leads to weaker FAD binding, unstable protein, and loss of catalytic activity, which can be rescued by cofactor addition. The modes of polypeptide unfolding in these two variants differ significantly, as revealed by limited trypsin digestion: V492E is less stable but unfolds locally and gradually, whereas R457H is more stable but unfolds globally. FAD addition to either variant prevents trypsin digestion, supporting the role of the cofactor in conferring stability to CYPOR structure. Thus, CYPOR dysfunction in patients harboring these particular mutations may possibly be prevented by riboflavin therapy in utero, if predicted prenatally, or rescued postnatally in less severe cases.« less

  3. Thermodynamic prediction of protein neutrality.

    PubMed

    Bloom, Jesse D; Silberg, Jonathan J; Wilke, Claus O; Drummond, D Allan; Adami, Christoph; Arnold, Frances H

    2005-01-18

    We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.

  4. Thermodynamic prediction of protein neutrality

    PubMed Central

    Bloom, Jesse D.; Silberg, Jonathan J.; Wilke, Claus O.; Drummond, D. Allan; Adami, Christoph; Arnold, Frances H.

    2005-01-01

    We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 β-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications. PMID:15644440

  5. Enhanced expression of Rubisco activase splicing variants differentially affects Rubisco activity during low temperature treatment in Lolium perenne.

    PubMed

    Jurczyk, Barbara; Pociecha, Ewa; Grzesiak, Maciej; Kalita, Katarzyna; Rapacz, Marcin

    2016-07-01

    Alternative splicing of the Rubisco activase gene was shown to be a point for optimization of photosynthetic carbon assimilation. It can be expected to be a stress-regulated event that depends on plant freezing tolerance. The aim of the study was to examine the relationships among Rubisco activity, the expression of two Rubisco activase splicing variants and photoacclimation to low temperature. The experiment was performed on two Lolium perenne genotypes with contrasting levels of freezing tolerance. The study investigated the effect of pre-hardening (15°C) and cold acclimation (4°C) on net photosynthesis, photosystem II photochemical activity, Rubisco activity and the expression of two splicing variants of the Rubisco activase gene. The results showed an induction of Rubisco activity at both 15°C and 4°C only in a highly freezing-tolerant genotype. The enhanced Rubisco activity after pre-hardening corresponded to increased expression of the splicing variant representing the large isoform, while the increase in Rubisco activity during cold acclimation was due to the activation of both transcript variants. These boosts in Rubisco activity also corresponded to an activation of non-photochemical mechanism of photoacclimation induced at low temperature exclusively in the highly freezing-tolerant genotype. In conclusion, enhanced expression of Rubisco activase splicing variants caused an increase in Rubisco activity during pre-hardening and cold acclimation in the more freezing-tolerant Lolium perenne genotype. The induction of the transcript variant representing the large isoform may be an important element of increasing the carbon assimilation rate supporting the photochemical mechanism of photosynthetic acclimation to cold. Copyright © 2016 Elsevier GmbH. All rights reserved.

  6. Improved methods for multi-trait fine mapping of pleiotropic risk loci.

    PubMed

    Kichaev, Gleb; Roytman, Megan; Johnson, Ruth; Eskin, Eleazar; Lindström, Sara; Kraft, Peter; Pasaniuc, Bogdan

    2017-01-15

    Genome-wide association studies (GWAS) have identified thousands of regions in the genome that contain genetic variants that increase risk for complex traits and diseases. However, the variants uncovered in GWAS are typically not biologically causal, but rather, correlated to the true causal variant through linkage disequilibrium (LD). To discern the true causal variant(s), a variety of statistical fine-mapping methods have been proposed to prioritize variants for functional validation. In this work we introduce a new approach, fastPAINTOR, that leverages evidence across correlated traits, as well as functional annotation data, to improve fine-mapping accuracy at pleiotropic risk loci. To improve computational efficiency, we describe an new importance sampling scheme to perform model inference. First, we demonstrate in simulations that by leveraging functional annotation data, fastPAINTOR increases fine-mapping resolution relative to existing methods. Next, we show that jointly modeling pleiotropic risk regions improves fine-mapping resolution compared to standard single trait and pleiotropic fine mapping strategies. We report a reduction in the number of SNPs required for follow-up in order to capture 90% of the causal variants from 23 SNPs per locus using a single trait to 12 SNPs when fine-mapping two traits simultaneously. Finally, we analyze summary association data from a large-scale GWAS of lipids and show that these improvements are largely sustained in real data. The fastPAINTOR framework is implemented in the PAINTOR v3.0 package which is publicly available to the research community http://bogdan.bioinformatics.ucla.edu/software/paintor CONTACT: gkichaev@ucla.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Biophysical Analysis of Apolipoprotein E3 Variants Linked with Development of Type III Hyperlipoproteinemia

    PubMed Central

    Georgiadou, Dimitra; Chroni, Angeliki; Vezeridis, Alexander; Zannis, Vassilis I.; Stratikos, Efstratios

    2011-01-01

    Background Apolipoprotein E (apoE) is a major protein of the lipoprotein transport system that plays important roles in lipid homeostasis and protection from atherosclerosis. ApoE is characterized by structural plasticity and thermodynamic instability and can undergo significant structural rearrangements as part of its biological function. Mutations in the 136–150 region of the N-terminal domain of apoE, reduce its low density lipoprotein (LDL) receptor binding capacity and have been linked with lipoprotein disorders, such as type III hyperlipoproteinemia (HLP) in humans. However, the LDL-receptor binding defects for these apoE variants do not correlate well with the severity of dyslipidemia, indicating that these variants may carry additional properties that contribute to their pathogenic potential. Methodology/Principal Findings In this study we examined whether three type III HLP predisposing apoE3 variants, namely R136S, R145C and K146E affect the biophysical properties of the protein. Circular dichroism (CD) spectroscopy revealed that these mutations do not significantly alter the secondary structure of the protein. Thermal and chemical unfolding analysis revealed small thermodynamic alterations in each variant compared to wild-type apoE3, as well as effects in the reversibility of the unfolding transition. All variants were able to remodel multillamelar 1,2-Dimyristoyl-sn-glycero-3-phosphocholine (DMPC) vesicles, but R136S and R145C had reduced kinetics. Dynamic light scattering analysis indicated that the variant R136S exists in a higher-order oligomerization state in solution. Finally, 1-anilinonaphthalene-8-sulfonic acid (ANS) binding suggested that the variant R145C exposes a larger amount of hydrophobic surface to the solvent. Conclusions/Significance Overall, our findings suggest that single amino acid changes in the functionally important region 136–150 of apoE3 can affect the molecule's stability and conformation in solution and may underlie functional consequences. However, the magnitude and the non-concerted nature of these changes, make it unlikely that they constitute a distinct unifying mechanism leading to type III HLP pathogenesis. PMID:22069485

  8. Characterisation of myosin heavy chain gene variants in the fast and slow muscle fibres of gammarid amphipods.

    PubMed

    Whiteley, N M; Magnay, J L; McCleary, S J; Nia, S Khazraee; El Haj, A J; Rock, J

    2010-10-01

    Recent molecular work has revealed a large diversity of myosin heavy chain (MyHC) gene variants in the abdominal musculature of gammarid amphipods. An unusual truncated MyHC transcript from the loop 1 region (Variant A(3)) was consistently observed in multiple species and populations. The current study aimed to determine whether this MyHC variant is specific to a particular muscle fibre type, as a change in net charge to the loop 1 region of Variant A(3) could be functionally significant. The localisation of different fibre types within the abdominal musculature of several gammarid species revealed that the deep flexor and extensor muscles are fast-twitch muscle fibres. The dorsal superficial muscles were identified as slow fibres and the muscles extrinsic to the pleopods were identified as intermediate fibres. Amplification of loop 1 region mRNA from isolated superficial extensor and deep flexor muscles, and subsequent liquid chromatography and sequence analysis revealed that Variant A(3) was the primary MyHC variant in slow muscles, and the conserved A(1) sequence was the primary variant in fast muscles. The specific role of Variant A(3) in the slow muscles remains to be investigated. 2010 Elsevier Inc. All rights reserved.

  9. Variants of uncertain significance in newborn screening disorders: implications for large-scale genomic sequencing.

    PubMed

    Narravula, Alekhya; Garber, Kathryn B; Askree, S Hussain; Hegde, Madhuri; Hall, Patricia L

    2017-01-01

    As exome and genome sequencing using high-throughput sequencing technologies move rapidly into the diagnostic process, laboratories and clinicians need to develop a strategy for dealing with uncertain findings. A commitment must be made to minimize these findings, and all parties may need to make adjustments to their processes. The information required to reclassify these variants is often available but not communicated to all relevant parties. To illustrate these issues, we focused on three well-characterized monogenic, metabolic disorders included in newborn screens: classic galactosemia, caused by GALT variants; phenylketonuria, caused by PAH variants; and medium-chain acyl-CoA dehydrogenase (MCAD) deficiency, caused by ACADM variants. In 10 years of clinical molecular testing, we have observed 134 unique GALT variants, 46 of which were variants of uncertain significance (VUS). In PAH, we observed 132 variants, including 17 VUS, and for ACADM, we observed 64 unique variants, of which 33 were uncertain. After this review, 17 VUS (37%; 7 in ACADM, 9 in GALT, and 1 in PAH) were reclassified from uncertain (6 to benign or likely benign and 11 to pathogenic or likely pathogenic). We identified common types of missing information that would have helped make a definitive classification and categorized this information by ease and cost to obtain.Genet Med 19 1, 77-82.

  10. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database.

    PubMed

    Thompson, Bryony A; Spurdle, Amanda B; Plazzer, John-Paul; Greenblatt, Marc S; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P; Farrington, Susan M; Frayling, Ian M; Frebourg, Thierry; Goldgar, David E; Heinen, Christopher D; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J; Sijmons, Rolf; Tavtigian, Sean V; Tops, Carli M; Weber, Thomas; Wijnen, Juul; Woods, Michael O; Macrae, Finlay; Genuardi, Maurizio

    2014-02-01

    The clinical classification of hereditary sequence variants identified in disease-related genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch syndrome-associated genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist in variant classification and was recognized through microattribution. The scheme was refined by multidisciplinary expert committee review of the clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants that were not obviously protein truncating from nomenclature. This large-scale endeavor will facilitate the consistent management of families suspected to have Lynch syndrome and demonstrates the value of multidisciplinary collaboration in the curation and classification of variants in public locus-specific databases.

  11. Application of a five-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants lodged on the InSiGHT locus-specific database

    PubMed Central

    Plazzer, John-Paul; Greenblatt, Marc S.; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T.; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P.; Farrington, Susan M.; Frayling, Ian M.; Frebourg, Thierry; Goldgar, David E.; Heinen, Christopher D.; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J.; Sijmons, Rolf; Tavtigian, Sean V.; Tops, Carli M.; Weber, Thomas; Wijnen, Juul; Woods, Michael O.; Macrae, Finlay; Genuardi, Maurizio

    2015-01-01

    Clinical classification of sequence variants identified in hereditary disease genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch Syndrome genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist variant classification, and recognized by microattribution. The scheme was refined by multidisciplinary expert committee review of clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants not obviously protein-truncating from nomenclature. This large-scale endeavor will facilitate consistent management of suspected Lynch Syndrome families, and demonstrates the value of multidisciplinary collaboration for curation and classification of variants in public locus-specific databases. PMID:24362816

  12. Investigation of exomic variants associated with overall survival in ovarian cancer

    PubMed Central

    Ann Chen, Yian; Larson, Melissa C; Fogarty, Zachary C; Earp, Madalene A; Anton-Culver, Hoda; Bandera, Elisa V; Cramer, Daniel; Doherty, Jennifer A; Goodman, Marc T; Gronwald, Jacek; Karlan, Beth Y; Kjaer, Susanne K; Levine, Douglas A; Menon, Usha; Ness, Roberta B; Pearce, Celeste L; Pejovic, Tanja; Rossing, Mary Anne; Wentzensen, Nicolas; Bean, Yukie T; Bisogna, Maria; Brinton, Louise A; Carney, Michael E; Cunningham, Julie M; Cybulski, Cezary; deFazio, Anna; Dicks, Ed M; Edwards, Robert P; Gayther, Simon A; Gentry-Maharaj, Aleksandra; Gore, Martin; Iversen, Edwin S; Jensen, Allan; Johnatty, Sharon E; Lester, Jenny; Lin, Hui-Yi; Lissowska, Jolanta; Lubinski, Jan; Menkiszak, Janusz; Modugno, Francesmary; Moysich, Kirsten B; Orlow, Irene; Pike, Malcolm C; Ramus, Susan J; Song, Honglin; Terry, Kathryn L; Thompson, Pamela J; Tyrer, Jonathan P; van den Berg, David J; Vierkant, Robert A; Vitonis, Allison F; Walsh, Christine; Wilkens, Lynne R; Wu, Anna H; Yang, Hannah; Ziogas, Argyrios; Berchuck, Andrew; Chenevix-Trench, Georgia; Schildkraut, Joellen M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pharoah, Paul D P; Fridley, Brooke L

    2016-01-01

    Background While numerous susceptibility loci for epithelial ovarian cancer (EOC) have been identified, few associations have been reported with overall survival. In the absence of common prognostic genetic markers, we hypothesize that rare coding variants may be associated with overall EOC survival and assessed their contribution in two exome-based genotyping projects of the Ovarian Cancer Association Consortium (OCAC). Methods The primary patient set (Set 1) included 14 independent EOC studies (4293 patients) and 227,892 variants, and a secondary patient set (Set 2) included six additional EOC studies (1744 patients) and 114,620 variants. Because power to detect rare variants individually is reduced, gene-level tests were conducted. Sets were analyzed separately at individual variants and by gene, and then combined with meta-analyses (73,203 variants and 13,163 genes overlapped). Results No individual variant reached genome-wide statistical significance. A SNP previously implicated to be associated with EOC risk and, to a lesser extent, survival, rs8170, showed the strongest evidence of association with survival and similar effect size estimates across sets (Pmeta=1.1E-6, HRSet1=1.17, HRSet2=1.14). Rare variants in ATG2B, an autophagy gene important for apoptosis, were significantly associated with survival after multiple testing correction (Pmeta=1.1E-6; Pcorrected=0.01). Conclusions Common variant rs8170 and rare variants in ATG2B may be associated with EOC overall survival, although further study is needed. Impact This study represents the first exome-wide association study of EOC survival to include rare variant analyses, and suggests that complementary single variant and gene-level analyses in large studies are needed to identify rare variants that warrant follow-up study. PMID:26747452

  13. Yield of the RYR2 Genetic Test in Suspected Catecholaminergic Polymorphic Ventricular Tachycardia and Implications for Test Interpretation.

    PubMed

    Kapplinger, Jamie D; Pundi, Krishna N; Larson, Nicholas B; Callis, Thomas E; Tester, David J; Bikker, Hennie; Wilde, Arthur A M; Ackerman, Michael J

    2018-02-01

    Pathogenic RYR2 variants account for ≈60% of clinically definite cases of catecholaminergic polymorphic ventricular tachycardia. However, the rate of rare benign RYR2 variants identified in the general population remains a challenge for genetic test interpretation. Therefore, we examined the results of the RYR2 genetic test among patients referred for commercial genetic testing and examined factors impacting variant interpretability. Frequency and location comparisons were made for RYR2 variants identified among 1355 total patients of varying clinical certainty and 60 706 Exome Aggregation Consortium controls. The impact of the clinical phenotype on the yield of RYR2 variants was examined. Six in silico tools were assessed using patient- and control-derived variants. A total of 18.2% (218/1200) of patients referred for commercial testing hosted rare RYR2 variants, statistically less than the 59% (46/78) yield among clinically definite cases, resulting in a much higher potential genetic false discovery rate among referrals considering the 3.2% background rate of rare, benign RYR2 variants. Exclusion of clearly putative pathogenic variants further complicates the interpretation of the next novel RYR2 variant. Exonic/topologic analyses revealed overrepresentation of patient variants in exons covering only one third of the protein. In silico tools largely failed to show evidence toward enhancement of variant interpretation. Current expert recommendations have resulted in increased use of RYR2 genetic testing in patients with questionable clinical phenotypes. Using the largest to date catecholaminergic polymorphic ventricular tachycardia patient versus control comparison, this study highlights important variables in the interpretation of variants to overcome the 3.2% background rate that confounds RYR2 variant interpretation. © 2018 American Heart Association, Inc.

  14. DMET-Miner: Efficient discovery of association rules from pharmacogenomic data.

    PubMed

    Agapito, Giuseppe; Guzzi, Pietro H; Cannataro, Mario

    2015-08-01

    Microarray platforms enable the investigation of allelic variants that may be correlated to phenotypes. Among those, the Affymetrix DMET (Drug Metabolism Enzymes and Transporters) platform enables the simultaneous investigation of all the genes that are related to drug absorption, distribution, metabolism and excretion (ADME). Although recent studies demonstrated the effectiveness of the use of DMET data for studying drug response or toxicity in clinical studies, there is a lack of tools for the automatic analysis of DMET data. In a previous work we developed DMET-Analyzer, a methodology and a supporting platform able to automatize the statistical study of allelic variants, that has been validated in several clinical studies. Although DMET-Analyzer is able to correlate a single variant for each probe (related to a portion of a gene) through the use of the Fisher test, it is unable to discover multiple associations among allelic variants, due to its underlying statistic analysis strategy that focuses on a single variant for each time. To overcome those limitations, here we propose a new analysis methodology for DMET data based on Association Rules mining, and an efficient implementation of this methodology, named DMET-Miner. DMET-Miner extends the DMET-Analyzer tool with data mining capabilities and correlates the presence of a set of allelic variants with the conditions of patient's samples by exploiting association rules. To face the high number of frequent itemsets generated when considering large clinical studies based on DMET data, DMET-Miner uses an efficient data structure and implements an optimized search strategy that reduces the search space and the execution time. Preliminary experiments on synthetic DMET datasets, show how DMET-Miner outperforms off-the-shelf data mining suites such as the FP-Growth algorithms available in Weka and RapidMiner. To demonstrate the biological relevance of the extracted association rules and the effectiveness of the proposed approach from a medical point of view, some preliminary studies on a real clinical dataset are currently under medical investigation. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. KinView: A visual comparative sequence analysis tool for integrated kinome research

    PubMed Central

    McSkimming, Daniel Ian; Dastgheib, Shima; Baffi, Timothy R.; Byrne, Dominic P.; Ferries, Samantha; Scott, Steven Thomas; Newton, Alexandra C.; Eyers, Claire E.; Kochut, Krzysztof J.; Eyers, Patrick A.

    2017-01-01

    Multiple sequence alignments (MSAs) are a fundamental analysis tool used throughout biology to investigate relationships between protein sequence, structure, function, evolutionary history, and patterns of disease-associated variants. However, their widespread application in systems biology research is currently hindered by the lack of user-friendly tools to simultaneously visualize, manipulate and query the information conceptualized in large sequence alignments, and the challenges in integrating MSAs with multiple orthogonal data such as cancer variants and post-translational modifications, which are often stored in heterogeneous data sources and formats. Here, we present the Multiple Sequence Alignment Ontology (MSAOnt), which represents a profile or consensus alignment in an ontological format. Subsets of the alignment are easily selected through the SPARQL Protocol and RDF Query Language for downstream statistical analysis or visualization. We have also created the Kinome Viewer (KinView), an interactive integrative visualization that places eukaryotic protein kinase cancer variants in the context of natural sequence variation and experimentally determined post-translational modifications, which play central roles in the regulation of cellular signaling pathways. Using KinView, we identified differential phosphorylation patterns between tyrosine and serine/threonine kinases in the activation segment, a major kinase regulatory region that is often mutated in proliferative diseases. We discuss cancer variants that disrupt phosphorylation sites in the activation segment, and show how KinView can be used as a comparative tool to identify differences and similarities in natural variation, cancer variants and post-translational modifications between kinase groups, families and subfamilies. Based on KinView comparisons, we identify and experimentally characterize a regulatory tyrosine (Y177PLK4) in the PLK4 C-terminal activation segment region termed the P+1 loop. To further demonstrate the application of KinView in hypothesis generation and testing, we formulate and validate a hypothesis explaining a novel predicted loss-of-function variant (D523NPKCβ) in the regulatory spine of PKCβ, a recently identified tumor suppressor kinase. KinView provides a novel, extensible interface for performing comparative analyses between subsets of kinases and for integrating multiple types of residue specific annotations in user friendly formats. PMID:27731453

  16. Evaluation: A Qualitative Pilot Study of Novel Information Technology Infrastructure to Communicate Genetic Variant Updates.

    PubMed

    Klinkenberg-Ramirez, Stephanie; Neri, Pamela M; Volk, Lynn A; Samaha, Sara J; Newmark, Lisa P; Pollard, Stephanie; Varugheese, Matthew; Baxter, Samantha; Aronson, Samuel J; Rehm, Heidi L; Bates, David W

    2016-01-01

    Partners HealthCare Personalized Medicine developed GeneInsight Clinic (GIC), a tool designed to communicate updated variant information from laboratory geneticists to treating clinicians through automated alerts, categorized by level of variant interpretation change. The study aimed to evaluate feedback from the initial users of the GIC, including the advantages and challenges to receiving this variant information and using this technology at the point of care. Healthcare professionals from two clinics that ordered genetic testing for cardiomyopathy and related disorders were invited to participate in one-hour semi-structured interviews and/ or a one-hour focus group. Using a Grounded Theory approach, transcript concepts were coded and organized into themes. Two genetic counselors and two physicians from two treatment clinics participated in individual interviews. Focus group participants included one genetic counselor and four physicians. Analysis resulted in 8 major themes related to structuring and communicating variant knowledge, GIC's impact on the clinic, and suggestions for improvements. The interview analysis identified longitudinal patient care, family data, and growth in genetic testing content as potential challenges to optimization of the GIC infrastructure. Participants agreed that GIC implementation increased efficiency and effectiveness of the clinic through increased access to genetic variant information at the point of care. Development of information technology (IT) infrastructure to aid in the organization and management of genetic variant knowledge will be critical as the genetic field moves towards whole exome and whole genome sequencing. Findings from this study could be applied to future development of IT support for genetic variant knowledge management that would serve to improve clinicians' ability to manage and care for patients.

  17. Rare NaV1.7 variants associated with painful diabetic peripheral neuropathy

    PubMed Central

    Blesneac, Iulia; Themistocleous, Andreas C.; Fratter, Carl; Conrad, Linus J.; Ramirez, Juan D.; Cox, James J.; Tesfaye, Solomon; Shillo, Pallai R.; Rice, Andrew S.C.; Tucker, Stephen J.

    2018-01-01

    Abstract Diabetic peripheral neuropathy (DPN) is a common disabling complication of diabetes. Almost half of the patients with DPN develop neuropathic pain (NeuP) for which current analgesic treatments are inadequate. Understanding the role of genetic variability in the development of painful DPN is needed for improved understanding of pain pathogenesis for better patient stratification in clinical trials and to target therapy more appropriately. Here, we examined the relationship between variants in the voltage-gated sodium channel NaV1.7 and NeuP in a deeply phenotyped cohort of patients with DPN. Although no rare variants were found in 78 participants with painless DPN, we identified 12 rare NaV1.7 variants in 10 (out of 111) study participants with painful DPN. Five of these variants had previously been described in the context of other NeuP disorders and 7 have not previously been linked to NeuP. Those patients with rare variants reported more severe pain and greater sensitivity to pressure stimuli on quantitative sensory testing. Electrophysiological characterization of 2 of the novel variants (M1852T and T1596I) demonstrated that gain of function changes as a consequence of markedly impaired channel fast inactivation. Using a structural model of NaV1.7, we were also able to provide further insight into the structural mechanisms underlying fast inactivation and the role of the C-terminal domain in this process. Our observations suggest that rare NaV1.7 variants contribute to the development NeuP in patients with DPN. Their identification should aid understanding of sensory phenotype, patient stratification, and help target treatments effectively. PMID:29176367

  18. m6ASNP: a tool for annotating genetic variants by m6A function.

    PubMed

    Jiang, Shuai; Xie, Yubin; He, Zhihao; Zhang, Ya; Zhao, Yuli; Chen, Li; Zheng, Yueyuan; Miao, Yanyan; Zuo, Zhixiang; Ren, Jian

    2018-05-01

    Large-scale genome sequencing projects have identified many genetic variants for diverse diseases. A major goal of these projects is to characterize these genetic variants to provide insight into their function and roles in diseases. N6-methyladenosine (m6A) is one of the most abundant RNA modifications in eukaryotes. Recent studies have revealed that aberrant m6A modifications are involved in many diseases. In this study, we present a user-friendly web server called "m6ASNP" that is dedicated to the identification of genetic variants that target m6A modification sites. A random forest model was implemented in m6ASNP to predict whether the methylation status of an m6A site is altered by the variants that surround the site. In m6ASNP, genetic variants in a standard variant call format (VCF) are accepted as the input data, and the output includes an interactive table that contains the genetic variants annotated by m6A function. In addition, statistical diagrams and a genome browser are provided to visualize the characteristics and to annotate the genetic variants. We believe that m6ASNP is a very convenient tool that can be used to boost further functional studies investigating genetic variants. The web server "m6ASNP" is implemented in JAVA and PHP and is freely available at [60].

  19. Common 5S rRNA variants are likely to be accepted in many sequence contexts

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.

    2003-01-01

    Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The results demonstrate that changes that occur multiple times in a local region of RNA sequence space in fact usually will be accepted in any sequence context in that same local region.

  20. Comparison of statistical tests for association between rare variants and binary traits.

    PubMed

    Bacanu, Silviu-Alin; Nelson, Matthew R; Whittaker, John C

    2012-01-01

    Genome-wide association studies have found thousands of common genetic variants associated with a wide variety of diseases and other complex traits. However, a large portion of the predicted genetic contribution to many traits remains unknown. One plausible explanation is that some of the missing variation is due to the effects of rare variants. Nonetheless, the statistical analysis of rare variants is challenging. A commonly used method is to contrast, within the same region (gene), the frequency of minor alleles at rare variants between cases and controls. However, this strategy is most useful under the assumption that the tested variants have similar effects. We previously proposed a method that can accommodate heterogeneous effects in the analysis of quantitative traits. Here we extend this method to include binary traits that can accommodate covariates. We use simulations for a variety of causal and covariate impact scenarios to compare the performance of the proposed method to standard logistic regression, C-alpha, SKAT, and EREC. We found that i) logistic regression methods perform well when the heterogeneity of the effects is not extreme and ii) SKAT and EREC have good performance under all tested scenarios but they can be computationally intensive. Consequently, it would be more computationally desirable to use a two-step strategy by (i) selecting promising genes by faster methods and ii) analyzing selected genes using SKAT/EREC. To select promising genes one can use (1) regression methods when effect heterogeneity is assumed to be low and the covariates explain a non-negligible part of trait variability, (2) C-alpha when heterogeneity is assumed to be large and covariates explain a small fraction of trait's variability and (3) the proposed trend and heterogeneity test when the heterogeneity is assumed to be non-trivial and the covariates explain a large fraction of trait variability.

  1. Common susceptibility variants are shared between schizophrenia and psoriasis in the Han Chinese population

    PubMed Central

    Yin, Xianyong; Wineinger, Nathan E.; Wang, Kai; Yue, Weihua; Norgren, Nina; Wang, Ling; Yao, Weiyi; Jiang, Xiaoyun; Wu, Bo; Cui, Yong; Shen, Changbing; Cheng, Hui; Zhou, Fusheng; Chen, Gang; Zuo, Xianbo; Zheng, Xiaodong; Fan, Xing; Wang, Hongyan; Wang, Lifang; Lee, Jimmy; Lam, Max; Tai, E. Shyong; Zhang, Zheng; Huang, Qiong; Sun, Liangdan; Xu, Jinhua; Yang, Sen; Wilhelmsen, Kirk C.; Liu, Jianjun; Schork, Nicholas J.; Zhang, Xuejun

    2016-01-01

    Background Previous studies have shown that individuals with schizophrenia have a greater risk for psoriasis than a typical person. This suggests that there might be a shared genetic etiology between the 2 conditions. We aimed to characterize the potential shared genetic susceptibility between schizophrenia and psoriasis using genome-wide marker genotype data. Methods We obtained genetic data on individuals with psoriasis, schizophrenia and control individuals. We applied a marker-based coheritability estimation procedure, polygenic score analysis, a gene set enrichment test and a least absolute shrinkage and selection operator regression model to estimate the potential shared genetic etiology between the 2 diseases. We validated the results in independent schizophrenia and psoriasis cohorts from Singapore. Results We included 1139 individuals with psoriasis, 744 with schizophrenia and 1678 controls in our analysis, and we validated the results in independent cohorts, including 441 individuals with psoriasis (and 2420 controls) and 1630 with schizophrenia (and 1860 controls). We estimated that a large fraction of schizophrenia and psoriasis risk could be attributed to common variants (h2SNP = 29% ± 5.0%, p = 2.00 × 10−8), with a coheritability estimate between the traits of 21%. We identified 5 variants within the human leukocyte antigen (HLA) gene region, which were most likely to be associated with both diseases and collectively conferred a significant risk effect (odds ratio of highest risk quartile = 6.03, p < 2.00 × 10−16). We discovered that variants contributing most to the shared heritable component between psoriasis and schizophrenia were enriched in antigen processing and cell endoplasmic reticulum. Limitations Our sample size was relatively small. The findings of 5 HLA gene variants were complicated by the complex structure in the HLA region. Conclusion We found evidence for a shared genetic etiology between schizophrenia and psoriasis. The mechanism for this shared genetic basis likely involves immune and calcium signalling pathways. PMID:27091718

  2. Pathogenicity of Different Rabies Virus Variants Inversely Correlates with Apoptosis and Rabies Virus Glycoprotein Expression in Infected Primary Neuron Cultures

    PubMed Central

    Morimoto, Kinjiro; Hooper, D. Craig; Spitsin, Sergei; Koprowski, Hilary; Dietzschold, Bernhard

    1999-01-01

    The mouse-adapted rabies virus strain CVS-24 has stable variants, CVS-B2c and CVS-N2c, which differ greatly in their pathogenicity for normal adult mice and in their ability to infect nonneuronal cells. The glycoprotein (G protein), which has previously been implicated in rabies virus pathogenicity, shows substantial structural differences between these variants. Although prior studies have identified antigenic site III of the G protein as the major pathogenicity determinant, CVS-B2c and CVS-N2c do not vary at this site. The possibility that pathogenicity is inversely related to G protein expression levels is suggested by the finding that CVS-B2c, the less pathogenic variant, expresses at least fourfold-higher levels of G protein than CVS-N2c in infected neurons. Although there is some difference between CVS-B2c- and CVS-N2c-infected neurons in G protein mRNA expression levels, the differential expression of G protein appears to be largely determined by posttranslational mechanisms that affect G protein stability. Pulse-chase experiments indicated that the G protein of CVS-B2c is degraded more slowly than that of CVS-N2c. The accumulation of G protein correlated with the induction of programmed cell death in CVS-B2c-infected neurons. The extent of apoptosis was considerably lower in CVS-N2c-infected neurons, where G protein expression was minimal. While nucleoprotein (N protein) expression levels were similar in neurons infected with either variant, the transport of N protein into neuronal processes was strongly inhibited in CVS-B2c-infected cells. Thus, downregulation of G protein expression in neuronal cells evidently contributes to rabies virus pathogenesis by preventing apoptosis and the apparently associated failure of the axonal transport of N protein. PMID:9847357

  3. Topology of AspT, the aspartate:alanine antiporter of Tetragenococcus halophilus, determined by site-directed fluorescence labeling.

    PubMed

    Nanatani, Kei; Fujiki, Takashi; Kanou, Kazuhiko; Takeda-Shitaka, Mayuko; Umeyama, Hideaki; Ye, Liwen; Wang, Xicheng; Nakajima, Tasuku; Uchida, Takafumi; Maloney, Peter C; Abe, Keietsu

    2007-10-01

    The gram-positive lactic acid bacterium Tetragenococcus halophilus catalyzes the decarboxylation of L-aspartate (Asp) with release of L-alanine (Ala) and CO(2). The decarboxylation reaction consists of two steps: electrogenic exchange of Asp for Ala catalyzed by an aspartate:alanine antiporter (AspT) and intracellular decarboxylation of the transported Asp catalyzed by an L-aspartate-beta-decarboxylase (AspD). AspT belongs to the newly classified aspartate:alanine exchanger family (transporter classification no. 2.A.81) of transporters. In this study, we were interested in the relationship between the structure and function of AspT and thus analyzed the topology by means of the substituted-cysteine accessibility method using the impermeant, fluorescent, thiol-specific probe Oregon Green 488 maleimide (OGM) and the impermeant, nonfluorescent, thiol-specific probe [2-(trimethylammonium)ethyl]methanethiosulfonate bromide. We generated 23 single-cysteine variants from a six-histidine-tagged cysteineless AspT template. A cysteine position was assigned an external location if the corresponding single-cysteine variant reacted with OGM added to intact cells, and a position was assigned an internal location if OGM labeling required cell lysis. The topology analyses revealed that AspT has a unique topology; the protein has 10 transmembrane helices (TMs), a large hydrophilic cytoplasmic loop (about 180 amino acids) between TM5 and TM6, N and C termini that face the periplasm, and a positively charged residue (arginine 76) within TM3. Moreover, the three-dimensional structure constructed by means of the full automatic modeling system indicates that the large hydrophilic cytoplasmic loop of AspT possesses a TrkA_C domain and a TrkA_C-like domain and that the three-dimensional structures of these domains are similar to each other even though their amino acid sequences show low similarity.

  4. Topology of AspT, the Aspartate:Alanine Antiporter of Tetragenococcus halophilus, Determined by Site-Directed Fluorescence Labeling▿ †

    PubMed Central

    Nanatani, Kei; Fujiki, Takashi; Kanou, Kazuhiko; Takeda-Shitaka, Mayuko; Umeyama, Hideaki; Ye, Liwen; Wang, Xicheng; Nakajima, Tasuku; Uchida, Takafumi; Maloney, Peter C.; Abe, Keietsu

    2007-01-01

    The gram-positive lactic acid bacterium Tetragenococcus halophilus catalyzes the decarboxylation of l-aspartate (Asp) with release of l-alanine (Ala) and CO2. The decarboxylation reaction consists of two steps: electrogenic exchange of Asp for Ala catalyzed by an aspartate:alanine antiporter (AspT) and intracellular decarboxylation of the transported Asp catalyzed by an l-aspartate-β-decarboxylase (AspD). AspT belongs to the newly classified aspartate:alanine exchanger family (transporter classification no. 2.A.81) of transporters. In this study, we were interested in the relationship between the structure and function of AspT and thus analyzed the topology by means of the substituted-cysteine accessibility method using the impermeant, fluorescent, thiol-specific probe Oregon Green 488 maleimide (OGM) and the impermeant, nonfluorescent, thiol-specific probe [2-(trimethylammonium)ethyl]methanethiosulfonate bromide. We generated 23 single-cysteine variants from a six-histidine-tagged cysteineless AspT template. A cysteine position was assigned an external location if the corresponding single-cysteine variant reacted with OGM added to intact cells, and a position was assigned an internal location if OGM labeling required cell lysis. The topology analyses revealed that AspT has a unique topology; the protein has 10 transmembrane helices (TMs), a large hydrophilic cytoplasmic loop (about 180 amino acids) between TM5 and TM6, N and C termini that face the periplasm, and a positively charged residue (arginine 76) within TM3. Moreover, the three-dimensional structure constructed by means of the full automatic modeling system indicates that the large hydrophilic cytoplasmic loop of AspT possesses a TrkA_C domain and a TrkA_C-like domain and that the three-dimensional structures of these domains are similar to each other even though their amino acid sequences show low similarity. PMID:17660287

  5. Retarded protein folding of deficient human α1-antitrypsin D256V and L41P variants

    PubMed Central

    Jung, Chan-Hun; Na, Yu-Ran; Im, Hana

    2004-01-01

    α1-Antitrypsin is the most abundant protease inhibitor in plasma and is the archetype of the serine protease inhibitor superfamily. Genetic variants of human α1-antitrypsin are associated with early-onset emphysema and liver cirrhosis. However, the detailed molecular mechanism for the pathogenicity of most variant α1-antitrypsin molecules is not known. Here we examined the structural basis of a dozen deficient α1-antitrypsin variants. Unlike most α1-antitrypsin variants, which were unstable, D256V and L41P variants exhibited extremely retarded protein folding as compared with the wild-type molecule. Once folded, however, the stability and inhibitory activity of these variant proteins were comparable to those of the wild-type molecule. Retarded protein folding may promote protein aggregation by allowing the accumulation of aggregation-prone folding intermediates. Repeated observations of retarded protein folding indicate that it is an important mechanism causing α1-antitrypsin deficiency by variant molecules, which have to fold into the metastable native form to be functional. PMID:14767073

  6. Clinical polyomavirus BK variants with agnogene deletion are non-functional but rescued by trans-complementation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Myhre, Marit Renee; Olsen, Gunn-Hege; Gosert, Rainer

    High-level replication of polyomavirus BK (BKV) in kidney transplant recipients is associated with the emergence of BKV variants with rearranged (rr) non-coding control region (NCCR) increasing viral early gene expression and cytopathology. Cloning and sequencing revealed the presence of a BKV quasispecies which included non-functional variants when assayed in a recombinant virus assay. Here we report that the rr-NCCR of BKV variants RH-3 and RH-12, both bearing a NCCR deletion including the 5' end of the agnoprotein coding sequence, mediated early and late viral reporter gene expression in kidney cells. However, in a recombinant virus they failed to produce infectiousmore » progeny despite large T-antigen and VP1 expression and the formation of nuclear virus-like particles. Infectious progeny was generated when the agnogene was reconstructed in cis or agnoprotein provided in trans from a co-existing BKV rr-NCCR variant. We conclude that complementation can rescue non-functional BKV variants in vitro and possibly in vivo.« less

  7. Fine-mapping inflammatory bowel disease loci to single variant resolution

    PubMed Central

    Huang, Hailiang; Fang, Ming; Jostins, Luke; Mirkov, Maša Umićević; Boucher, Gabrielle; Anderson, Carl A; Andersen, Vibeke; Cleynen, Isabelle; Cortes, Adrian; Crins, François; D'Amato, Mauro; Deffontaine, Valérie; Dimitrieva, Julia; Docampo, Elisa; Elansary, Mahmoud; Farh, Kyle Kai-How; Franke, Andre; Gori, Ann-Stephan; Goyette, Philippe; Halfvarson, Jonas; Haritunians, Talin; Knight, Jo; Lawrance, Ian C; Lees, Charlie W; Louis, Edouard; Mariman, Rob; Meuwissen, Theo; Mni, Myriam; Momozawa, Yukihide; Parkes, Miles; Spain, Sarah L; Théâtre, Emilie; Trynka, Gosia; Satsangi, Jack; van Sommeren, Suzanne; Vermeire, Severine; Xavier, Ramnik J; Weersma, Rinse K; Duerr, Richard H; Mathew, Christopher G; Rioux, John D; McGovern, Dermot PB; Cho, Judy H; Georges, Michel; Daly, Mark J; Barrett, Jeffrey C

    2017-01-01

    Summary The inflammatory bowel diseases (IBD) are chronic gastrointestinal inflammatory disorders that affect millions worldwide. Genome-wide association studies have identified 200 IBD-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 IBD loci using high-density genotyping in 67,852 individuals. We pinpointed 18 associations to a single causal variant with >95% certainty, and an additional 27 associations to a single variant with >50% certainty. These 45 variants are significantly enriched for protein-coding changes (n=13), direct disruption of transcription factor binding sites (n=3) and tissue specific epigenetic marks (n=10), with the latter category showing enrichment in specific immune cells among associations stronger in CD and in gut mucosa among associations stronger in UC. The results of this study suggest that high-resolution fine-mapping in large samples can convert many GWAS discoveries into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms. PMID:28658209

  8. Decision-making and addiction (part II): myopia for the future or hypersensitivity to reward?

    PubMed

    Bechara, Antoine; Dolan, Sara; Hindes, Andrea

    2002-01-01

    On a decision-making instrument known as the "gambling task" (GT), a subgroup of substance dependent individuals (SDI) opted for choices that yield high immediate gains in spite of higher future losses. This resembles the behavior of patients with ventromedial (VM) prefrontal cortex lesions. In this study, we addressed the possibility that hypersensitivity to reward may account for the "myopia" for the future in this subgroup of SDI. We used a variant version of the GT, in which the good decks yielded high immediate punishment but higher delayed reward. The bad decks yielded low immediate punishment and lower delayed reward. We measured the skin conductance response (SCR) of subjects after receiving reward (reward SCR) and during their pondering from which deck to choose (anticipatory SCR). A subgroup of SDI who was not impaired on the original GT performed normally on the variant GT. The subgroup of SDI who was impaired on the original GT showed two levels of performance on the variant GT. One subgroup (36% of the sample) performed poorly on the variant GT, and showed similar behavioral and physiological impairments to VM patients. The other subgroup of SDI (64% of the sample) performed normally on the variant task, but had abnormally large physiological responses to reward, i.e. large SCR after receiving reward (reward SCR) and large SCR in anticipation of outcomes that yield large reward. Thus, the combined cognitive and physiological approach of assessing decision-making characterizes three sub-populations of SDI. One sub-population is without impairments that can be detected by any measure of the GT paradigm. Another sub-population is similar to VM patients in that they are insensitive to the future, both positive and negative. A third sub-population is hypersensitive to reward, so that the presence or the prospect of receiving, reward dominates their behavior.

  9. The counsellees' view of an unclassified variant in BRCA1/2: recall, interpretation, and impact on life.

    PubMed

    Vos, Joël; Otten, Wilma; van Asperen, Christi; Jansen, Anna; Menko, Fred; Tibben, Aad

    2008-08-01

    Unclassified variants (UVs, variants of uncertain clinical significance) are found in 13% of all BRCA1/2 mutation analyses. Little is known about the counsellees' recall and interpretation of a UV, and its psychosocial/medical impact. Retrospective semi-structured interviews with open questions and five-point Likert scales were carried out in 24 counsellees who received a UV result 3 years before (sd=1.9). Sixty-seven percent (16/24) recalled the UV result as a non-informative DNA result; 29% recalled a pathogenic result. However, 79% of all counsellees interpreted the UV result as a genetic predisposition for cancer. Variations in recall and interpretation were unexplained by demographics, cancer history of themselves and relatives, and communication aspects of UV disclosure. Sixty-seven percent perceived genetic counselling as completed, whereas 71% expected to receive new DNA information. Although most counsellees reported that UV disclosure had changed their lives in general little, one in three counsellees reported large changes in specific life domains, especially in surveillance behavior and medical decisions. Ten out of 19 participants who interpreted the UV as pathogenic had undergone preventive surgery against none of the 5 counsellees who interpreted the UV as non-informative. Counsellors and researchers need to address discrepancies between the counsellees' factual recall and their subjective interpretation of non-informative BRCA1/2-test results.

  10. Sensitivity to sequencing depth in single-cell cancer genomics.

    PubMed

    Alves, João M; Posada, David

    2018-04-16

    Querying cancer genomes at single-cell resolution is expected to provide a powerful framework to understand in detail the dynamics of cancer evolution. However, given the high costs currently associated with single-cell sequencing, together with the inevitable technical noise arising from single-cell genome amplification, cost-effective strategies that maximize the quality of single-cell data are critically needed. Taking advantage of previously published single-cell whole-genome and whole-exome cancer datasets, we studied the impact of sequencing depth and sampling effort towards single-cell variant detection. Five single-cell whole-genome and whole-exome cancer datasets were independently downscaled to 25, 10, 5, and 1× sequencing depth. For each depth level, ten technical replicates were generated, resulting in a total of 6280 single-cell BAM files. The sensitivity of variant detection, including structural and driver mutations, genotyping, clonal inference, and phylogenetic reconstruction to sequencing depth was evaluated using recent tools specifically designed for single-cell data. Altogether, our results suggest that for relatively large sample sizes (25 or more cells) sequencing single tumor cells at depths > 5× does not drastically improve somatic variant discovery, characterization of clonal genotypes, or estimation of single-cell phylogenies. We suggest that sequencing multiple individual tumor cells at a modest depth represents an effective alternative to explore the mutational landscape and clonal evolutionary patterns of cancer genomes.

  11. CDKL5 variants: Improving our understanding of a rare neurologic disorder.

    PubMed

    Hector, Ralph D; Kalscheuer, Vera M; Hennig, Friederike; Leonard, Helen; Downs, Jenny; Clarke, Angus; Benke, Tim A; Armstrong, Judith; Pineda, Mercedes; Bailey, Mark E S; Cobb, Stuart R

    2017-12-01

    To provide new insights into the interpretation of genetic variants in a rare neurologic disorder, CDKL5 deficiency, in the contexts of population sequencing data and an updated characterization of the CDKL5 gene. We analyzed all known potentially pathogenic CDKL5 variants by combining data from large-scale population sequencing studies with CDKL5 variants from new and all available clinical cohorts and combined this with computational methods to predict pathogenicity. The study has identified several variants that can be reclassified as benign or likely benign. With the addition of novel CDKL5 variants, we confirm that pathogenic missense variants cluster in the catalytic domain of CDKL5 and reclassify a purported missense variant as having a splicing consequence. We provide further evidence that missense variants in the final 3 exons are likely to be benign and not important to disease pathology. We also describe benign splicing and nonsense variants within these exons, suggesting that isoform hCDKL5_5 is likely to have little or no neurologic significance. We also use the available data to make a preliminary estimate of minimum incidence of CDKL5 deficiency. These findings have implications for genetic diagnosis, providing evidence for the reclassification of specific variants previously thought to result in CDKL5 deficiency. Together, these analyses support the view that the predominant brain isoform in humans (hCDKL5_1) is crucial for normal neurodevelopment and that the catalytic domain is the primary functional domain.

  12. CDKL5 variants

    PubMed Central

    Kalscheuer, Vera M.; Hennig, Friederike; Leonard, Helen; Downs, Jenny; Clarke, Angus; Benke, Tim A.; Armstrong, Judith; Pineda, Mercedes; Bailey, Mark E.S.; Cobb, Stuart R.

    2017-01-01

    Objective: To provide new insights into the interpretation of genetic variants in a rare neurologic disorder, CDKL5 deficiency, in the contexts of population sequencing data and an updated characterization of the CDKL5 gene. Methods: We analyzed all known potentially pathogenic CDKL5 variants by combining data from large-scale population sequencing studies with CDKL5 variants from new and all available clinical cohorts and combined this with computational methods to predict pathogenicity. Results: The study has identified several variants that can be reclassified as benign or likely benign. With the addition of novel CDKL5 variants, we confirm that pathogenic missense variants cluster in the catalytic domain of CDKL5 and reclassify a purported missense variant as having a splicing consequence. We provide further evidence that missense variants in the final 3 exons are likely to be benign and not important to disease pathology. We also describe benign splicing and nonsense variants within these exons, suggesting that isoform hCDKL5_5 is likely to have little or no neurologic significance. We also use the available data to make a preliminary estimate of minimum incidence of CDKL5 deficiency. Conclusions: These findings have implications for genetic diagnosis, providing evidence for the reclassification of specific variants previously thought to result in CDKL5 deficiency. Together, these analyses support the view that the predominant brain isoform in humans (hCDKL5_1) is crucial for normal neurodevelopment and that the catalytic domain is the primary functional domain. PMID:29264392

  13. The Association between Pediatric NAFLD and Common Genetic Variants

    PubMed Central

    Umano, Giuseppina Rosaria; Martino, Mariangela; Santoro, Nicola

    2017-01-01

    Non-alcoholic fatty liver disease (NAFLD) is one of the most common complications of obesity. Several studies have shown that genetic predisposition probably plays an important role in its pathogenesis. In fact, in the last few years a large number of genetic studies have provided compelling evidence that some gene variants, especially those in genes encoding proteins regulating lipid metabolism, are associated with intra-hepatic fat accumulation. Here we provide a comprehensive review of the gene variants that have affected the natural history of the disease. PMID:28629152

  14. Mobile Interspersed Repeats Are Major Structural Variants in the Human Genome

    PubMed Central

    Huang, Cheng Ran Lisa; Schneider, Anna M.; Lu, Yunqi; Niranjan, Tejasvi; Shen, Peilin; Robinson, Matoya A.; Steranka, Jared P.; Valle, David; Civin, Curt I.; Wang, Tao; Wheelan, Sarah J.; Ji, Hongkai; Boeke, Jef D.; Burns, Kathleen H.

    2010-01-01

    Summary Characterizing structural variants in the human genome is of great importance, but a genome wide analysis to detect interspersed repeats has not been done. Thus, the degree to which mobile DNAs contribute to genetic diversity, heritable disease, and oncogenesis remains speculative. We perform transposon insertion profiling by microarray (TIP-chip) to map human L1(Ta) retrotransposons (LINE-1 s) genome-wide. This identified numerous novel human L1(Ta) insertional polymorphisms with highly variant allelic frequencies. We also explored TIP-chip's usefulness to identify candidate alleles associated with different phenotypes in clinical cohorts. Our data suggest that the occurrence of new insertions is twice as high as previously estimated, and that these repeats are under-recognized as sources of human genomic and phenotypic diversity. We have just begun to probe the universe of human L1(Ta) polymorphisms, and as TIP-chip is applied to other insertions such as Alu SINEs, it will expand the catalog of genomic variants even further. PMID:20602999

  15. Fine structure characterization of martensite/austenite constituent in low-carbon low-alloy steel by transmission electron forward scatter diffraction.

    PubMed

    Li, C W; Han, L Z; Luo, X M; Liu, Q D; Gu, J F

    2016-11-01

    Transmission electron forward scatter diffraction and other characterization techniques were used to investigate the fine structure and the variant relationship of the martensite/austenite (M/A) constituent of the granular bainite in low-carbon low-alloy steel. The results demonstrated that the M/A constituents were distributed in clusters throughout the bainitic ferrite. Lath martensite was the main component of the M/A constituent, where the relationship between the martensite variants was consistent with the Nishiyama-Wassermann orientation relationship and only three variants were found in the M/A constituent, suggesting that the variants had formed in the M/A constituent according to a specific mechanism. Furthermore, the Σ3 boundaries in the M/A constituent were much longer than their counterparts in the bainitic ferrite region. The results indicate that transmission electron forward scatter diffraction is an effective method of crystallographic analysis for nanolaths in M/A constituents. © 2016 The Authors Journal of Microscopy © 2016 Royal Microscopical Society.

  16. Spatially variant periodic structures in electromagnetics.

    PubMed

    Rumpf, Raymond C; Pazos, Javier J; Digaum, Jennefir L; Kuebler, Stephen M

    2015-08-28

    Spatial transforms are a popular technique for designing periodic structures that are macroscopically inhomogeneous. The structures are often required to be anisotropic, provide a magnetic response, and to have extreme values for the constitutive parameters in Maxwell's equations. Metamaterials and photonic crystals are capable of providing these, although sometimes only approximately. The problem still remains about how to generate the geometry of the final lattice when it is functionally graded, or spatially varied. This paper describes a simple numerical technique to spatially vary any periodic structure while minimizing deformations to the unit cells that would weaken or destroy the electromagnetic properties. New developments in this algorithm are disclosed that increase efficiency, improve the quality of the lattices and provide the ability to design aplanatic metasurfaces. The ability to spatially vary a lattice in this manner enables new design paradigms that are not possible using spatial transforms, three of which are discussed here. First, spatially variant self-collimating photonic crystals are shown to flow unguided waves around very tight bends using ordinary materials with low refractive index. Second, multi-mode waveguides in spatially variant band gap materials are shown to guide waves around bends without mixing power between the modes. Third, spatially variant anisotropic materials are shown to sculpt the near-field around electric components. This can be used to improve electromagnetic compatibility between components in close proximity. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  17. Spatially variant periodic structures in electromagnetics

    PubMed Central

    Rumpf, Raymond C.; Pazos, Javier J.; Digaum, Jennefir L.; Kuebler, Stephen M.

    2015-01-01

    Spatial transforms are a popular technique for designing periodic structures that are macroscopically inhomogeneous. The structures are often required to be anisotropic, provide a magnetic response, and to have extreme values for the constitutive parameters in Maxwell's equations. Metamaterials and photonic crystals are capable of providing these, although sometimes only approximately. The problem still remains about how to generate the geometry of the final lattice when it is functionally graded, or spatially varied. This paper describes a simple numerical technique to spatially vary any periodic structure while minimizing deformations to the unit cells that would weaken or destroy the electromagnetic properties. New developments in this algorithm are disclosed that increase efficiency, improve the quality of the lattices and provide the ability to design aplanatic metasurfaces. The ability to spatially vary a lattice in this manner enables new design paradigms that are not possible using spatial transforms, three of which are discussed here. First, spatially variant self-collimating photonic crystals are shown to flow unguided waves around very tight bends using ordinary materials with low refractive index. Second, multi-mode waveguides in spatially variant band gap materials are shown to guide waves around bends without mixing power between the modes. Third, spatially variant anisotropic materials are shown to sculpt the near-field around electric components. This can be used to improve electromagnetic compatibility between components in close proximity. PMID:26217058

  18. Recurrent emergence of structural variants of LTR retrotransposon CsRn1 evolving novel expression strategy and their selective expansion in a carcinogenic liver fluke, Clonorchis sinensis.

    PubMed

    Kim, Seon-Hee; Kong, Yoon; Bae, Young-An

    2017-06-01

    Autonomous retrotransposons, in which replication and transcription are coupled, encode the essential gag and pol genes as a fusion or separate overlapping form(s) that are expressed in single transcripts regulated by a common upstream promoter. The element-specific expression strategies have driven development of relevant translational recoding mechanisms including ribosomal frameshifting to satisfy the protein stoichiometry critical for the assembly of infectious virus-like particles. Retrotransposons with different recoding strategies exhibit a mosaic distribution pattern across the diverse families of reverse transcribing elements, even though their respective distributions are substantially skewed towards certain family groups. However, only a few investigations to date have focused on the emergence of retrotransposons evolving novel expression strategy and causal genetic drivers of the structural variants. In this study, the bulk of genomic and transcribed sequences of a Ty3/gypsy-like CsRn1 retrotransposon in Clonorchis sinensis were analyzed for the comprehensive examination of its expression strategy. Our results demonstrated that structural variants with single open reading frame (ORF) have recurrently emerged from precedential CsRn1 copies encoding overlapping gag-pol ORFs by a single-nucleotide insertion in an upstream region of gag stop codon. In the parasite genome, some of the newly evolved variants appeared to undergo proliferative burst as active master lineages together with their ancestral copies. The genetic event was similarly observed in Opisthorchis viverrini, the closest neighbor of C. sinensis, whereas the resulting structural variants might have failed to overcome purifying selection and comprised minor remnant copies in the Opisthorchis genome. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Structure-Function Analysis of Friedreich's Ataxia Mutants Reveals Determinants of Frataxin Binding and Activation of the Fe-S Assembly Complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bridwell-Rabb, Jennifer; Winn, Andrew M; Barondeau, David P

    2012-08-01

    Friedreich's ataxia (FRDA) is a progressive neurodegenerative disease associated with the loss of function of the protein frataxin (FXN) that results from low FXN levels due to a GAA triplet repeat expansion or, occasionally, from missense mutations in the FXN gene. Here biochemical and structural properties of FXN variants, including three FRDA missense mutations (N146K, Q148R, and R165C) and three related mutants (N146A, Q148G, and Q153A), were determined in an effort to understand the structural basis for the loss of function. In vitro assays revealed that although the three FRDA missense mutations exhibited similar losses of cysteine desulfurase and Fe-Smore » cluster assembly activities, the causes for these activation defects were distinct. The R165C variant exhibited a k cat/K M higher than that of native FXN but weak binding to the NFS1, ISD11, and ISCU2 (SDU) complex, whereas the Q148R variant exhibited the lowest k cat/K M of the six tested FXN variants and only a modest binding deficiency. The order of the FXN binding affinities for the SDU Fe-S assembly complex was as follows: FXN > Q148R > N146A > Q148G > N146K > Q153A > R165C. Four different classes of FXN variants were identified on the basis of their biochemical properties. Together, these structure-function studies reveal determinants for the binding and allosteric activation of the Fe-S assembly complex and provide insight into how FRDA missense mutations are functionally compromised.« less

  20. Differential effects of PCSK9 variants on risk of coronary disease and ischaemic stroke

    PubMed Central

    Hopewell, Jemma C; Malik, Rainer; Valdés-Márquez, Elsa; Worrall, Bradford B; Collins, Rory

    2018-01-01

    Abstract Aims PCSK9 genetic variants that have large effects on low-density lipoprotein cholesterol (LDL-C) and coronary heart disease (CHD) have prompted the development of therapeutic PCSK9-inhibition. However, there is limited evidence that PCSK9 variants are associated with ischaemic stroke (IS). Methods and results Associations of the loss-of-function PCSK9 genetic variant (rs11591147; R46L), and five additional PCSK9 variants, with IS and IS subtypes (cardioembolic, large vessel, and small vessel) were estimated in a meta-analysis involving 10 307 IS cases and 19 326 controls of European ancestry. They were then compared with the associations of these variants with LDL-C levels (in up to 172 970 individuals) and CHD (in up to 60 801 CHD cases and 123 504 controls). The rs11591147 T allele was associated with 0.5 mmol/L lower LDL-C level (P = 9 × 10−143) and 23% lower CHD risk [odds ratio (OR): 0.77, 95% confidence interval (CI): 0.69–0.87, P = 7 × 10−6]. However, it was not associated with risk of IS (OR: 1.04, 95% CI: 0.84–1.28, P = 0.74) or IS subtypes. Information from additional PCSK9 variants also indicated consistently weaker effects on IS than on CHD. Conclusion PCSK9 genetic variants that confer life-long lower PCSK9 and LDL-C levels appear to have significantly weaker, if any, associations with risk of IS than with risk of CHD. By contrast, similar proportional reductions in risks of IS and CHD have been observed in randomized trials of therapeutic PCSK9-inhibition. These findings have implications for our understanding of when Mendelian randomization can be relied upon to predict the effects of therapeutic interventions. PMID:29020353

  1. [Antigen differences of genetic variants Abent+ and Abent- poliovirus vaccine strain of III type].

    PubMed

    Shyrobokov, V P; Kostenko, I H; Nikolaienko, I V

    2003-01-01

    Hybridomes--producers of monoclonal antibodies (MAB) were obtained able to differentiate the variants Abent+ and Abent- poliovirus vaccine strain in the virus neutralizing reaction. Using the obtained panel the changes of the epitope structure of capsid proteins of poliovirus variants (dissociants) were found which appeared during reproduction in cell culture. It proves the fact that there exist essential antigenic differences of superficial virion's proteins, which appear during the process of dissociation.

  2. Detection of de novo single nucleotide variants in offspring of atomic-bomb survivors close to the hypocenter by whole-genome sequencing.

    PubMed

    Horai, Makiko; Mishima, Hiroyuki; Hayashida, Chisa; Kinoshita, Akira; Nakane, Yoshibumi; Matsuo, Tatsuki; Tsuruda, Kazuto; Yanagihara, Katsunori; Sato, Shinya; Imanishi, Daisuke; Imaizumi, Yoshitaka; Hata, Tomoko; Miyazaki, Yasushi; Yoshiura, Koh-Ichiro

    2018-03-01

    Ionizing radiation released by the atomic bombs at Hiroshima and Nagasaki, Japan, in 1945 caused many long-term illnesses, including increased risks of malignancies such as leukemia and solid tumours. Radiation has demonstrated genetic effects in animal models, leading to concerns over the potential hereditary effects of atomic bomb-related radiation. However, no direct analyses of whole DNA have yet been reported. We therefore investigated de novo variants in offspring of atomic-bomb survivors by whole-genome sequencing (WGS). We collected peripheral blood from three trios, each comprising a father (atomic-bomb survivor with acute radiation symptoms), a non-exposed mother, and their child, none of whom had any past history of haematological disorders. One trio of non-exposed individuals was included as a control. DNA was extracted and the numbers of de novo single nucleotide variants in the children were counted by WGS with sequencing confirmation. Gross structural variants were also analysed. Written informed consent was obtained from all participants prior to the study. There were 62, 81, and 42 de novo single nucleotide variants in the children of atomic-bomb survivors, compared with 48 in the control trio. There were no gross structural variants in any trio. These findings are in accord with previously published results that also showed no significant genetic effects of atomic-bomb radiation on second-generation survivors.

  3. Protective V127 prion variant prevents prion disease by interrupting the formation of dimer and fibril from molecular dynamics simulations.

    PubMed

    Zhou, Shuangyan; Shi, Danfeng; Liu, Xuewei; Liu, Huanxiang; Yao, Xiaojun

    2016-02-24

    Recent studies uncovered a novel protective prion protein variant: V127 variant, which was reported intrinsically resistant to prion conversion and propagation. However, the structural basis of its protective effect is still unknown. To uncover the origin of the protective role of V127 variant, molecular dynamics simulations were performed to explore the influence of G127V mutation on two key processes of prion propagation: dimerization and fibril formation. The simulation results indicate V127 variant is unfavorable to form dimer by reducing the main-chain H-bond interactions. The simulations of formed fibrils consisting of β1 strand prove V127 variant will make the formed fibril become unstable and disorder. The weaker interaction energies between layers and reduced H-bonds number for V127 variant reveal this mutation is unfavorable to the formation of stable fibril. Consequently, we find V127 variant is not only unfavorable to the formation of dimer but also unfavorable to the formation of stable core and fibril, which can explain the mechanism on the protective role of V127 variant from the molecular level. Our findings can deepen the understanding of prion disease and may guide the design of peptide mimetics or small molecule to mimic the protective effect of V127 variant.

  4. Genetic polymorphisms of pharmacogenomic VIP variants in the Yi population from China.

    PubMed

    Yan, Mengdan; Li, Dianzhen; Zhao, Guige; Li, Jing; Niu, Fanglin; Li, Bin; Chen, Peng; Jin, Tianbo

    2018-03-30

    Drug response and target therapeutic dosage are different among individuals. The variability is largely genetically determined. With the development of pharmacogenetics and pharmacogenomics, widespread research have provided us a wealth of information on drug-related genetic polymorphisms, and the very important pharmacogenetic (VIP) variants have been identified for the major populations around the world whereas less is known regarding minorities in China, including the Yi ethnic group. Our research aims to screen the potential genetic variants in Yi population on pharmacogenomics and provide a theoretical basis for future medication guidance. In the present study, 80 VIP variants (selected from the PharmGKB database) were genotyped in 100 unrelated and healthy Yi adults recruited for our research. Through statistical analysis, we made a comparison between the Yi and other 11 populations listed in the HapMap database for significant SNPs detection. Two specific SNPs were subsequently enrolled in an observation on global allele distribution with the frequencies downloaded from ALlele FREquency Database. Moreover, F-statistics (Fst), genetic structure and phylogenetic tree analyses were conducted for determination of genetic similarity between the 12 ethnic groups. Using the χ2 tests, rs1128503 (ABCB1), rs7294 (VKORC1), rs9934438 (VKORC1), rs1540339 (VDR) and rs689466 (PTGS2) were identified as the significantly different loci for further analysis. The global allele distribution revealed that the allele "A" of rs1540339 and rs9934438 were more frequent in Yi people, which was consistent with the most populations in East Asia. F-statistics (Fst), genetic structure and phylogenetic tree analyses demonstrated that the Yi and CHD shared a closest relationship on their genetic backgrounds. Additionally, Yi was considered similar to the Han people from Shaanxi province among the domestic ethnic populations in China. Our results demonstrated significant differences on several polymorphic SNPs and supplement the pharmacogenomic information for the Yi population, which could provide new strategies for optimizing clinical medication in accordance with the genetic determinants of drug toxicity and efficacy. Copyright © 2018 Elsevier B.V. All rights reserved.

  5. Structural basis for binding of fluorinated glucose and galactose to Trametes multicolor pyranose 2-oxidase variants with improved galactose conversion.

    PubMed

    Tan, Tien Chye; Spadiut, Oliver; Gandini, Rosaria; Haltrich, Dietmar; Divne, Christina

    2014-01-01

    Each year, about six million tons of lactose are generated from liquid whey as industrial byproduct, and optimally this large carbohydrate waste should be used for the production of value-added products. Trametes multicolor pyranose 2-oxidase (TmP2O) catalyzes the oxidation of various monosaccharides to the corresponding 2-keto sugars. Thus, a potential use of TmP2O is to convert the products from lactose hydrolysis, D-glucose and D-galactose, to more valuable products such as tagatose. Oxidation of glucose is however strongly favored over galactose, and oxidation of both substrates at more equal rates is desirable. Characterization of TmP2O variants (H450G, V546C, H450G/V546C) with improved D-galactose conversion has been given earlier, of which H450G displayed the best relative conversion between the substrates. To rationalize the changes in conversion rates, we have analyzed high-resolution crystal structures of the aforementioned mutants with bound 2- and 3-fluorinated glucose and galactose. Binding of glucose and galactose in the productive 2-oxidation binding mode is nearly identical in all mutants, suggesting that this binding mode is essentially unaffected by the mutations. For the competing glucose binding mode, enzyme variants carrying the H450G replacement stabilize glucose as the α-anomer in position for 3-oxidation. The backbone relaxation at position 450 allows the substrate-binding loop to fold tightly around the ligand. V546C however stabilize glucose as the β-anomer using an open loop conformation. Improved binding of galactose is enabled by subtle relaxation effects at key active-site backbone positions. The competing binding mode for galactose 2-oxidation by V546C stabilizes the β-anomer for oxidation at C1, whereas H450G variants stabilize the 3-oxidation binding mode of the galactose α-anomer. The present study provides a detailed description of binding modes that rationalize changes in the relative conversion rates of D-glucose and D-galactose and can be used to refine future enzyme designs for more efficient use of lactose-hydrolysis byproducts.

  6. Structural Basis for Binding of Fluorinated Glucose and Galactose to Trametes multicolor Pyranose 2-Oxidase Variants with Improved Galactose Conversion

    PubMed Central

    Gandini, Rosaria; Haltrich, Dietmar; Divne, Christina

    2014-01-01

    Each year, about six million tons of lactose are generated from liquid whey as industrial byproduct, and optimally this large carbohydrate waste should be used for the production of value-added products. Trametes multicolor pyranose 2-oxidase (TmP2O) catalyzes the oxidation of various monosaccharides to the corresponding 2-keto sugars. Thus, a potential use of TmP2O is to convert the products from lactose hydrolysis, D-glucose and D-galactose, to more valuable products such as tagatose. Oxidation of glucose is however strongly favored over galactose, and oxidation of both substrates at more equal rates is desirable. Characterization of TmP2O variants (H450G, V546C, H450G/V546C) with improved D-galactose conversion has been given earlier, of which H450G displayed the best relative conversion between the substrates. To rationalize the changes in conversion rates, we have analyzed high-resolution crystal structures of the aforementioned mutants with bound 2- and 3-fluorinated glucose and galactose. Binding of glucose and galactose in the productive 2-oxidation binding mode is nearly identical in all mutants, suggesting that this binding mode is essentially unaffected by the mutations. For the competing glucose binding mode, enzyme variants carrying the H450G replacement stabilize glucose as the α-anomer in position for 3-oxidation. The backbone relaxation at position 450 allows the substrate-binding loop to fold tightly around the ligand. V546C however stabilize glucose as the β-anomer using an open loop conformation. Improved binding of galactose is enabled by subtle relaxation effects at key active-site backbone positions. The competing binding mode for galactose 2-oxidation by V546C stabilizes the β-anomer for oxidation at C1, whereas H450G variants stabilize the 3-oxidation binding mode of the galactose α-anomer. The present study provides a detailed description of binding modes that rationalize changes in the relative conversion rates of D-glucose and D-galactose and can be used to refine future enzyme designs for more efficient use of lactose-hydrolysis byproducts. PMID:24466218

  7. Structural characterization of O- and C-glycosylating variants of the landomycin glycosyltransferase LanGT2.

    PubMed

    Tam, Heng Keat; Härle, Johannes; Gerhardt, Stefan; Rohr, Jürgen; Wang, Guojun; Thorson, Jon S; Bigot, Aurélien; Lutterbeck, Monika; Seiche, Wolfgang; Breit, Bernhard; Bechthold, Andreas; Einsle, Oliver

    2015-02-23

    The structures of the O-glycosyltransferase LanGT2 and the engineered, C-C bond-forming variant LanGT2S8Ac show how the replacement of a single loop can change the functionality of the enzyme. Crystal structures of the enzymes in complex with a nonhydrolyzable nucleotide-sugar analogue revealed that there is a conformational transition to create the binding sites for the aglycon substrate. This induced-fit transition was explored by molecular docking experiments with various aglycon substrates. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. The variant call format and VCFtools.

    PubMed

    Danecek, Petr; Auton, Adam; Abecasis, Goncalo; Albers, Cornelis A; Banks, Eric; DePristo, Mark A; Handsaker, Robert E; Lunter, Gerton; Marth, Gabor T; Sherry, Stephen T; McVean, Gilean; Durbin, Richard

    2011-08-01

    The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. http://vcftools.sourceforge.net

  9. Structural comparisons of two allelic variants of human placental alkaline phosphatase.

    PubMed

    Millán, J L; Stigbrand, T; Jörnvall, H

    1985-01-01

    A simple immunosorbent purification scheme based on monoclonal antibodies has been devised for human placental alkaline phosphatase. The two most common allelic variants, S and F, have similar amino acid compositions with identical N-terminal amino acid sequences through the first 13 residues. Both variants have identical lectin binding properties towards concanavalin A, lentil-lectin, wheat germ agglutinin, phytohemagglutinin and soybean agglutinin, and identical carbohydrate contents as revealed by methylation analysis. CNBr fragments of the variants demonstrate identical high performance liquid chromatography patterns. The carbohydrate containing fragment is different from the 32P-labeled active site fragment and the N-terminal fragment.

  10. Distribution of Disease-Associated Copy Number Variants across Distinct Disorders of Cognitive Development

    ERIC Educational Resources Information Center

    Pescosolido, Matthew F.; Gamsiz, Ece D.; Nagpal, Shailender; Morrow, Eric M.

    2013-01-01

    Objective: The purpose of the present study was to discover the extent to which distinct "DSM" disorders share large, highly recurrent copy number variants (CNVs) as susceptibility factors. We also sought to identify gene mechanisms common to groups of diagnoses and/or specific to a given diagnosis based on associations with CNVs. Method:…

  11. Improving longleaf pine mortality predictions in the Southern Variant of the Forest Vegetation Simulator

    Treesearch

    R. Justin DeRose; John D. Shaw; Giorgio Vacchiano; James N. Long

    2008-01-01

    The Southern Variant of the Forest Vegetation Simulator (FVS-SN) is made up of individual submodels that predict tree growth, recruitment and mortality. Forest managers on Ft. Bragg, North Carolina, discovered biologically unrealistic longleaf pine (Pinus palustris) size-density predictions at large diameters when using FVS-SN to project red-cockaded...

  12. 3' rapid amplification of cDNA ends (RACE) walking for rapid structural analysis of large transcripts.

    PubMed

    Ozawa, Tatsuhiko; Kondo, Masato; Isobe, Masaharu

    2004-01-01

    The 3' rapid amplification of cDNA ends (3' RACE) is widely used to isolate the cDNA of unknown 3' flanking sequences. However, the conventional 3' RACE often fails to amplify cDNA from a large transcript if there is a long distance between the 5' gene-specific primer and poly(A) stretch, since the conventional 3' RACE utilizes 3' oligo-dT-containing primer complementary to the poly(A) tail of mRNA at the first strand cDNA synthesis. To overcome this problem, we have developed an improved 3' RACE method suitable for the isolation of cDNA derived from very large transcripts. By using the oligonucleotide-containing random 9mer together with the GC-rich sequence for the suppression PCR technology at the first strand of cDNA synthesis, we have been able to amplify the cDNA from a very large transcript, such as the microtubule-actin crosslinking factor 1 (MACF1) gene, which codes a transcript of 20 kb in size. When there is no splicing variant, our highly specific amplification allows us to perform the direct sequencing of 3' RACE products without requiring cloning in bacterial hosts. Thus, this stepwise 3' RACE walking will help rapid characterization of the 3' structure of a gene, even when it encodes a very large transcript.

  13. Tunable electroresistance and electro-optic effects of transparent molecular ferroelectrics

    DOE PAGES

    Zhang, Zhuolei; Li, Peng-Fei; Tang, Yuan-Yuan; ...

    2017-08-30

    Recent progress in molecular ferroelectrics (MOFEs) has been overshadowed by the lack of high-quality thin films for device integration. We report a water-based air-processable technique to prepare large-area MOFE thin films, controlled by supersaturation growth at the liquid-air interface under a temperature gradient and external water partial pressure. We used this technique to fabricate ImClO4 thin films and found a large, tunable room temperature electroresistance: a 20-fold resistance variation upon polarization switching. The as-grown films are transparent and consist of a bamboo-like structure of (more » $$2,\\overline{1},0$$) and ($$1,0,\\overline{2}$$) structural variants of R3m symmetry with a reversible polarization of 6.7 μC/cm 2. The resulting ferroelectric domain structure leads to a reversible electromechanical response of d 33 = 38.8 pm/V. Polarization switching results in a change of the refractive index, n, of single domains, $$\\frac{Δn}{n}$$ = 0.3. The remarkable combination of these characteristics renders MOFEs a prime candidate material for new nanoelectronic devices. The information that we present in this work will open a new area of MOFE thin-film technologies.« less

  14. Tunable electroresistance and electro-optic effects of transparent molecular ferroelectrics

    PubMed Central

    Zhang, Zhuolei; Li, Peng-Fei; Tang, Yuan-Yuan; Wilson, Andrew J.; Willets, Katherine; Wuttig, Manfred; Xiong, Ren-Gen; Ren, Shenqiang

    2017-01-01

    Recent progress in molecular ferroelectrics (MOFEs) has been overshadowed by the lack of high-quality thin films for device integration. We report a water-based air-processable technique to prepare large-area MOFE thin films, controlled by supersaturation growth at the liquid-air interface under a temperature gradient and external water partial pressure. We used this technique to fabricate ImClO4 thin films and found a large, tunable room temperature electroresistance: a 20-fold resistance variation upon polarization switching. The as-grown films are transparent and consist of a bamboo-like structure of (2,1¯,0) and (1,0,2¯) structural variants of R3m symmetry with a reversible polarization of 6.7 μC/cm2. The resulting ferroelectric domain structure leads to a reversible electromechanical response of d33 = 38.8 pm/V. Polarization switching results in a change of the refractive index, n, of single domains, Δnn=0.3. The remarkable combination of these characteristics renders MOFEs a prime candidate material for new nanoelectronic devices. The information that we present in this work will open a new area of MOFE thin-film technologies. PMID:28875167

  15. Post-mortem testing; germline BRCA1/2 variant detection using archival FFPE non-tumor tissue. A new paradigm in genetic counseling.

    PubMed

    Petersen, Annabeth Høgh; Aagaard, Mads Malik; Nielsen, Henriette Roed; Steffensen, Karina Dahl; Waldstrøm, Marianne; Bojesen, Anders

    2016-08-01

    Accurate estimation of cancer risk in HBOC families often requires BRCA1/2 testing, but this may be impossible in deceased family members. Previous, testing archival formalin-fixed, paraffin-embedded (FFPE) tissue for germline BRCA1/2 variants was unsuccessful, except for the Jewish founder mutations. A high-throughput method to systematically test for variants in all coding regions of BRCA1/2 in archival FFPE samples of non-tumor tissue is described, using HaloPlex target enrichment and next-generation sequencing. In a validation study, correct identification of variants or wild-type was possible in 25 out of 30 (83%) FFPE samples (age range 1-14 years), with a known variant status in BRCA1/2. No false positive was found. Unsuccessful identification was due to highly degraded DNA or presence of large intragenic deletions. In clinical use, a total of 201 FFPE samples (aged 0-43 years) were processed. Thirty-six samples were rejected because of highly degraded DNA or failed library preparation. Fifteen samples were investigated to search for a known variant. In the remaining 150 samples (aged 0-38 years), three variants known to affect function and one variant likely to affect function in BRCA1, six variants known to affect function and one variant likely to affect function in BRCA2, as well as four variants of unknown significance (VUS) in BRCA1 and three VUS in BRCA2 were discovered. It is now possible to test for germline BRCA1/2 variants in deceased persons, using archival FFPE samples from non-tumor tissue. Accurate genetic counseling is achievable in families where variant testing would otherwise be impossible.

  16. Post-mortem testing; germline BRCA1/2 variant detection using archival FFPE non-tumor tissue. A new paradigm in genetic counseling

    PubMed Central

    Petersen, Annabeth Høgh; Aagaard, Mads Malik; Nielsen, Henriette Roed; Steffensen, Karina Dahl; Waldstrøm, Marianne; Bojesen, Anders

    2016-01-01

    Accurate estimation of cancer risk in HBOC families often requires BRCA1/2 testing, but this may be impossible in deceased family members. Previous, testing archival formalin-fixed, paraffin-embedded (FFPE) tissue for germline BRCA1/2 variants was unsuccessful, except for the Jewish founder mutations. A high-throughput method to systematically test for variants in all coding regions of BRCA1/2 in archival FFPE samples of non-tumor tissue is described, using HaloPlex target enrichment and next-generation sequencing. In a validation study, correct identification of variants or wild-type was possible in 25 out of 30 (83%) FFPE samples (age range 1–14 years), with a known variant status in BRCA1/2. No false positive was found. Unsuccessful identification was due to highly degraded DNA or presence of large intragenic deletions. In clinical use, a total of 201 FFPE samples (aged 0–43 years) were processed. Thirty-six samples were rejected because of highly degraded DNA or failed library preparation. Fifteen samples were investigated to search for a known variant. In the remaining 150 samples (aged 0–38 years), three variants known to affect function and one variant likely to affect function in BRCA1, six variants known to affect function and one variant likely to affect function in BRCA2, as well as four variants of unknown significance (VUS) in BRCA1 and three VUS in BRCA2 were discovered. It is now possible to test for germline BRCA1/2 variants in deceased persons, using archival FFPE samples from non-tumor tissue. Accurate genetic counseling is achievable in families where variant testing would otherwise be impossible. PMID:26733283

  17. Mining the LIPG Allelic Spectrum Reveals the Contribution of Rare and Common Regulatory Variants to HDL Cholesterol

    PubMed Central

    Raghavan, Avanthi; Neeli, Hemanth; Jin, Weijun; Badellino, Karen O.; Demissie, Serkalem; Manning, Alisa K.; DerOhannessian, Stephanie L.; Wolfe, Megan L.; Cupples, L. Adrienne; Li, Mingyao; Kathiresan, Sekar; Rader, Daniel J.

    2011-01-01

    Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5′ UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5′ UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci. PMID:22174694

  18. Impact of genetic variation on three dimensional structure and function of proteins

    PubMed Central

    Bhattacharya, Roshni; Rose, Peter W.; Burley, Stephen K.

    2017-01-01

    The Protein Data Bank (PDB; http://wwpdb.org) was established in 1971 as the first open access digital data resource in biology with seven protein structures as its initial holdings. The global PDB archive now contains more than 126,000 experimentally determined atomic level three-dimensional (3D) structures of biological macromolecules (proteins, DNA, RNA), all of which are freely accessible via the Internet. Knowledge of the 3D structure of the gene product can help in understanding its function and role in disease. Of particular interest in the PDB archive are proteins for which 3D structures of genetic variant proteins have been determined, thus revealing atomic-level structural differences caused by the variation at the DNA level. Herein, we present a systematic and qualitative analysis of such cases. We observe a wide range of structural and functional changes caused by single amino acid differences, including changes in enzyme activity, aggregation propensity, structural stability, binding, and dissociation, some in the context of large assemblies. Structural comparison of wild type and mutated proteins, when both are available, provide insights into atomic-level structural differences caused by the genetic variation. PMID:28296894

  19. Population- and individual-specific regulatory variation in Sardinia.

    PubMed

    Pala, Mauro; Zappala, Zachary; Marongiu, Mara; Li, Xin; Davis, Joe R; Cusano, Roberto; Crobu, Francesca; Kukurba, Kimberly R; Gloudemans, Michael J; Reinier, Frederic; Berutti, Riccardo; Piras, Maria G; Mulas, Antonella; Zoledziewska, Magdalena; Marongiu, Michele; Sorokin, Elena P; Hess, Gaelen T; Smith, Kevin S; Busonero, Fabio; Maschio, Andrea; Steri, Maristella; Sidore, Carlo; Sanna, Serena; Fiorillo, Edoardo; Bassik, Michael C; Sawcer, Stephen J; Battle, Alexis; Novembre, John; Jones, Chris; Angius, Andrea; Abecasis, Gonçalo R; Schlessinger, David; Cucca, Francesco; Montgomery, Stephen B

    2017-05-01

    Genetic studies of complex traits have mainly identified associations with noncoding variants. To further determine the contribution of regulatory variation, we combined whole-genome and transcriptome data for 624 individuals from Sardinia to identify common and rare variants that influence gene expression and splicing. We identified 21,183 expression quantitative trait loci (eQTLs) and 6,768 splicing quantitative trait loci (sQTLs), including 619 new QTLs. We identified high-frequency QTLs and found evidence of selection near genes involved in malarial resistance and increased multiple sclerosis risk, reflecting the epidemiological history of Sardinia. Using family relationships, we identified 809 segregating expression outliers (median z score of 2.97), averaging 13.3 genes per individual. Outlier genes were enriched for proximal rare variants, providing a new approach to study large-effect regulatory variants and their relevance to traits. Our results provide insight into the effects of regulatory variants and their relationship to population history and individual genetic risk.

  20. Leapfrog variants of iterative methods for linear algebra equations

    NASA Technical Reports Server (NTRS)

    Saylor, Paul E.

    1988-01-01

    Two iterative methods are considered, Richardson's method and a general second order method. For both methods, a variant of the method is derived for which only even numbered iterates are computed. The variant is called a leapfrog method. Comparisons between the conventional form of the methods and the leapfrog form are made under the assumption that the number of unknowns is large. In the case of Richardson's method, it is possible to express the final iterate in terms of only the initial approximation, a variant of the iteration called the grand-leap method. In the case of the grand-leap variant, a set of parameters is required. An algorithm is presented to compute these parameters that is related to algorithms to compute the weights and abscissas for Gaussian quadrature. General algorithms to implement the leapfrog and grand-leap methods are presented. Algorithms for the important special case of the Chebyshev method are also given.

Top