Sample records for exome sequencing project

  1. Modeling read counts for CNV detection in exome sequencing data.

    PubMed

    Love, Michael I; Myšičková, Alena; Sun, Ruping; Kalscheuer, Vera; Vingron, Martin; Haas, Stefan A

    2011-11-08

    Varying depth of high-throughput sequencing reads along a chromosome makes it possible to observe copy number variants (CNVs) in a sample relative to a reference. In exome and other targeted sequencing projects, technical factors increase variation in read depth while reducing the number of observed locations, adding difficulty to the problem of identifying CNVs. We present a hidden Markov model for detecting CNVs from raw read count data, using background read depth from a control set as well as other positional covariates such as GC-content. The model, exomeCopy, is applied to a large chromosome X exome sequencing project identifying a list of large unique CNVs. CNVs predicted by the model and experimentally validated are then recovered using a cross-platform control set from publicly available exome sequencing data. Simulations show high sensitivity for detecting heterozygous and homozygous CNVs, outperforming normalization and state-of-the-art segmentation methods.

  2. Development of a Prognostic Marker for Lung Cancer Using Analysis of Tumor Evolution

    DTIC Science & Technology

    2017-08-01

    SUPPLEMENTARY NOTES 14. ABSTRACT The goal of this project is to sequence the exomes of single tumor cells from tumors in order to construct evolutionary trees...dissociation, tumor cell isolation, whole genome amplification, and exome sequencing. We have begun to sequence the exomes of single cells and to...of populations, the evolution of tumor cells within a tumor can be diagrammed on a phylogenetic tree. The more diverse a tumor’s phylogenetic tree

  3. MTTE: an innovative strategy for the evaluation of targeted/exome enrichment efficiency

    PubMed Central

    Klonowska, Katarzyna; Handschuh, Luiza; Swiercz, Aleksandra; Figlerowicz, Marek; Kozlowski, Piotr

    2016-01-01

    Although currently available strategies for the preparation of exome-enriched libraries are well established, a final validation of the libraries in terms of exome enrichment efficiency prior to the sequencing step is of considerable importance. Here, we present a strategy for the evaluation of exome enrichment, i.e., the Multipoint Test for Targeted-enrichment Efficiency (MTTE), PCR-based approach utilizing multiplex ligation-dependent probe amplification with capillary electrophoresis separation. We used MTTE for the analysis of subsequent steps of the Illumina TruSeq Exome Enrichment procedure. The calculated values of enrichment-associated parameters (i.e., relative enrichment, relative clearance, overall clearance, and fold enrichment) and the comparison of MTTE results with the actual enrichment revealed the high reliability of our assay. Additionally, the MTTE assay enabled the determination of the sequence-associated features that may confer bias in the enrichment of different targets. Importantly, the MTTE is low cost method that can be easily adapted to the region of interest important for a particular project. Thus, the MTTE strategy is attractive for post-capture validation in a variety of targeted/exome enrichment NGS projects. PMID:27572310

  4. MTTE: an innovative strategy for the evaluation of targeted/exome enrichment efficiency.

    PubMed

    Klonowska, Katarzyna; Handschuh, Luiza; Swiercz, Aleksandra; Figlerowicz, Marek; Kozlowski, Piotr

    2016-10-11

    Although currently available strategies for the preparation of exome-enriched libraries are well established, a final validation of the libraries in terms of exome enrichment efficiency prior to the sequencing step is of considerable importance. Here, we present a strategy for the evaluation of exome enrichment, i.e., the Multipoint Test for Targeted-enrichment Efficiency (MTTE), PCR-based approach utilizing multiplex ligation-dependent probe amplification with capillary electrophoresis separation. We used MTTE for the analysis of subsequent steps of the Illumina TruSeq Exome Enrichment procedure. The calculated values of enrichment-associated parameters (i.e., relative enrichment, relative clearance, overall clearance, and fold enrichment) and the comparison of MTTE results with the actual enrichment revealed the high reliability of our assay. Additionally, the MTTE assay enabled the determination of the sequence-associated features that may confer bias in the enrichment of different targets. Importantly, the MTTE is low cost method that can be easily adapted to the region of interest important for a particular project. Thus, the MTTE strategy is attractive for post-capture validation in a variety of targeted/exome enrichment NGS projects.

  5. Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.

    PubMed

    Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J; Robertson, Peggy D; Krumm, Niklas; Nickerson, Deborah A; Hershberger, Ray E

    2013-04-01

    BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.

  6. Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis.

    PubMed

    Emond, Mary J; Louie, Tin; Emerson, Julia; Zhao, Wei; Mathias, Rasika A; Knowles, Michael R; Wright, Fred A; Rieder, Mark J; Tabor, Holly K; Nickerson, Deborah A; Barnes, Kathleen C; Gibson, Ronald L; Bamshad, Michael J

    2012-07-08

    Exome sequencing has become a powerful and effective strategy for the discovery of genes underlying Mendelian disorders. However, use of exome sequencing to identify variants associated with complex traits has been more challenging, partly because the sample sizes needed for adequate power may be very large. One strategy to increase efficiency is to sequence individuals who are at both ends of a phenotype distribution (those with extreme phenotypes). Because the frequency of alleles that contribute to the trait are enriched in one or both phenotype extremes, a modest sample size can potentially be used to identify novel candidate genes and/or alleles. As part of the National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP), we used an extreme phenotype study design to discover that variants in DCTN4, encoding a dynactin protein, are associated with time to first P. aeruginosa airway infection, chronic P. aeruginosa infection and mucoid P. aeruginosa in individuals with cystic fibrosis.

  7. An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.

    PubMed

    Jun, Goo; Wing, Mary Kate; Abecasis, Gonçalo R; Kang, Hyun Min

    2015-06-01

    The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies. © 2015 Jun et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Reference genotype and exome data from an Australian Aboriginal population for health-based research

    PubMed Central

    Tang, Dave; Anderson, Denise; Francis, Richard W.; Syn, Genevieve; Jamieson, Sarra E.; Lassmann, Timo; Blackwell, Jenefer M.

    2016-01-01

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians. PMID:27070114

  9. Reference genotype and exome data from an Australian Aboriginal population for health-based research.

    PubMed

    Tang, Dave; Anderson, Denise; Francis, Richard W; Syn, Genevieve; Jamieson, Sarra E; Lassmann, Timo; Blackwell, Jenefer M

    2016-04-12

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.

  10. Whole Exome Sequencing in Pediatric Neurology Patients: Clinical Implications and Estimated Cost Analysis.

    PubMed

    Nolan, Danielle; Carlson, Martha

    2016-06-01

    Genetic heterogeneity in neurologic disorders has been an obstacle to phenotype-based diagnostic testing. The authors hypothesized that information compiled via whole exome sequencing will improve clinical diagnosis and management of pediatric neurology patients. The authors performed a retrospective chart review of patients evaluated in the University of Michigan Pediatric Neurology clinic between 6/2011 and 6/2015. The authors recorded previous diagnostic testing, indications for whole exome sequencing, and whole exome sequencing results. Whole exome sequencing was recommended for 135 patients and obtained in 53 patients. Insurance barriers often precluded whole exome sequencing. The most common indication for whole exome sequencing was neurodevelopmental disorders. Whole exome sequencing improved the presumptive diagnostic rate in the patient cohort from 25% to 48%. Clinical implications included family planning, medication selection, and systemic investigation. Compared to current second tier testing, whole exome sequencing can result in lower long-term charges and more timely diagnosis. Overcoming barriers related to whole exome sequencing insurance authorization could allow for more efficient and fruitful diagnostic neurological evaluations. © The Author(s) 2016.

  11. The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.

    PubMed

    Fadista, João; Manning, Alisa K; Florez, Jose C; Groop, Leif

    2016-08-01

    Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals of European ancestry from the Genetics of Type 2 Diabetes (GoT2D) project and a whole-exome-sequencing data set of 13 000 individuals from five ancestries from the GoT2D and T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) projects, we describe guidelines for genome- and exome-wide association P-value thresholds needed to correct for multiple testing, explaining the impact of linkage disequilibrium thresholds for distinguishing independent variants, minor allele frequency and ancestry characteristics. We emphasize the advantage of studying recent genetic isolate populations when performing rare and low-frequency genetic association analyses, as the multiple testing burden is diminished due to higher genetic homogeneity.

  12. Imputation of Exome Sequence Variants into Population- Based Samples and Blood-Cell-Trait-Associated Loci in African Americans: NHLBI GO Exome Sequencing Project

    PubMed Central

    Auer, Paul L.; Johnsen, Jill M.; Johnson, Andrew D.; Logsdon, Benjamin A.; Lange, Leslie A.; Nalls, Michael A.; Zhang, Guosheng; Franceschini, Nora; Fox, Keolu; Lange, Ethan M.; Rich, Stephen S.; O’Donnell, Christopher J.; Jackson, Rebecca D.; Wallace, Robert B.; Chen, Zhao; Graubert, Timothy A.; Wilson, James G.; Tang, Hua; Lettre, Guillaume; Reiner, Alex P.; Ganesh, Santhi K.; Li, Yun

    2012-01-01

    Researchers have successfully applied exome sequencing to discover causal variants in selected individuals with familial, highly penetrant disorders. We demonstrate the utility of exome sequencing followed by imputation for discovering low-frequency variants associated with complex quantitative traits. We performed exome sequencing in a reference panel of 761 African Americans and then imputed newly discovered variants into a larger sample of more than 13,000 African Americans for association testing with the blood cell traits hemoglobin, hematocrit, white blood count, and platelet count. First, we illustrate the feasibility of our approach by demonstrating genome-wide-significant associations for variants that are not covered by conventional genotyping arrays; for example, one such association is that between higher platelet count and an MPL c.117G>T (p.Lys39Asn) variant encoding a p.Lys39Asn amino acid substitution of the thrombpoietin receptor gene (p = 1.5 × 10−11). Second, we identified an association between missense variants of LCT and higher white blood count (p = 4 × 10−13). Third, we identified low-frequency coding variants that might account for allelic heterogeneity at several known blood cell-associated loci: MPL c.754T>C (p.Tyr252His) was associated with higher platelet count; CD36 c.975T>G (p.Tyr325∗) was associated with lower platelet count; and several missense variants at the α-globin gene locus were associated with lower hemoglobin. By identifying low-frequency missense variants associated with blood cell traits not previously reported by genome-wide association studies, we establish that exome sequencing followed by imputation is a powerful approach to dissecting complex, genetically heterogeneous traits in large population-based studies. PMID:23103231

  13. India Allele Finder: a web-based annotation tool for identifying common alleles in next-generation sequencing data of Indian origin.

    PubMed

    Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R

    2017-06-27

    We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.

  14. SIMPLEX: Cloud-Enabled Pipeline for the Comprehensive Analysis of Exome Sequencing Data

    PubMed Central

    Fischer, Maria; Snajder, Rene; Pabinger, Stephan; Dander, Andreas; Schossig, Anna; Zschocke, Johannes; Trajanoski, Zlatko; Stocker, Gernot

    2012-01-01

    In recent studies, exome sequencing has proven to be a successful screening tool for the identification of candidate genes causing rare genetic diseases. Although underlying targeted sequencing methods are well established, necessary data handling and focused, structured analysis still remain demanding tasks. Here, we present a cloud-enabled autonomous analysis pipeline, which comprises the complete exome analysis workflow. The pipeline combines several in-house developed and published applications to perform the following steps: (a) initial quality control, (b) intelligent data filtering and pre-processing, (c) sequence alignment to a reference genome, (d) SNP and DIP detection, (e) functional annotation of variants using different approaches, and (f) detailed report generation during various stages of the workflow. The pipeline connects the selected analysis steps, exposes all available parameters for customized usage, performs required data handling, and distributes computationally expensive tasks either on a dedicated high-performance computing infrastructure or on the Amazon cloud environment (EC2). The presented application has already been used in several research projects including studies to elucidate the role of rare genetic diseases. The pipeline is continuously tested and is publicly available under the GPL as a VirtualBox or Cloud image at http://simplex.i-med.ac.at; additional supplementary data is provided at http://www.icbi.at/exome. PMID:22870267

  15. The GENCODE exome: sequencing the complete human exome

    PubMed Central

    Coffey, Alison J; Kokocinski, Felix; Calafato, Maria S; Scott, Carol E; Palta, Priit; Drury, Eleanor; Joyce, Christopher J; LeProust, Emily M; Harrow, Jen; Hunt, Sarah; Lehesjoki, Anna-Elina; Turner, Daniel J; Hubbard, Tim J; Palotie, Aarno

    2011-01-01

    Sequencing the coding regions, the exome, of the human genome is one of the major current strategies to identify low frequency and rare variants associated with human disease traits. So far, the most widely used commercial exome capture reagents have mainly targeted the consensus coding sequence (CCDS) database. We report the design of an extended set of targets for capturing the complete human exome, based on annotation from the GENCODE consortium. The extended set covers an additional 5594 genes and 10.3 Mb compared with the current CCDS-based sets. The additional regions include potential disease genes previously inaccessible to exome resequencing studies, such as 43 genes linked to ion channel activity and 70 genes linked to protein kinase activity. In total, the new GENCODE exome set developed here covers 47.9 Mb and performed well in sequence capture experiments. In the sample set used in this study, we identified over 5000 SNP variants more in the GENCODE exome target (24%) than in the CCDS-based exome sequencing. PMID:21364695

  16. Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions.

    PubMed

    Seo, Heewon; Park, Yoomi; Min, Byung Joo; Seo, Myung Eui; Kim, Ju Han

    2017-01-01

    The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.

  17. Single-Exome sequencing identified a novel RP2 mutation in a child with X-linked retinitis pigmentosa.

    PubMed

    Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun

    2016-10-01

    To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.

  18. Assessing copy number from exome sequencing and exome array CGH based on CNV spectrum in a large clinical cohort.

    PubMed

    Retterer, Kyle; Scuffins, Julie; Schmidt, Daniel; Lewis, Rachel; Pineda-Alvarez, Daniel; Stafford, Amanda; Schmidt, Lindsay; Warren, Stephanie; Gibellini, Federica; Kondakova, Anastasia; Blair, Amanda; Bale, Sherri; Matyakhina, Ludmila; Meck, Jeanne; Aradhya, Swaroop; Haverfield, Eden

    2015-08-01

    Detection of copy-number variation (CNV) is important for investigating many genetic disorders. Testing a large clinical cohort by array comparative genomic hybridization provides a deep perspective on the spectrum of pathogenic CNV. In this context, we describe a bioinformatics approach to extract CNV information from whole-exome sequencing and demonstrate its utility in clinical testing. Exon-focused arrays and whole-genome chromosomal microarray analysis were used to test 14,228 and 14,000 individuals, respectively. Based on these results, we developed an algorithm to detect deletions/duplications in whole-exome sequencing data and a novel whole-exome array. In the exon array cohort, we observed a positive detection rate of 2.4% (25 duplications, 318 deletions), of which 39% involved one or two exons. Chromosomal microarray analysis identified 3,345 CNVs affecting single genes (18%). We demonstrate that our whole-exome sequencing algorithm resolves CNVs of three or more exons. These results demonstrate the clinical utility of single-exon resolution in CNV assays. Our whole-exome sequencing algorithm approaches this resolution but is complemented by a whole-exome array to unambiguously identify intragenic CNVs and single-exon changes. These data illustrate the next advancements in CNV analysis through whole-exome sequencing and whole-exome array.Genet Med 17 8, 623-629.

  19. A gradient-boosting approach for filtering de novo mutations in parent-offspring trios.

    PubMed

    Liu, Yongzhuang; Li, Bingshan; Tan, Renjie; Zhu, Xiaolin; Wang, Yadong

    2014-07-01

    Whole-genome and -exome sequencing on parent-offspring trios is a powerful approach to identifying disease-associated genes by detecting de novo mutations in patients. Accurate detection of de novo mutations from sequencing data is a critical step in trio-based genetic studies. Existing bioinformatic approaches usually yield high error rates due to sequencing artifacts and alignment issues, which may either miss true de novo mutations or call too many false ones, making downstream validation and analysis difficult. In particular, current approaches have much worse specificity than sensitivity, and developing effective filters to discriminate genuine from spurious de novo mutations remains an unsolved challenge. In this article, we curated 59 sequence features in whole genome and exome alignment context which are considered to be relevant to discriminating true de novo mutations from artifacts, and then employed a machine-learning approach to classify candidates as true or false de novo mutations. Specifically, we built a classifier, named De Novo Mutation Filter (DNMFilter), using gradient boosting as the classification algorithm. We built the training set using experimentally validated true and false de novo mutations as well as collected false de novo mutations from an in-house large-scale exome-sequencing project. We evaluated DNMFilter's theoretical performance and investigated relative importance of different sequence features on the classification accuracy. Finally, we applied DNMFilter on our in-house whole exome trios and one CEU trio from the 1000 Genomes Project and found that DNMFilter could be coupled with commonly used de novo mutation detection approaches as an effective filtering approach to significantly reduce false discovery rate without sacrificing sensitivity. The software DNMFilter implemented using a combination of Java and R is freely available from the website at http://humangenome.duke.edu/software. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. Genetic evolution of pancreatic cancer: lessons learnt from the pancreatic cancer genome sequencing project

    PubMed Central

    Iacobuzio-Donahue, Christine A

    2012-01-01

    Pancreatic cancer is a disease caused by the accumulation of genetic alterations in specific genes. Elucidation of the human genome sequence, in conjunction with technical advances in the ability to perform whole exome sequencing, have provided new insight into the mutational spectra characteristic of this lethal tumour type. Most recently, exomic sequencing has been used to clarify the clonal evolution of pancreatic cancer as well as provide time estimates of pancreatic carcinogenesis, indicating that a long window of opportunity may exist for early detection of this disease while in the curative stage. Moving forward, these mutational analyses indicate potential targets for personalised diagnostic and therapeutic intervention as well as the optimal timing for intervention based on the natural history of pancreatic carcinogenesis and progression. PMID:21749982

  1. Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium

    PubMed Central

    Grove, Megan L.; Yu, Bing; Cochran, Barbara J.; Haritunians, Talin; Bis, Joshua C.; Taylor, Kent D.; Hansen, Mark; Borecki, Ingrid B.; Cupples, L. Adrienne; Fornage, Myriam; Gudnason, Vilmundur; Harris, Tamara B.; Kathiresan, Sekar; Kraaij, Robert; Launer, Lenore J.; Levy, Daniel; Liu, Yongmei; Mosley, Thomas; Peloso, Gina M.; Psaty, Bruce M.; Rich, Stephen S.; Rivadeneira, Fernando; Siscovick, David S.; Smith, Albert V.; Uitterlinden, Andre; van Duijn, Cornelia M.; Wilson, James G.; O’Donnell, Christopher J.; Rotter, Jerome I.; Boerwinkle, Eric

    2013-01-01

    Genotyping arrays are a cost effective approach when typing previously-identified genetic polymorphisms in large numbers of samples. One limitation of genotyping arrays with rare variants (e.g., minor allele frequency [MAF] <0.01) is the difficulty that automated clustering algorithms have to accurately detect and assign genotype calls. Combining intensity data from large numbers of samples may increase the ability to accurately call the genotypes of rare variants. Approximately 62,000 ethnically diverse samples from eleven Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium cohorts were genotyped with the Illumina HumanExome BeadChip across seven genotyping centers. The raw data files for the samples were assembled into a single project for joint calling. To assess the quality of the joint calling, concordance of genotypes in a subset of individuals having both exome chip and exome sequence data was analyzed. After exclusion of low performing SNPs on the exome chip and non-overlap of SNPs derived from sequence data, genotypes of 185,119 variants (11,356 were monomorphic) were compared in 530 individuals that had whole exome sequence data. A total of 98,113,070 pairs of genotypes were tested and 99.77% were concordant, 0.14% had missing data, and 0.09% were discordant. We report that joint calling allows the ability to accurately genotype rare variation using array technology when large sample sizes are available and best practices are followed. The cluster file from this experiment is available at www.chargeconsortium.com/main/exomechip. PMID:23874508

  2. Rare coding variation in paraoxonase-1 is associated with ischemic stroke in the NHLBI Exome Sequencing Project[S

    PubMed Central

    Kim, Daniel Seung; Crosslin, David R.; Auer, Paul L.; Suzuki, Stephanie M.; Marsillach, Judit; Burt, Amber A.; Gordon, Adam S.; Meschia, James F.; Nalls, Mike A.; Worrall, Bradford B.; Longstreth, W. T.; Gottesman, Rebecca F.; Furlong, Clement E.; Peters, Ulrike; Rich, Stephen S.; Nickerson, Deborah A.; Jarvik, Gail P.

    2014-01-01

    HDL-associated paraoxonase-1 (PON1) is an enzyme whose activity is associated with cerebrovascular disease. Common PON1 genetic variants have not been consistently associated with cerebrovascular disease. Rare coding variation that likely alters PON1 enzyme function may be more strongly associated with stroke. The National Heart, Lung, and Blood Institute Exome Sequencing Project sequenced the coding regions (exomes) of the genome for heart, lung, and blood-related phenotypes (including ischemic stroke). In this sample of 4,204 unrelated participants, 496 had verified, noncardioembolic ischemic stroke. After filtering, 28 nonsynonymous PON1 variants were identified. Analysis with the sequence kernel association test, adjusted for covariates, identified significant associations between PON1 variants and ischemic stroke (P = 3.01 × 10−3). Stratified analyses demonstrated a stronger association of PON1 variants with ischemic stroke in African ancestry (AA) participants (P = 5.03 × 10−3). Ethnic differences in the association between PON1 variants with stroke could be due to the effects of PON1Val109Ile (overall P = 7.88 × 10−3; AA P = 6.52 × 10−4), found at higher frequency in AA participants (1.16% vs. 0.02%) and whose protein is less stable than the common allele. In summary, rare genetic variation in PON1 was associated with ischemic stroke, with stronger associations identified in those of AA. Increased focus on PON1 enzyme function and its role in cerebrovascular disease is warranted. PMID:24711634

  3. CIDR

    Science.gov Websites

    NIH CIDR Program Studies For whole exome sequencing projects, we pretest all samples using a high -density SNP array (>200,000 markers). For custom targeted sequencing, we pretest all samples using a 96 pretest samples using a 96 SNP GoldenGate assay. This extensive pretesting allows us to unambiguously tie

  4. Jannovar: a java library for exome annotation.

    PubMed

    Jäger, Marten; Wang, Kai; Bauer, Sebastian; Smedley, Damian; Krawitz, Peter; Robinson, Peter N

    2014-05-01

    Transcript-based annotation and pedigree analysis are two basic steps in the computational analysis of whole-exome sequencing experiments in genetic diagnostics and disease-gene discovery projects. Here, we present Jannovar, a stand-alone Java application as well as a Java library designed to be used in larger software frameworks for exome and genome analysis. Jannovar uses an interval tree to identify all transcripts affected by a given variant, and provides Human Genome Variation Society-compliant annotations both for variants affecting coding sequences and splice junctions as well as untranslated regions and noncoding RNA transcripts. Jannovar can also perform family-based pedigree analysis with Variant Call Format (VCF) files with data from members of a family segregating a Mendelian disorder. Using a desktop computer, Jannovar requires a few seconds to annotate a typical VCF file with exome data. Jannovar is freely available under the BSD2 license. Source code as well as the Java application and library file can be downloaded from http://compbio.charite.de (with tutorial) and https://github.com/charite/jannovar. © 2014 WILEY PERIODICALS, INC.

  5. Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.

    PubMed

    Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P

    2016-05-27

    Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.

  6. A comparative analysis of exome capture.

    PubMed

    Parla, Jennifer S; Iossifov, Ivan; Grabill, Ian; Spector, Mona S; Kramer, Melissa; McCombie, W Richard

    2011-09-29

    Human exome resequencing using commercial target capture kits has been and is being used for sequencing large numbers of individuals to search for variants associated with various human diseases. We rigorously evaluated the capabilities of two solution exome capture kits. These analyses help clarify the strengths and limitations of those data as well as systematically identify variables that should be considered in the use of those data. Each exome kit performed well at capturing the targets they were designed to capture, which mainly corresponds to the consensus coding sequences (CCDS) annotations of the human genome. In addition, based on their respective targets, each capture kit coupled with high coverage Illumina sequencing produced highly accurate nucleotide calls. However, other databases, such as the Reference Sequence collection (RefSeq), define the exome more broadly, and so not surprisingly, the exome kits did not capture these additional regions. Commercial exome capture kits provide a very efficient way to sequence select areas of the genome at very high accuracy. Here we provide the data to help guide critical analyses of sequencing data derived from these products.

  7. Understanding the Human Genome Project -- A Fact Sheet

    MedlinePlus

    ... cost of sequencing whole exomes or genomes, groundbreaking comparative genomic studies are now identifiying the causes of ... the role of ethical, legal, and social implications research more important than ever. National Human Genome Research ...

  8. Integrated sequencing of exome and mRNA of large-sized single cells.

    PubMed

    Wang, Lily Yan; Guo, Jiajie; Cao, Wei; Zhang, Meng; He, Jiankui; Li, Zhoufang

    2018-01-10

    Current approaches of single cell DNA-RNA integrated sequencing are difficult to call SNPs, because a large amount of DNA and RNA is lost during DNA-RNA separation. Here, we performed simultaneous single-cell exome and transcriptome sequencing on individual mouse oocytes. Using microinjection, we kept the nuclei intact to avoid DNA loss, while retaining the cytoplasm inside the cell membrane, to maximize the amount of DNA and RNA captured from the single cell. We then conducted exome-sequencing on the isolated nuclei and mRNA-sequencing on the enucleated cytoplasm. For single oocytes, exome-seq can cover up to 92% of exome region with an average sequencing depth of 10+, while mRNA-sequencing reveals more than 10,000 expressed genes in enucleated cytoplasm, with similar performance for intact oocytes. This approach provides unprecedented opportunities to study DNA-RNA regulation, such as RNA editing at single nucleotide level in oocytes. In future, this method can also be applied to other large cells, including neurons, large dendritic cells and large tumour cells for integrated exome and transcriptome sequencing.

  9. Rare coding variation in paraoxonase-1 is associated with ischemic stroke in the NHLBI Exome Sequencing Project.

    PubMed

    Kim, Daniel Seung; Crosslin, David R; Auer, Paul L; Suzuki, Stephanie M; Marsillach, Judit; Burt, Amber A; Gordon, Adam S; Meschia, James F; Nalls, Mike A; Worrall, Bradford B; Longstreth, W T; Gottesman, Rebecca F; Furlong, Clement E; Peters, Ulrike; Rich, Stephen S; Nickerson, Deborah A; Jarvik, Gail P

    2014-06-01

    HDL-associated paraoxonase-1 (PON1) is an enzyme whose activity is associated with cerebrovascular disease. Common PON1 genetic variants have not been consistently associated with cerebrovascular disease. Rare coding variation that likely alters PON1 enzyme function may be more strongly associated with stroke. The National Heart, Lung, and Blood Institute Exome Sequencing Project sequenced the coding regions (exomes) of the genome for heart, lung, and blood-related phenotypes (including ischemic stroke). In this sample of 4,204 unrelated participants, 496 had verified, noncardioembolic ischemic stroke. After filtering, 28 nonsynonymous PON1 variants were identified. Analysis with the sequence kernel association test, adjusted for covariates, identified significant associations between PON1 variants and ischemic stroke (P = 3.01 × 10(-3)). Stratified analyses demonstrated a stronger association of PON1 variants with ischemic stroke in African ancestry (AA) participants (P = 5.03 × 10(-3)). Ethnic differences in the association between PON1 variants with stroke could be due to the effects of PON1Val109Ile (overall P = 7.88 × 10(-3); AA P = 6.52 × 10(-4)), found at higher frequency in AA participants (1.16% vs. 0.02%) and whose protein is less stable than the common allele. In summary, rare genetic variation in PON1 was associated with ischemic stroke, with stronger associations identified in those of AA. Increased focus on PON1 enzyme function and its role in cerebrovascular disease is warranted.

  10. Exome sequencing reveals novel genetic loci influencing obesity-related traits in Hispanic children

    USDA-ARS?s Scientific Manuscript database

    To perform whole exome sequencing in 928 Hispanic children and identify variants and genes associated with childhood obesity.Single-nucleotide variants (SNVs) were identified from Illumina whole exome sequencing data using integrated read mapping, variant calling, and an annotation pipeline (Mercury...

  11. Whole exome sequencing to estimate alloreactivity potential between donors and recipients in stem cell transplantation

    PubMed Central

    Sampson, Juliana K.; Sheth, Nihar U.; Koparde, Vishal N.; Scalora, Allison F.; Serrano, Myrna G.; Lee, Vladimir; Roberts, Catherine H.; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H.; Buck, Gregory A.; Neale, Michael C.; Toor, Amir A.

    2016-01-01

    Summary Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. PMID:24749631

  12. Rare and Coding Region Genetic Variants Associated With Risk of Ischemic Stroke: The NHLBI Exome Sequence Project.

    PubMed

    Auer, Paul L; Nalls, Mike; Meschia, James F; Worrall, Bradford B; Longstreth, W T; Seshadri, Sudha; Kooperberg, Charles; Burger, Kathleen M; Carlson, Christopher S; Carty, Cara L; Chen, Wei-Min; Cupples, L Adrienne; DeStefano, Anita L; Fornage, Myriam; Hardy, John; Hsu, Li; Jackson, Rebecca D; Jarvik, Gail P; Kim, Daniel S; Lakshminarayan, Kamakshi; Lange, Leslie A; Manichaikul, Ani; Quinlan, Aaron R; Singleton, Andrew B; Thornton, Timothy A; Nickerson, Deborah A; Peters, Ulrike; Rich, Stephen S

    2015-07-01

    Stroke is the second leading cause of death and the third leading cause of years of life lost. Genetic factors contribute to stroke prevalence, and candidate gene and genome-wide association studies (GWAS) have identified variants associated with ischemic stroke risk. These variants often have small effects without obvious biological significance. Exome sequencing may discover predicted protein-altering variants with a potentially large effect on ischemic stroke risk. To investigate the contribution of rare and common genetic variants to ischemic stroke risk by targeting the protein-coding regions of the human genome. The National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP) analyzed approximately 6000 participants from numerous cohorts of European and African ancestry. For discovery, 365 cases of ischemic stroke (small-vessel and large-vessel subtypes) and 809 European ancestry controls were sequenced; for replication, 47 affected sibpairs concordant for stroke subtype and an African American case-control series were sequenced, with 1672 cases and 4509 European ancestry controls genotyped. The ESP's exome sequencing and genotyping started on January 1, 2010, and continued through June 30, 2012. Analyses were conducted on the full data set between July 12, 2012, and July 13, 2013. Discovery of new variants or genes contributing to ischemic stroke risk and subtype (primary analysis) and determination of support for protein-coding variants contributing to risk in previously published candidate genes (secondary analysis). We identified 2 novel genes associated with an increased risk of ischemic stroke: a protein-coding variant in PDE4DIP (rs1778155; odds ratio, 2.15; P = 2.63 × 10(-8)) with an intracellular signal transduction mechanism and in ACOT4 (rs35724886; odds ratio, 2.04; P = 1.24 × 10(-7)) with a fatty acid metabolism; confirmation of PDE4DIP was observed in affected sibpair families with large-vessel stroke subtype and in African Americans. Replication of protein-coding variants in candidate genes was observed for 2 previously reported GWAS associations: ZFHX3 (cardioembolic stroke) and ABCA1 (large-vessel stroke). Exome sequencing discovered 2 novel genes and mechanisms, PDE4DIP and ACOT4, associated with increased risk for ischemic stroke. In addition, ZFHX3 and ABCA1 were discovered to have protein-coding variants associated with ischemic stroke. These results suggest that genetic variation in novel pathways contributes to ischemic stroke risk and serves as a target for prediction, prevention, and therapy.

  13. Whole exome or genome sequencing: nurses need to prepare families for the possibilities.

    PubMed

    Prows, Cynthia A; Tran, Grace; Blosser, Beverly

    2014-12-01

    A discussion of whole exome sequencing and the type of possible results patients and families should be aware of before samples are obtained. To find the genetic cause of a rare disorder, whole exome sequencing analyses all known and suspected human genes from a single sample. Over 20,000 detected DNA variants in each individual exome must be considered as possibly causing disease or disregarded as not relevant to the person's disease. In the process, unexpected gene variants associated with known diseases unrelated to the primary purpose of the test may be incidentally discovered. Because family members' DNA samples are often needed, gene variants associated with known genetic diseases or predispositions for diseases can also be discovered in their samples. Discussion paper. PubMed 2009-2013, list of references in retrieved articles, Google Scholar. Nurses need a general understanding of the scope of potential genomic information that may be revealed with whole exome sequencing to provide support and guidance to individuals and families during their decision-making process, while waiting for results and after disclosure. Nurse scientists who want to use whole exome sequencing in their study design and methods must decide early in study development if they will return primary whole exome sequencing research results and if they will give research participants choices about learning incidental research results. It is critical that nurses translate their knowledge about whole exome sequencing into their patient education and patient advocacy roles and relevant programmes of research. © 2014 John Wiley & Sons Ltd.

  14. Whole exome sequencing to estimate alloreactivity potential between donors and recipients in stem cell transplantation.

    PubMed

    Sampson, Juliana K; Sheth, Nihar U; Koparde, Vishal N; Scalora, Allison F; Serrano, Myrna G; Lee, Vladimir; Roberts, Catherine H; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H; Buck, Gregory A; Neale, Michael C; Toor, Amir A

    2014-08-01

    Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. © 2014 John Wiley & Sons Ltd.

  15. Deep whole-genome sequencing of 90 Han Chinese genomes.

    PubMed

    Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

    2017-09-01

    Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.

  16. XLID-Causing Mutations and Associated Genes Challenged in Light of Data From Large-Scale Human Exome Sequencing

    PubMed Central

    Piton, Amélie; Redin, Claire; Mandel, Jean-Louis

    2013-01-01

    Because of the unbalanced sex ratio (1.3–1.4 to 1) observed in intellectual disability (ID) and the identification of large ID-affected families showing X-linked segregation, much attention has been focused on the genetics of X-linked ID (XLID). Mutations causing monogenic XLID have now been reported in over 100 genes, most of which are commonly included in XLID diagnostic gene panels. Nonetheless, the boundary between true mutations and rare non-disease-causing variants often remains elusive. The sequencing of a large number of control X chromosomes, required for avoiding false-positive results, was not systematically possible in the past. Such information is now available thanks to large-scale sequencing projects such as the National Heart, Lung, and Blood (NHLBI) Exome Sequencing Project, which provides variation information on 10,563 X chromosomes from the general population. We used this NHLBI cohort to systematically reassess the implication of 106 genes proposed to be involved in monogenic forms of XLID. We particularly question the implication in XLID of ten of them (AGTR2, MAGT1, ZNF674, SRPX2, ATP6AP2, ARHGEF6, NXF5, ZCCHC12, ZNF41, and ZNF81), in which truncating variants or previously published mutations are observed at a relatively high frequency within this cohort. We also highlight 15 other genes (CCDC22, CLIC2, CNKSR2, FRMPD4, HCFC1, IGBP1, KIAA2022, KLF8, MAOA, NAA10, NLGN3, RPL10, SHROOM4, ZDHHC15, and ZNF261) for which replication studies are warranted. We propose that similar reassessment of reported mutations (and genes) with the use of data from large-scale human exome sequencing would be relevant for a wide range of other genetic diseases. PMID:23871722

  17. MACARON: A python framework to identify and re-annotate multi-base affected codons in whole genome/exome sequence data.

    PubMed

    Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre

    2018-05-03

    Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.

  18. Personal Genome Sequencing in Ostensibly Healthy Individuals and the PeopleSeq Consortium

    PubMed Central

    Linderman, Michael D.; Nielsen, Daiva E.; Green, Robert C.

    2016-01-01

    Thousands of ostensibly healthy individuals have had their exome or genome sequenced, but a much smaller number of these individuals have received any personal genomic results from that sequencing. We term those projects in which ostensibly healthy participants can receive sequencing-derived genetic findings and may also have access to their genomic data as participatory predispositional personal genome sequencing (PPGS). Here we are focused on genome sequencing applied in a pre-symptomatic context and so define PPGS to exclude diagnostic genome sequencing intended to identify the molecular cause of suspected or diagnosed genetic disease. In this report we describe the design of completed and underway PPGS projects, briefly summarize the results reported to date and introduce the PeopleSeq Consortium, a newly formed collaboration of PPGS projects designed to collect much-needed longitudinal outcome data. PMID:27023617

  19. Illustrative case studies in the return of exome and genome sequencing results

    PubMed Central

    Amendola, Laura M; Lautenbach, Denise; Scollon, Sarah; Bernhardt, Barbara; Biswas, Sawona; East, Kelly; Everett, Jessica; Gilmore, Marian J; Himes, Patricia; Raymond, Victoria M; Wynn, Julia; Hart, Ragan; Jarvik, Gail P

    2015-01-01

    Whole genome and exome sequencing tests are increasingly being ordered in clinical practice, creating a need for research exploring the return of results from these tests. A goal of the Clinical Sequencing and Exploratory Research (CSER) consortium is to gain experience with this process to develop best practice recommendations for offering exome and genome testing and returning results. Genetic counselors in the CSER consortium have an integral role in the return of results from these genomic sequencing tests and have gained valuable insight. We present seven emerging themes related to return of exome and genome sequencing results accompanied by case descriptions illustrating important lessons learned, counseling challenges specific to these tests and considerations for future research and practice. PMID:26478737

  20. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is available as a stand-alone software package at http://wiki.chasmsoftware.org and is hosted by the CRAVAT web server at http://www.cravat.us PMID:23819870

  1. Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.

    PubMed

    Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin

    2015-06-01

    Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.

  2. CEQer: a graphical tool for copy number and allelic imbalance detection from whole-exome sequencing data.

    PubMed

    Piazza, Rocco; Magistroni, Vera; Pirola, Alessandra; Redaelli, Sara; Spinelli, Roberta; Redaelli, Serena; Galbiati, Marta; Valletta, Simona; Giudici, Giovanni; Cazzaniga, Giovanni; Gambacorti-Passerini, Carlo

    2013-01-01

    Copy number alterations (CNA) are common events occurring in leukaemias and solid tumors. Comparative Genome Hybridization (CGH) is actually the gold standard technique to analyze CNAs; however, CGH analysis requires dedicated instruments and is able to perform only low resolution Loss of Heterozygosity (LOH) analyses. Here we present CEQer (Comparative Exome Quantification analyzer), a new graphical, event-driven tool for CNA/allelic-imbalance (AI) coupled analysis of exome sequencing data. By using case-control matched exome data, CEQer performs a comparative digital exonic quantification to generate CNA data and couples this information with exome-wide LOH and allelic imbalance detection. This data is used to build mixed statistical/heuristic models allowing the identification of CNA/AI events. To test our tool, we initially used in silico generated data, then we performed whole-exome sequencing from 20 leukemic specimens and corresponding matched controls and we analyzed the results using CEQer. Taken globally, these analyses showed that the combined use of comparative digital exon quantification and LOH/AI allows generating very accurate CNA data. Therefore, we propose CEQer as an efficient, robust and user-friendly graphical tool for the identification of CNA/AI in the context of whole-exome sequencing data.

  3. Probable Diagnosis of a Patient with Niemann-Pick Disease Type C: Managing Pitfalls of Exome Sequencing.

    PubMed

    Zeiger, William A; Jamal, Nasheed I; Scheuner, Maren T; Pittman, Patricia; Raymond, Kimiyo M; Morra, Massimo; Mishra, Shri K

    2018-02-17

    Here, we present a case of a 31-year-old man with progressive cognitive decline, ataxia, and dystonia. Extensive laboratory, radiographic, and targeted genetic studies over the course of several years failed to yield a diagnosis. Initial whole exome sequencing through a commercial laboratory identified several variants of uncertain significance; however, follow-up clinical examination and testing ruled each of these out. Eventually, repeat whole exome sequencing identified a known pathogenic intronic variant in the NPC1 gene (NM_000271.4, c.1554-1009G>A) and an additional heterozygous exonic variant of uncertain significance in the NPC1 gene (NM_000271.4, c.2524T>C). Follow-up biochemical testing was consistent with a diagnosis of probable Niemann-Pick disease Type C (NP-C). This case illustrates the potential of whole exome sequencing for diagnosing rare complex neurologic diseases. It also identifies several potential common pitfalls that must be navigated by clinicians when interpreting commercial whole exome sequencing results.

  4. Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

    ScienceCinema

    Jenkins, David

    2018-01-10

    David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.

  5. Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jenkins, David

    David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.

  6. Whole exome sequencing identifies a homozygous nonsense variation in ALMS1 gene in a patient with syndromic obesity.

    PubMed

    Das Bhowmik, Aneek; Gupta, Neerja; Dalal, Ashwin; Kabra, Madhulika

    In the present study we report on genetic analysis in a patient with developmental delay, truncal obesity and vision problem, to find the causative mutation. Whole exome sequencing was performed on genomic DNA extracted from whole blood of the patient which revealed a homozygous nonsense variant (c.2816T>A) in exon 8 of ALMS1 gene that results in a stop codon and premature truncation at codon 939 (p.L939Ter) of the protein. The mutation was confirmed by Sanger sequencing. Exome sequencing was helpful in establishing diagnosis of Alstrom syndrome in this patient. This case highlights the utility of exome sequencing in clinical practice. Copyright © 2016 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.

  7. Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.

    PubMed

    Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew

    2017-11-06

    Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.

  8. Exome sequencing for prenatal diagnosis of fetuses with sonographic abnormalities.

    PubMed

    Drury, Suzanne; Williams, Hywel; Trump, Natalie; Boustred, Christopher; Lench, Nicholas; Scott, Richard H; Chitty, Lyn S

    2015-10-01

    In the absence of aneuploidy or other pathogenic cytogenetic abnormality, fetuses with increased nuchal translucency (NT ≥ 3.5 mm) and/or other sonographic abnormalities have a greater incidence of genetic syndromes, but defining the underlying pathology can be challenging. Here, we investigate the value of whole exome sequencing in fetuses with sonographic abnormalities but normal microarray analysis. Whole exome sequencing was performed on DNA extracted from chorionic villi or amniocytes in 24 fetuses with unexplained ultrasound findings. In the first 14 cases sequencing was initially performed on fetal DNA only. For the remaining 10, the trio of fetus, mother and father was sequenced simultaneously. In 21% (5/24) cases, exome sequencing provided definitive diagnoses (Milroy disease, hypophosphatasia, achondrogenesis type 2, Freeman-Sheldon syndrome and Baraitser-Winter Syndrome). In a further case, a plausible diagnosis of orofaciodigital syndrome type 6 was made. In two others, a single mutation in an autosomal recessive gene was identified, but incomplete sequencing coverage precluded exclusion of the presence of a second mutation. Whole exome sequencing improves prenatal diagnosis in euploid fetuses with abnormal ultrasound scans. In order to expedite interpretation of results, trio sequencing should be employed, but interpretation can still be compromised by incomplete coverage of relevant genes. © 2015 John Wiley & Sons, Ltd.

  9. CEQer: A Graphical Tool for Copy Number and Allelic Imbalance Detection from Whole-Exome Sequencing Data

    PubMed Central

    Piazza, Rocco; Magistroni, Vera; Pirola, Alessandra; Redaelli, Sara; Spinelli, Roberta; Redaelli, Serena; Galbiati, Marta; Valletta, Simona; Giudici, Giovanni; Cazzaniga, Giovanni; Gambacorti-Passerini, Carlo

    2013-01-01

    Copy number alterations (CNA) are common events occurring in leukaemias and solid tumors. Comparative Genome Hybridization (CGH) is actually the gold standard technique to analyze CNAs; however, CGH analysis requires dedicated instruments and is able to perform only low resolution Loss of Heterozygosity (LOH) analyses. Here we present CEQer (Comparative Exome Quantification analyzer), a new graphical, event-driven tool for CNA/allelic-imbalance (AI) coupled analysis of exome sequencing data. By using case-control matched exome data, CEQer performs a comparative digital exonic quantification to generate CNA data and couples this information with exome-wide LOH and allelic imbalance detection. This data is used to build mixed statistical/heuristic models allowing the identification of CNA/AI events. To test our tool, we initially used in silico generated data, then we performed whole-exome sequencing from 20 leukemic specimens and corresponding matched controls and we analyzed the results using CEQer. Taken globally, these analyses showed that the combined use of comparative digital exon quantification and LOH/AI allows generating very accurate CNA data. Therefore, we propose CEQer as an efficient, robust and user-friendly graphical tool for the identification of CNA/AI in the context of whole-exome sequencing data. PMID:24124457

  10. Leveraging the Power of High Performance Computing for Next Generation Sequencing Data Analysis: Tricks and Twists from a High Throughput Exome Workflow

    PubMed Central

    Wonczak, Stephan; Thiele, Holger; Nieroda, Lech; Jabbari, Kamel; Borowski, Stefan; Sinha, Vishal; Gunia, Wilfried; Lang, Ulrich; Achter, Viktor; Nürnberg, Peter

    2015-01-01

    Next generation sequencing (NGS) has been a great success and is now a standard method of research in the life sciences. With this technology, dozens of whole genomes or hundreds of exomes can be sequenced in rather short time, producing huge amounts of data. Complex bioinformatics analyses are required to turn these data into scientific findings. In order to run these analyses fast, automated workflows implemented on high performance computers are state of the art. While providing sufficient compute power and storage to meet the NGS data challenge, high performance computing (HPC) systems require special care when utilized for high throughput processing. This is especially true if the HPC system is shared by different users. Here, stability, robustness and maintainability are as important for automated workflows as speed and throughput. To achieve all of these aims, dedicated solutions have to be developed. In this paper, we present the tricks and twists that we utilized in the implementation of our exome data processing workflow. It may serve as a guideline for other high throughput data analysis projects using a similar infrastructure. The code implementing our solutions is provided in the supporting information files. PMID:25942438

  11. Outcomes of Diagnostic Exome Sequencing in Patients With Diagnosed or Suspected Autism Spectrum Disorders.

    PubMed

    Rossi, Mari; El-Khechen, Dima; Black, Mary Helen; Farwell Hagman, Kelly D; Tang, Sha; Powis, Zöe

    2017-05-01

    Exome sequencing has recently been proved to be a successful diagnostic method for complex neurodevelopmental disorders. However, the diagnostic yield of exome sequencing for autism spectrum disorders has not been extensively evaluated in large cohorts to date. We performed diagnostic exome sequencing in a cohort of 163 individuals with autism spectrum disorder (66.3%) or autistic features (33.7%). The diagnostic yield observed in patients in our cohort was 25.8% (42 of 163) for positive or likely positive findings in characterized disease genes, while a candidate genetic etiology was reported for an additional 3.3% (4 of 120) of patients. Among the positive findings in the patients with autism spectrum disorder or autistic features, 61.9% were the result of de novo mutations. Patients presenting with psychiatric conditions or ataxia or paraplegia in addition to autism spectrum disorder or autistic features were significantly more likely to receive positive results compared with patients without these clinical features (95.6% vs 27.1%, P < 0.0001; 83.3% vs 21.2%, P < 0.0001, respectively). The majority of the positive findings were in recently identified autism spectrum disorder genes, supporting the importance of diagnostic exome sequencing for patients with autism spectrum disorder or autistic features as the causative genes might evade traditional sequential or panel testing. These results suggest that diagnostic exome sequencing would be an efficient primary diagnostic method for patients with autism spectrum disorders or autistic features. Moreover, our data may aid clinicians to better determine which subset of patients with autism spectrum disorder with additional clinical features would benefit the most from diagnostic exome sequencing. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  12. Impact of exome sequencing in inflammatory bowel disease

    PubMed Central

    Cardinale, Christopher J; Kelsen, Judith R; Baldassano, Robert N; Hakonarson, Hakon

    2013-01-01

    Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD. PMID:24187447

  13. XLID-causing mutations and associated genes challenged in light of data from large-scale human exome sequencing.

    PubMed

    Piton, Amélie; Redin, Claire; Mandel, Jean-Louis

    2013-08-08

    Because of the unbalanced sex ratio (1.3-1.4 to 1) observed in intellectual disability (ID) and the identification of large ID-affected families showing X-linked segregation, much attention has been focused on the genetics of X-linked ID (XLID). Mutations causing monogenic XLID have now been reported in over 100 genes, most of which are commonly included in XLID diagnostic gene panels. Nonetheless, the boundary between true mutations and rare non-disease-causing variants often remains elusive. The sequencing of a large number of control X chromosomes, required for avoiding false-positive results, was not systematically possible in the past. Such information is now available thanks to large-scale sequencing projects such as the National Heart, Lung, and Blood (NHLBI) Exome Sequencing Project, which provides variation information on 10,563 X chromosomes from the general population. We used this NHLBI cohort to systematically reassess the implication of 106 genes proposed to be involved in monogenic forms of XLID. We particularly question the implication in XLID of ten of them (AGTR2, MAGT1, ZNF674, SRPX2, ATP6AP2, ARHGEF6, NXF5, ZCCHC12, ZNF41, and ZNF81), in which truncating variants or previously published mutations are observed at a relatively high frequency within this cohort. We also highlight 15 other genes (CCDC22, CLIC2, CNKSR2, FRMPD4, HCFC1, IGBP1, KIAA2022, KLF8, MAOA, NAA10, NLGN3, RPL10, SHROOM4, ZDHHC15, and ZNF261) for which replication studies are warranted. We propose that similar reassessment of reported mutations (and genes) with the use of data from large-scale human exome sequencing would be relevant for a wide range of other genetic diseases. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  14. Simple and efficient identification of rare recessive pathologically important sequence variants from next generation exome sequence data.

    PubMed

    Carr, Ian M; Morgan, Joanne; Watson, Christopher; Melnik, Svitlana; Diggle, Christine P; Logan, Clare V; Harrison, Sally M; Taylor, Graham R; Pena, Sergio D J; Markham, Alexander F; Alkuraya, Fowzan S; Black, Graeme C M; Ali, Manir; Bonthron, David T

    2013-07-01

    Massively parallel ("next generation") DNA sequencing (NGS) has quickly become the method of choice for seeking pathogenic mutations in rare uncharacterized monogenic diseases. Typically, before DNA sequencing, protein-coding regions are enriched from patient genomic DNA, representing either the entire genome ("exome sequencing") or selected mapped candidate loci. Sequence variants, identified as differences between the patient's and the human genome reference sequences, are then filtered according to various quality parameters. Changes are screened against datasets of known polymorphisms, such as dbSNP and the 1000 Genomes Project, in the effort to narrow the list of candidate causative variants. An increasing number of commercial services now offer to both generate and align NGS data to a reference genome. This potentially allows small groups with limited computing infrastructure and informatics skills to utilize this technology. However, the capability to effectively filter and assess sequence variants is still an important bottleneck in the identification of deleterious sequence variants in both research and diagnostic settings. We have developed an approach to this problem comprising a user-friendly suite of programs that can interactively analyze, filter and screen data from enrichment-capture NGS data. These programs ("Agile Suite") are particularly suitable for small-scale gene discovery or for diagnostic analysis. © 2013 WILEY PERIODICALS, INC.

  15. Rapid-Onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD): exome sequencing of trios, monozygotic twins and tumours.

    PubMed

    Barclay, Sarah F; Rand, Casey M; Borch, Lauren A; Nguyen, Lisa; Gray, Paul A; Gibson, William T; Wilson, Richard J A; Gordon, Paul M K; Aung, Zaw; Berry-Kravis, Elizabeth M; Ize-Ludlow, Diego; Weese-Mayer, Debra E; Bech-Hansen, N Torben

    2015-08-25

    Rapid-onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD) is thought to be a genetic disease caused by de novo mutations, though causative mutations have yet to be identified. We searched for de novo coding mutations among a carefully-diagnosed and clinically homogeneous cohort of 35 ROHHAD patients. We sequenced the exomes of seven ROHHAD trios, plus tumours from four of these patients and the unaffected monozygotic (MZ) twin of one (discovery cohort), to identify constitutional and somatic de novo sequence variants. We further analyzed this exome data to search for candidate genes under autosomal dominant and recessive models, and to identify structural variations. Candidate genes were tested by exome or Sanger sequencing in a replication cohort of 28 ROHHAD singletons. The analysis of the trio-based exomes found 13 de novo variants. However, no two patients had de novo variants in the same gene, and additional patient exomes and mutation analysis in the replication cohort did not provide strong genetic evidence to implicate any of these sequence variants in ROHHAD. Somatic comparisons revealed no coding differences between any blood and tumour samples, or between the two discordant MZ twins. Neither autosomal dominant nor recessive analysis yielded candidate genes for ROHHAD, and we did not identify any potentially causative structural variations. Clinical exome sequencing is highly unlikely to be a useful diagnostic test in patients with true ROHHAD. As ROHHAD has a high risk for fatality if not properly managed, it remains imperative to expand the search for non-exomic genetic risk factors, as well as to investigate other possible mechanisms of disease. In so doing, we will be able to confirm objectively the ROHHAD diagnosis and to contribute to our understanding of obesity, respiratory control, hypothalamic function, and autonomic regulation.

  16. Applications of the 1000 Genomes Project resources

    PubMed Central

    Zheng-Bradley, Xiangqun

    2017-01-01

    Abstract The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. PMID:27436001

  17. Exome Sequencing in Suspected Monogenic Dyslipidemias

    PubMed Central

    Stitziel, Nathan O.; Peloso, Gina M.; Abifadel, Marianne; Cefalu, Angelo B.; Fouchier, Sigrid; Motazacker, M. Mahdi; Tada, Hayato; Larach, Daniel B.; Awan, Zuhier; Haller, Jorge F.; Pullinger, Clive R.; Varret, Mathilde; Rabès, Jean-Pierre; Noto, Davide; Tarugi, Patrizia; Kawashiri, Masa-aki; Nohara, Atsushi; Yamagishi, Masakazu; Risman, Marjorie; Deo, Rahul; Ruel, Isabelle; Shendure, Jay; Nickerson, Deborah A.; Wilson, James G.; Rich, Stephen S.; Gupta, Namrata; Farlow, Deborah N.; Neale, Benjamin M.; Daly, Mark J.; Kane, John P.; Freeman, Mason W.; Genest, Jacques; Rader, Daniel J.; Mabuchi, Hiroshi; Kastelein, John J.P.; Hovingh, G. Kees; Averna, Maurizio R.; Gabriel, Stacey; Boileau, Catherine; Kathiresan, Sekar

    2015-01-01

    Background Exome sequencing is a promising tool for gene mapping in Mendelian disorders. We utilized this technique in an attempt to identify novel genes underlying monogenic dyslipidemias. Methods and Results We performed exome sequencing on 213 selected family members from 41 kindreds with suspected Mendelian inheritance of extreme levels of low-density lipoprotein (LDL) cholesterol (after candidate gene sequencing excluded known genetic causes for high LDL cholesterol families) or high-density lipoprotein (HDL) cholesterol. We used standard analytic approaches to identify candidate variants and also assigned a polygenic score to each individual in order to account for their burden of common genetic variants known to influence lipid levels. In nine families, we identified likely pathogenic variants in known lipid genes (ABCA1, APOB, APOE, LDLR, LIPA, and PCSK9); however, we were unable to identify obvious genetic etiologies in the remaining 32 families despite follow-up analyses. We identified three factors that limited novel gene discovery: (1) imperfect sequencing coverage across the exome hid potentially causal variants; (2) large numbers of shared rare alleles within families obfuscated causal variant identification; and (3) individuals from 15% of families carried a significant burden of common lipid-related alleles, suggesting complex inheritance can masquerade as monogenic disease. Conclusions We identified the genetic basis of disease in nine of 41 families; however, none of these represented novel gene discoveries. Our results highlight the promise and limitations of exome sequencing as a discovery technique in suspected monogenic dyslipidemias. Considering the confounders identified may inform the design of future exome sequencing studies. PMID:25632026

  18. Exome sequencing for simultaneous mutation screening in children with hemophagocytic lymphohistiocytosis.

    PubMed

    Mukda, Ekchol; Trachoo, Objoon; Pasomsub, Ekawat; Tiyasirichokchai, Rawiphorn; Iemwimangsa, Nareenart; Sosothikul, Darintr; Chantratita, Wasun; Pakakasama, Samart

    2017-08-01

    In the present study, we used exome sequencing to analyze PRF1, UNC13D, STX11, and STXBP2, as well as genes associated with primary immunodeficiency disease (RAB27A, LYST, AP3B1, SH2D1A, ITK, CD27, XIAP, and MAGT1) in Thai children with hemophagocytic lymphohistiocytosis (HLH). We performed mutation analysis of HLH-associated genes in 25 Thai children using an exome sequencing method. Genetic variations found within these target genes were compared to exome sequencing data from 133 healthy individuals. Variants identified with minor allele frequencies <5% and novel mutations were confirmed using Sanger sequencing. Exome sequencing data revealed 101 non-synonymous single nucleotide polymorphisms (SNPs) in all subjects. These SNPs were classified as pathogenic (n = 1), likely pathogenic (n = 16), variant of unknown significance (n = 12), or benign variant (n = 72). Homozygous, compound heterozygous, and double-gene heterozygous variants, involving mutations in PRF1 (n = 3), UNC13D (n = 2), STXBP2 (n = 3), LYST (n = 3), XIAP (n = 2), AP3B1 (n = 1), RAB27A (n = 1), and MAGT1 (n = 1), were demonstrated in 12 patients. Novel mutations were found in most patients in this study. In conclusion, exome sequencing demonstrated the ability to identify rare genetic variants in HLH patients. This method is useful in the detection of mutations in multi-gene associated diseases.

  19. Extensive sequencing of seven human genomes to characterize benchmark reference materials

    PubMed Central

    Zook, Justin M.; Catoe, David; McDaniel, Jennifer; Vang, Lindsay; Spies, Noah; Sidow, Arend; Weng, Ziming; Liu, Yuling; Mason, Christopher E.; Alexander, Noah; Henaff, Elizabeth; McIntyre, Alexa B.R.; Chandramohan, Dhruva; Chen, Feng; Jaeger, Erich; Moshrefi, Ali; Pham, Khoa; Stedman, William; Liang, Tiffany; Saghbini, Michael; Dzakula, Zeljko; Hastie, Alex; Cao, Han; Deikus, Gintaras; Schadt, Eric; Sebra, Robert; Bashir, Ali; Truty, Rebecca M.; Chang, Christopher C.; Gulbahce, Natali; Zhao, Keyan; Ghosh, Srinka; Hyland, Fiona; Fu, Yutao; Chaisson, Mark; Xiao, Chunlin; Trow, Jonathan; Sherry, Stephen T.; Zaranek, Alexander W.; Ball, Madeleine; Bobe, Jason; Estep, Preston; Church, George M.; Marks, Patrick; Kyriazopoulou-Panagiotopoulou, Sofia; Zheng, Grace X.Y.; Schnall-Levin, Michael; Ordonez, Heather S.; Mudivarti, Patrice A.; Giorda, Kristina; Sheng, Ying; Rypdal, Karoline Bjarnesdatter; Salit, Marc

    2016-01-01

    The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly. PMID:27271295

  20. Use Massive Parallel Sequencing and Exome Capture Technology to Sequence the Exome of Fanconi Anemia Children and Their Patents

    ClinicalTrials.gov

    2013-11-21

    Fanconi Anemia; Autosomal or Sex Linked Recessive Genetic Disease; Bone Marrow Hematopoiesis Failure, Multiple Congenital Abnormalities, and Susceptibility to Neoplastic Diseases.; Hematopoiesis Maintainance.

  1. Refining the structure and content of clinical genomic reports.

    PubMed

    Dorschner, Michael O; Amendola, Laura M; Shirts, Brian H; Kiedrowski, Lesli; Salama, Joseph; Gordon, Adam S; Fullerton, Stephanie M; Tarczy-Hornoch, Peter; Byers, Peter H; Jarvik, Gail P

    2014-03-01

    To effectively articulate the results of exome and genome sequencing we refined the structure and content of molecular test reports. To communicate results of a randomized control trial aimed at the evaluation of exome sequencing for clinical medicine, we developed a structured narrative report. With feedback from genetics and non-genetics professionals, we developed separate indication-specific and incidental findings reports. Standard test report elements were supplemented with research study-specific language, which highlighted the limitations of exome sequencing and provided detailed, structured results, and interpretations. The report format we developed to communicate research results can easily be transformed for clinical use by removal of research-specific statements and disclaimers. The development of clinical reports for exome sequencing has shown that accurate and open communication between the clinician and laboratory is ideally an ongoing process to address the increasing complexity of molecular genetic testing. © 2014 Wiley Periodicals, Inc.

  2. Refining the Structure and Content of Clinical Genomic Reports

    PubMed Central

    DORSCHNER, MICHAEL O.; AMENDOLA, LAURA M.; SHIRTS, BRIAN H.; KIEDROWSKI, LESLI; SALAMA, JOSEPH; GORDON, ADAM S.; FULLERTON, STEPHANIE M.; TARCZY-HORNOCH, PETER; BYERS, PETER H.; JARVIK, GAIL P.

    2014-01-01

    To effectively articulate the results of exome and genome sequencing we refined the structure and content of molecular test reports. To communicate results of a randomized control trial aimed at the evaluation of exome sequencing for clinical medicine, we developed a structured narrative report. With feedback from genetics and non-genetics professionals, we developed separate indication-specific and incidental findings reports. Standard test report elements were supplemented with research study-specific language, which highlighted the limitations of exome sequencing and provided detailed, structured results, and interpretations. The report format we developed to communicate research results can easily be transformed for clinical use by removal of research-specific statements and disclaimers. The development of clinical reports for exome sequencing has shown that accurate and open communication between the clinician and laboratory is ideally an ongoing process to address the increasing complexity of molecular genetic testing. PMID:24616401

  3. The rapid evolution of molecular genetic diagnostics in neuromuscular diseases.

    PubMed

    Volk, Alexander E; Kubisch, Christian

    2017-10-01

    The development of massively parallel sequencing (MPS) has revolutionized molecular genetic diagnostics in monogenic disorders. The present review gives a brief overview of different MPS-based approaches used in clinical diagnostics of neuromuscular disorders (NMDs) and highlights their advantages and limitations. MPS-based approaches like gene panel sequencing, (whole) exome sequencing, (whole) genome sequencing, and RNA sequencing have been used to identify the genetic cause in NMDs. Although gene panel sequencing has evolved as a standard test for heterogeneous diseases, it is still debated, mainly because of financial issues and unsolved problems of variant interpretation, whether genome sequencing (and to a lesser extent also exome sequencing) of single patients can already be regarded as routine diagnostics. However, it has been shown that the inclusion of parents and additional family members often leads to a substantial increase in the diagnostic yield in exome-wide/genome-wide MPS approaches. In addition, MPS-based RNA sequencing just enters the research and diagnostic scene. Next-generation sequencing increasingly enables the detection of the genetic cause in highly heterogeneous diseases like NMDs in an efficient and affordable way. Gene panel sequencing and family-based exome sequencing have been proven as potent and cost-efficient diagnostic tools. Although clinical validation and interpretation of genome sequencing is still challenging, diagnostic RNA sequencing represents a promising tool to bypass some hurdles of diagnostics using genomic DNA.

  4. Identification of rare paired box 3 variant in strabismus by whole exome sequencing

    PubMed Central

    Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang

    2017-01-01

    AIM To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. METHODS A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. RESULTS Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. CONCLUSION Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder. PMID:28861346

  5. Identification of rare paired box 3 variant in strabismus by whole exome sequencing.

    PubMed

    Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang

    2017-01-01

    To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder.

  6. Quantifying rare, deleterious variation in 12 human cytochrome P450 drug-metabolism genes in a large-scale exome dataset.

    PubMed

    Gordon, Adam S; Tabor, Holly K; Johnson, Andrew D; Snively, Beverly M; Assimes, Themistocles L; Auer, Paul L; Ioannidis, John P A; Peters, Ulrike; Robinson, Jennifer G; Sucheston, Lara E; Wang, Danxin; Sotoodehnia, Nona; Rotter, Jerome I; Psaty, Bruce M; Jackson, Rebecca D; Herrington, David M; O'Donnell, Christopher J; Reiner, Alexander P; Rich, Stephen S; Rieder, Mark J; Bamshad, Michael J; Nickerson, Deborah A

    2014-04-15

    The study of genetic influences on drug response and efficacy ('pharmacogenetics') has existed for over 50 years. Yet, we still lack a complete picture of how genetic variation, both common and rare, affects each individual's responses to medications. Exome sequencing is a promising alternative method for pharmacogenetic discovery as it provides information on both common and rare variation in large numbers of individuals. Using exome data from 2203 AA and 4300 Caucasian individuals through the NHLBI Exome Sequencing Project, we conducted a survey of coding variation within 12 Cytochrome P450 (CYP) genes that are collectively responsible for catalyzing nearly 75% of all known Phase I drug oxidation reactions. In addition to identifying many polymorphisms with known pharmacogenetic effects, we discovered over 730 novel nonsynonymous alleles across the 12 CYP genes of interest. These alleles include many with diverse functional effects such as premature stop codons, aberrant splicesites and mutations at conserved active site residues. Our analysis considering both novel, predicted functional alleles as well as known, actionable CYP alleles reveals that rare, deleterious variation contributes markedly to the overall burden of pharmacogenetic alleles within the populations considered, and that the contribution of rare variation to this burden is over three times greater in AA individuals as compared with Caucasians. While most of these impactful alleles are individually rare, 7.6-11.7% of individuals interrogated in the study carry at least one newly described potentially deleterious alleles in a major drug-metabolizing CYP.

  7. Using whole-exome sequencing to identify variants inherited from mosaic parents

    PubMed Central

    Rios, Jonathan J; Delgado, Mauricio R

    2015-01-01

    Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828

  8. Unexpected allelic heterogeneity and spectrum of mutations in Fowler syndrome revealed by next-generation exome sequencing.

    PubMed

    Lalonde, Emilie; Albrecht, Steffen; Ha, Kevin C H; Jacob, Karine; Bolduc, Nathalie; Polychronakos, Constantin; Dechelotte, Pierre; Majewski, Jacek; Jabado, Nada

    2010-08-01

    Protein coding genes constitute approximately 1% of the human genome but harbor 85% of the mutations with large effects on disease-related traits. Therefore, efficient strategies for selectively sequencing complete coding regions (i.e., "whole exome") have the potential to contribute our understanding of human diseases. We used a method for whole-exome sequencing coupling Agilent whole-exome capture to the Illumina DNA-sequencing platform, and investigated two unrelated fetuses from nonconsanguineous families with Fowler Syndrome (FS), a stereotyped phenotype lethal disease. We report novel germline mutations in feline leukemia virus subgroup C cellular-receptor-family member 2, FLVCR2, which has recently been shown to cause FS. Using this technology, we identified three types of genetic abnormalities: point-mutations, insertions-deletions, and intronic splice-site changes (first pathogenic report using this technology), in the fetuses who both were compound heterozygotes for the disease. Although revealing a high level of allelic heterogeneity and mutational spectrum in FS, this study further illustrates the successful application of whole-exome sequencing to uncover genetic defects in rare Mendelian disorders. Of importance, we show that we can identify genes underlying rare, monogenic and recessive diseases using a limited number of patients (n=2), in the absence of shared genetic heritage and in the presence of allelic heterogeneity.

  9. Whole exome sequencing identifies genetic variants in inherited thrombocytopenia with secondary qualitative function defects

    PubMed Central

    Johnson, Ben; Lowe, Gillian C.; Futterer, Jane; Lordkipanidzé, Marie; MacDonald, David; Simpson, Michael A.; Sanchez-Guiú, Isabel; Drake, Sian; Bem, Danai; Leo, Vincenzo; Fletcher, Sarah J.; Dawood, Ban; Rivera, José; Allsup, David; Biss, Tina; Bolton-Maggs, Paula HB; Collins, Peter; Curry, Nicola; Grimley, Charlotte; James, Beki; Makris, Mike; Motwani, Jayashree; Pavord, Sue; Talks, Katherine; Thachil, Jecko; Wilde, Jonathan; Williams, Mike; Harrison, Paul; Gissen, Paul; Mundell, Stuart; Mumford, Andrew; Daly, Martina E.; Watson, Steve P.; Morgan, Neil V.

    2016-01-01

    Inherited thrombocytopenias are a heterogeneous group of disorders characterized by abnormally low platelet counts which can be associated with abnormal bleeding. Next-generation sequencing has previously been employed in these disorders for the confirmation of suspected genetic abnormalities, and more recently in the discovery of novel disease-causing genes. However its full potential has not yet been exploited. Over the past 6 years we have sequenced the exomes from 55 patients, including 37 index cases and 18 additional family members, all of whom were recruited to the UK Genotyping and Phenotyping of Platelets study. All patients had inherited or sustained thrombocytopenia of unknown etiology with platelet counts varying from 11×109/L to 186×109/L. Of the 51 patients phenotypically tested, 37 (73%), had an additional secondary qualitative platelet defect. Using whole exome sequencing analysis we have identified “pathogenic” or “likely pathogenic” variants in 46% (17/37) of our index patients with thrombocytopenia. In addition, we report variants of uncertain significance in 12 index cases, including novel candidate genetic variants in previously unreported genes in four index cases. These results demonstrate that whole exome sequencing is an efficient method for elucidating potential pathogenic genetic variants in inherited thrombocytopenia. Whole exome sequencing also has the added benefit of discovering potentially pathogenic genetic variants for further study in novel genes not previously implicated in inherited thrombocytopenia. PMID:27479822

  10. Whole-Exome Sequencing Reveals GPIHBP1 Mutations in Infantile Colitis With Severe Hypertriglyceridemia

    PubMed Central

    Gonzaga-Jauregui, Claudia; Mir, Sabina; Penney, Samantha; Jhangiani, Shalini; Midgen, Craig; Finegold, Milton; Muzny, Donna M.; Wang, Min; Bacino, Carlos A.; Gibbs, Richard A.; Lupski, James R.; Kellermayer, Richard; Hanchard, Neil A.

    2014-01-01

    Severe congenital hypertriglyceridemia (HTG) is a rare disorder caused by mutations in genes affecting lipoprotein lipase (LPL) activity. Here we report a 5-week-old Hispanic girl with severe HTG (12,031 mg/dL, normal limit 150 mg/dL) who presented with the unusual combination of lower gastrointestinal bleeding and milky plasma. Initial colonoscopy was consistent with colitis, which resolved with reduction of triglycerides. After negative sequencing of the LPL gene, whole-exome sequencing revealed novel compound heterozygous mutations in GPIHBP1. Our study broadens the phenotype of GPIHBP1-associated HTG, reinforces the effectiveness of whole-exome sequencing in Mendelian diagnoses, and implicates triglycer-ides in gastrointestinal mucosal injury. PMID:24614124

  11. Whole-exome sequencing reveals GPIHBP1 mutations in infantile colitis with severe hypertriglyceridemia.

    PubMed

    Gonzaga-Jauregui, Claudia; Mir, Sabina; Penney, Samantha; Jhangiani, Shalini; Midgen, Craig; Finegold, Milton; Muzny, Donna M; Wang, Min; Bacino, Carlos A; Gibbs, Richard A; Lupski, James R; Kellermayer, Richard; Hanchard, Neil A

    2014-07-01

    Severe congenital hypertriglyceridemia (HTG) is a rare disorder caused by mutations in genes affecting lipoprotein lipase (LPL) activity. Here we report a 5-week-old Hispanic girl with severe HTG (12,031 mg/dL, normal limit 150 mg/dL) who presented with the unusual combination of lower gastrointestinal bleeding and milky plasma. Initial colonoscopy was consistent with colitis, which resolved with reduction of triglycerides. After negative sequencing of the LPL gene, whole-exome sequencing revealed novel compound heterozygous mutations in GPIHBP1. Our study broadens the phenotype of GPIHBP1-associated HTG, reinforces the effectiveness of whole-exome sequencing in Mendelian diagnoses, and implicates triglycerides in gastrointestinal mucosal injury.

  12. Self-guided management of exome and whole-genome sequencing results: changing the results return model.

    PubMed

    Yu, Joon-Ho; Jamal, Seema M; Tabor, Holly K; Bamshad, Michael J

    2013-09-01

    Researchers and clinicians face the practical and ethical challenge of if and how to offer for return the wide and varied scope of results available from individual exome sequencing and whole-genome sequencing. We argue that rather than viewing individual exome sequencing and whole-genome sequencing as a test for which results need to be "returned," that the technology should instead be framed as a dynamic resource of information from which results should be "managed" over the lifetime of an individual. We further suggest that individual exome sequencing and whole-genome sequencing results management is optimized using a self-guided approach that enables individuals to self-select among results offered for return in a convenient, confidential, personalized context that is responsive to their value system. This approach respects autonomy, allows individuals to maximize potential benefits of genomic information (beneficence) and minimize potential harms (nonmaleficence), and also preserves their right to an open future to the extent they desire or think is appropriate. We describe key challenges and advantages of such a self-guided management system and offer guidance on implementation using an information systems approach.

  13. cnvScan: a CNV screening and annotation tool to improve the clinical utility of computational CNV prediction from exome sequencing data.

    PubMed

    Samarakoon, Pubudu Saneth; Sorte, Hanne Sørmo; Stray-Pedersen, Asbjørg; Rødningen, Olaug Kristin; Rognes, Torbjørn; Lyle, Robert

    2016-01-14

    With advances in next generation sequencing technology and analysis methods, single nucleotide variants (SNVs) and indels can be detected with high sensitivity and specificity in exome sequencing data. Recent studies have demonstrated the ability to detect disease-causing copy number variants (CNVs) in exome sequencing data. However, exonic CNV prediction programs have shown high false positive CNV counts, which is the major limiting factor for the applicability of these programs in clinical studies. We have developed a tool (cnvScan) to improve the clinical utility of computational CNV prediction in exome data. cnvScan can accept input from any CNV prediction program. cnvScan consists of two steps: CNV screening and CNV annotation. CNV screening evaluates CNV prediction using quality scores and refines this using an in-house CNV database, which greatly reduces the false positive rate. The annotation step provides functionally and clinically relevant information using multiple source datasets. We assessed the performance of cnvScan on CNV predictions from five different prediction programs using 64 exomes from Primary Immunodeficiency (PIDD) patients, and identified PIDD-causing CNVs in three individuals from two different families. In summary, cnvScan reduces the time and effort required to detect disease-causing CNVs by reducing the false positive count and providing annotation. This improves the clinical utility of CNV detection in exome data.

  14. Applications of the 1000 Genomes Project resources.

    PubMed

    Zheng-Bradley, Xiangqun; Flicek, Paul

    2017-05-01

    The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. © The Author 2016. Published by Oxford University Press.

  15. Exome copy number variation detection: Use of a pool of unrelated healthy tissue as reference sample.

    PubMed

    Wenric, Stephane; Sticca, Tiberio; Caberg, Jean-Hubert; Josse, Claire; Fasquelle, Corinne; Herens, Christian; Jamar, Mauricette; Max, Stéphanie; Gothot, André; Caers, Jo; Bours, Vincent

    2017-01-01

    An increasing number of bioinformatic tools designed to detect CNVs (copy number variants) in tumor samples based on paired exome data where a matched healthy tissue constitutes the reference have been published in the recent years. The idea of using a pool of unrelated healthy DNA as reference has previously been formulated but not thoroughly validated. As of today, the gold standard for CNV calling is still aCGH but there is an increasing interest in detecting CNVs by exome sequencing. We propose to design a metric allowing the comparison of two CNV profiles, independently of the technique used and assessed the validity of using a pool of unrelated healthy DNA instead of a matched healthy tissue as reference in exome-based CNV detection. We compared the CNV profiles obtained with three different approaches (aCGH, exome sequencing with a matched healthy tissue as reference, exome sequencing with a pool of eight unrelated healthy tissue as reference) on three multiple myeloma samples. We show that the usual analyses performed to compare CNV profiles (deletion/amplification ratios and CNV size distribution) lack in precision when confronted with low LRR values, as they only consider the binary status of each CNV. We show that the metric-based distance constitutes a more accurate comparison of two CNV profiles. Based on these analyses, we conclude that a reliable picture of CNV alterations in multiple myeloma samples can be obtained from whole-exome sequencing in the absence of a matched healthy sample. © 2016 WILEY PERIODICALS, INC.

  16. Exome capture sequencing identifies a novel mutation in BBS4

    PubMed Central

    Wang, Hui; Chen, Xianfeng; Dudinsky, Lynn; Patenia, Claire; Chen, Yiyun; Li, Yumei; Wei, Yue; Abboud, Emad B.; Al-Rajhi, Ali A.; Lewis, Richard Alan; Lupski, James R.; Mardon, Graeme; Gibbs, Richard A.; Perkins, Brian D.

    2011-01-01

    Purpose Leber congenital amaurosis (LCA) is one of the most severe eye dystrophies characterized by severe vision loss at an early stage and accounts for approximately 5% of all retinal dystrophies. The purpose of this study was to identify a novel LCA disease allele or gene and to develop an approach combining genetic mapping with whole exome sequencing. Methods Three patients from King Khaled Eye Specialist Hospital (KKESH205) underwent whole genome single nucleotide polymorphism genotyping, and a single candidate region was identified. Taking advantage of next-generation high-throughput DNA sequencing technologies, whole exome capture sequencing was performed on patient KKESH205#7. Sanger direct sequencing was used during the validation step. The zebrafish model was used to examine the function of the mutant allele. Results A novel missense mutation in Bardet-Biedl syndrome 4 protein (BBS4) was identified in a consanguineous family from Saudi Arabia. This missense mutation in the fifth exon (c.253G>C;p.E85Q) of BBS4 is likely a disease-causing mutation as it segregates with the disease. The mutation is not found in the single nucleotide polymorphism (SNP) database, the 1000 Genomes Project, or matching normal controls. Functional analysis of this mutation in zebrafish indicates that the G253C allele is pathogenic. Coinjection of the G253C allele cannot rescue the mislocalization of rhodopsin in the retina when BBS4 is knocked down by morpholino injection. Immunofluorescence analysis in cell culture shows that this missense mutation in BBS4 does not cause obvious defects in protein expression or pericentriolar localization. Conclusions This mutation likely mainly reduces or abolishes BBS4 function in the retina. Further studies of this allele will provide important insights concerning the pleiotropic nature of BBS4 function. PMID:22219648

  17. Identification of a Heterozygous SPG11 Mutation by Clinical Exome Sequencing in a Patient With Hereditary Spastic Paraplegia: A Case Report.

    PubMed

    Oh, Ja-Young; Do, Hyun Jung; Lee, Seungok; Jang, Ja-Hyun; Cho, Eun-Hae; Jang, Dae-Hyun

    2016-12-01

    Next-generation sequencing, such as whole-genome sequencing, whole-exome sequencing, and targeted panel sequencing have been applied for diagnosis of many genetic diseases, and are in the process of replacing the traditional methods of genetic analysis. Clinical exome sequencing (CES), which provides not only sequence variation data but also clinical interpretation, aids in reaching a final conclusion with regards to genetic diagnosis. Sequencing of genes with clinical relevance rather than whole exome sequencing might be more suitable for the diagnosis of known hereditary disease with genetic heterogeneity. Here, we present the clinical usefulness of CES for the diagnosis of hereditary spastic paraplegia (HSP). We report a case of patient who was strongly suspected of having HSP based on her clinical manifestations. HSP is one of the diseases with high genetic heterogeneity, the 72 different loci and 59 discovered genes identified so far. Therefore, traditional approach for diagnosis of HSP with genetic analysis is very challenging and time-consuming. CES with TruSight One Sequencing Panel, which enriches about 4,800 genes with clinical relevance, revealed compound heterozygous mutations in SPG11 . One workflow and one procedure can provide the results of genetic analysis, and CES with enrichment of clinically relevant genes is a cost-effective and time-saving diagnostic tool for diseases with genetic heterogeneity, including HSP.

  18. Exome analysis of a family with Wolff-Parkinson-White syndrome identifies a novel disease locus.

    PubMed

    Bowles, Neil E; Jou, Chuanchau J; Arrington, Cammon B; Kennedy, Brett J; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L; Etheridge, Susan P; Saarel, Elizabeth V; Bleyl, Steven B; Yost, H Joseph; Yandell, Mark; Leppert, Mark F; Tristani-Firouzi, Martin; Gruber, Peter J

    2015-12-01

    Wolff-Parkinson-White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5'-AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. © 2015 Wiley Periodicals, Inc.

  19. Exome Analysis of a Family with Wolff–Parkinson–White Syndrome Identifies a Novel Disease Locus

    PubMed Central

    Bowles, Neil E.; Jou, Chuanchau J.; Arrington, Cammon B.; Kennedy, Brett J.; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L.; Etheridge, Susan P.; Saarel, Elizabeth V.; Bleyl, Steven B.; Yost, H. Joseph; Yandell, Mark; Leppert, Mark F.; Tristani-Firouzi, Martin; Gruber, Peter J.

    2016-01-01

    Wolff–Parkinson–White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5’ -AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. PMID:26284702

  20. An automatic and efficient pipeline for disease gene identification through utilizing family-based sequencing data.

    PubMed

    Song, Dandan; Li, Ning; Liao, Lejian

    2015-01-01

    Due to the generation of enormous amounts of data at both lower costs as well as in shorter times, whole-exome sequencing technologies provide dramatic opportunities for identifying disease genes implicated in Mendelian disorders. Since upwards of thousands genomic variants can be sequenced in each exome, it is challenging to filter pathogenic variants in protein coding regions and reduce the number of missing true variants. Therefore, an automatic and efficient pipeline for finding disease variants in Mendelian disorders is designed by exploiting a combination of variants filtering steps to analyze the family-based exome sequencing approach. Recent studies on the Freeman-Sheldon disease are revisited and show that the proposed method outperforms other existing candidate gene identification methods.

  1. Whole exome sequencing identifies genetic variants in inherited thrombocytopenia with secondary qualitative function defects.

    PubMed

    Johnson, Ben; Lowe, Gillian C; Futterer, Jane; Lordkipanidzé, Marie; MacDonald, David; Simpson, Michael A; Sanchez-Guiú, Isabel; Drake, Sian; Bem, Danai; Leo, Vincenzo; Fletcher, Sarah J; Dawood, Ban; Rivera, José; Allsup, David; Biss, Tina; Bolton-Maggs, Paula Hb; Collins, Peter; Curry, Nicola; Grimley, Charlotte; James, Beki; Makris, Mike; Motwani, Jayashree; Pavord, Sue; Talks, Katherine; Thachil, Jecko; Wilde, Jonathan; Williams, Mike; Harrison, Paul; Gissen, Paul; Mundell, Stuart; Mumford, Andrew; Daly, Martina E; Watson, Steve P; Morgan, Neil V

    2016-10-01

    Inherited thrombocytopenias are a heterogeneous group of disorders characterized by abnormally low platelet counts which can be associated with abnormal bleeding. Next-generation sequencing has previously been employed in these disorders for the confirmation of suspected genetic abnormalities, and more recently in the discovery of novel disease-causing genes. However its full potential has not yet been exploited. Over the past 6 years we have sequenced the exomes from 55 patients, including 37 index cases and 18 additional family members, all of whom were recruited to the UK Genotyping and Phenotyping of Platelets study. All patients had inherited or sustained thrombocytopenia of unknown etiology with platelet counts varying from 11×10 9 /L to 186×10 9 /L. Of the 51 patients phenotypically tested, 37 (73%), had an additional secondary qualitative platelet defect. Using whole exome sequencing analysis we have identified "pathogenic" or "likely pathogenic" variants in 46% (17/37) of our index patients with thrombocytopenia. In addition, we report variants of uncertain significance in 12 index cases, including novel candidate genetic variants in previously unreported genes in four index cases. These results demonstrate that whole exome sequencing is an efficient method for elucidating potential pathogenic genetic variants in inherited thrombocytopenia. Whole exome sequencing also has the added benefit of discovering potentially pathogenic genetic variants for further study in novel genes not previously implicated in inherited thrombocytopenia. Copyright© Ferrata Storti Foundation.

  2. Whole exome sequencing for familial bicuspid aortic valve identifies putative variants.

    PubMed

    Martin, Lisa J; Pilipenko, Valentina; Kaufman, Kenneth M; Cripe, Linda; Kottyan, Leah C; Keddache, Mehdi; Dexheimer, Phillip; Weirauch, Matthew T; Benson, D Woodrow

    2014-10-01

    Bicuspid aortic valve (BAV) is the most common congenital cardiovascular malformation. Although highly heritable, few causal variants have been identified. The purpose of this study was to identify genetic variants underlying BAV by whole exome sequencing a multiplex BAV kindred. Whole exome sequencing was performed on 17 individuals from a single family (BAV=3; other cardiovascular malformation, 3). Postvariant calling error control metrics were established after examining the relationship between Mendelian inheritance error rate and coverage, quality score, and call rate. To determine the most effective approach to identifying susceptibility variants from among 54 674 variants passing error control metrics, we evaluated 3 variant selection strategies frequently used in whole exome sequencing studies plus extended family linkage. No putative rare, high-effect variants were identified in all affected but no unaffected individuals. Eight high-effect variants were identified by ≥2 of the commonly used selection strategies; however, these were either common in the general population (>10%) or present in the majority of the unaffected family members. However, using extended family linkage, 3 synonymous variants were identified; all 3 variants were identified by at least one other strategy. These results suggest that traditional whole exome sequencing approaches, which assume causal variants alter coding sense, may be insufficient for BAV and other complex traits. Identification of disease-associated variants is facilitated by the use of segregation within families. © 2014 American Heart Association, Inc.

  3. Whole-exome sequencing identified a variant in EFTUD2 gene in establishing a genetic diagnosis.

    PubMed

    Rengasamy Venugopalan, S; Farrow, E G; Lypka, M

    2017-06-01

    Craniofacial anomalies are complex and have an overlapping phenotype. Mandibulofacial Dysostosis and Oculo-Auriculo-Vertebral Spectrum are conditions that share common craniofacial phenotype and present a challenge in arriving at a diagnosis. In this report, we present a case of female proband who was given a differential diagnosis of Treacher Collins syndrome or Hemifacial Microsomia without certainty. Prior genetic testing reported negative for 22q deletion and FGFR screenings. The objective of this study was to demonstrate the critical role of whole-exome sequencing in establishing a genetic diagnosis of the proband. The participants were 14½-year-old affected female proband/parent trio. Proband/parent trio were enrolled in the study. Surgical tissue sample from the proband and parental blood samples were collected and prepared for whole-exome sequencing. Illumina HiSeq 2500 instrument was used for sequencing (125 nucleotide reads/84X coverage). Analyses of variants were performed using custom-developed software, RUNES and VIKING. Variant analyses following whole-exome sequencing identified a heterozygous de novo pathogenic variant, c.259C>T (p.Gln87*), in EFTUD2 (NM_004247.3) gene in the proband. Previous studies have reported that the variants in EFTUD2 gene were associated with Mandibulofacial Dysostosis with Microcephaly. Patients with facial asymmetry, micrognathia, choanal atresia and microcephaly should be analyzed for variants in EFTUD2 gene. Next-generation sequencing techniques, such as whole-exome sequencing offer great promise to improve the understanding of etiologies of sporadic genetic diseases. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  4. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing

    PubMed Central

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J.; Szatkiewicz, Jin P.

    2015-01-01

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. PMID:25883151

  5. Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

    PubMed

    Wu, Jiaxin; Li, Yanda; Jiang, Rui

    2014-03-01

    Exome sequencing has been widely used in detecting pathogenic nonsynonymous single nucleotide variants (SNVs) for human inherited diseases. However, traditional statistical genetics methods are ineffective in analyzing exome sequencing data, due to such facts as the large number of sequenced variants, the presence of non-negligible fraction of pathogenic rare variants or de novo mutations, and the limited size of affected and normal populations. Indeed, prevalent applications of exome sequencing have been appealing for an effective computational method for identifying causative nonsynonymous SNVs from a large number of sequenced variants. Here, we propose a bioinformatics approach called SPRING (Snv PRioritization via the INtegration of Genomic data) for identifying pathogenic nonsynonymous SNVs for a given query disease. Based on six functional effect scores calculated by existing methods (SIFT, PolyPhen2, LRT, MutationTaster, GERP and PhyloP) and five association scores derived from a variety of genomic data sources (gene ontology, protein-protein interactions, protein sequences, protein domain annotations and gene pathway annotations), SPRING calculates the statistical significance that an SNV is causative for a query disease and hence provides a means of prioritizing candidate SNVs. With a series of comprehensive validation experiments, we demonstrate that SPRING is valid for diseases whose genetic bases are either partly known or completely unknown and effective for diseases with a variety of inheritance styles. In applications of our method to real exome sequencing data sets, we show the capability of SPRING in detecting causative de novo mutations for autism, epileptic encephalopathies and intellectual disability. We further provide an online service, the standalone software and genome-wide predictions of causative SNVs for 5,080 diseases at http://bioinfo.au.tsinghua.edu.cn/spring.

  6. A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments.

    PubMed

    Bansal, Vikas

    2017-03-14

    PCR amplification is an important step in the preparation of DNA sequencing libraries prior to high-throughput sequencing. PCR amplification introduces redundant reads in the sequence data and estimating the PCR duplication rate is important to assess the frequency of such reads. Existing computational methods do not distinguish PCR duplicates from "natural" read duplicates that represent independent DNA fragments and therefore, over-estimate the PCR duplication rate for DNA-seq and RNA-seq experiments. In this paper, we present a computational method to estimate the average PCR duplication rate of high-throughput sequence datasets that accounts for natural read duplicates by leveraging heterozygous variants in an individual genome. Analysis of simulated data and exome sequence data from the 1000 Genomes project demonstrated that our method can accurately estimate the PCR duplication rate on paired-end as well as single-end read datasets which contain a high proportion of natural read duplicates. Further, analysis of exome datasets prepared using the Nextera library preparation method indicated that 45-50% of read duplicates correspond to natural read duplicates likely due to fragmentation bias. Finally, analysis of RNA-seq datasets from individuals in the 1000 Genomes project demonstrated that 70-95% of read duplicates observed in such datasets correspond to natural duplicates sampled from genes with high expression and identified outlier samples with a 2-fold greater PCR duplication rate than other samples. The method described here is a useful tool for estimating the PCR duplication rate of high-throughput sequence datasets and for assessing the fraction of read duplicates that correspond to natural read duplicates. An implementation of the method is available at https://github.com/vibansal/PCRduplicates .

  7. Exome sequencing of a multigenerational human pedigree.

    PubMed

    Hedges, Dale J; Hedges, Dale; Burges, Dan; Powell, Eric; Almonte, Cherylyn; Huang, Jia; Young, Stuart; Boese, Benjamin; Schmidt, Mike; Pericak-Vance, Margaret A; Martin, Eden; Zhang, Xinmin; Harkins, Timothy T; Züchner, Stephan

    2009-12-14

    Over the next few years, the efficient use of next-generation sequencing (NGS) in human genetics research will depend heavily upon the effective mechanisms for the selective enrichment of genomic regions of interest. Recently, comprehensive exome capture arrays have become available for targeting approximately 33 Mb or approximately 180,000 coding exons across the human genome. Selective genomic enrichment of the human exome offers an attractive option for new experimental designs aiming to quickly identify potential disease-associated genetic variants, especially in family-based studies. We have evaluated a 2.1 M feature human exome capture array on eight individuals from a three-generation family pedigree. We were able to cover up to 98% of the targeted bases at a long-read sequence read depth of > or = 3, 86% at a read depth of > or = 10, and over 50% of all targets were covered with > or = 20 reads. We identified up to 14,284 SNPs and small indels per individual exome, with up to 1,679 of these representing putative novel polymorphisms. Applying the conservative genotype calling approach HCDiff, the average rate of detection of a variant allele based on Illumina 1 M BeadChips genotypes was 95.2% at > or = 10x sequence. Further, we propose an advantageous genotype calling strategy for low covered targets that empirically determines cut-off thresholds at a given coverage depth based on existing genotype data. Application of this method was able to detect >99% of SNPs covered > or = 8x. Our results offer guidance for "real-world" applications in human genetics and provide further evidence that microarray-based exome capture is an efficient and reliable method to enrich for chromosomal regions of interest in next-generation sequencing experiments.

  8. Human genetics and genomics a decade after the release of the draft sequence of the human genome.

    PubMed

    Naidoo, Nasheen; Pawitan, Yudi; Soong, Richie; Cooper, David N; Ku, Chee-Seng

    2011-10-01

    Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.

  9. Human genetics and genomics a decade after the release of the draft sequence of the human genome

    PubMed Central

    2011-01-01

    Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605

  10. Novel pathogenic variant (c.3178G>A) in the SMC1A gene in a family with Cornelia de Lange syndrome identified by exome sequencing.

    PubMed

    Jang, Mi Ae; Lee, Chang Woo; Kim, Jin Kyung; Ki, Chang Seok

    2015-11-01

    Cornelia de Lange syndrome (CdLS) is a clinically and genetically heterogeneous congenital anomaly. Mutations in the NIPBL gene account for a half of the affected individuals. We describe a family with CdLS carrying a novel pathogenic variant of the SMC1A gene identified by exome sequencing. The proband was a 3-yr-old boy presenting with a developmental delay. He had distinctive facial features without major structural anomalies and tested negative for the NIPBL gene. His younger sister, mother, and maternal grandmother presented with mild mental retardation. By exome sequencing of the proband, a novel SMC1A variant, c.3178G>A, was identified, which was expected to cause an amino acid substitution (p.Glu1060Lys) in the highly conserved coiled-coil domain of the SMC1A protein. Sanger sequencing confirmed that the three female relatives with mental retardation also carry this variant. Our results reveal that SMC1A gene defects are associated with milder phenotypes of CdLS. Furthermore, we showed that exome sequencing could be a useful tool to identify pathogenic variants in patients with CdLS.

  11. Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines.

    PubMed

    Ellrott, Kyle; Bailey, Matthew H; Saksena, Gordon; Covington, Kyle R; Kandoth, Cyriac; Stewart, Chip; Hess, Julian; Ma, Singer; Chiotti, Kami E; McLellan, Michael; Sofia, Heidi J; Hutter, Carolyn; Getz, Gad; Wheeler, David; Ding, Li

    2018-03-28

    The Cancer Genome Atlas (TCGA) cancer genomics dataset includes over 10,000 tumor-normal exome pairs across 33 different cancer types, in total >400 TB of raw data files requiring analysis. Here we describe the Multi-Center Mutation Calling in Multiple Cancers project, our effort to generate a comprehensive encyclopedia of somatic mutation calls for the TCGA data to enable robust cross-tumor-type analyses. Our approach accounts for variance and batch effects introduced by the rapid advancement of DNA extraction, hybridization-capture, sequencing, and analysis methods over time. We present best practices for applying an ensemble of seven mutation-calling algorithms with scoring and artifact filtering. The dataset created by this analysis includes 3.5 million somatic variants and forms the basis for PanCan Atlas papers. The results have been made available to the research community along with the methods used to generate them. This project is the result of collaboration from a number of institutes and demonstrates how team science drives extremely large genomics projects. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  12. Whole Exome Sequencing of Pediatric Gastric Adenocarcinoma Reveals an Atypical Presentation of Li-Fraumeni Syndrome

    PubMed Central

    Chang, Vivian Y.; Federman, Noah; Martinez-Agosto, Julian; Tatishchev, Sergei F.; Nelson, Stanley F.

    2014-01-01

    Background Gastric adenocarcinoma is a rare diagnosis in childhood. A 14-year old male patient presented with metastatic gastric adenocarcinoma, and a strong family history of colon cancer. Clinical sequencing of CDH1 and APC were negative. Whole exome sequencing was therefore applied to capture the majority of protein-coding regions for the identification of single-nucleotide variants, small insertion/deletions, and copy number abnormalities in the patient’s germline as well as primary tumor. Materials and Methods DNA was extracted from the patient’s blood, primary tumor, and the unaffected mother’s blood. DNA libraries were constructed and sequenced on Illumina HiSeq2000. Data were post-processed using Picard and Samtools, then analyzed with the Genome Analysis Toolkit. Variants were annotated using an in-house Ensembl-based program. Copy number was assessed using ExomeCNV. Results Each sample was sequenced to a mean depth of coverage of greater than 120×. A rare non-synonymous coding SNV in TP53 was identified in the germline. There were 10 somatic cancer protein-damaging variants that were not observed in the unaffected mother genome. ExomeCNV comparing tumor to the patient’s germline, identified abnormal copy number, spanning 6,946 genes. Conclusion We present an unusual case of Li-Fraumeni detected by whole exome sequencing. There were also likely driver somatic mutations in the gastric adenocarcinoma. These results highlight the need for more thorough and broad scale germline and cancer analyses to accurately inform patients of inherited risk to cancer and to identify somatic mutations. PMID:23015295

  13. Whole-exome sequencing identifies novel homozygous mutation in NPAS2 in family with nonobstructive azoospermia.

    PubMed

    Ramasamy, Ranjith; Bakırcıoğlu, M Emre; Cengiz, Cenk; Karaca, Ender; Scovell, Jason; Jhangiani, Shalini N; Akdemir, Zeynep C; Bainbridge, Matthew; Yu, Yao; Huff, Chad; Gibbs, Richard A; Lupski, James R; Lamb, Dolores J

    2015-08-01

    To investigate the genetic cause of nonobstructive azoospermia (NOA) in a consanguineous Turkish family through homozygosity mapping followed by targeted exon/whole-exome sequencing to identify genetic variations. Whole-exome sequencing (WES). Research laboratory. Two siblings in a consanguineous family with NOA. Validating all variants passing filter criteria with Sanger sequencing to confirm familial segregation and absence in the control population. Discovery of a mutation that could potentially cause NOA. A novel nonsynonymous mutation in the neuronal PAS-2 domain (NPAS2) was identified in a consanguineous family from Turkey. This mutation in exon 14 (chr2: 101592000 C>G) of NPAS2 is likely a disease-causing mutation as it is predicted to be damaging, it is a novel variant, and it segregates with the disease. Family segregation of the variants showed the presence of the homozygous mutation in the three brothers with NOA and a heterozygous mutation in the mother as well as one brother and one sister who were both fertile. The mutation is not found in the single-nucleotide polymorphism database, the 1000 Genomes Project, the Baylor College of Medicine cohort of 500 Turkish patients (not a population-specific polymorphism), or the matching 50 fertile controls. With the use of WES we identified a novel homozygous mutation in NPAS2 as a likely disease-causing variant in a Turkish family diagnosed with NOA. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases for which conventional genetic approaches have previously failed to find a molecular diagnosis. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  14. Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis.

    PubMed

    Asgari, Samira; McLaren, Paul J; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R; Abarca, Katia; Gelderman, Kyra A; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J

    2016-01-01

    One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa ( P. aeruginosa ) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B . This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs.

  15. Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis

    PubMed Central

    Asgari, Samira; McLaren, Paul J.; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R.; Abarca, Katia; Gelderman, Kyra A.; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J.; Posfay-Barbe, Klara

    2016-01-01

    One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa (P. aeruginosa) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B. This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs. PMID:27703454

  16. Transcriptome-based differentiation of closely-related Miscanthus lines.

    PubMed

    Chouvarine, Philippe; Cooksey, Amanda M; McCarthy, Fiona M; Ray, David A; Baldwin, Brian S; Burgess, Shane C; Peterson, Daniel G

    2012-01-01

    Distinguishing between individuals is critical to those conducting animal/plant breeding, food safety/quality research, diagnostic and clinical testing, and evolutionary biology studies. Classical genetic identification studies are based on marker polymorphisms, but polymorphism-based techniques are time and labor intensive and often cannot distinguish between closely related individuals. Illumina sequencing technologies provide the detailed sequence data required for rapid and efficient differentiation of related species, lines/cultivars, and individuals in a cost-effective manner. Here we describe the use of Illumina high-throughput exome sequencing, coupled with SNP mapping, as a rapid means of distinguishing between related cultivars of the lignocellulosic bioenergy crop giant miscanthus (Miscanthus × giganteus). We provide the first exome sequence database for Miscanthus species complete with Gene Ontology (GO) functional annotations. A SNP comparative analysis of rhizome-derived cDNA sequences was successfully utilized to distinguish three Miscanthus × giganteus cultivars from each other and from other Miscanthus species. Moreover, the resulting phylogenetic tree generated from SNP frequency data parallels the known breeding history of the plants examined. Some of the giant miscanthus plants exhibit considerable sequence divergence. Here we describe an analysis of Miscanthus in which high-throughput exome sequencing was utilized to differentiate between closely related genotypes despite the current lack of a reference genome sequence. We functionally annotated the exome sequences and provide resources to support Miscanthus systems biology. In addition, we demonstrate the use of the commercial high-performance cloud computing to do computational GO annotation.

  17. Molecular Findings Among Patients Referred for Clinical Whole-Exome Sequencing

    PubMed Central

    Yang, Yaping; Muzny, Donna M.; Xia, Fan; Niu, Zhiyv; Person, Richard; Ding, Yan; Ward, Patricia; Braxton, Alicia; Wang, Min; Buhay, Christian; Veeraraghavan, Narayanan; Hawes, Alicia; Chiang, Theodore; Leduc, Magalie; Beuten, Joke; Zhang, Jing; He, Weimin; Scull, Jennifer; Willis, Alecia; Landsverk, Megan; Craigen, William J.; Bekheirnia, Mir Reza; Stray-Pedersen, Asbjorg; Liu, Pengfei; Wen, Shu; Alcaraz, Wendy; Cui, Hong; Walkiewicz, Magdalena; Reid, Jeffrey; Bainbridge, Matthew; Patel, Ankita; Boerwinkle, Eric; Beaudet, Arthur L.; Lupski, James R.; Plon, Sharon E.; Gibbs, Richard A.; Eng, Christine M.

    2015-01-01

    IMPORTANCE Clinical whole-exome sequencing is increasingly used for diagnostic evaluation of patients with suspected genetic disorders. OBJECTIVE To perform clinical whole-exome sequencing and report (1) the rate of molecular diagnosis among phenotypic groups, (2) the spectrum of genetic alterations contributing to disease, and (3) the prevalence of medically actionable incidental findings such as FBN1 mutations causing Marfan syndrome. DESIGN, SETTING, AND PATIENTS Observational study of 2000 consecutive patients with clinical whole-exome sequencing analyzed between June 2012 and August 2014. Whole-exome sequencing tests were performed at a clinical genetics laboratory in the United States. Results were reported by clinical molecular geneticists certified by the American Board of Medical Genetics and Genomics. Tests were ordered by the patient’s physician. The patients were primarily pediatric (1756 [88%]; mean age, 6 years; 888 females [44%], 1101 males [55%], and 11 fetuses [1% gender unknown]), demonstrating diverse clinical manifestations most often including nervous system dysfunction such as developmental delay. MAIN OUTCOMES AND MEASURES Whole-exome sequencing diagnosis rate overall and by phenotypic category, mode of inheritance, spectrum of genetic events, and reporting of incidental findings. RESULTS A molecular diagnosis was reported for 504 patients (25.2%) with 58% of the diagnostic mutations not previously reported. Molecular diagnosis rates for each phenotypic category were 143/526 (27.2%; 95% CI, 23.5%–31.2%) for the neurological group, 282/1147 (24.6%; 95% CI, 22.1%–27.2%) for the neurological plus other organ systems group, 30/83 (36.1%; 95% CI, 26.1%–47.5%) for the specific neurological group, and 49/244 (20.1%; 95% CI, 15.6%–25.8%) for the nonneurological group. The Mendelian disease patterns of the 527 molecular diagnoses included 280 (53.1%) autosomal dominant, 181 (34.3%) autosomal recessive (including 5 with uniparental disomy), 65 (12.3%) X-linked, and 1 (0.2%) mitochondrial. Of 504 patients with a molecular diagnosis, 23 (4.6%) had blended phenotypes resulting from 2 single gene defects. About 30% of the positive cases harbored mutations in disease genes reported since 2011. There were 95 medically actionable incidental findings in genes unrelated to the phenotype but with immediate implications for management in 92 patients (4.6%), including 59 patients (3%) with mutations in genes recommended for reporting by the American College of Medical Genetics and Genomics. CONCLUSIONS AND RELEVANCE Whole-exome sequencing provided a potential molecular diagnosis for 25% of a large cohort of patients referred for evaluation of suspected genetic conditions, including detection of rare genetic events and new mutations contributing to disease. The yield of whole-exome sequencing may offer advantages over traditional molecular diagnostic approaches in certain patients. PMID:25326635

  18. An Observational Study of Children's Involvement in Informed Consent for Exome Sequencing Research.

    PubMed

    Miller, Victoria A; Werner-Lin, Allison; Walser, Sarah A; Biswas, Sawona; Bernhardt, Barbara A

    2017-02-01

    The goal of this study was to examine children's involvement in consent sessions for exome sequencing research and associations of involvement with provider and parent communication. Participants included 44 children (8-17 years) from five cohorts who were offered participation in an exome sequencing study. The consent sessions were audiotaped, transcribed, and coded. Providers attempted to facilitate the child's involvement in the majority (73%) of sessions, and most (75%) children also verbally participated. Provider facilitation was strongly associated with likelihood of child participation. These findings underscore that strategies such as asking for children's opinions and soliciting their questions show respect for children and may increase the likelihood that they are engaged and involved in decisions about research participation.

  19. Thiamine pyrophosphokinase deficiency causes a Leigh Disease like phenotype in a sibling pair: identification through whole exome sequencing and management strategies.

    PubMed

    Fraser, Jamie L; Vanderver, Adeline; Yang, Sandra; Chang, Taeun; Cramp, Laura; Vezina, Gilbert; Lichter-Konecki, Uta; Cusmano-Ozog, Kristina P; Smpokou, Patroula; Chapman, Kimberly A; Zand, Dina J

    2014-01-01

    We present a sibling pair with Leigh-like disease, progressive hypotonia, regression, and chronic encephalopathy. Whole exome sequencing in the younger sibling demonstrated a homozygous thiamine pyrophosphokinase (TPK) mutation. Initiation of high dose thiamine, niacin, biotin, α-lipoic acid and ketogenic diet in this child demonstrated improvement in neurologic function and re-attainment of previously lost milestones. The diagnosis of TPK deficiency was difficult due to inconsistent biochemical and diagnostic parameters, rapidity of clinical demise and would not have been made in a timely manner without the use of whole exome sequencing. Molecular diagnosis allowed for attempt at dietary modification with cofactor supplementation which resulted in an improved clinical course.

  20. mirVAFC: A Web Server for Prioritizations of Pathogenic Sequence Variants from Exome Sequencing Data via Classifications.

    PubMed

    Li, Zhongshan; Liu, Zhenwei; Jiang, Yi; Chen, Denghui; Ran, Xia; Sun, Zhong Sheng; Wu, Jinyu

    2017-01-01

    Exome sequencing has been widely used to identify the genetic variants underlying human genetic disorders for clinical diagnoses, but the identification of pathogenic sequence variants among the huge amounts of benign ones is complicated and challenging. Here, we describe a new Web server named mirVAFC for pathogenic sequence variants prioritizations from clinical exome sequencing (CES) variant data of single individual or family. The mirVAFC is able to comprehensively annotate sequence variants, filter out most irrelevant variants using custom criteria, classify variants into different categories as for estimated pathogenicity, and lastly provide pathogenic variants prioritizations based on classifications and mutation effects. Case studies using different types of datasets for different diseases from publication and our in-house data have revealed that mirVAFC can efficiently identify the right pathogenic candidates as in original work in each case. Overall, the Web server mirVAFC is specifically developed for pathogenic sequence variant identifications from family-based CES variants using classification-based prioritizations. The mirVAFC Web server is freely accessible at https://www.wzgenomics.cn/mirVAFC/. © 2016 WILEY PERIODICALS, INC.

  1. Actionable exomic incidental findings in 6503 participants: challenges of variant classification

    PubMed Central

    Amendola, Laura M.; Dorschner, Michael O.; Robertson, Peggy D.; Salama, Joseph S.; Hart, Ragan; Shirts, Brian H.; Murray, Mitzi L.; Tokita, Mari J.; Gallego, Carlos J.; Kim, Daniel Seung; Bennett, James T.; Crosslin, David R.; Ranchalis, Jane; Jones, Kelly L.; Rosenthal, Elisabeth A.; Jarvik, Ella R.; Itsara, Andy; Turner, Emily H.; Herman, Daniel S.; Schleit, Jennifer; Burt, Amber; Jamal, Seema M.; Abrudan, Jenica L.; Johnson, Andrew D.; Conlin, Laura K.; Dulik, Matthew C.; Santani, Avni; Metterville, Danielle R.; Kelly, Melissa; Foreman, Ann Katherine M.; Lee, Kristy; Taylor, Kent D.; Guo, Xiuqing; Crooks, Kristy; Kiedrowski, Lesli A.; Raffel, Leslie J.; Gordon, Ora; Machini, Kalotina; Desnick, Robert J.; Biesecker, Leslie G.; Lubitz, Steven A.; Mulchandani, Surabhi; Cooper, Greg M.; Joffe, Steven; Richards, C. Sue; Yang, Yaoping; Rotter, Jerome I.; Rich, Stephen S.; O’Donnell, Christopher J.; Berg, Jonathan S.; Spinner, Nancy B.; Evans, James P.; Fullerton, Stephanie M.; Leppig, Kathleen A.; Bennett, Robin L.; Bird, Thomas; Sybert, Virginia P.; Grady, William M.; Tabor, Holly K.; Kim, Jerry H.; Bamshad, Michael J.; Wilfond, Benjamin; Motulsky, Arno G.; Scott, C. Ronald; Pritchard, Colin C.; Walsh, Tom D.; Burke, Wylie; Raskind, Wendy H.; Byers, Peter; Hisama, Fuki M.; Rehm, Heidi; Nickerson, Debbie A.; Jarvik, Gail P.

    2015-01-01

    Recommendations for laboratories to report incidental findings from genomic tests have stimulated interest in such results. In order to investigate the criteria and processes for assigning the pathogenicity of specific variants and to estimate the frequency of such incidental findings in patients of European and African ancestry, we classified potentially actionable pathogenic single-nucleotide variants (SNVs) in all 4300 European- and 2203 African-ancestry participants sequenced by the NHLBI Exome Sequencing Project (ESP). We considered 112 gene-disease pairs selected by an expert panel as associated with medically actionable genetic disorders that may be undiagnosed in adults. The resulting classifications were compared to classifications from other clinical and research genetic testing laboratories, as well as with in silico pathogenicity scores. Among European-ancestry participants, 30 of 4300 (0.7%) had a pathogenic SNV and six (0.1%) had a disruptive variant that was expected to be pathogenic, whereas 52 (1.2%) had likely pathogenic SNVs. For African-ancestry participants, six of 2203 (0.3%) had a pathogenic SNV and six (0.3%) had an expected pathogenic disruptive variant, whereas 13 (0.6%) had likely pathogenic SNVs. Genomic Evolutionary Rate Profiling mammalian conservation score and the Combined Annotation Dependent Depletion summary score of conservation, substitution, regulation, and other evidence were compared across pathogenicity assignments and appear to have utility in variant classification. This work provides a refined estimate of the burden of adult onset, medically actionable incidental findings expected from exome sequencing, highlights challenges in variant classification, and demonstrates the need for a better curated variant interpretation knowledge base. PMID:25637381

  2. A global reference for human genetic variation

    PubMed Central

    2016-01-01

    The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. PMID:26432245

  3. Exome Sequence Analysis of 14 Families With High Myopia.

    PubMed

    Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L

    2017-04-01

    To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.

  4. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

    PubMed

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J; Szatkiewicz, Jin P

    2015-08-18

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.

    PubMed

    Favero, F; Joshi, T; Marquard, A M; Birkbak, N J; Krzystanek, M; Li, Q; Szallasi, Z; Eklund, A C

    2015-01-01

    Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology.

  6. X-Linked Glomerulopathy Due to COL4A5 Founder Variant.

    PubMed

    Barua, Moumita; John, Rohan; Stella, Lorenzo; Li, Weili; Roslin, Nicole M; Sharif, Bedra; Hack, Saidah; Lajoie-Starkell, Ginette; Schwaderer, Andrew L; Becknell, Brian; Wuttke, Matthias; Köttgen, Anna; Cattran, Daniel; Paterson, Andrew D; Pei, York

    2018-03-01

    Alport syndrome is a rare hereditary disorder caused by rare variants in 1 of 3 genes encoding for type IV collagen. Rare variants in COL4A5 on chromosome Xq22 cause X-linked Alport syndrome, which accounts for ∼80% of the cases. Alport syndrome has a variable clinical presentation, including progressive kidney failure, hearing loss, and ocular defects. Exome sequencing performed in 2 affected related males with an undefined X-linked glomerulopathy characterized by global and segmental glomerulosclerosis, mesangial hypercellularity, and vague basement membrane immune complex deposition revealed a COL4A5 sequence variant, a substitution of a thymine by a guanine at nucleotide 665 (c.T665G; rs281874761) of the coding DNA predicted to lead to a cysteine to phenylalanine substitution at amino acid 222, which was not seen in databases cataloguing natural human genetic variation, including dbSNP138, 1000 Genomes Project release version 01-11-2004, Exome Sequencing Project 21-06-2014, or ExAC 01-11-2014. Review of the literature identified 2 additional families with the same COL4A5 variant leading to similar atypical histopathologic features, suggesting a unique pathologic mechanism initiated by this specific rare variant. Homology modeling suggests that the substitution alters the structural and dynamic properties of the type IV collagen trimer. Genetic analysis comparing members of the 3 families indicated a distant relationship with a shared haplotype, implying a founder effect. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.

  7. A survey of tools for variant analysis of next-generation genome sequencing data

    PubMed Central

    Pabinger, Stephan; Dander, Andreas; Fischer, Maria; Snajder, Rene; Sperk, Michael; Efremova, Mirjana; Krabichler, Birgit; Speicher, Michael R.; Zschocke, Johannes

    2014-01-01

    Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers. PMID:23341494

  8. Comprehensive comparison of three commercial human whole-exome capture platforms.

    PubMed

    Asan; Xu, Yu; Jiang, Hui; Tyler-Smith, Chris; Xue, Yali; Jiang, Tao; Wang, Jiawei; Wu, Mingzhi; Liu, Xiao; Tian, Geng; Wang, Jun; Wang, Jian; Yang, Huangming; Zhang, Xiuqing

    2011-09-28

    Exome sequencing, which allows the global analysis of protein coding sequences in the human genome, has become an effective and affordable approach to detecting causative genetic mutations in diseases. Currently, there are several commercial human exome capture platforms; however, the relative performances of these have not been characterized sufficiently to know which is best for a particular study. We comprehensively compared three platforms: NimbleGen's Sequence Capture Array and SeqCap EZ, and Agilent's SureSelect. We assessed their performance in a variety of ways, including number of genes covered and capture efficacy. Differences that may impact on the choice of platform were that Agilent SureSelect covered approximately 1,100 more genes, while NimbleGen provided better flanking sequence capture. Although all three platforms achieved similar capture specificity of targeted regions, the NimbleGen platforms showed better uniformity of coverage and greater genotype sensitivity at 30- to 100-fold sequencing depth. All three platforms showed similar power in exome SNP calling, including medically relevant SNPs. Compared with genotyping and whole-genome sequencing data, the three platforms achieved a similar accuracy of genotype assignment and SNP detection. Importantly, all three platforms showed similar levels of reproducibility, GC bias and reference allele bias. We demonstrate key differences between the three platforms, particularly advantages of solutions over array capture and the importance of a large gene target set.

  9. Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.

    PubMed

    Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing

    2015-08-05

    To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.

  10. The Genomic Evolution of Prostate Cancer

    DTIC Science & Technology

    2017-06-01

    management and grant writing skills. 15. SUBJECT TERMS Cancer genetics , tumor evolution, tumor heterogeneity, prostate cancer, exome sequencing 16...aggressive disease, it is unclear if the genetic alterations more common in late disease are present early on, but at low frequency, or if they only...from localized to metastatic prostate cancer. 2. KEYWORDS: Cancer genetics , tumor evolution, tumor heterogeneity, prostate cancer, exome sequencing

  11. Exome Sequencing Identifies Potential Risk Variants for Mendelian Disorders at High Prevalence in Qatar

    PubMed Central

    Rodriguez-Flores, Juan L.; Fakhro, Khalid; Hackett, Neil R.; Salit, Jacqueline; Fuller, Jennifer; Agosto-Perez, Francisco; Gharbiah, Maey; Malek, Joel A.; Zirie, Mahmoud; Jayyousi, Amin; Badii, Ramin; Al-Marri, Ajayeb Al-Nabet; Chouchane, Lotfi; Stadler, Dora J.; Hunter-Zinck, Haley; Mezey, Jason G.; Crystal, Ronald G.

    2013-01-01

    Exome sequencing of families of related individuals has been highly successful in identifying genetic polymorphisms responsible for Mendelian disorders. Here, we demonstrate the value of the reverse approach, where we use exome sequencing of a sample of unrelated individuals to analyze allele frequencies of known causal mutations for Mendelian diseases. We sequenced the exomes of 100 individuals representing the three major genetic subgroups of the Qatari population (Q1 Bedouin, Q2 Persian-South Asian, Q3 African) and identified 37 variants in 33 genes with effects on 36 clinically significant Mendelian diseases. These include variants not present in 1000 Genomes and variants at high frequency when compared to 1000 Genomes populations. Several of these Mendelian variants were only segregating in one Qatari subpopulation, where the observed subpopulation specificity trends were confirmed in an independent population of 386 Qataris. Pre-marital genetic screening in Qatar tests for only 4 out of the 37, such that this study provides a set of Mendelian disease variants with potential impact on the epidemiological profile of the population that could be incorporated into the testing program if further experimental and clinical characterization confirms high penetrance. PMID:24123366

  12. Novel DDR2 mutation identified by whole exome sequencing in a Moroccan patient with spondylo-meta-epiphyseal dysplasia, short limb-abnormal calcification type.

    PubMed

    Mansouri, Maria; Kayserili, Hülya; Elalaoui, Siham Chafai; Nishimura, Gen; Iida, Aritoshi; Lyahyai, Jaber; Miyake, Noriko; Matsumoto, Naomichi; Sefiani, Abdelaziz; Ikegawa, Shiro

    2016-02-01

    Spondylo-meta-epiphyseal dysplasia (SMED), short limb-abnormal calcification type (SMED, SL-AC), is a very rare autosomal recessive disorder with various skeletal changes characterized by premature calcification leading to severe disproportionate short stature. Twenty-two patients have been reported until now, but only five mutations (four missense and one splice-site) in the conserved sequence encoding the tyrosine kinase domain of the DDR2 gene has been identified. We report here a novel DDR2 missense mutation, c.370C > T (p.Arg124Trp) in a Moroccan girl with SMED, SL-AC, identified by whole exome sequencing. Our study has expanded the mutational spectrum of this rare disease and it has shown that exome sequencing is a powerful and cost-effective tool for the diagnosis of clinically heterogeneous disorders such as SMED. © 2015 Wiley Periodicals, Inc.

  13. Rare Compound Heterozygous Frameshift Mutations in ALMS1 Gene Identified Through Exome Sequencing in a Taiwanese Patient With Alström Syndrome.

    PubMed

    Tsai, Meng-Che; Yu, Hui-Wen; Liu, Tsunglin; Chou, Yen-Yin; Chiou, Yuan-Yow; Chen, Peng-Chieh

    2018-01-01

    Alström syndrome (AS) is a rare autosomal recessive disorder that shares clinical features with other ciliopathy-related diseases. Genetic mutation analysis is often required in making differential diagnosis but usually costly in time and effort using conventional Sanger sequencing. Herein we describe a Taiwanese patient presenting cone-rod dystrophy and early-onset obesity that progressed to diabetes mellitus with marked insulin resistance during adolescence. Whole exome sequencing of the patient's genomic DNA identified a novel frameshift mutation in exons 15 (c.10290_10291delTA, p.Lys3431Serfs * 10) and a rare mutation in 16 (c.10823_10824delAG, p.Arg3609Alafs * 6) of ALMS1 gene. The compound heterozygous mutations were predicted to render truncated proteins. This report highlighted the clinical utility of exome sequencing and extended the knowledge of mutation spectrum in AS patients.

  14. Genome-Wide Linkage, Exome Sequencing and Functional Analyses Identify ABCB6 as the Pathogenic Gene of Dyschromatosis Universalis Hereditaria

    PubMed Central

    Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen

    2014-01-01

    Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303

  15. Detection of clinically relevant copy-number variants by exome sequencing in a large cohort of genetic disorders

    PubMed Central

    Pfundt, Rolph; del Rosario, Marisol; Vissers, Lisenka E.L.M.; Kwint, Michael P.; Janssen, Irene M.; de Leeuw, Nicole; Yntema, Helger G.; Nelen, Marcel R.; Lugtenberg, Dorien; Kamsteeg, Erik-Jan; Wieskamp, Nienke; Stegmann, Alexander P.A.; Stevens, Servi J.C.; Rodenburg, Richard J.T.; Simons, Annet; Mensenkamp, Arjen R.; Rinne, Tuula; Gilissen, Christian; Scheffer, Hans; Veltman, Joris A.; Hehir-Kwa, Jayne Y.

    2017-01-01

    Purpose: Copy-number variation is a common source of genomic variation and an important genetic cause of disease. Microarray-based analysis of copy-number variants (CNVs) has become a first-tier diagnostic test for patients with neurodevelopmental disorders, with a diagnostic yield of 10–20%. However, for most other genetic disorders, the role of CNVs is less clear and most diagnostic genetic studies are generally limited to the study of single-nucleotide variants (SNVs) and other small variants. With the introduction of exome and genome sequencing, it is now possible to detect both SNVs and CNVs using an exome- or genome-wide approach with a single test. Methods: We performed exome-based read-depth CNV screening on data from 2,603 patients affected by a range of genetic disorders for which exome sequencing was performed in a diagnostic setting. Results: In total, 123 clinically relevant CNVs ranging in size from 727 bp to 15.3 Mb were detected, which resulted in 51 conclusive diagnoses and an overall increase in diagnostic yield of ~2% (ranging from 0 to –5.8% per disorder). Conclusions: This study shows that CNVs play an important role in a broad range of genetic disorders and that detection via exome-based CNV profiling results in an increase in the diagnostic yield without additional testing, bringing us closer to single-test genomics. Genet Med advance online publication 27 October 2016 PMID:28574513

  16. Whole exome sequencing: a state-of-the-art approach for defining (and exploring!) genetic landscapes in pediatric nephrology.

    PubMed

    Gulati, Ashima; Somlo, Stefan

    2018-05-01

    The genesis of whole exome sequencing as a powerful tool for detailing the protein coding sequence of the human genome was conceptualized based on the availability of next-generation sequencing technology and knowledge of the human reference genome. The field of pediatric nephrology enriched with molecularly unsolved phenotypes is allowing the clinical and research application of whole exome sequencing to enable novel gene discovery and provide amendment of phenotypic misclassification. Recent studies in the field have informed us that newer high-throughput sequencing techniques are likely to be of high yield when applied in conjunction with conventional genomic approaches such as linkage analysis and other strategies used to focus subsequent analysis. They have also emphasized the need for the validation of novel genetic findings in large collaborative cohorts and the production of robust corroborative biological data. The well-structured application of comprehensive genomic testing in clinical and research arenas will hopefully continue to advance patient care and precision medicine, but does call for attention to be paid to its integrated challenges.

  17. Exome sequencing-driven discovery of coding polymorphisms associated with common metabolic phenotypes.

    PubMed

    Albrechtsen, A; Grarup, N; Li, Y; Sparsø, T; Tian, G; Cao, H; Jiang, T; Kim, S Y; Korneliussen, T; Li, Q; Nie, C; Wu, R; Skotte, L; Morris, A P; Ladenvall, C; Cauchi, S; Stančáková, A; Andersen, G; Astrup, A; Banasik, K; Bennett, A J; Bolund, L; Charpentier, G; Chen, Y; Dekker, J M; Doney, A S F; Dorkhan, M; Forsen, T; Frayling, T M; Groves, C J; Gui, Y; Hallmans, G; Hattersley, A T; He, K; Hitman, G A; Holmkvist, J; Huang, S; Jiang, H; Jin, X; Justesen, J M; Kristiansen, K; Kuusisto, J; Lajer, M; Lantieri, O; Li, W; Liang, H; Liao, Q; Liu, X; Ma, T; Ma, X; Manijak, M P; Marre, M; Mokrosiński, J; Morris, A D; Mu, B; Nielsen, A A; Nijpels, G; Nilsson, P; Palmer, C N A; Rayner, N W; Renström, F; Ribel-Madsen, R; Robertson, N; Rolandsson, O; Rossing, P; Schwartz, T W; Slagboom, P E; Sterner, M; Tang, M; Tarnow, L; Tuomi, T; van't Riet, E; van Leeuwen, N; Varga, T V; Vestmar, M A; Walker, M; Wang, B; Wang, Y; Wu, H; Xi, F; Yengo, L; Yu, C; Zhang, X; Zhang, J; Zhang, Q; Zhang, W; Zheng, H; Zhou, Y; Altshuler, D; 't Hart, L M; Franks, P W; Balkau, B; Froguel, P; McCarthy, M I; Laakso, M; Groop, L; Christensen, C; Brandslund, I; Lauritzen, T; Witte, D R; Linneberg, A; Jørgensen, T; Hansen, T; Wang, J; Nielsen, R; Pedersen, O

    2013-02-01

    Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes. The study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m(2) and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case-control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans. Exome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10(-14)), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10(-11)) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10(-10)). We applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits.

  18. Whole Exome Sequencing Reveals a Monogenic Cause of Disease in ≈43% of 35 Families With Midaortic Syndrome.

    PubMed

    Warejko, Jillian K; Schueler, Markus; Vivante, Asaf; Tan, Weizhen; Daga, Ankana; Lawson, Jennifer A; Braun, Daniela A; Shril, Shirlee; Amann, Kassaundra; Somers, Michael J G; Rodig, Nancy M; Baum, Michelle A; Daouk, Ghaleb; Traum, Avram Z; Kim, Heung Bae; Vakili, Khashayar; Porras, Diego; Lock, James; Rivkin, Michael J; Chaudry, Gulraiz; Smoot, Leslie B; Singh, Michael N; Smith, Edward R; Mane, Shrikant M; Lifton, Richard P; Stein, Deborah R; Ferguson, Michael A; Hildebrandt, Friedhelm

    2018-04-01

    Midaortic syndrome (MAS) is a rare cause of severe childhood hypertension characterized by narrowing of the abdominal aorta in children and is associated with extensive vascular disease. It may occur as part of a genetic syndrome, such as neurofibromatosis, or as consequence of a pathological inflammatory disease. However, most cases are considered idiopathic. We hypothesized that in a high percentage of these patients, a monogenic cause of disease may be detected by evaluating whole exome sequencing data for mutations in 1 of 38 candidate genes previously described to cause vasculopathy. We studied a cohort of 36 individuals from 35 different families with MAS by exome sequencing. In 15 of 35 families (42.9%), we detected likely causal dominant mutations. In 15 of 35 (42.9%) families with MAS, whole exome sequencing revealed a mutation in one of the genes previously associated with vascular disease ( NF1 , JAG1 , ELN , GATA6 , and RNF213 ). Ten of the 15 mutations have not previously been reported. This is the first report of ELN , RNF213 , or GATA6 mutations in individuals with MAS. Mutations were detected in NF1 (6/15 families), JAG1 (4/15 families), ELN (3/15 families), and one family each for GATA6 and RNF213 Eight individuals had syndromic disease and 7 individuals had isolated MAS. Whole exome sequencing can provide conclusive molecular genetic diagnosis in a high fraction of individuals with syndromic or isolated MAS. Establishing an etiologic diagnosis may reveal genotype/phenotype correlations for MAS in the future and should, therefore, be performed routinely in MAS. © 2018 American Heart Association, Inc.

  19. Diagnostic application of clinical exome sequencing in Leber congenital amaurosis.

    PubMed

    Han, Jinu; Rim, John Hoon; Hwang, In Sik; Kim, Jieun; Shin, Saeam; Lee, Seung-Tae; Choi, Jong Rak

    2017-01-01

    Leber congenital amaurosis (LCA) is a hereditary retinal dystrophy with wide genetic heterogeneity. Next-generation sequencing (NGS) targeting multiple genes can be a good option for the diagnosis of LCA, and we tested a clinical exome panel in patients with LCA. A total of nine unrelated Korean patients with LCA were sequenced using the Illumina TruSight One panel, which targets 4,813 clinically associated genes, followed by confirmation using Sanger sequencing. Patients' clinical information and familial study results were obtained and used for comprehensive interpretation. In all nine patients, we identified pathogenic variations in LCA-associated genes: NMNAT1 (n=3), GUCY2D (n=2), RPGRIP1 (n=2), CRX (n=1), and CEP290 or SPATA7 . Six patients had one or two mutations in accordance with inheritance patterns, all consistent with clinical phenotypes. Two patients had only one pathogenic mutation in recessive genes ( NMNAT1 and RPGRIP1 ), and the clinical features were specific to disorders associated with those genes. Six patients were solved for genetic causes, and it remains unclear for three patients with the clinical exome panel. With subsequent targeted panel sequencing with 113 genes associated with infantile nystagmus syndrome, a likely pathogenic allele in CEP290 was detected in one patient. Interestingly, one pathogenic variant (p.Arg237Cys) in NMNAT1 was present in three patients, and it had a high allele frequency (0.24%) in the general Korean population, suggesting that NMNAT1 could be a major gene responsible for LCA in Koreans. We confirmed that a commercial clinical exome panel can be effectively used in the diagnosis of LCA. Careful interpretation and clinical correlation could promote the successful implementation of clinical exome panels in routine diagnoses of retinal dystrophies, including LCA.

  20. Exome Sequencing and the Management of Neurometabolic Disorders.

    PubMed

    Tarailo-Graovac, Maja; Shyr, Casper; Ross, Colin J; Horvath, Gabriella A; Salvarinova, Ramona; Ye, Xin C; Zhang, Lin-Hua; Bhavsar, Amit P; Lee, Jessica J Y; Drögemöller, Britt I; Abdelsayed, Mena; Alfadhel, Majid; Armstrong, Linlea; Baumgartner, Matthias R; Burda, Patricie; Connolly, Mary B; Cameron, Jessie; Demos, Michelle; Dewan, Tammie; Dionne, Janis; Evans, A Mark; Friedman, Jan M; Garber, Ian; Lewis, Suzanne; Ling, Jiqiang; Mandal, Rupasri; Mattman, Andre; McKinnon, Margaret; Michoulas, Aspasia; Metzger, Daniel; Ogunbayo, Oluseye A; Rakic, Bojana; Rozmus, Jacob; Ruben, Peter; Sayson, Bryan; Santra, Saikat; Schultz, Kirk R; Selby, Kathryn; Shekel, Paul; Sirrs, Sandra; Skrypnyk, Cristina; Superti-Furga, Andrea; Turvey, Stuart E; Van Allen, Margot I; Wishart, David; Wu, Jiang; Wu, John; Zafeiriou, Dimitrios; Kluijtmans, Leo; Wevers, Ron A; Eydoux, Patrice; Lehman, Anna M; Vallance, Hilary; Stockler-Ipsiroglu, Sylvia; Sinclair, Graham; Wasserman, Wyeth W; van Karnebeek, Clara D

    2016-06-09

    Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient's clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children's Hospital Foundation and others.).

  1. Exome Sequencing and the Management of Neurometabolic Disorders

    PubMed Central

    Tarailo-Graovac, M.; Shyr, C.; Ross, C.J.; Horvath, G.A.; Salvarinova, R.; Ye, X.C.; Zhang, L.-H.; Bhavsar, A.P.; Lee, J.J.Y.; Drögemöller, B.I.; Abdelsayed, M.; Alfadhel, M.; Armstrong, L.; Baumgartner, M.R.; Burda, P.; Connolly, M.B.; Cameron, J.; Demos, M.; Dewan, T.; Dionne, J.; Evans, A.M.; Friedman, J.M.; Garber, I.; Lewis, S.; Ling, J.; Mandal, R.; Mattman, A.; McKinnon, M.; Michoulas, A.; Metzger, D.; Ogunbayo, O.A.; Rakic, B.; Rozmus, J.; Ruben, P.; Sayson, B.; Santra, S.; Schultz, K.R.; Selby, K.; Shekel, P.; Sirrs, S.; Skrypnyk, C.; Superti-Furga, A.; Turvey, S.E.; Van Allen, M.I.; Wishart, D.; Wu, J.; Wu, J.; Zafeiriou, D.; Kluijtmans, L.; Wevers, R.A.; Eydoux, P.; Lehman, A.M.; Vallance, H.; Stockler-Ipsiroglu, S.; Sinclair, G.; Wasserman, W.W.; van Karnebeek, C.D.

    2016-01-01

    BACKGROUND Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. METHODS To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient’s clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. RESULTS We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). CONCLUSIONS Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children’s Hospital Foundation and others.) PMID:27276562

  2. Exome sequencing reveals a de novo POLD1 mutation causing phenotypic variability in mandibular hypoplasia, deafness, progeroid features, and lipodystrophy syndrome (MDPL).

    PubMed

    Elouej, Sahar; Beleza-Meireles, Ana; Caswell, Richard; Colclough, Kevin; Ellard, Sian; Desvignes, Jean Pierre; Béroud, Christophe; Lévy, Nicolas; Mohammed, Shehla; De Sandre-Giovannoli, Annachiara

    2017-06-01

    Mandibular hypoplasia, deafness, progeroid features, and lipodystrophy syndrome (MDPL) is an autosomal dominant systemic disorder characterized by prominent loss of subcutaneous fat, a characteristic facial appearance and metabolic abnormalities. This syndrome is caused by heterozygous de novo mutations in the POLD1 gene. To date, 19 patients with MDPL have been reported in the literature and among them 14 patients have been characterized at the molecular level. Twelve unrelated patients carried a recurrent in-frame deletion of a single codon (p.Ser605del) and two other patients carried a novel heterozygous mutation in exon 13 (p.Arg507Cys). Additionally and interestingly, germline mutations of the same gene have been involved in familial polyposis and colorectal cancer (CRC) predisposition. We describe a male and a female patient with MDPL respectively affected with mild and severe phenotypes. Both of them showed mandibular hypoplasia, a beaked nose with bird-like facies, prominent eyes, a small mouth, growth retardation, muscle and skin atrophy, but the female patient showed such a severe and early phenotype that a first working diagnosis of Hutchinson-Gilford Progeria was made. The exploration was performed by direct sequencing of POLD1 gene exon 15 in the male patient with a classical MDPL phenotype and by whole exome sequencing in the female patient and her unaffected parents. Exome sequencing identified in the latter patient a de novo heterozygous undescribed mutation in the POLD1 gene (NM_002691.3: c.3209T>A), predicted to cause the missense change p.Ile1070Asn in the ZnF2 (Zinc Finger 2) domain of the protein. This mutation was not reported in the 1000 Genome Project, dbSNP and Exome sequencing databases. Furthermore, the Isoleucine1070 residue of POLD1 is highly conserved among various species, suggesting that this substitution may cause a major impairment of POLD1 activity. For the second patient, affected with a typical MDPL phenotype, direct sequencing of POLD1 exon 15 revealed the recurrent in-frame deletion (c.1812_1814del, p.S605del). Our work highlights that mutations in different POLD1 domains can lead to phenotypic variability, ranging from dominantly inherited cancer predisposition syndromes, to mild MDPL phenotypes without lifespan reduction, to very severe MDPL syndromes with major premature aging features. These results also suggest that POLD1 gene testing should be considered in patients presenting with severe progeroid features. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Molecular Diagnosis of Usher Syndrome: Application of Two Different Next Generation Sequencing-Based Procedures

    PubMed Central

    Licastro, Danilo; Mutarelli, Margherita; Peluso, Ivana; Neveling, Kornelia; Wieskamp, Nienke; Rispoli, Rossella; Vozzi, Diego; Athanasakis, Emmanouil; D'Eustacchio, Angela; Pizzo, Mariateresa; D'Amico, Francesca; Ziviello, Carmela; Simonelli, Francesca; Fabretto, Antonella; Scheffer, Hans; Gasparini, Paolo; Banfi, Sandro; Nigro, Vincenzo

    2012-01-01

    Usher syndrome (USH) is a clinically and genetically heterogeneous disorder characterized by visual and hearing impairments. Clinically, it is subdivided into three subclasses with nine genes identified so far. In the present study, we investigated whether the currently available Next Generation Sequencing (NGS) technologies are already suitable for molecular diagnostics of USH. We analyzed a total of 12 patients, most of which were negative for previously described mutations in known USH genes upon primer extension-based microarray genotyping. We enriched the NGS template either by whole exome capture or by Long-PCR of the known USH genes. The main NGS sequencing platforms were used: SOLiD for whole exome sequencing, Illumina (Genome Analyzer II) and Roche 454 (GS FLX) for the Long-PCR sequencing. Long-PCR targeting was more efficient with up to 94% of USH gene regions displaying an overall coverage higher than 25×, whereas whole exome sequencing yielded a similar coverage for only 50% of those regions. Overall this integrated analysis led to the identification of 11 novel sequence variations in USH genes (2 homozygous and 9 heterozygous) out of 18 detected. However, at least two cases were not genetically solved. Our result highlights the current limitations in the diagnostic use of NGS for USH patients. The limit for whole exome sequencing is linked to the need of a strong coverage and to the correct interpretation of sequence variations with a non obvious, pathogenic role, whereas the targeted approach suffers from the high genetic heterogeneity of USH that may be also caused by the presence of additional causative genes yet to be identified. PMID:22952768

  4. Exome sequencing and arrayCGH detection of gene sequence and copy number variation between ILS and ISS mouse strains.

    PubMed

    Dumas, Laura; Dickens, C Michael; Anderson, Nathan; Davis, Jonathan; Bennett, Beth; Radcliffe, Richard A; Sikela, James M

    2014-06-01

    It has been well documented that genetic factors can influence predisposition to develop alcoholism. While the underlying genomic changes may be of several types, two of the most common and disease associated are copy number variations (CNVs) and sequence alterations of protein coding regions. The goal of this study was to identify CNVs and single-nucleotide polymorphisms that occur in gene coding regions that may play a role in influencing the risk of an individual developing alcoholism. Toward this end, two mouse strains were used that have been selectively bred based on their differential sensitivity to alcohol: the Inbred long sleep (ILS) and Inbred short sleep (ISS) mouse strains. Differences in initial response to alcohol have been linked to risk for alcoholism, and the ILS/ISS strains are used to investigate the genetics of initial sensitivity to alcohol. Array comparative genomic hybridization (arrayCGH) and exome sequencing were conducted to identify CNVs and gene coding sequence differences, respectively, between ILS and ISS mice. Mouse arrayCGH was performed using catalog Agilent 1 × 244 k mouse arrays. Subsequently, exome sequencing was carried out using an Illumina HiSeq 2000 instrument. ArrayCGH detected 74 CNVs that were strain-specific (38 ILS/36 ISS), including several ISS-specific deletions that contained genes implicated in brain function and neurotransmitter release. Among several interesting coding variations detected by exome sequencing was the gain of a premature stop codon in the alpha-amylase 2B (AMY2B) gene specifically in the ILS strain. In total, exome sequencing detected 2,597 and 1,768 strain-specific exonic gene variants in the ILS and ISS mice, respectively. This study represents the most comprehensive and detailed genomic comparison of ILS and ISS mouse strains to date. The two complementary genome-wide approaches identified strain-specific CNVs and gene coding sequence variations that should provide strong candidates to contribute to the alcohol-related phenotypic differences associated with these strains.

  5. Rapid prenatal diagnosis using targeted exome sequencing: a cohort study to assess feasibility and potential impact on prenatal counseling and pregnancy management.

    PubMed

    Chandler, Natalie; Best, Sunayna; Hayward, Jane; Faravelli, Francesca; Mansour, Sahar; Kivuva, Emma; Tapon, Dagmar; Male, Alison; DeVile, Catherine; Chitty, Lyn S

    2018-03-29

    PurposeUnexpected fetal abnormalities occur in 2-5% of pregnancies. While traditional cytogenetic and microarray approaches achieve diagnosis in around 40% of cases, lack of diagnosis in others impedes parental counseling, informed decision making, and pregnancy management. Postnatally exome sequencing yields high diagnostic rates, but relies on careful phenotyping to interpret genotype results. Here we used a multidisciplinary approach to explore the utility of rapid fetal exome sequencing for prenatal diagnosis using skeletal dysplasias as an exemplar.MethodsParents in pregnancies undergoing invasive testing because of sonographic fetal abnormalities, where multidisciplinary review considered skeletal dysplasia a likely etiology, were consented for exome trio sequencing (both parents and fetus). Variant interpretation focused on a virtual panel of 240 genes known to cause skeletal dysplasias.ResultsDefinitive molecular diagnosis was made in 13/16 (81%) cases. In some cases, fetal ultrasound findings alone were of sufficient severity for parents to opt for termination. In others, molecular diagnosis informed accurate prediction of outcome, improved parental counseling, and enabled parents to terminate or continue the pregnancy with certainty.ConclusionTrio sequencing with expert multidisciplinary review for case selection and data interpretation yields timely, high diagnostic rates in fetuses presenting with unexpected skeletal abnormalities. This improves parental counseling and pregnancy management.Genetics in Medicine advance online publication, 29 March 2018; doi:10.1038/gim.2018.30.

  6. Whole-Exome Sequencing to Decipher the Genetic Heterogeneity of Hearing Loss in a Chinese Family with Deaf by Deaf Mating

    PubMed Central

    Qing, Jie; Yan, Denise; Zhou, Yuan; Liu, Qiong; Wu, Weijing; Xiao, Zian; Liu, Yuyuan; Liu, Jia; Du, Lilin; Xie, Dinghua; Liu, Xue Zhong

    2014-01-01

    Inherited deafness has been shown to have high genetic heterogeneity. For many decades, linkage analysis and candidate gene approaches have been the main tools to elucidate the genetics of hearing loss. However, this associated study design is costly, time-consuming, and unsuitable for small families. This is mainly due to the inadequate numbers of available affected individuals, locus heterogeneity, and assortative mating. Exome sequencing has now become technically feasible and a cost-effective method for detection of disease variants underlying Mendelian disorders due to the recent advances in next-generation sequencing (NGS) technologies. In the present study, we have combined both the Deafness Gene Mutation Detection Array and exome sequencing to identify deafness causative variants in a large Chinese composite family with deaf by deaf mating. The simultaneous screening of the 9 common deafness mutations using the allele-specific PCR based universal array, resulted in the identification of the 1555A>G in the mitochondrial DNA (mtDNA) 12S rRNA in affected individuals in one branch of the family. We then subjected the mutation-negative cases to exome sequencing and identified novel causative variants in the MYH14 and WFS1 genes. This report confirms the effective use of a NGS technique to detect pathogenic mutations in affected individuals who were not candidates for classical genetic studies. PMID:25289672

  7. Growth hormone deficiency with advanced bone age: phenotypic interaction between GHRH receptor and CYP21A2 mutations diagnosed by sanger and whole exome sequencing.

    PubMed

    Correa, Fernanda A; França, Marcela M; Fang, Qing; Ma, Qianyi; Bachega, Tania A; Rodrigues, Andresa; Ozel, Bilge A; Li, Jun Z; Mendonca, Berenice B; Jorge, Alexander A L; Carvalho, Luciani R; Camper, Sally A; Arnhold, Ivo J P

    2017-12-01

    Isolated growth hormone deficiency (IGHD) is the most common pituitary hormone deficiency and, clinically, patients have delayed bone age. High sequence similarity between CYP21A2 gene and CYP21A1P pseudogene poses difficulties for exome sequencing interpretation. A 7.5 year-old boy born to second-degree cousins presented with severe short stature (height SDS -3.7) and bone age of 6 years. Clonidine and combined pituitary stimulation tests revealed GH deficiency. Pituitary MRI was normal. The patient was successfully treated with rGH. Surprisingly, at 10.8 years, his bone age had advanced to 13 years, but physical exam, LH and testosterone levels remained prepubertal. An ACTH stimulation test disclosed a non-classic congenital adrenal hyperplasia due to 21-hydroxylase deficiency explaining the bone age advancement and, therefore, treatment with cortisone acetate was added. The genetic diagnosis of a homozygous mutation in GHRHR (p.Leu144His), a homozygous CYP21A2 mutation (p.Val282Leu) and CYP21A1P pseudogene duplication was established by Sanger sequencing, MLPA and whole-exome sequencing. We report the unusual clinical presentation of a patient born to consanguineous parents with two recessive endocrine diseases: non-classic congenital adrenal hyperplasia modifying the classical GH deficiency phenotype. We used a method of paired read mapping aided by neighbouring mis-matches to overcome the challenges of exome-sequencing in the presence of a pseudogene.

  8. Whole-exome sequencing identifies recurrent SF3B1 R625 mutation and comutation of NF1 and KIT in mucosal melanoma.

    PubMed

    Hintzsche, Jennifer D; Gorden, Nicholas T; Amato, Carol M; Kim, Jihye; Wuensch, Kelsey E; Robinson, Steven E; Applegate, Allison J; Couts, Kasey L; Medina, Theresa M; Wells, Keith R; Wisell, Joshua A; McCarter, Martin D; Box, Neil F; Shellman, Yiqun G; Gonzalez, Rene C; Lewis, Karl D; Tentler, John J; Tan, Aik Choon; Robinson, William A

    2017-06-01

    Mucosal melanomas are a rare subtype of melanoma, arising in mucosal tissues, which have a very poor prognosis due to the lack of effective targeted therapies. This study aimed to better understand the molecular landscape of these cancers and find potential new therapeutic targets. Whole-exome sequencing was performed on mucosal melanomas from 19 patients and 135 sun-exposed cutaneous melanomas, with matched peripheral blood samples when available. Mutational profiles were compared between mucosal subgroups and sun-exposed cutaneous melanomas. Comparisons of molecular profiles identified 161 genes enriched in mucosal melanoma (P<0.05). KIT and NF1 were frequently comutated (32%) in the mucosal subgroup, with a significantly higher incidence than that in cutaneous melanoma (4%). Recurrent SF3B1 R625H/S/C mutations were identified and validated in 7 of 19 (37%) mucosal melanoma patients. Mutations in the spliceosome pathway were found to be enriched in mucosal melanomas when compared with cutaneous melanomas. Alternative splicing in four genes were observed in SF3B1-mutant samples compared with the wild-type samples. This study identified potential new therapeutic targets for mucosal melanoma, including comutation of NF1 and KIT, and recurrent R625 mutations in SF3B1. This is the first report of SF3B1 R625 mutations in vulvovaginal mucosal melanoma, with the largest whole-exome sequencing project of mucosal melanomas to date. The results here also indicated that the mutations in SF3B1 lead to alternative splicing in multiple genes. These findings expand our knowledge of this rare disease.

  9. Whole-exome sequencing identifies recurrent SF3B1 R625 mutation and comutation of NF1 and KIT in mucosal melanoma

    PubMed Central

    Hintzsche, Jennifer D.; Gorden, Nicholas T.; Amato, Carol M.; Kim, Jihye; Wuensch, Kelsey E.; Robinson, Steven E.; Applegate, Allison J.; Couts, Kasey L.; Medina, Theresa M.; Wells, Keith R.; Wisell, Joshua A.; McCarter, Martin D.; Box, Neil F.; Shellman, Yiqun G.; Gonzalez, Rene C.; Lewis, Karl D.; Tentler, John J.

    2017-01-01

    Mucosal melanomas are a rare subtype of melanoma, arising in mucosal tissues, which have a very poor prognosis due to the lack of effective targeted therapies. This study aimed to better understand the molecular landscape of these cancers and find potential new therapeutic targets. Whole-exome sequencing was performed on mucosal melanomas from 19 patients and 135 sun-exposed cutaneous melanomas, with matched peripheral blood samples when available. Mutational profiles were compared between mucosal subgroups and sun-exposed cutaneous melanomas. Comparisons of molecular profiles identified 161 genes enriched in mucosal melanoma (P<0.05). KIT and NF1 were frequently comutated (32%) in the mucosal subgroup, with a significantly higher incidence than that in cutaneous melanoma (4%). Recurrent SF3B1 R625H/S/C mutations were identified and validated in 7 of 19 (37%) mucosal melanoma patients. Mutations in the spliceosome pathway were found to be enriched in mucosal melanomas when compared with cutaneous melanomas. Alternative splicing in four genes were observed in SF3B1-mutant samples compared with the wild-type samples. This study identified potential new therapeutic targets for mucosal melanoma, including comutation of NF1 and KIT, and recurrent R625 mutations in SF3B1. This is the first report of SF3B1 R625 mutations in vulvovaginal mucosal melanoma, with the largest whole-exome sequencing project of mucosal melanomas to date. The results here also indicated that the mutations in SF3B1 lead to alternative splicing in multiple genes. These findings expand our knowledge of this rare disease. PMID:28296713

  10. Actionable exomic incidental findings in 6503 participants: challenges of variant classification.

    PubMed

    Amendola, Laura M; Dorschner, Michael O; Robertson, Peggy D; Salama, Joseph S; Hart, Ragan; Shirts, Brian H; Murray, Mitzi L; Tokita, Mari J; Gallego, Carlos J; Kim, Daniel Seung; Bennett, James T; Crosslin, David R; Ranchalis, Jane; Jones, Kelly L; Rosenthal, Elisabeth A; Jarvik, Ella R; Itsara, Andy; Turner, Emily H; Herman, Daniel S; Schleit, Jennifer; Burt, Amber; Jamal, Seema M; Abrudan, Jenica L; Johnson, Andrew D; Conlin, Laura K; Dulik, Matthew C; Santani, Avni; Metterville, Danielle R; Kelly, Melissa; Foreman, Ann Katherine M; Lee, Kristy; Taylor, Kent D; Guo, Xiuqing; Crooks, Kristy; Kiedrowski, Lesli A; Raffel, Leslie J; Gordon, Ora; Machini, Kalotina; Desnick, Robert J; Biesecker, Leslie G; Lubitz, Steven A; Mulchandani, Surabhi; Cooper, Greg M; Joffe, Steven; Richards, C Sue; Yang, Yaoping; Rotter, Jerome I; Rich, Stephen S; O'Donnell, Christopher J; Berg, Jonathan S; Spinner, Nancy B; Evans, James P; Fullerton, Stephanie M; Leppig, Kathleen A; Bennett, Robin L; Bird, Thomas; Sybert, Virginia P; Grady, William M; Tabor, Holly K; Kim, Jerry H; Bamshad, Michael J; Wilfond, Benjamin; Motulsky, Arno G; Scott, C Ronald; Pritchard, Colin C; Walsh, Tom D; Burke, Wylie; Raskind, Wendy H; Byers, Peter; Hisama, Fuki M; Rehm, Heidi; Nickerson, Debbie A; Jarvik, Gail P

    2015-03-01

    Recommendations for laboratories to report incidental findings from genomic tests have stimulated interest in such results. In order to investigate the criteria and processes for assigning the pathogenicity of specific variants and to estimate the frequency of such incidental findings in patients of European and African ancestry, we classified potentially actionable pathogenic single-nucleotide variants (SNVs) in all 4300 European- and 2203 African-ancestry participants sequenced by the NHLBI Exome Sequencing Project (ESP). We considered 112 gene-disease pairs selected by an expert panel as associated with medically actionable genetic disorders that may be undiagnosed in adults. The resulting classifications were compared to classifications from other clinical and research genetic testing laboratories, as well as with in silico pathogenicity scores. Among European-ancestry participants, 30 of 4300 (0.7%) had a pathogenic SNV and six (0.1%) had a disruptive variant that was expected to be pathogenic, whereas 52 (1.2%) had likely pathogenic SNVs. For African-ancestry participants, six of 2203 (0.3%) had a pathogenic SNV and six (0.3%) had an expected pathogenic disruptive variant, whereas 13 (0.6%) had likely pathogenic SNVs. Genomic Evolutionary Rate Profiling mammalian conservation score and the Combined Annotation Dependent Depletion summary score of conservation, substitution, regulation, and other evidence were compared across pathogenicity assignments and appear to have utility in variant classification. This work provides a refined estimate of the burden of adult onset, medically actionable incidental findings expected from exome sequencing, highlights challenges in variant classification, and demonstrates the need for a better curated variant interpretation knowledge base. © 2015 Amendola et al.; Published by Cold Spring Harbor Laboratory Press.

  11. Joint linkage and association analysis with exome sequence data implicates SLC25A40 in hypertriglyceridemia.

    PubMed

    Rosenthal, Elisabeth A; Ranchalis, Jane; Crosslin, David R; Burt, Amber; Brunzell, John D; Motulsky, Arno G; Nickerson, Deborah A; Wijsman, Ellen M; Jarvik, Gail P

    2013-12-05

    Hypertriglyceridemia (HTG) is a heritable risk factor for cardiovascular disease. Investigating the genetics of HTG may identify new drug targets. There are ~35 known single-nucleotide variants (SNVs) that explain only ~10% of variation in triglyceride (TG) level. Because of the genetic heterogeneity of HTG, a family study design is optimal for identification of rare genetic variants with large effect size because the same mutation can be observed in many relatives and cosegregation with TG can be tested. We considered HTG in a five-generation family of European American descent (n = 121), ascertained for familial combined hyperlipidemia. By using Bayesian Markov chain Monte Carlo joint oligogenic linkage and association analysis, we detected linkage to chromosomes 7 and 17. Whole-exome sequence data revealed shared, highly conserved, private missense SNVs in both SLC25A40 on chr7 and PLD2 on chr17. Jointly, these SNVs explained 49% of the genetic variance in TG; however, only the SLC25A40 SNV was significantly associated with TG (p = 0.0001). This SNV, c.374A>G, causes a highly disruptive p.Tyr125Cys substitution just outside the second helical transmembrane region of the SLC25A40 inner mitochondrial membrane transport protein. Whole-gene testing in subjects from the Exome Sequencing Project confirmed the association between TG and SLC25A40 rare, highly conserved, coding variants (p = 0.03). These results suggest a previously undescribed pathway for HTG and illustrate the power of large pedigrees in the search for rare, causal variants. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  12. Whole-exome sequencing for mutation detection in pediatric disorders of insulin secretion: Maturity onset diabetes of the young and congenital hyperinsulinism.

    PubMed

    Johnson, S R; Leo, P J; McInerney-Leo, A M; Anderson, L K; Marshall, M; McGown, I; Newell, F; Brown, M A; Conwell, L S; Harris, M; Duncan, E L

    2018-06-01

    To assess the utility of whole-exome sequencing (WES) for mutation detection in maturity-onset diabetes of the young (MODY) and congenital hyperinsulinism (CHI). MODY and CHI are the two commonest monogenic disorders of glucose-regulated insulin secretion in childhood, with 13 causative genes known for MODY and 10 causative genes identified for CHI. The large number of potential genes makes comprehensive screening using traditional methods expensive and time-consuming. Ten subjects with MODY and five with CHI with known mutations underwent WES using two different exome capture kits (Nimblegen SeqCap EZ Human v3.0 Exome Enrichment Kit, Nextera Rapid Capture Exome Kit). Analysis was blinded to previously identified mutations, and included assessment for large deletions. The target capture of five exome capture technologies was also analyzed using sequencing data from >2800 unrelated samples. Four of five MODY mutations were identified using Nimblegen (including a large deletion in HNF1B). Although targeted, one mutation (in INS) had insufficient coverage for detection. Eleven of eleven mutations (six MODY, five CHI) were identified using Nextera Rapid (including the previously missed mutation). On reconciliation, all mutations concorded with previous data and no additional variants in MODY genes were detected. There were marked differences in the performance of the capture technologies. WES can be useful for screening for MODY/CHI mutations, detecting both point mutations and large deletions. However, capture technologies require careful selection. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  13. Analyzing Somatic Genome Rearrangements in Human Cancers by Using Whole-Exome Sequencing | Office of Cancer Genomics

    Cancer.gov

    Although exome sequencing data are generated primarily to detect single-nucleotide variants and indels, they can also be used to identify a subset of genomic rearrangements whose breakpoints are located in or near exons. Using >4,600 tumor and normal pairs across 15 cancer types, we identified over 9,000 high confidence somatic rearrangements, including a large number of gene fusions.

  14. Clinical and genetic characterization of leukoencephalopathies in adults.

    PubMed

    Lynch, David S; Rodrigues Brandão de Paiva, Anderson; Zhang, Wei Jia; Bugiardini, Enrico; Freua, Fernando; Tavares Lucato, Leandro; Macedo-Souza, Lucia Inês; Lakshmanan, Rahul; Kinsella, Justin A; Merwick, Aine; Rossor, Alexander M; Bajaj, Nin; Herron, Brian; McMonagle, Paul; Morrison, Patrick J; Hughes, Deborah; Pittman, Alan; Laurà, Matilde; Reilly, Mary M; Warren, Jason D; Mummery, Catherine J; Schott, Jonathan M; Adams, Matthew; Fox, Nick C; Murphy, Elaine; Davagnanam, Indran; Kok, Fernando; Chataway, Jeremy; Houlden, Henry

    2017-05-01

    Leukodystrophies and genetic leukoencephalopathies are a rare group of disorders leading to progressive degeneration of cerebral white matter. They are associated with a spectrum of clinical phenotypes dominated by dementia, psychiatric changes, movement disorders and upper motor neuron signs. Mutations in at least 60 genes can lead to leukoencephalopathy with often overlapping clinical and radiological presentations. For these reasons, patients with genetic leukoencephalopathies often endure a long diagnostic odyssey before receiving a definitive diagnosis or may receive no diagnosis at all. In this study, we used focused and whole exome sequencing to evaluate a cohort of undiagnosed adult patients referred to a specialist leukoencephalopathy service. In total, 100 patients were evaluated using focused exome sequencing of 6100 genes. We detected pathogenic or likely pathogenic variants in 26 cases. The most frequently mutated genes were NOTCH3, EIF2B5, AARS2 and CSF1R. We then carried out whole exome sequencing on the remaining negative cases including four family trios, but could not identify any further potentially disease-causing mutations, confirming the equivalence of focused and whole exome sequencing in the diagnosis of genetic leukoencephalopathies. Here we provide an overview of the clinical and genetic features of these disorders in adults. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain.

  15. Usability study of clinical exome analysis software: top lessons learned and recommendations.

    PubMed

    Shyr, Casper; Kushniruk, Andre; Wasserman, Wyeth W

    2014-10-01

    New DNA sequencing technologies have revolutionized the search for genetic disruptions. Targeted sequencing of all protein coding regions of the genome, called exome analysis, is actively used in research-oriented genetics clinics, with the transition to exomes as a standard procedure underway. This transition is challenging; identification of potentially causal mutation(s) amongst ∼10(6) variants requires specialized computation in combination with expert assessment. This study analyzes the usability of user interfaces for clinical exome analysis software. There are two study objectives: (1) To ascertain the key features of successful user interfaces for clinical exome analysis software based on the perspective of expert clinical geneticists, (2) To assess user-system interactions in order to reveal strengths and weaknesses of existing software, inform future design, and accelerate the clinical uptake of exome analysis. Surveys, interviews, and cognitive task analysis were performed for the assessment of two next-generation exome sequence analysis software packages. The subjects included ten clinical geneticists who interacted with the software packages using the "think aloud" method. Subjects' interactions with the software were recorded in their clinical office within an urban research and teaching hospital. All major user interface events (from the user interactions with the packages) were time-stamped and annotated with coding categories to identify usability issues in order to characterize desired features and deficiencies in the user experience. We detected 193 usability issues, the majority of which concern interface layout and navigation, and the resolution of reports. Our study highlights gaps in specific software features typical within exome analysis. The clinicians perform best when the flow of the system is structured into well-defined yet customizable layers for incorporation within the clinical workflow. The results highlight opportunities to dramatically accelerate clinician analysis and interpretation of patient genomic data. We present the first application of usability methods to evaluate software interfaces in the context of exome analysis. Our results highlight how the study of user responses can lead to identification of usability issues and challenges and reveal software reengineering opportunities for improving clinical next-generation sequencing analysis. While the evaluation focused on two distinctive software tools, the results are general and should inform active and future software development for genome analysis software. As large-scale genome analysis becomes increasingly common in healthcare, it is critical that efficient and effective software interfaces are provided to accelerate clinical adoption of the technology. Implications for improved design of such applications are discussed. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  16. Common and rare von Willebrand factor (VWF) coding variants, VWF levels, and factor VIII levels in African Americans: the NHLBI Exome Sequencing Project.

    PubMed

    Johnsen, Jill M; Auer, Paul L; Morrison, Alanna C; Jiao, Shuo; Wei, Peng; Haessler, Jeffrey; Fox, Keolu; McGee, Sean R; Smith, Joshua D; Carlson, Christopher S; Smith, Nicholas; Boerwinkle, Eric; Kooperberg, Charles; Nickerson, Deborah A; Rich, Stephen S; Green, David; Peters, Ulrike; Cushman, Mary; Reiner, Alex P

    2013-07-25

    Several rare European von Willebrand disease missense variants of VWF (including p.Arg2185Gln and p.His817Gln) were recently reported to be common in apparently healthy African Americans (AAs). Using data from the NHLBI Exome Sequencing Project, we assessed the association of these and other VWF coding variants with von Willebrand factor (VWF) and factor VIII (FVIII) levels in 4468 AAs. Of 30 nonsynonymous VWF variants, 6 were significantly and independently associated (P < .001) with levels of VWF and/or FVIII. Each additional copy of the common VWF variants encoding p.Thr789Ala or p.Asp1472His was associated with 6 to 8 IU/dL higher VWF levels. The VWF variant encoding p.Arg2185Gln was associated with 7 to 13 IU/dL lower VWF and FVIII levels. The type 2N-related VWF variant encoding p.His817Gln was associated with 17 IU/dL lower FVIII level but normal VWF level. A novel, rare missense VWF variant that predicts disruption of an O-glycosylation site (p.Ser1486Leu) and a rare variant encoding p.Arg2287Trp were each associated with 30 to 40 IU/dL lower VWF level (P < .001). In summary, several common and rare VWF missense variants contribute to phenotypic differences in VWF and FVIII among AAs.

  17. Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies.

    PubMed

    Tao, Ran; Zeng, Donglin; Lin, Dan-Yu

    2017-01-01

    In modern epidemiological and clinical studies, the covariates of interest may involve genome sequencing, biomarker assay, or medical imaging and thus are prohibitively expensive to measure on a large number of subjects. A cost-effective solution is the two-phase design, under which the outcome and inexpensive covariates are observed for all subjects during the first phase and that information is used to select subjects for measurements of expensive covariates during the second phase. For example, subjects with extreme values of quantitative traits were selected for whole-exome sequencing in the National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP). Herein, we consider general two-phase designs, where the outcome can be continuous or discrete, and inexpensive covariates can be continuous and correlated with expensive covariates. We propose a semiparametric approach to regression analysis by approximating the conditional density functions of expensive covariates given inexpensive covariates with B-spline sieves. We devise a computationally efficient and numerically stable EM-algorithm to maximize the sieve likelihood. In addition, we establish the consistency, asymptotic normality, and asymptotic efficiency of the estimators. Furthermore, we demonstrate the superiority of the proposed methods over existing ones through extensive simulation studies. Finally, we present applications to the aforementioned NHLBI ESP.

  18. Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets.

    PubMed

    Samadian, Soroush; Bruce, Jeff P; Pugh, Trevor J

    2018-03-01

    Somatic copy number variations (CNVs) play a crucial role in development of many human cancers. The broad availability of next-generation sequencing data has enabled the development of algorithms to computationally infer CNV profiles from a variety of data types including exome and targeted sequence data; currently the most prevalent types of cancer genomics data. However, systemic evaluation and comparison of these tools remains challenging due to a lack of ground truth reference sets. To address this need, we have developed Bamgineer, a tool written in Python to introduce user-defined haplotype-phased allele-specific copy number events into an existing Binary Alignment Mapping (BAM) file, with a focus on targeted and exome sequencing experiments. As input, this tool requires a read alignment file (BAM format), lists of non-overlapping genome coordinates for introduction of gains and losses (bed file), and an optional file defining known haplotypes (vcf format). To improve runtime performance, Bamgineer introduces the desired CNVs in parallel using queuing and parallel processing on a local machine or on a high-performance computing cluster. As proof-of-principle, we applied Bamgineer to a single high-coverage (mean: 220X) exome sequence file from a blood sample to simulate copy number profiles of 3 exemplar tumors from each of 10 tumor types at 5 tumor cellularity levels (20-100%, 150 BAM files in total). To demonstrate feasibility beyond exome data, we introduced read alignments to a targeted 5-gene cell-free DNA sequencing library to simulate EGFR amplifications at frequencies consistent with circulating tumor DNA (10, 1, 0.1 and 0.01%) while retaining the multimodal insert size distribution of the original data. We expect Bamgineer to be of use for development and systematic benchmarking of CNV calling algorithms by users using locally-generated data for a variety of applications. The source code is freely available at http://github.com/pughlab/bamgineer.

  19. Whole Exome Sequencing of Patients with Steroid-Resistant Nephrotic Syndrome.

    PubMed

    Warejko, Jillian K; Tan, Weizhen; Daga, Ankana; Schapiro, David; Lawson, Jennifer A; Shril, Shirlee; Lovric, Svjetlana; Ashraf, Shazia; Rao, Jia; Hermle, Tobias; Jobst-Schwan, Tilman; Widmeier, Eugen; Majmundar, Amar J; Schneider, Ronen; Gee, Heon Yung; Schmidt, J Magdalena; Vivante, Asaf; van der Ven, Amelie T; Ityel, Hadas; Chen, Jing; Sadowski, Carolin E; Kohl, Stefan; Pabst, Werner L; Nakayama, Makiko; Somers, Michael J G; Rodig, Nancy M; Daouk, Ghaleb; Baum, Michelle; Stein, Deborah R; Ferguson, Michael A; Traum, Avram Z; Soliman, Neveen A; Kari, Jameela A; El Desoky, Sherif; Fathy, Hanan; Zenker, Martin; Bakkaloglu, Sevcan A; Müller, Dominik; Noyan, Aytul; Ozaltin, Fatih; Cadnapaphornchai, Melissa A; Hashmi, Seema; Hopcian, Jeffrey; Kopp, Jeffrey B; Benador, Nadine; Bockenhauer, Detlef; Bogdanovic, Radovan; Stajić, Nataša; Chernin, Gil; Ettenger, Robert; Fehrenbach, Henry; Kemper, Markus; Munarriz, Reyner Loza; Podracka, Ludmila; Büscher, Rainer; Serdaroglu, Erkin; Tasic, Velibor; Mane, Shrikant; Lifton, Richard P; Braun, Daniela A; Hildebrandt, Friedhelm

    2018-01-06

    Steroid-resistant nephrotic syndrome overwhelmingly progresses to ESRD. More than 30 monogenic genes have been identified to cause steroid-resistant nephrotic syndrome. We previously detected causative mutations using targeted panel sequencing in 30% of patients with steroid-resistant nephrotic syndrome. Panel sequencing has a number of limitations when compared with whole exome sequencing. We employed whole exome sequencing to detect monogenic causes of steroid-resistant nephrotic syndrome in an international cohort of 300 families. Three hundred thirty-five individuals with steroid-resistant nephrotic syndrome from 300 families were recruited from April of 1998 to June of 2016. Age of onset was restricted to <25 years of age. Exome data were evaluated for 33 known monogenic steroid-resistant nephrotic syndrome genes. In 74 of 300 families (25%), we identified a causative mutation in one of 20 genes known to cause steroid-resistant nephrotic syndrome. In 11 families (3.7%), we detected a mutation in a gene that causes a phenocopy of steroid-resistant nephrotic syndrome. This is consistent with our previously published identification of mutations using a panel approach. We detected a causative mutation in a known steroid-resistant nephrotic syndrome gene in 38% of consanguineous families and in 13% of nonconsanguineous families, and 48% of children with congenital nephrotic syndrome. A total of 68 different mutations were detected in 20 of 33 steroid-resistant nephrotic syndrome genes. Fifteen of these mutations were novel. NPHS1 , PLCE1 , NPHS2 , and SMARCAL1 were the most common genes in which we detected a mutation. In another 28% of families, we detected mutations in one or more candidate genes for steroid-resistant nephrotic syndrome. Whole exome sequencing is a sensitive approach toward diagnosis of monogenic causes of steroid-resistant nephrotic syndrome. A molecular genetic diagnosis of steroid-resistant nephrotic syndrome may have important consequences for the management of treatment and kidney transplantation in steroid-resistant nephrotic syndrome. Copyright © 2018 by the American Society of Nephrology.

  20. MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data.

    PubMed

    Fan, Yu; Xi, Liu; Hughes, Daniel S T; Zhang, Jianjun; Zhang, Jianhua; Futreal, P Andrew; Wheeler, David A; Wang, Wenyi

    2016-08-24

    Subclonal mutations reveal important features of the genetic architecture of tumors. However, accurate detection of mutations in genetically heterogeneous tumor cell populations using next-generation sequencing remains challenging. We develop MuSE ( http://bioinformatics.mdanderson.org/main/MuSE ), Mutation calling using a Markov Substitution model for Evolution, a novel approach for modeling the evolution of the allelic composition of the tumor and normal tissue at each reference base. MuSE adopts a sample-specific error model that reflects the underlying tumor heterogeneity to greatly improve the overall accuracy. We demonstrate the accuracy of MuSE in calling subclonal mutations in the context of large-scale tumor sequencing projects using whole exome and whole genome sequencing.

  1. Whole-exome sequencing identifies USH2A mutations in a pseudo-dominant Usher syndrome family.

    PubMed

    Zheng, Sui-Lian; Zhang, Hong-Liang; Lin, Zhen-Lang; Kang, Qian-Yan

    2015-10-01

    Usher syndrome (USH) is an autosomal recessive (AR) multi-sensory degenerative disorder leading to deaf-blindness. USH is clinically subdivided into three subclasses, and 10 genes have been identified thus far. Clinical and genetic heterogeneities in USH make a precise diagnosis difficult. A dominant‑like USH family in successive generations was identified, and the present study aimed to determine the genetic predisposition of this family. Whole‑exome sequencing was performed in two affected patients and an unaffected relative. Systematic data were analyzed by bioinformatic analysis to remove the candidate mutations via step‑wise filtering. Direct Sanger sequencing and co‑segregation analysis were performed in the pedigree. One novel and two known mutations in the USH2A gene were identified, and were further confirmed by direct sequencing and co‑segregation analysis. The affected mother carried compound mutations in the USH2A gene, while the unaffected father carried a heterozygous mutation. The present study demonstrates that whole‑exome sequencing is a robust approach for the molecular diagnosis of disorders with high levels of genetic heterogeneity.

  2. Data Interoperability of Whole Exome Sequencing (WES) Based Mutational Burden Estimates from Different Laboratories

    PubMed Central

    Qiu, Ping; Pang, Ling; Arreaza, Gladys; Maguire, Maureen; Chang, Ken C. N.; Marton, Matthew J.; Levitan, Diane

    2016-01-01

    Immune checkpoint inhibitors, which unleash a patient’s own T cells to kill tumors, are revolutionizing cancer treatment. Several independent studies suggest that higher non-synonymous mutational burden assessed by whole exome sequencing (WES) in tumors is associated with improved objective response, durable clinical benefit, and progression-free survival in immune checkpoint inhibitors treatment. Next-generation sequencing (NGS) is a promising technology being used in the clinic to direct patient treatment. Cancer genome WES poses a unique challenge due to tumor heterogeneity and sequencing artifacts introduced by formalin-fixed, paraffin-embedded (FFPE) tissue. In order to evaluate the data interoperability of WES data from different sources to survey tumor mutational landscape, we compared WES data of several tumor/normal matched samples from five commercial vendors. A large data discrepancy was observed from vendors’ self-reported data. Independent data analysis from vendors’ raw NGS data shows that whole exome sequencing data from qualified vendors can be combined and analyzed uniformly to derive comparable quantitative estimates of tumor mutational burden. PMID:27136543

  3. Germline whole exome sequencing and large-scale replication identifies FANCM as a likely high grade serous ovarian cancer susceptibility gene.

    PubMed

    Dicks, Ed; Song, Honglin; Ramus, Susan J; Oudenhove, Elke Van; Tyrer, Jonathan P; Intermaggio, Maria P; Kar, Siddhartha; Harrington, Patricia; Bowtell, David D; Group, Aocs Study; Cicek, Mine S; Cunningham, Julie M; Fridley, Brooke L; Alsop, Jennifer; Jimenez-Linan, Mercedes; Piskorz, Anna; Goranova, Teodora; Kent, Emma; Siddiqui, Nadeem; Paul, James; Crawford, Robin; Poblete, Samantha; Lele, Shashi; Sucheston-Campbell, Lara; Moysich, Kirsten B; Sieh, Weiva; McGuire, Valerie; Lester, Jenny; Odunsi, Kunle; Whittemore, Alice S; Bogdanova, Natalia; Dürst, Matthias; Hillemanns, Peter; Karlan, Beth Y; Gentry-Maharaj, Aleksandra; Menon, Usha; Tischkowitz, Marc; Levine, Douglas; Brenton, James D; Dörk, Thilo; Goode, Ellen L; Gayther, Simon A; Pharoah, D P Paul

    2017-08-01

    We analyzed whole exome sequencing data in germline DNA from 412 high grade serous ovarian cancer (HGSOC) cases from The Cancer Genome Atlas Project and identified 5,517 genes harboring a predicted deleterious germline coding mutation in at least one HGSOC case. Gene-set enrichment analysis showed enrichment for genes involved in DNA repair (p = 1.8×10 -3 ). Twelve DNA repair genes - APEX1, APLF, ATX, EME1, FANCL, FANCM, MAD2L2, PARP2, PARP3, POLN, RAD54L and SMUG1 - were prioritized for targeted sequencing in up to 3,107 HGSOC cases, 1,491 cases of other epithelial ovarian cancer (EOC) subtypes and 3,368 unaffected controls of European origin. We estimated mutation prevalence for each gene and tested for associations with disease risk. Mutations were identified in both cases and controls in all genes except MAD2L2 , where we found no evidence of mutations in controls. In FANCM we observed a higher mutation frequency in HGSOC cases compared to controls (29/3,107 cases, 0.96 percent; 13/3,368 controls, 0.38 percent; P=0.008) with little evidence for association with other subtypes (6/1,491, 0.40 percent; P=0.82). The relative risk of HGSOC associated with deleterious FANCM mutations was estimated to be 2.5 (95% CI 1.3 - 5.0; P=0.006). In summary, whole exome sequencing of EOC cases with large-scale replication in case-control studies has identified FANCM as a likely novel susceptibility gene for HGSOC, with mutations associated with a moderate increase in risk. These data may have clinical implications for risk prediction and prevention approaches for high-grade serous ovarian cancer in the future and a significant impact on reducing disease mortality.

  4. Molecular defects identified by whole exome sequencing in a child with Fanconi anemia.

    PubMed

    Zheng, Zhaojing; Geng, Juan; Yao, Ru-En; Li, Caihua; Ying, Daming; Shen, Yongnian; Ying, Lei; Yu, Yongguo; Fu, Qihua

    2013-11-10

    Fanconi anemia is a rare genetic disease characterized by bone marrow failure, multiple congenital malformations, and an increased susceptibility to malignancy. At least 15 genes have been identified that are involved in the pathogenesis of Fanconi anemia. However, it is still a challenge to assign the complementation group and to characterize the molecular defects in patients with Fanconi anemia. In the current study, whole exome sequencing was used to identify the affected gene(s) in a boy with Fanconi anemia. A recurring, non-synonymous mutation was found (c.3971C>T, p.P1324L) as well as a novel frameshift mutation (c.989_995del, p.H330LfsX2) in FANCA gene. Our results indicate that whole exome sequencing may be useful in clinical settings for rapid identification of disease-causing mutations in rare genetic disorders such as Fanconi anemia. © 2013 Elsevier B.V. All rights reserved.

  5. Novel ZBTB24 Mutation Associated with Immunodeficiency, Centromere Instability, and Facial Anomalies Type-2 Syndrome Identified in a Patient with Very Early Onset Inflammatory Bowel Disease.

    PubMed

    Conrad, Máire A; Dawany, Noor; Sullivan, Kathleen E; Devoto, Marcella; Kelsen, Judith R

    2017-12-01

    Very early onset inflammatory bowel disease, diagnosed in children ≤5 years old, can be the initial presentation of some primary immunodeficiencies. In this study, we describe a 17-month-old boy with recurrent infections, growth failure, facial anomalies, and inflammatory bowel disease. Immune evaluation, whole-exome sequencing, karyotyping, and methylation array were performed to evaluate the child's constellation of symptoms and examination findings. Whole-exome sequencing revealed that the child was homozygous for a novel variant in ZBTB24, the gene associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome. This describes the first case of inflammatory bowel disease associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome in a child with a novel disease-causing mutation in ZBTB24 found on whole-exome sequencing.

  6. Diagnosis of mitochondrial disorders by concomitant next-generation sequencing of the exome and mitochondrial genome

    PubMed Central

    Dinwiddie, Darrell L.; Smith, Laurie D.; Miller, Neil A.; Atherton, Andrea M.; Farrow, Emily G.; Strenk, Meghan E.; Soden, Sarah E.; Saunders, Carol J.; Kingsmore, Stephen F.

    2015-01-01

    Mitochondrial diseases are notoriously difficult to diagnose due to extreme locus and allelic heterogeneity, with both nuclear and mitochondrial genomes potentially liable. Using exome sequencing we demonstrate the ability to rapidly and cost effectively evaluate both the nuclear and mitochondrial genomes to obtain a molecular diagnosis for four patients with three distinct mitochondrial disorders. One patient was found to have Leigh syndrome due to a mutation in MT-ATP6, two affected siblings were discovered to be compound heterozygous for mutations in the NDUFV1 gene, which causes mitochondrial complex I deficiency, and one patient was found to have coenzyme Q10 deficiency due to compound heterozygous mutations in COQ2. In all cases conventional diagnostic testing failed to identify a molecular diagnosis. We suggest that additional studies should be conducted to evaluate exome sequencing as a primary diagnostic test for mitochondrial diseases, including those due to mtDNA mutations. PMID:23631824

  7. Exome sequencing identifies a DNAJB6 mutation in a family with dominantly-inherited limb-girdle muscular dystrophy.

    PubMed

    Couthouis, Julien; Raphael, Alya R; Siskind, Carly; Findlay, Andrew R; Buenrostro, Jason D; Greenleaf, William J; Vogel, Hannes; Day, John W; Flanigan, Kevin M; Gitler, Aaron D

    2014-05-01

    Limb-girdle muscular dystrophy primarily affects the muscles of the hips and shoulders (the "limb-girdle" muscles), although it is a heterogeneous disorder that can present with varying symptoms. There is currently no cure. We sought to identify the genetic basis of limb-girdle muscular dystrophy type 1 in an American family of Northern European descent using exome sequencing. Exome sequencing was performed on DNA samples from two affected siblings and one unaffected sibling and resulted in the identification of eleven candidate mutations that co-segregated with the disease. Notably, this list included a previously reported mutation in DNAJB6, p.Phe89Ile, which was recently identified as a cause of limb-girdle muscular dystrophy type 1D. Additional family members were Sanger sequenced and the mutation in DNAJB6 was only found in affected individuals. Subsequent haplotype analysis indicated that this DNAJB6 p.Phe89Ile mutation likely arose independently of the previously reported mutation. Since other published mutations are located close by in the G/F domain of DNAJB6, this suggests that the area may represent a mutational hotspot. Exome sequencing provided an unbiased and effective method for identifying the genetic etiology of limb-girdle muscular dystrophy type 1 in a previously genetically uncharacterized family. This work further confirms the causative role of DNAJB6 mutations in limb-girdle muscular dystrophy type 1D. Copyright © 2014 Elsevier B.V. All rights reserved.

  8. Atypical case of Wolfram syndrome revealed through targeted exome sequencing in a patient with suspected mitochondrial disease

    PubMed Central

    2012-01-01

    Background Mitochondrial diseases comprise a diverse set of clinical disorders that affect multiple organ systems with varying severity and age of onset. Due to their clinical and genetic heterogeneity, these diseases are difficult to diagnose. We have developed a targeted exome sequencing approach to improve our ability to properly diagnose mitochondrial diseases and apply it here to an individual patient. Our method targets mitochondrial DNA (mtDNA) and the exons of 1,600 nuclear genes involved in mitochondrial biology or Mendelian disorders with multi-system phenotypes, thereby allowing for simultaneous evaluation of multiple disease loci. Case Presentation Targeted exome sequencing was performed on a patient initially suspected to have a mitochondrial disorder. The patient presented with diabetes mellitus, diffuse brain atrophy, autonomic neuropathy, optic nerve atrophy, and a severe amnestic syndrome. Further work-up revealed multiple heteroplasmic mtDNA deletions as well as profound thiamine deficiency without a clear nutritional cause. Targeted exome sequencing revealed a homozygous c.1672C > T (p.R558C) missense mutation in exon 8 of WFS1 that has previously been reported in a patient with Wolfram syndrome. Conclusion This case demonstrates how clinical application of next-generation sequencing technology can enhance the diagnosis of patients suspected to have rare genetic disorders. Furthermore, the finding of unexplained thiamine deficiency in a patient with Wolfram syndrome suggests a potential link between WFS1 biology and thiamine metabolism that has implications for the clinical management of Wolfram syndrome patients. PMID:22226368

  9. Whole exome sequencing is an efficient, sensitive and specific method of mutation detection in osteogenesis imperfecta and Marfan syndrome

    PubMed Central

    McInerney-Leo, Aideen M; Marshall, Mhairi S; Gardiner, Brooke; Coucke, Paul J; Van Laer, Lut; Loeys, Bart L; Summers, Kim M; Symoens, Sofie; West, Jennifer A; West, Malcolm J; Paul Wordsworth, B; Zankl, Andreas; Leo, Paul J; Brown, Matthew A; Duncan, Emma L

    2013-01-01

    Osteogenesis imperfecta (OI) and Marfan syndrome (MFS) are common Mendelian disorders. Both conditions are usually diagnosed clinically, as genetic testing is expensive due to the size and number of potentially causative genes and mutations. However, genetic testing may benefit patients, at-risk family members and individuals with borderline phenotypes, as well as improving genetic counseling and allowing critical differential diagnoses. We assessed whether whole exome sequencing (WES) is a sensitive method for mutation detection in OI and MFS. WES was performed on genomic DNA from 13 participants with OI and 10 participants with MFS who had known mutations, with exome capture followed by massive parallel sequencing of multiplexed samples. Single nucleotide polymorphisms (SNPs) and small indels were called using Genome Analysis Toolkit (GATK) and annotated with ANNOVAR. CREST, exomeCopy and exomeDepth were used for large deletion detection. Results were compared with the previous data. Specificity was calculated by screening WES data from a control population of 487 individuals for mutations in COL1A1, COL1A2 and FBN1. The target capture of five exome capture platforms was compared. All 13 mutations in the OI cohort and 9/10 in the MFS cohort were detected (sensitivity=95.6%) including non-synonymous SNPs, small indels (<10 bp), and a large UTR5/exon 1 deletion. One mutation was not detected by GATK due to strand bias. Specificity was 99.5%. Capture platforms and analysis programs differed considerably in their ability to detect mutations. Consumable costs for WES were low. WES is an efficient, sensitive, specific and cost-effective method for mutation detection in patients with OI and MFS. Careful selection of platform and analysis programs is necessary to maximize success. PMID:24501682

  10. SEXCMD: Development and validation of sex marker sequences for whole-exome/genome and RNA sequencing.

    PubMed

    Jeong, Seongmun; Kim, Jiwoong; Park, Won; Jeon, Hongmin; Kim, Namshin

    2017-01-01

    Over the last decade, a large number of nucleotide sequences have been generated by next-generation sequencing technologies and deposited to public databases. However, most of these datasets do not specify the sex of individuals sampled because researchers typically ignore or hide this information. Male and female genomes in many species have distinctive sex chromosomes, XX/XY and ZW/ZZ, and expression levels of many sex-related genes differ between the sexes. Herein, we describe how to develop sex marker sequences from syntenic regions of sex chromosomes and use them to quickly identify the sex of individuals being analyzed. Array-based technologies routinely use either known sex markers or the B-allele frequency of X or Z chromosomes to deduce the sex of an individual. The same strategy has been used with whole-exome/genome sequence data; however, all reads must be aligned onto a reference genome to determine the B-allele frequency of the X or Z chromosomes. SEXCMD is a pipeline that can extract sex marker sequences from reference sex chromosomes and rapidly identify the sex of individuals from whole-exome/genome and RNA sequencing after training with a known dataset through a simple machine learning approach. The pipeline counts total numbers of hits from sex-specific marker sequences and identifies the sex of the individuals sampled based on the fact that XX/ZZ samples do not have Y or W chromosome hits. We have successfully validated our pipeline with mammalian (Homo sapiens; XY) and avian (Gallus gallus; ZW) genomes. Typical calculation time when applying SEXCMD to human whole-exome or RNA sequencing datasets is a few minutes, and analyzing human whole-genome datasets takes about 10 minutes. Another important application of SEXCMD is as a quality control measure to avoid mixing samples before bioinformatics analysis. SEXCMD comprises simple Python and R scripts and is freely available at https://github.com/lovemun/SEXCMD.

  11. Exome sequencing in 32 patients with anophthalmia/microphthalmia and developmental eye defects.

    PubMed

    Slavotinek, A M; Garcia, S T; Chandratillake, G; Bardakjian, T; Ullah, E; Wu, D; Umeda, K; Lao, R; Tang, P L-F; Wan, E; Madireddy, L; Lyalina, S; Mendelsohn, B A; Dugan, S; Tirch, J; Tischler, R; Harris, J; Clark, M J; Chervitz, S; Patwardhan, A; West, J M; Ursell, P; de Alba Campomanes, A; Schneider, A; Kwok, P-Y; Baranzini, S; Chen, R O

    2015-11-01

    Anophthalmia/microphthalmia (A/M) is a genetically heterogeneous birth defect for which the etiology is unknown in more than 50% of patients. We used exome sequencing with the ACE Exome(TM) (Personalis, Inc; 18 cases) and UCSF Genomics Core (21 cases) to sequence 28 patients with A/M and four patients with varied developmental eye defects. In the 28 patients with A/M, we identified de novo mutations in three patients (OTX2, p.(Gln91His), RARB, p.Arg387Cys and GDF6, p.Ala249Glu) and inherited mutations in STRA6 in two patients. In patients with developmental eye defects, a female with cataracts and cardiomyopathy had a de novo COL4A1 mutation, p.(Gly773Arg), expanding the phenotype associated with COL4A1 to include cardiomyopathy. A male with a chorioretinal defect, microcephaly, seizures and sensorineural deafness had two PNPT1 mutations, p.(Ala507Ser) and c.401-1G>A, and we describe eye defects associated with this gene for the first time. Exome sequencing was efficient for identifying mutations in pathogenic genes for which there is no clinical testing available and for identifying cases that expand phenotypic spectra, such as the PNPT1 and COL4A1-associated disorders described here. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  12. Identification of co-occurrence in a patient with Dent's disease and ADA2-deficiency by exome sequencing.

    PubMed

    Günthner, Roman; Wagner, Matias; Thurm, Tobias; Ponsel, Sabine; Höfele, Julia; Lange-Sperandio, Bärbel

    2018-04-05

    Patients with co-occurrence of two independent pathologies pose a challenge for clinicians as the phenotype often presents as an unclear syndrome. In these cases, exome sequencing serves as a powerful instrument to determine the underlying genetic causes. Here, we present the case of a 4-year old boy with proteinuria, microhematuria, hypercalciuria, nephrocalcinosis, livedo-like rash, recurrent abdominal pain, anemia and continuously elevated CRP. Single exome sequencing revealed the pathogenic nonsense mutation p.(Arg98*) in the CLCN5 gene causing the X-linked inherited, renal tubular disorder Dent's disease. Furthermore, the two pathogenic and compound heterozygous missense variants p.(Gly47Ala) and p.(Pro251Leu) in the CECR1 gene could be identified. Mutations in the CECR1 gene are associated with a hereditary form of polyarteritis nodosa, called ADA2-deficiency. Both parents were carriers of a single heterozygous variant in CECR1 and the mother was carrier of the CLCN5 variant. This case evidently demonstrates the advantage of whole exome sequencing compared to single gene testing as the pathology in the CECR1 gene might have only been diagnosed after the occurrence of signs of systemic vasculitis like strokes or hemorrhages. Therefore, treatment and prevention can now start early to improve the outcome of these patients. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Clinical germline diagnostic exome sequencing for hereditary cancer: Findings within novel candidate genes are prevalent.

    PubMed

    Powis, Zöe; Espenschied, Carin R; LaDuca, Holly; Hagman, Kelly D; Paudyal, Tripti; Li, Shuwei; Inaba, Hiroto; Mauer, Ann; Nathanson, Katherine L; Knost, James; Chao, Elizabeth C; Tang, Sha

    2018-08-01

    Clinical diagnostic exome sequencing (DES) has been effective in diagnosing individuals with suspected genetic conditions; nevertheless little has been described regarding its clinical utility in individuals with a personal and family history of cancer. This study aimed to assess diagnostic yield and clinical characteristics of pediatric and adult patients undergoing germline DES for hereditary cancer. We retrospectively reviewed 2171 patients referred for DES; cases with a personal and/or family history of cancer were further studied. Of 39 cancer patients, relevant alterations were found in eight individuals (21%), including one (3%) positive pathogenic alteration within a characterized gene, two (5%) uncertain findings in characterized genes, and five (13%) alterations in novel candidate genes. Two of the 5 pediatric patients, undergoing testing, (40%) had findings in novel candidate genes, with the remainder being negative. We include brief case studies to illustrate the variety of challenging issues related to these patients. Our observations demonstrate utility of family-based exome sequencing in patients for suspected hereditary cancer, including familial co-segregation analysis, and comprehensive medical review. DES may be particularly useful when traditional approaches do not result in a diagnosis or in families with unique phenotypes. This work also highlights the importance and complexity of analysis of uncharacterized genes in exome sequencing for hereditary cancer. Copyright © 2018 Elsevier Inc. All rights reserved.

  14. Exome sequencing of bilateral testicular germ cell tumors suggests independent development lineages.

    PubMed

    Brabrand, Sigmund; Johannessen, Bjarne; Axcrona, Ulrika; Kraggerud, Sigrid M; Berg, Kaja G; Bakken, Anne C; Bruun, Jarle; Fosså, Sophie D; Lothe, Ragnhild A; Lehne, Gustav; Skotheim, Rolf I

    2015-02-01

    Intratubular germ cell neoplasia, the precursor of testicular germ cell tumors (TGCTs), is hypothesized to arise during embryogenesis from developmentally arrested primordial germ cells (PGCs) or gonocytes. In early embryonal life, the PGCs migrate from the yolk sac to the dorsal body wall where the cell population separates before colonizing the genital ridges. However, whether the malignant transformation takes place before or after this separation is controversial. We have explored the somatic exome-wide mutational spectra of bilateral TGCT to provide novel insight into the in utero critical time frame of malignant transformation and TGCT pathogenesis. Exome sequencing was performed in five patients with bilateral TGCT (eight tumors), of these three patients in whom both tumors were available (six tumors) and two patients each with only one available tumor (two tumors). Selected loci were explored by Sanger sequencing in 71 patients with bilateral TGCT. From the exome-wide mutational spectra, no identical mutations in any of the three bilateral tumor pairs were identified. Exome sequencing of all eight tumors revealed 87 somatic non-synonymous mutations (median 10 per tumor; range 5-21), some in already known cancer genes such as CIITA, NEB, platelet-derived growth factor receptor α (PDGFRA), and WHSC1. SUPT6H was found recurrently mutated in two tumors. We suggest independent development lineages of bilateral TGCT. Thus, malignant transformation into intratubular germ cell neoplasia is likely to occur after the migration of PGCs. We reveal possible drivers of TGCT pathogenesis, such as mutated PDGFRA, potentially with therapeutic implications for TGCT patients. Copyright © 2014 Neoplasia Press, Inc. Published by Elsevier Inc. All rights reserved.

  15. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

    PubMed

    Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

    2017-12-19

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

  16. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

    PubMed Central

    Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M.; Agarwala, Vineeta; Gaulton, Kyle J.; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J.; Rivas, Manuel A.; Perry, John R. B.; Sim, Xueling; Blackwell, Thomas W.; Robertson, Neil R.; Rayner, N William; Cingolani, Pablo; Locke, Adam E.; Tajes, Juan Fernandez; Highland, Heather M.; Dupuis, Josee; Chines, Peter S.; Lindgren, Cecilia M.; Hartl, Christopher; Jackson, Anne U.; Chen, Han; Huyghe, Jeroen R.; van de Bunt, Martijn; Pearson, Richard D.; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M.; Gamazon, Eric R.; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A.; Below, Jennifer E.; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L.; Pasko, Dorota; Parker, Stephen C. J.; Varga, Tibor V.; Green, Todd; Beer, Nicola L.; Day-Williams, Aaron G.; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J.; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P.; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F.; Han, Bok-Ghee; Jenkinson, Christopher P.; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C. Y.; Palmer, Nicholette D.; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E.; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D.; Neale, Benjamin M.; Purcell, Shaun; Butterworth, Adam S.; Howson, Joanna M. M.; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K. L.; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H. T.; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E.; Rybin, Dennis; Farook, Vidya S.; Fowler, Sharon P.; Freedman, Barry I.; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J.; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K.; Puppala, Sobha; Scott, William R.; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A.; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C.; Mangino, Massimo; Bonnycastle, Lori L.; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L.; Herder, Christian; Groves, Christopher J.; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A.; Doney, Alex S. F.; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J.; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E.; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H.; Stirrups, Kathleen; Wood, Andrew R.; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O.; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P.; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B.; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N. A.; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M.; Syvänen, Ann-Christine; Bergman, Richard N.; Bharadwaj, Dwaipayan; Bottinger, Erwin P.; Cho, Yoon Shin; Chandak, Giriraj R.; Chan, Juliana CN; Chia, Kee Seng; Daly, Mark J.; Ebrahim, Shah B.; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A.; Lehman, Donna M.; Jia, Weiping; Ma, Ronald C. W.; Pollin, Toni I.; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J. F.; Small, Kerrin S.; Ried, Janina S.; DeFronzo, Ralph A.; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J.; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W.; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R.; Gloyn, Anna L.; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D.; Hattersley, Andrew T.; Bowden, Donald W.; Collins, Francis S.; Atzmon, Gil; Chambers, John C.; Spector, Timothy D.; Laakso, Markku; Strom, Tim M.; Bell, Graeme I.; Blangero, John; Duggirala, Ravindranath; Tai, E. Shyong; McVean, Gilean; Hanis, Craig L.; Wilson, James G.; Seielstad, Mark; Frayling, Timothy M.; Meigs, James B.; Cox, Nancy J.; Sladek, Rob; Lander, Eric S.; Gabriel, Stacey; Mohlke, Karen L.; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J.; Morris, Andrew P.; Kang, Hyun Min; Altshuler, David; Burtt, Noël P.; Florez, Jose C.; Boehnke, Michael; McCarthy, Mark I.

    2017-01-01

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D. PMID:29257133

  17. Exome sequences of multiplex, multigenerational families reveal schizophrenia risk loci with potential implications for neurocognitive performance.

    PubMed

    Kos, Mark Z; Carless, Melanie A; Peralta, Juan; Curran, Joanne E; Quillen, Ellen E; Almeida, Marcio; Blackburn, August; Blondell, Lucy; Roalf, David R; Pogue-Geile, Michael F; Gur, Ruben C; Göring, Harald H H; Nimgaonkar, Vishwajit L; Gur, Raquel E; Almasy, Laura

    2017-12-01

    Schizophrenia is a serious mental illness, involving disruptions in thought and behavior, with a worldwide prevalence of about one percent. Although highly heritable, much of the genetic liability of schizophrenia is yet to be explained. We searched for susceptibility loci in multiplex, multigenerational families affected by schizophrenia, targeting protein-altering variation with in silico predicted functional effects. Exome sequencing was performed on 136 samples from eight European-American families, including 23 individuals diagnosed with schizophrenia or schizoaffective disorder. In total, 11,878 non-synonymous variants from 6,396 genes were tested for their association with schizophrenia spectrum disorders. Pathway enrichment analyses were conducted on gene-based test results, protein-protein interaction (PPI) networks, and epistatic effects. Using a significance threshold of FDR < 0.1, association was detected for rs10941112 (p = 2.1 × 10 -5 ; q-value = 0.073) in AMACR, a gene involved in fatty acid metabolism and previously implicated in schizophrenia, with significant cis effects on gene expression (p = 5.5 × 10 -4 ), including brain tissue data from the Genotype-Tissue Expression project (minimum p = 6.0 × 10 -5 ). A second SNP, rs10378 located in TMEM176A, also shows risk effects in the exome data (p = 2.8 × 10 -5 ; q-value = 0.073). PPIs among our top gene-based association results (p < 0.05; n = 359 genes) reveal significant enrichment of genes involved in NCAM-mediated neurite outgrowth (p = 3.0 × 10 -5 ), while exome-wide SNP-SNP interaction effects for rs10941112 and rs10378 indicate a potential role for kinase-mediated signaling involved in memory and learning. In conclusion, these association results implicate AMACR and TMEM176A in schizophrenia risk, whose effects may be modulated by genes involved in synaptic plasticity and neurocognitive performance. © 2017 Wiley Periodicals, Inc.

  18. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

    PubMed

    McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.

  19. Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

    PubMed Central

    McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746

  20. Performance comparison of two commercial human whole-exome capture systems on formalin-fixed paraffin-embedded lung adenocarcinoma samples.

    PubMed

    Bonfiglio, Silvia; Vanni, Irene; Rossella, Valeria; Truini, Anna; Lazarevic, Dejan; Dal Bello, Maria Giovanna; Alama, Angela; Mora, Marco; Rijavec, Erika; Genova, Carlo; Cittaro, Davide; Grossi, Francesco; Coco, Simona

    2016-08-30

    Next Generation Sequencing (NGS) has become a valuable tool for molecular landscape characterization of cancer genomes, leading to a better understanding of tumor onset and progression, and opening new avenues in translational oncology. Formalin-fixed paraffin-embedded (FFPE) tissue is the method of choice for storage of clinical samples, however low quality of FFPE genomic DNA (gDNA) can limit its use for downstream applications. To investigate the FFPE specimen suitability for NGS analysis and to establish the performance of two solution-based exome capture technologies, we compared the whole-exome sequencing (WES) data of gDNA extracted from 5 fresh frozen (FF) and 5 matched FFPE lung adenocarcinoma tissues using: SeqCap EZ Human Exome v.3.0 (Roche NimbleGen) and SureSelect XT Human All Exon v.5 (Agilent Technologies). Sequencing metrics on Illumina HiSeq were optimal for both exome systems and comparable among FFPE and FF samples, with a slight increase of PCR duplicates in FFPE, mainly in Roche NimbleGen libraries. Comparison of single nucleotide variants (SNVs) between FFPE-FF pairs reached overlapping values >90 % in both systems. Both WES showed high concordance with target re-sequencing data by Ion PGM™ in 22 lung-cancer genes, regardless the source of samples. Exon coverage of 623 cancer-related genes revealed high coverage efficiency of both kits, proposing WES as a valid alternative to target re-sequencing. High-quality and reliable data can be successfully obtained from WES of FFPE samples starting from a relatively low amount of input gDNA, suggesting the inclusion of NGS-based tests into clinical contest. In conclusion, our analysis suggests that the WES approach could be extended to a translational research context as well as to the clinic (e.g. to study rare malignancies), where the simultaneous analysis of the whole coding region of the genome may help in the detection of cancer-linked variants.

  1. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

    PubMed

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A; Larsen, Martin Jakob

    2016-01-01

    Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.

  2. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data

    PubMed Central

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A.; Larsen, Martin Jakob

    2016-01-01

    Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths. PMID:27002637

  3. What are Whole Exome Sequencing and Whole Genome Sequencing?

    MedlinePlus

    ... the future. For more information about DNA sequencing technologies and their use: Genetics Home Reference discusses whether ... University in St. Louis describes the different sequencing technologies and what the new technologies have meant for ...

  4. Identification of a novel PHEX mutation in a Chinese family with X-linked hypophosphatemic rickets using exome sequencing.

    PubMed

    Yuan, Lamei; Wu, Song; Xu, Hongbo; Xiao, Jingjing; Yang, Zhijian; Xia, Hong; Liu, An; Hu, Pengzhi; Lu, Anjie; Chen, Yulan; Xu, Fengping; Deng, Hao

    2015-01-01

    Familial hypophosphatemic rickets (HR), the most common inherited form of rickets, is a group of inherited renal phosphate wasting disorders characterized by growth retardation, rickets with bone deformities, osteomalacia, poor dental development, and hypophosphatemia. The purpose of this study was to identify the genetic defect responsible for familial HR in a four-generation Chinese Han pedigree by exome sequencing and Sanger sequencing. Clinical features include skeletal deformities, teeth abnormalities, hearing impairments and variable serum phosphate level in patients of this family. A novel deletion mutation, c.1553delT (p.F518Sfs*4), was identified in the X-linked phosphate regulating endopeptidase homolog gene (PHEX). The mutation is predicted to result in prematurely truncated and loss-of-function PHEX protein. Our data suggest that exome sequencing is a powerful tool to discover mutation(s) in HR, a disorder with genetic and clinical heterogeneity. The findings may also provide new insights into the cause and diagnosis of HR, and have implications for genetic counseling and clinical management.

  5. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

    PubMed Central

    Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

    2015-01-01

    ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133

  6. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family.

    PubMed

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods.

  7. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family

    PubMed Central

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods. PMID:28690861

  8. Genome-wide gene–gene interaction analysis for next-generation sequencing

    PubMed Central

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao

    2016-01-01

    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study. PMID:26173972

  9. Molecular diagnosis of putative Stargardt disease probands by exome sequencing

    PubMed Central

    2012-01-01

    Background The commonest genetic form of juvenile or early adult onset macular degeneration is Stargardt Disease (STGD) caused by recessive mutations in the gene ABCA4. However, high phenotypic and allelic heterogeneity and a small but non-trivial amount of locus heterogeneity currently impede conclusive molecular diagnosis in a significant proportion of cases. Methods We performed whole exome sequencing (WES) of nine putative Stargardt Disease probands and searched for potentially disease-causing genetic variants in previously identified retinal or macular dystrophy genes. Follow-up dideoxy sequencing was performed for confirmation and to screen for mutations in an additional set of affected individuals lacking a definitive molecular diagnosis. Results Whole exome sequencing revealed seven likely disease-causing variants across four genes, providing a confident genetic diagnosis in six previously uncharacterized participants. We identified four previously missed mutations in ABCA4 across three individuals. Likely disease-causing mutations in RDS/PRPH2, ELOVL, and CRB1 were also identified. Conclusions Our findings highlight the enormous potential of whole exome sequencing in Stargardt Disease molecular diagnosis and research. WES adequately assayed all coding sequences and canonical splice sites of ABCA4 in this study. Additionally, WES enables the identification of disease-related alleles in other genes. This work highlights the importance of collecting parental genetic material for WES testing as the current knowledge of human genome variation limits the determination of causality between identified variants and disease. While larger sample sizes are required to establish the precision and accuracy of this type of testing, this study supports WES for inherited early onset macular degeneration disorders as an alternative to standard mutation screening techniques. PMID:22863181

  10. Whole exome sequencing frequently detects a monogenic cause in early onset nephrolithiasis and nephrocalcinosis.

    PubMed

    Daga, Ankana; Majmundar, Amar J; Braun, Daniela A; Gee, Heon Yung; Lawson, Jennifer A; Shril, Shirlee; Jobst-Schwan, Tilman; Vivante, Asaf; Schapiro, David; Tan, Weizhen; Warejko, Jillian K; Widmeier, Eugen; Nelson, Caleb P; Fathy, Hanan M; Gucev, Zoran; Soliman, Neveen A; Hashmi, Seema; Halbritter, Jan; Halty, Margarita; Kari, Jameela A; El-Desoky, Sherif; Ferguson, Michael A; Somers, Michael J G; Traum, Avram Z; Stein, Deborah R; Daouk, Ghaleb H; Rodig, Nancy M; Katz, Avi; Hanna, Christian; Schwaderer, Andrew L; Sayer, John A; Wassner, Ari J; Mane, Shrikant; Lifton, Richard P; Milosevic, Danko; Tasic, Velibor; Baum, Michelle A; Hildebrandt, Friedhelm

    2018-01-01

    The incidence of nephrolithiasis continues to rise. Previously, we showed that a monogenic cause could be detected in 11.4% of individuals with adult-onset nephrolithiasis or nephrocalcinosis and in 16.7-20.8% of individuals with onset before 18 years of age, using gene panel sequencing of 30 genes known to cause nephrolithiasis/nephrocalcinosis. To overcome the limitations of panel sequencing, we utilized whole exome sequencing in 51 families, who presented before age 25 years with at least one renal stone or with a renal ultrasound finding of nephrocalcinosis to identify the underlying molecular genetic cause of disease. In 15 of 51 families, we detected a monogenic causative mutation by whole exome sequencing. A mutation in seven recessive genes (AGXT, ATP6V1B1, CLDN16, CLDN19, GRHPR, SLC3A1, SLC12A1), in one dominant gene (SLC9A3R1), and in one gene (SLC34A1) with both recessive and dominant inheritance was detected. Seven of the 19 different mutations were not previously described as disease-causing. In one family, a causative mutation in one of 117 genes that may represent phenocopies of nephrolithiasis-causing genes was detected. In nine of 15 families, the genetic diagnosis may have specific implications for stone management and prevention. Several factors that correlated with the higher detection rate in our cohort were younger age at onset of nephrolithiasis/nephrocalcinosis, presence of multiple affected members in a family, and presence of consanguinity. Thus, we established whole exome sequencing as an efficient approach toward a molecular genetic diagnosis in individuals with nephrolithiasis/nephrocalcinosis who manifest before age 25 years. Copyright © 2017 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.

  11. Exome sequencing results in successful riboflavin treatment of a rapidly progressive neurological condition

    PubMed Central

    Petrovski, Slavé; Shashi, Vandana; Petrou, Steven; Schoch, Kelly; McSweeney, Keisha Melodi; Dhindsa, Ryan S.; Krueger, Brian; Crimian, Rebecca; Case, Laura E.; Khalid, Roha; El-Dairi, Maysantoine A.; Jiang, Yong-Hui; Mikati, Mohamad A.; Goldstein, David B.

    2015-01-01

    Genetically targeted therapies for rare Mendelian conditions are improving patient outcomes. Here, we present the case of a 20-mo-old female suffering from a rapidly progressing neurological disorder. Although diagnosed initially with a possible autoimmune condition, analysis of the child's exome resulted in a diagnosis of Brown–Vialetto–Van Laere syndrome 2 (BVVLS2). This new diagnosis led to a change in the therapy plan from steroids and precautionary chemotherapy to high-dose riboflavin. Improvements were reported quickly, including in motor strength after 1 mo. In this case, the correct diagnosis and appropriate treatment would have been unlikely in the absence of exome sequencing and careful interpretation. This experience adds to a growing list of examples that emphasize the importance of early genome-wide diagnostics. PMID:27148561

  12. Genetic alterations in seborrheic keratoses

    PubMed Central

    Heidenreich, Barbara; Denisova, Evygenia; Rachakonda, Sivaramakrishna; Sanmartin, Onofre; Dereani, Timo; Hosen, Ismail; Nagore, Eduardo; Kumar, Rajiv

    2017-01-01

    Seborrheic keratoses are common benign epidermal lesions that are associated with increased age and sun-exposure. Those lesions despite harboring multiple somatic alterations in contrast to malignant tumors appear to be genetically stable. In order to investigate and characterize the presence of recurrent mutations, we performed exome sequencing on DNA from one seborrheic keratosis lesion and corresponding blood cells from the same patients with follow up investigation of alterations identified by exome sequencing in 24 additional lesions from as many patients. In addition we investigated alterations in all lesions at specific genes loci that included FGFR3, PIK3CA, HRAS, BRAF, CDKN2A and TERT and DHPH3 promoters. The exome sequencing data indicated three mutations per Mb of the targeted sequence. The mutational pattern depicted typical UV signature with majority of alterations being C>T and CC>TT base changes at dipyrimidinic sites. The FGFR3 mutations were the most frequent, detected in 12 of 25 (48%) lesions, followed by the PIK3CA (32%), TERT promoter (24%) and DPH3 promoter mutations (24%). TERT promoter mutations associated with increased age and were present mainly in the lesions excised from head and neck. Three lesions also carried alterations in CDKN2A. FGFR3, TERT and DPH3 expression did not correlate with mutations in the respective genes and promoters; however, increased FGFR3 transcript levels were associated with increased FOXN1 levels, a suggested positive feedback loop that stalls malignant progression. Thus, in this study we report overall mutation rate through exome sequencing and show the most frequent mutations seborrheic keratosis. PMID:28410231

  13. Rare deleterious mutations are associated with disease in bipolar disorder families.

    PubMed

    Rao, A R; Yourshaw, M; Christensen, B; Nelson, S F; Kerner, B

    2017-07-01

    Bipolar disorder (BD) is a common, complex and heritable psychiatric disorder characterized by episodes of severe mood swings. The identification of rare, damaging genomic mutations in families with BD could inform about disease mechanisms and lead to new therapeutic interventions. To determine whether rare, damaging mutations shared identity-by-descent in families with BD could be associated with disease, exome sequencing was performed in multigenerational families of the NIMH BD Family Study followed by in silico functional prediction. Disease association and disease specificity was determined using 5090 exomes from the Sweden-Schizophrenia (SZ) Population-Based Case-Control Exome Sequencing study. We identified 14 rare and likely deleterious mutations in 14 genes that were shared identity-by-descent among affected family members. The variants were associated with BD (P<0.05 after Bonferroni's correction) and disease specificity was supported by the absence of the mutations in patients with SZ. In addition, we found rare, functional mutations in known causal genes for neuropsychiatric disorders including holoprosencephaly and epilepsy. Our results demonstrate that exome sequencing in multigenerational families with BD is effective in identifying rare genomic variants of potential clinical relevance and also disease modifiers related to coexisting medical conditions. Replication of our results and experimental validation are required before disease causation could be assumed.

  14. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects

    PubMed Central

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB PMID:25281234

  15. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects.

    PubMed

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB. © The Author(s) 2014. Published by Oxford University Press.

  16. Analysis of protein-coding genetic variation in 60,706 humans.

    PubMed

    Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G

    2016-08-18

    Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

  17. Myopathic mtDNA Depletion Syndrome Due to Mutation in TK2 Gene.

    PubMed

    Martín-Hernández, Elena; García-Silva, María Teresa; Quijada-Fraile, Pilar; Rodríguez-García, María Elena; Rivera, Henry; Hernández-Laín, Aurelio; Coca-Robinot, David; Fernández-Toral, Joaquín; Arenas, Joaquín; Martín, Miguel A; Martínez-Azorín, Francisco

    2017-01-01

    Whole-exome sequencing was used to identify the disease gene(s) in a Spanish girl with failure to thrive, muscle weakness, mild facial weakness, elevated creatine kinase, deficiency of mitochondrial complex III and depletion of mtDNA. With whole-exome sequencing data, it was possible to get the whole mtDNA sequencing and discard any pathogenic variant in this genome. The analysis of whole exome uncovered a homozygous pathogenic mutation in thymidine kinase 2 gene ( TK2; NM_004614.4:c.323 C>T, p.T108M). TK2 mutations have been identified mainly in patients with the myopathic form of mtDNA depletion syndromes. This patient presents an atypical TK2-related myopathic form of mtDNA depletion syndromes, because despite having a very low content of mtDNA (<20%), she presents a slower and less severe evolution of the disease. In conclusion, our data confirm the role of TK2 gene in mtDNA depletion syndromes and expanded the phenotypic spectrum.

  18. Genetic screening and diagnosis in epilepsy?

    PubMed

    Sisodiya, Sanjay M

    2015-04-01

    Genetic discovery has been extremely rapid over the last year, with many new discoveries illuminating novel mechanisms and pathways. In particular, the application of whole exome and whole genome sequencing has identified many new genetic causes of the epilepsies. As such methods become increasingly available, it will be critical for practicing neurologists to be acquainted with them. This review surveys some important developments over the last year. The range of tests available to the clinician is wide, and likely soon to be dominated by whole exome and whole genome sequencing. Both whole exome and whole genome sequencing have usually proven to be more powerful than most existing tests. Many new genes have been implicated in the epilepsies, with emerging evidence of the involvement of particular multigene pathways. For the practicing clinician, it will be important to appreciate progress in the field, and to prepare for the application of novel genetic testing in clinical practice, as genetic data are likely to contribute importantly for many people with epilepsy.

  19. Exome sequencing establishes a gelsolin mutation as the cause of inherited bulbar-onset neuropathy.

    PubMed

    Caress, James B; Johnson, Janel O; Abramzon, Yevgeniya A; Hawkins, Gregory A; Gibbs, J Raphael; Sullivan, Elizabeth A; Chahal, Chamanpreet S; Traynor, Bryan J

    2017-11-01

    Progressive bulbar motor neuropathy is primarily caused by bulbar-onset ALS. Hereditary amyloidosis type IV also presents with a bulbar neuropathy that mimics motor neuron disease. The disease is prevalent in Finland only and is not commonly included in the differential diagnosis of ALS. We studied 18 members of a family in which some had bulbar motor neuropathy, and we performed exome sequencing. Five affected family members were found to have a D187Y substitution in the GSN gene known to cause hereditary amyloidosis type IV. This American family presented with progressive bulbar neuropathy due to a gelsolin mutation not found in Finland. Hereditary amyloidosis type IV presents with bulbar motor neuropathy and not with peripheral neuropathy as occurs with common forms of amyloidosis. This report demonstrates the power of exome sequencing to determine the cause of rare hereditary diseases with incomplete or atypical phenotypes. Muscle Nerve 56: 1001-1005, 2017. © 2016 Wiley Periodicals, Inc.

  20. Evaluating information content of SNPs for sample-tagging in re-sequencing projects.

    PubMed

    Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F

    2015-05-15

    Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.

  1. Whole-exome sequencing of 228 patients with sporadic Parkinson's disease.

    PubMed

    Sandor, Cynthia; Honti, Frantisek; Haerty, Wilfried; Szewczyk-Krolikowski, Konrad; Tomlinson, Paul; Evetts, Sam; Millin, Stephanie; Keane, Thomas; McCarthy, Shane A; Durbin, Richard; Talbot, Kevin; Hu, Michele; Webber, Caleb; Ponting, Chris P; Wade-Martins, Richard

    2017-01-24

    Parkinson's disease (PD) is the most common neurodegenerative movement disorder, affecting 1% of the population over 65 years characterized clinically by both motor and non-motor symptoms accompanied by the preferential loss of dopamine neurons in the substantia nigra pars compacta. Here, we sequenced the exomes of 244 Parkinson's patients selected from the Oxford Parkinson's Disease Centre Discovery Cohort and, after quality control, 228 exomes were available for analyses. The PD patient exomes were compared to 884 control exomes selected from the UK10K datasets. No single non-synonymous (NS) single nucleotide variant (SNV) nor any gene carrying a higher burden of NS SNVs was significantly associated with PD status after multiple-testing correction. However, significant enrichments of genes whose proteins have roles in the extracellular matrix were amongst the top 300 genes with the most significantly associated NS SNVs, while regions associated with PD by a recent Genome Wide Association (GWA) study were enriched in genes containing PD-associated NS SNVs. By examining genes within GWA regions possessing rare PD-associated SNVs, we identified RAD51B. The protein-product of RAD51B interacts with that of its paralogue RAD51, which is associated with congenital mirror movements phenotypes, a phenotype also comorbid with PD.

  2. Challenges imposed by minor reference alleles on the identification and reporting of clinical variants from exome data.

    PubMed

    Koko, Mahmoud; Abdallah, Mohammed O E; Amin, Mutaz; Ibrahim, Muntaser

    2018-01-15

    The conventional variant calling of pathogenic alleles in exome and genome sequencing requires the presence of the non-pathogenic alleles as genome references. This hinders the correct identification of variants with minor and/or pathogenic reference alleles warranting additional approaches for variant calling. More than 26,000 Exome Aggregation Consortium (ExAC) variants have a minor reference allele including variants with known ClinVar disease alleles. For instance, in a number of variants related to clotting disorders, the phenotype-associated allele is a human genome reference allele (rs6025, rs6003, rs1799983, and rs2227564 using the assembly hg19). We highlighted how the current variant calling standards miss homozygous reference disease variants in these sites and provided a bioinformatic panel that can be used to screen these variants using commonly available variant callers. We present exome sequencing results from an individual with venous thrombosis to emphasize how pathogenic alleles in clinically relevant variants escape variant calling while non-pathogenic alleles are detected. This article highlights the importance of specialized variant calling strategies in clinical variants with minor reference alleles especially in the context of personal genomes and exomes. We provide here a simple strategy to screen potential disease-causing variants when present in homozygous reference state.

  3. Exome Pool-Seq in neurodevelopmental disorders.

    PubMed

    Popp, Bernt; Ekici, Arif B; Thiel, Christian T; Hoyer, Juliane; Wiesener, Antje; Kraus, Cornelia; Reis, André; Zweier, Christiane

    2017-12-01

    High throughput sequencing has greatly advanced disease gene identification, especially in heterogeneous entities. Despite falling costs this is still an expensive and laborious technique, particularly when studying large cohorts. To address this problem we applied Exome Pool-Seq as an economic and fast screening technology in neurodevelopmental disorders (NDDs). Sequencing of 96 individuals can be performed in eight pools of 12 samples on less than one Illumina sequencer lane. In a pilot study with 96 cases we identified 27 variants, likely or possibly affecting function. Twenty five of these were identified in 923 established NDD genes (based on SysID database, status November 2016) (ACTB, AHDC1, ANKRD11, ATP6V1B2, ATRX, CASK, CHD8, GNAS, IFIH1, KCNQ2, KMT2A, KRAS, MAOA, MED12, MED13L, RIT1, SETD5, SIN3A, TCF4, TRAPPC11, TUBA1A, WAC, ZBTB18, ZMYND11), two in 543 (SysID) candidate genes (ZNF292, BPTF), and additionally a de novo loss-of-function variant in LRRC7, not previously implicated in NDDs. Most of them were confirmed to be de novo, but we also identified X-linked or autosomal-dominantly or autosomal-recessively inherited variants. With a detection rate of 28%, Exome Pool-Seq achieves comparable results to individual exome analyses but reduces costs by >85%. Compared with other large scale approaches using Molecular Inversion Probes (MIP) or gene panels, it allows flexible re-analysis of data. Exome Pool-Seq is thus well suited for large-scale, cost-efficient and flexible screening in characterized but heterogeneous entities like NDDs.

  4. Molecular diagnosis of glycogen storage disease and disorders with overlapping clinical symptoms by massive parallel sequencing.

    PubMed

    Vega, Ana I; Medrano, Celia; Navarrete, Rosa; Desviat, Lourdes R; Merinero, Begoña; Rodríguez-Pombo, Pilar; Vitoria, Isidro; Ugarte, Magdalena; Pérez-Cerdá, Celia; Pérez, Belen

    2016-10-01

    Glycogen storage disease (GSD) is an umbrella term for a group of genetic disorders that involve the abnormal metabolism of glycogen; to date, 23 types of GSD have been identified. The nonspecific clinical presentation of GSD and the lack of specific biomarkers mean that Sanger sequencing is now widely relied on for making a diagnosis. However, this gene-by-gene sequencing technique is both laborious and costly, which is a consequence of the number of genes to be sequenced and the large size of some genes. This work reports the use of massive parallel sequencing to diagnose patients at our laboratory in Spain using either a customized gene panel (targeted exome sequencing) or the Illumina Clinical-Exome TruSight One Gene Panel (clinical exome sequencing (CES)). Sequence variants were matched against biochemical and clinical hallmarks. Pathogenic mutations were detected in 23 patients. Twenty-two mutations were recognized (mostly loss-of-function mutations), including 11 that were novel in GSD-associated genes. In addition, CES detected five patients with mutations in ALDOB, LIPA, NKX2-5, CPT2, or ANO5. Although these genes are not involved in GSD, they are associated with overlapping phenotypic characteristics such as hepatic, muscular, and cardiac dysfunction. These results show that next-generation sequencing, in combination with the detection of biochemical and clinical hallmarks, provides an accurate, high-throughput means of making genetic diagnoses of GSD and related diseases.Genet Med 18 10, 1037-1043.

  5. A population of deletion mutants and an integrated mapping and Exome-seq pipeline for gene discovery in maize

    USDA-ARS?s Scientific Manuscript database

    To better understand maize endosperm filling and maturation, we developed a novel functional genomics platform that combined Bulked Segregant RNA and Exome sequencing (BSREx-seq) to map causative mutations and identify candidate genes within mapping intervals. Using gamma-irradiation of B73 maize to...

  6. High-depth, high-accuracy microsatellite genotyping enables precision lung cancer risk classification

    PubMed Central

    Velmurugan, K R; Varghese, R T; Fonville, N C; Garner, H R

    2017-01-01

    There remains a large discrepancy between the known genetic contributions to cancer and that which can be explained by genomic variants, both inherited and somatic. Recently, understudied repetitive DNA regions called microsatellites have been identified as genetic risk markers for a number of diseases including various cancers (breast, ovarian and brain). In this study, we demonstrate an integrated process for identifying and further evaluating microsatellite-based risk markers for lung cancer using data from the cancer genome atlas and the 1000 genomes project. Comparing whole-exome germline sequencing data from 488 TCGA lung cancer samples to germline exome data from 390 control samples from the 1000 genomes project, we identified 119 potentially informative microsatellite loci. These loci were found to be able to distinguish between cancer and control samples with sensitivity and specificity ratios over 0.8. Then these loci, supplemented with additional loci from other cancers and controls, were evaluated using a target enrichment kit and sample-multiplexed nextgen sequencing. Thirteen of the 119 risk markers were found to be informative in a well powered study (>0.99 for a 0.95 confidence interval) using high-depth (579x±315) nextgen sequencing of 30 lung cancer and 89 control samples, resulting in sensitivity and specificity ratios of 0.90 and 0.94, respectively. When 8 loci harvested from the bioinformatic analysis of other cancers are added to the classifier, then the sensitivity and specificity rise to 0.93 and 0.97, respectively. Analysis of the genes harboring these loci revealed two genes (ARID1B and REL) and two significantly enriched pathways (chromatin organization and cellular stress response) suggesting that the process of lung carcinogenesis is linked to chromatin remodeling, inflammation, and tumor microenvironment restructuring. We illustrate that high-depth sequencing enables a high-precision microsatellite-based risk classifier analysis approach. This microsatellite-based platform confirms the potential to create clinically actionable diagnostics for lung cancer. PMID:28759038

  7. High-depth, high-accuracy microsatellite genotyping enables precision lung cancer risk classification.

    PubMed

    Velmurugan, K R; Varghese, R T; Fonville, N C; Garner, H R

    2017-11-16

    There remains a large discrepancy between the known genetic contributions to cancer and that which can be explained by genomic variants, both inherited and somatic. Recently, understudied repetitive DNA regions called microsatellites have been identified as genetic risk markers for a number of diseases including various cancers (breast, ovarian and brain). In this study, we demonstrate an integrated process for identifying and further evaluating microsatellite-based risk markers for lung cancer using data from the cancer genome atlas and the 1000 genomes project. Comparing whole-exome germline sequencing data from 488 TCGA lung cancer samples to germline exome data from 390 control samples from the 1000 genomes project, we identified 119 potentially informative microsatellite loci. These loci were found to be able to distinguish between cancer and control samples with sensitivity and specificity ratios over 0.8. Then these loci, supplemented with additional loci from other cancers and controls, were evaluated using a target enrichment kit and sample-multiplexed nextgen sequencing. Thirteen of the 119 risk markers were found to be informative in a well powered study (>0.99 for a 0.95 confidence interval) using high-depth (579x±315) nextgen sequencing of 30 lung cancer and 89 control samples, resulting in sensitivity and specificity ratios of 0.90 and 0.94, respectively. When 8 loci harvested from the bioinformatic analysis of other cancers are added to the classifier, then the sensitivity and specificity rise to 0.93 and 0.97, respectively. Analysis of the genes harboring these loci revealed two genes (ARID1B and REL) and two significantly enriched pathways (chromatin organization and cellular stress response) suggesting that the process of lung carcinogenesis is linked to chromatin remodeling, inflammation, and tumor microenvironment restructuring. We illustrate that high-depth sequencing enables a high-precision microsatellite-based risk classifier analysis approach. This microsatellite-based platform confirms the potential to create clinically actionable diagnostics for lung cancer.

  8. Targeted exome sequencing of suspected mitochondrial disorders

    PubMed Central

    Lieber, Daniel S.; Calvo, Sarah E.; Shanahan, Kristy; Slate, Nancy G.; Liu, Shangtao; Hershman, Steven G.; Gold, Nina B.; Chapman, Brad A.; Thorburn, David R.; Berry, Gerard T.; Schmahmann, Jeremy D.; Borowsky, Mark L.; Mueller, David M.; Sims, Katherine B.

    2013-01-01

    Objective: To evaluate the utility of targeted exome sequencing for the molecular diagnosis of mitochondrial disorders, which exhibit marked phenotypic and genetic heterogeneity. Methods: We considered a diverse set of 102 patients with suspected mitochondrial disorders based on clinical, biochemical, and/or molecular findings, and whose disease ranged from mild to severe, with varying age at onset. We sequenced the mitochondrial genome (mtDNA) and the exons of 1,598 nuclear-encoded genes implicated in mitochondrial biology, mitochondrial disease, or monogenic disorders with phenotypic overlap. We prioritized variants likely to underlie disease and established molecular diagnoses in accordance with current clinical genetic guidelines. Results: Targeted exome sequencing yielded molecular diagnoses in established disease loci in 22% of cases, including 17 of 18 (94%) with prior molecular diagnoses and 5 of 84 (6%) without. The 5 new diagnoses implicated 2 genes associated with canonical mitochondrial disorders (NDUFV1, POLG2), and 3 genes known to underlie other neurologic disorders (DPYD, KARS, WFS1), underscoring the phenotypic and biochemical overlap with other inborn errors. We prioritized variants in an additional 26 patients, including recessive, X-linked, and mtDNA variants that were enriched 2-fold over background and await further support of pathogenicity. In one case, we modeled patient mutations in yeast to provide evidence that recessive mutations in ATP5A1 can underlie combined respiratory chain deficiency. Conclusion: The results demonstrate that targeted exome sequencing is an effective alternative to the sequential testing of mtDNA and individual nuclear genes as part of the investigation of mitochondrial disease. Our study underscores the ongoing challenge of variant interpretation in the clinical setting. PMID:23596069

  9. Molecular Genetics of the Usher Syndrome in Lebanon: Identification of 11 Novel Protein Truncating Mutations by Whole Exome Sequencing

    PubMed Central

    Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

    2014-01-01

    Background Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Methods Whole exome sequencing followed by expanded familial validation by Sanger sequencing. Results We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Conclusion Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes. PMID:25211151

  10. Molecular genetics of the Usher syndrome in Lebanon: identification of 11 novel protein truncating mutations by whole exome sequencing.

    PubMed

    Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

    2014-01-01

    Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Whole exome sequencing followed by expanded familial validation by Sanger sequencing. We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.

  11. Exome Sequencing Fails to Identify the Genetic Cause of Aicardi Syndrome.

    PubMed

    Lund, Caroline; Striano, Pasquale; Sorte, Hanne Sørmo; Parisi, Pasquale; Iacomino, Michele; Sheng, Ying; Vigeland, Magnus D; Øye, Anne-Marte; Møller, Rikke Steensbjerre; Selmer, Kaja K; Zara, Federico

    2016-09-01

    Aicardi syndrome (AS) is a well-characterized neurodevelopmental disorder with an unknown etiology. In this study, we performed whole-exome sequencing in 11 female patients with the diagnosis of AS, in order to identify the disease-causing gene. In particular, we focused on detecting variants in the X chromosome, including the analysis of variants with a low number of sequencing reads, in case of somatic mosaicism. For 2 of the patients, we also sequenced the exome of the parents to search for de novo mutations. We did not identify any genetic variants likely to be damaging. Only one single missense variant was identified by the de novo analyses of the 2 trios, and this was considered benign. The failure to identify a disease gene in this study may be due to technical limitations of our study design, including the possibility that the genetic aberration leading to AS is situated in a non-exonic region or that the mutation is somatic and not detectable by our approach. Alternatively, it is possible that AS is genetically heterogeneous and that 11 patients are not sufficient to reveal the causative genes. Future studies of AS should consider designs where also non-exonic regions are explored and apply a sequencing depth so that also low-grade somatic mosaicism can be detected.

  12. A prospective evaluation of whole-exome sequencing as a first-tier molecular test in infants with suspected monogenic disorders.

    PubMed

    Stark, Zornitza; Tan, Tiong Y; Chong, Belinda; Brett, Gemma R; Yap, Patrick; Walsh, Maie; Yeung, Alison; Peters, Heidi; Mordaunt, Dylan; Cowie, Shannon; Amor, David J; Savarirayan, Ravi; McGillivray, George; Downie, Lilian; Ekert, Paul G; Theda, Christiane; James, Paul A; Yaplito-Lee, Joy; Ryan, Monique M; Leventer, Richard J; Creed, Emma; Macciocca, Ivan; Bell, Katrina M; Oshlack, Alicia; Sadedin, Simon; Georgeson, Peter; Anderson, Charlotte; Thorne, Natalie; Melbourne Genomics Health Alliance; Gaff, Clara; White, Susan M

    2016-11-01

    To prospectively evaluate the diagnostic and clinical utility of singleton whole-exome sequencing (WES) as a first-tier test in infants with suspected monogenic disease. Singleton WES was performed as a first-tier sequencing test in infants recruited from a single pediatric tertiary center. This occurred in parallel with standard investigations, including single- or multigene panel sequencing when clinically indicated. The diagnosis rate, clinical utility, and impact on management of singleton WES were evaluated. Of 80 enrolled infants, 46 received a molecular genetic diagnosis through singleton WES (57.5%) compared with 11 (13.75%) who underwent standard investigations in the same patient group. Clinical management changed following exome diagnosis in 15 of 46 diagnosed participants (32.6%). Twelve relatives received a genetic diagnosis following cascade testing, and 28 couples were identified as being at high risk of recurrence in future pregnancies. This prospective study provides strong evidence for increased diagnostic and clinical utility of singleton WES as a first-tier sequencing test for infants with a suspected monogenic disorder. Singleton WES outperformed standard care in terms of diagnosis rate and the benefits of a diagnosis, namely, impact on management of the child and clarification of reproductive risks for the extended family in a timely manner.Genet Med 18 11, 1090-1096.

  13. Whole-exome sequencing reveals genetic variants associated with chronic kidney disease characterized by tubulointerstitial damages in North Central Region, Sri Lanka.

    PubMed

    Nanayakkara, Shanika; Senevirathna, S T M L D; Parahitiyawa, Nipuna B; Abeysekera, Tilak; Chandrajith, Rohana; Ratnatunga, Neelakanthi; Hitomi, Toshiaki; Kobayashi, Hatasu; Harada, Kouji H; Koizumi, Akio

    2015-09-01

    The familial clustering observed in chronic kidney disease of uncertain etiology (CKDu) characterized by tubulointerstitial damages in the North Central Region of Sri Lanka strongly suggests the involvement of genetic factors in its pathogenesis. The objective of the present study is to use whole-exome sequencing to identify the genetic variants associated with CKDu. Whole-exome sequencing of eight CKDu cases and eight controls was performed, followed by direct sequencing of candidate loci in 301 CKDu cases and 276 controls. Association study revealed rs34970857 (c.658G > A/p.V220M) located in the KCNA10 gene encoding a voltage-gated K channel as the most promising SNP with the highest odds ratio of 1.74. Four rare variants were identified in gene encoding Laminin beta2 (LAMB2) which is known to cause congenital nephrotic syndrome. Three out of four variants in LAMB2 were novel variants found exclusively in cases. Genetic investigations provide strong evidence on the presence of genetic susceptibility for CKDu. Possibility of presence of several rare variants associated with CKDu in this population is also suggested.

  14. Exome capture sequencing reveals new insights into hepatitis B virus-induced hepatocellular carcinoma at the early stage of tumorigenesis.

    PubMed

    Chen, Yong; Wang, Lijuan; Xu, Hexiang; Liu, Xingxiang; Zhao, Yingren

    2013-10-01

    Hepatocellular carcinoma (HCC), the most common type of liver cancer, is the third primary cause of cancer-related mortality worldwide. The molecular mechanisms underlying the initiation and formation of HCC remain obscure. In the present study, we performed exome sequencing using tumor and normal tissues from 3 hepatitis B virus (HBV)-positive BCLC stage A HCC patients. Bioinformatic analysis was performed to find candidate protein-altering somatic mutations. Eighty damaging mutations were validated and 59 genes were reported to be mutated in HBV-related HCCs for the first time here. Further analysis using whole genome sequencing (WGS) data of 88 HBV-related HCC patients from the European Genome-phenome Archive database showed that mutations in 33 of the 59 genes were also detected in other samples. Variants of two newly found genes, ZNF717 and PARP4, were detected in more than 10% of the WGS samples. Several other genes, such as FLNA and CNTN2, are also noteworthy. Thus, the exome sequencing analysis of three BCLC stage A patients provides new insights into the molecular events governing the early steps of HBV-induced HCC tumorigenesis.

  15. Exome Sequencing Analysis Reveals Variants in Primary Immunodeficiency Genes in Patients With Very Early Onset Inflammatory Bowel Disease

    PubMed Central

    Kelsen, Judith R.; Dawany, Noor; Moran, Christopher J.; Petersen, Britt-Sabina; Sarmady, Mahdi; Sasson, Ariella; Pauly-Hubbard, Helen; Martinez, Alejandro; Maurer, Kelly; Soong, Joanne; Rappaport, Eric; Franke, Andre; Keller, Andreas; Winter, Harland S.; Mamula, Petar; Piccoli, David; Artis, David; Sonnenberg, Gregory F.; Daly, Mark; Sullivan, Kathleen E.; Baldassano, Robert N.; Devoto, Marcella

    2016-01-01

    Background & Aims Very early onset inflammatory bowel disease (VEO-IBD), IBD diagnosed ≤5 y of age, frequently presents with a different and more severe phenotype than older-onset IBD. We investigated whether patients with VEO-IBD carry rare or novel variants in genes associated with immunodeficiencies that might contribute to disease development. Methods Patients with VEO-IBD and parents (when available) were recruited from the Children's Hospital of Philadelphia from March 2013 through July 2014. We analyzed DNA from 125 patients with VEO-IBD (ages 3 weeks to 4 y) and 19 parents, 4 of whom also had IBD. Exome capture was performed by Agilent SureSelect V4, and sequencing was performed using the Illumina HiSeq platform. Alignment to human genome GRCh37 was achieved followed by post-processing and variant calling. Following functional annotation, candidate variants were analyzed for change in protein function, minor allele frequency <0.1%, and scaled combined annotation dependent depletion scores ≤10. We focused on genes associated with primary immunodeficiencies and related pathways. An additional 210 exome samples from patients with pediatric IBD (n=45) or adult-onset Crohn's disease (n=20) and healthy individuals (controls, n=145) were obtained from the University of Kiel, Germany and used as control groups. Results Four-hundred genes and regions associated with primary immunodeficiency, covering approximately 6500 coding exons totaling > 1 Mbp of coding sequence, were selected from the whole exome data. Our analysis revealed novel and rare variants within these genes that could contribute to the development of VEO-IBD, including rare heterozygous missense variants in IL10RA and previously unidentified variants in MSH5 and CD19. Conclusions In an exome sequence analysis of patients with VEO-IBD and their parents, we identified variants in genes that regulate B- and T-cell functions and could contribute to pathogenesis. Our analysis could lead to the identification of previously unidentified IBD-associated variants. PMID:26193622

  16. Identification of a novel mutation in a Chinese family with Nance-Horan syndrome by whole exome sequencing.

    PubMed

    Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun

    2014-08-01

    Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis.

  17. Cryopyrin-associated Periodic Syndrome Caused by a Myeloid-Restricted Somatic NLRP3 Mutation

    PubMed Central

    Zhou, Qing; Aksentijevich, Ivona; Wood, Geryl M.; Walts, Avram D.; Hoffmann, Patrycja; Remmers, Elaine F.; Kastner, Daniel L.; Ombrello, Amanda K.

    2015-01-01

    Objective To identify the cause of disease in an adult patient presenting with recent onset fevers, chills, urticaria, fatigue, and profound myalgia, who was negative for cryopyrin-associated periodic syndrome (CAPS) NLRP3 mutations by conventional Sanger DNA sequencing. Methods We performed whole-exome sequencing and targeted deep sequencing using DNA from the patient’s whole blood to identify a possible NLRP3 somatic mutation. We then screened for this mutation in subcloned NLRP3 amplicons from fibroblasts, buccal cells, granulocytes, negatively-selected monocytes, and T and B lymphocytes and further confirmed the somatic mutation by targeted sequencing of exon 3. Results We identified a previously reported CAPS-associated mutation, p.Tyr570Cys, with a mutant allele frequency of 15% based on exome data. Targeted sequencing and subcloning of NLRP3 amplicons confirmed the presence of the somatic mutation in whole blood at a ratio similar to the exome data. The mutant allele frequency was in the range of 13.3%–16.8% in monocytes and 15.2%–18% in granulocytes; Notably, this mutation was either absent or present at a very low frequency in B and T lymphocytes, buccal cells, and in the patient’s cultured fibroblasts. Conclusion These data document the possibility of myeloid-restricted somatic mosaicism in the pathogenesis of CAPS, underscoring the emerging role of massively-parallel sequencing in clinical diagnosis. PMID:25988971

  18. Regularized rare variant enrichment analysis for case-control exome sequencing data.

    PubMed

    Larson, Nicholas B; Schaid, Daniel J

    2014-02-01

    Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.

  19. Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond

    PubMed Central

    Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus FX; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils

    2013-01-01

    Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes. PMID:23889683

  20. Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results.

    PubMed

    Tabor, Holly K; Auer, Paul L; Jamal, Seema M; Chong, Jessica X; Yu, Joon-Ho; Gordon, Adam S; Graubert, Timothy A; O'Donnell, Christopher J; Rich, Stephen S; Nickerson, Deborah A; Bamshad, Michael J

    2014-08-07

    Exome sequencing (ES) is rapidly being deployed for use in clinical settings despite limited empirical data about the number and types of incidental results (with potential clinical utility) that could be offered for return to an individual. We analyzed deidentified ES data from 6,517 participants (2,204 African Americans and 4,313 European Americans) from the National Heart, Lung, and Blood Institute Exome Sequencing Project. We characterized the frequencies of pathogenic alleles in genes underlying Mendelian conditions commonly assessed by newborn-screening (NBS, n = 39) programs, genes associated with age-related macular degeneration (ARMD, n = 17), and genes known to influence drug response (PGx, n = 14). From these 70 genes, we identified 10,789 variants and curated them by manual review of OMIM, HGMD, locus-specific databases, or primary literature to a total of 399 validated pathogenic variants. The mean number of risk alleles per individual was 15.3. Every individual had at least five known PGx alleles, 99% of individuals had at least one ARMD risk allele, and 45% of individuals were carriers for at least one pathogenic NBS allele. The carrier burden for severe recessive childhood disorders was 0.57. Our results demonstrate that risk alleles of potential clinical utility for both Mendelian and complex traits are detectable in every individual. These findings highlight the necessity of developing guidelines and policies that consider the return of results to all individuals and underscore the need to develop innovative approaches and tools that enable individuals to exercise their choice about the return of incidental results. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  1. Whole-exome sequencing identifies novel compound heterozygous mutations in USH2A in Spanish patients with autosomal recessive retinitis pigmentosa.

    PubMed

    Méndez-Vidal, Cristina; González-Del Pozo, María; Vela-Boza, Alicia; Santoyo-López, Javier; López-Domingo, Francisco J; Vázquez-Marouschek, Carmen; Dopazo, Joaquin; Borrego, Salud; Antiñolo, Guillermo

    2013-01-01

    Retinitis pigmentosa (RP) is an inherited retinal dystrophy characterized by extreme genetic and clinical heterogeneity. Thus, the diagnosis is not always easily performed due to phenotypic and genetic overlap. Current clinical practices have focused on the systematic evaluation of a set of known genes for each phenotype, but this approach may fail in patients with inaccurate diagnosis or infrequent genetic cause. In the present study, we investigated the genetic cause of autosomal recessive RP (arRP) in a Spanish family in which the causal mutation has not yet been identified with primer extension technology and resequencing. We designed a whole-exome sequencing (WES)-based approach using NimbleGen SeqCap EZ Exome V3 sample preparation kit and the SOLiD 5500×l next-generation sequencing platform. We sequenced the exomes of both unaffected parents and two affected siblings. Exome analysis resulted in the identification of 43,204 variants in the index patient. All variants passing filter criteria were validated with Sanger sequencing to confirm familial segregation and absence in the control population. In silico prediction tools were used to determine mutational impact on protein function and the structure of the identified variants. Novel Usher syndrome type 2A (USH2A) compound heterozygous mutations, c.4325T>C (p.F1442S) and c.15188T>G (p.L5063R), located in exons 20 and 70, respectively, were identified as probable causative mutations for RP in this family. Family segregation of the variants showed the presence of both mutations in all affected members and in two siblings who were apparently asymptomatic at the time of family ascertainment. Clinical reassessment confirmed the diagnosis of RP in these patients. Using WES, we identified two heterozygous novel mutations in USH2A as the most likely disease-causing variants in a Spanish family diagnosed with arRP in which the cause of the disease had not yet been identified with commonly used techniques. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases where conventional genetic approaches have previously failed in achieving a proper diagnosis.

  2. Somatic mutation profiles of clear cell endometrial tumors revealed by whole exome and targeted gene sequencing.

    PubMed

    Le Gallo, Matthieu; Rudd, Meghan L; Urick, Mary Ellen; Hansen, Nancy F; Zhang, Suiyuan; Lozy, Fred; Sgroi, Dennis C; Vidal Bel, August; Matias-Guiu, Xavier; Broaddus, Russell R; Lu, Karen H; Levine, Douglas A; Mutch, David G; Goodfellow, Paul J; Salvesen, Helga B; Mullikin, James C; Bell, Daphne W

    2017-09-01

    The molecular pathogenesis of clear cell endometrial cancer (CCEC), a tumor type with a relatively unfavorable prognosis, is not well defined. We searched exome-wide for novel somatically mutated genes in CCEC and assessed the mutational spectrum of known and candidate driver genes in a large cohort of cases. We conducted whole exome sequencing of paired tumor-normal DNAs from 16 cases of CCEC (12 CCECs and the CCEC components of 4 mixed histology tumors). Twenty-two genes-of-interest were Sanger-sequenced from another 47 cases of CCEC. Microsatellite instability (MSI) and microsatellite stability (MSS) were determined by genotyping 5 mononucleotide repeats. Two tumor exomes had relatively high mutational loads and MSI. The other 14 tumor exomes were MSS and had 236 validated nonsynonymous or splice junction somatic mutations among 222 protein-encoding genes. Among the 63 cases of CCEC in this study, we identified frequent somatic mutations in TP53 (39.7%), PIK3CA (23.8%), PIK3R1 (15.9%), ARID1A (15.9%), PPP2R1A (15.9%), SPOP (14.3%), and TAF1 (9.5%), as well as MSI (11.3%). Five of 8 mutations in TAF1, a gene with no known role in CCEC, localized to the putative histone acetyltransferase domain and included 2 recurrently mutated residues. Based on patterns of MSI and mutations in 7 genes, CCEC subsets molecularly resembled serous endometrial cancer (SEC) or endometrioid endometrial cancer (EEC). Our findings demonstrate molecular similarities between CCEC and SEC and EEC and implicate TAF1 as a novel candidate CCEC driver gene. Cancer 2017;123:3261-8. © 2017 American Cancer Society. © 2017 American Cancer Society.

  3. Transcriptome-Based Differentiation of Closely-Related Miscanthus Lines

    DOE PAGES

    Chouvarine, Philippe; Cooksey, Amanda M.; McCarthy, Fiona M.; ...

    2012-01-10

    Distinguishing between individuals is critical to those conducting animal/plant breeding, food safety/quality research, diagnostic and clinical testing, and evolutionary biology studies. Classical genetic identification studies are based on marker polymorphisms, but polymorphism-based techniques are time and labor intensive and often cannot distinguish between closely related individuals. Illumina sequencing technologies provide the detailed sequence data required for rapid and efficient differentiation of related species, lines/cultivars, and individuals in a cost-effective manner. Here we describe the use of Illumina high-throughput exome sequencing, coupled with SNP mapping, as a rapid means of distinguishing between related cultivars of the lignocellulosic bioenergy crop giant miscanthusmore » (Miscanthus6giganteus). We provide the first exome sequence database for Miscanthus species complete with Gene Ontology (GO) functional annotations."« less

  4. Utility of whole exome sequencing in the diagnosis of Usher syndrome: Report of novel compound heterozygous MYO7A mutations.

    PubMed

    Ramzan, Khushnooda; Al-Owain, Mohammed; Huma, Rozeena; Al-Hazzaa, Selwa A F; Al-Ageel, Sarah; Imtiaz, Faiqa; Al-Sayed, Moeenaldeen

    2018-05-01

    Next generation sequencing (NGS), such as targeted panel sequencing, whole-exome sequencing and whole-genome sequencing has led to an exponential increase of elucidated genetic causes in both rare diseases, and common but heterogeneous disorders. NGS is applied in both research and clinical settings, and the clinical exome sequencing (CES), which provides not only the sequence variation data but also clinical interpretation, aids in reaching a final conclusion with regards to a genetic diagnosis. Usher syndrome is a group of disorders, characterized by bilateral sensorineural hearing loss, with or without vestibular dysfunction and retinitis pigmentosa. The index patient, a 2-year-old child was initially diagnosed with nonsyndromic hearing impairment. Homozygosity mapping followed by CES was utilized as a diagnostic tool to identify the genetic basis of his hearing loss. A paternally inherited novel insertion, c.198_199insA (p.Val67Serfs*73) and a maternally inherited novel deletion, c.1219_1226del (p.Phe407Aspfs*33) in gene MYO7A were found in compound heterozygous state in the index patient. The result expands the mutational spectrum of MYO7A. In addition it helped in early diagnosis of the syndrome, for planning and adjustments for the patient, and as well as for future family planning. This study highlights the clinical effectiveness of CES for Usher syndrome diagnosis in a child presented with congenital hearing loss. Copyright © 2018. Published by Elsevier B.V.

  5. Whole-exome sequencing for diagnosis of hereditary ichthyosis.

    PubMed

    Sitek, J C; Kulseth, M A; Rypdal, K B; Skodje, T; Sheng, Y; Retterstøl, L

    2018-02-14

    Hereditary ichthyosis constitutes a diverse group of cornification disorders. Identification of the molecular cause facilitates optimal patient care. We wanted to estimate the diagnostic yield of applying whole-exome sequencing (WES) in the routine genetic workup of inherited ichthyosis. During a 3-year-period, all ichthyosis patients, except X-linked and mild vulgar ichthyosis, consecutively admitted to a university hospital clinic were offered WES with subsequent analysis of ichthyosis-related genes as a first-line genetic investigation. Clinical and molecular data have been collected retrospectively. Genetic variants causative for the ichthyosis were identified in 27 of 34 investigated patients (79.4%). In all, 31 causative mutations across 13 genes were disclosed, including 12 novel variants. TGM1 was the most frequently mutated gene, accounting for 43.7% of patients suffering from autosomal recessive congenital ichthyosis (ARCI). Whole-exome sequencing appears an effective tool in disclosing the molecular cause of patients with hereditary ichthyosis seen in clinical practice and should be considered a first-tier genetic test in these patients. © 2018 European Academy of Dermatology and Venereology.

  6. Whole Exome Sequencing Identifies Rare Protein-Coding Variants in Behçet's Disease.

    PubMed

    Ognenovski, Mikhail; Renauer, Paul; Gensterblum, Elizabeth; Kötter, Ina; Xenitidis, Theodoros; Henes, Jörg C; Casali, Bruno; Salvarani, Carlo; Direskeneli, Haner; Kaufman, Kenneth M; Sawalha, Amr H

    2016-05-01

    Behçet's disease (BD) is a systemic inflammatory disease with an incompletely understood etiology. Despite the identification of multiple common genetic variants associated with BD, rare genetic variants have been less explored. We undertook this study to investigate the role of rare variants in BD by performing whole exome sequencing in BD patients of European descent. Whole exome sequencing was performed in a discovery set comprising 14 German BD patients of European descent. For replication and validation, Sanger sequencing and Sequenom genotyping were performed in the discovery set and in 2 additional independent sets of 49 German BD patients and 129 Italian BD patients of European descent. Genetic association analysis was then performed in BD patients and 503 controls of European descent. Functional effects of associated genetic variants were assessed using bioinformatic approaches. Using whole exome sequencing, we identified 77 rare variants (in 74 genes) with predicted protein-damaging effects in BD. These variants were genotyped in 2 additional patient sets and then analyzed to reveal significant associations with BD at 2 genetic variants detected in all 3 patient sets that remained significant after Bonferroni correction. We detected genetic association between BD and LIMK2 (rs149034313), involved in regulating cytoskeletal reorganization, and between BD and NEIL1 (rs5745908), involved in base excision DNA repair (P = 3.22 × 10(-4) and P = 5.16 × 10(-4) , respectively). The LIMK2 association is a missense variant with predicted protein damage that may influence functional interactions with proteins involved in cytoskeletal regulation by Rho GTPase, inflammation mediated by chemokine and cytokine signaling pathways, T cell activation, and angiogenesis (Bonferroni-corrected P = 5.63 × 10(-14) , P = 7.29 × 10(-6) , P = 1.15 × 10(-5) , and P = 6.40 × 10(-3) , respectively). The genetic association in NEIL1 is a predicted splice donor variant that may introduce a deleterious intron retention and result in a noncoding transcript variant. We used whole exome sequencing in BD for the first time and identified 2 rare putative protein-damaging genetic variants associated with this disease. These genetic variants might influence cytoskeletal regulation and DNA repair mechanisms in BD and might provide further insight into increased leukocyte tissue infiltration and the role of oxidative stress in BD. © 2016, American College of Rheumatology.

  7. Rare variants in FBN1 and FBN2 are associated with severe adolescent idiopathic scoliosis

    PubMed Central

    Buchan, Jillian G.; Alvarado, David M.; Haller, Gabe E.; Cruchaga, Carlos; Harms, Matthew B.; Zhang, Tianxiao; Willing, Marcia C.; Grange, Dorothy K.; Braverman, Alan C.; Miller, Nancy H.; Morcuende, Jose A.; Tang, Nelson Leung-Sang; Lam, Tsz-Ping; Ng, Bobby Kin-Wah; Cheng, Jack Chun-Yiu; Dobbs, Matthew B.; Gurnett, Christina A.

    2014-01-01

    Adolescent idiopathic scoliosis (AIS) causes spinal deformity in 3% of children. Despite a strong genetic basis, few genes have been associated with AIS and the pathogenesis remains poorly understood. In a genome-wide rare variant burden analysis using exome sequence data, we identified fibrillin-1 (FBN1) as the most significantly associated gene with AIS. Based on these results, FBN1 and a related gene, fibrillin-2 (FBN2), were sequenced in a total of 852 AIS cases and 669 controls. In individuals of European ancestry, rare variants in FBN1 and FBN2 were enriched in severely affected AIS cases (7.6%) compared with in-house controls (2.4%) (OR = 3.5, P = 5.46 × 10−4) and Exome Sequencing Project controls (2.3%) (OR = 3.5, P = 1.48 × 10−6). Scoliosis severity in AIS cases was associated with FBN1 and FBN2 rare variants (P = 0.0012) and replicated in an independent Han Chinese cohort (P = 0.0376), suggesting that rare variants may be useful as predictors of curve progression. Clinical evaluations revealed that the majority of AIS cases with rare FBN1 variants do not meet diagnostic criteria for Marfan syndrome, though variants are associated with tall stature (P = 0.0035) and upregulation of the transforming growth factor beta pathway. Overall, these results expand our definition of fibrillin-related disorders to include AIS and open up new strategies for diagnosing and treating severe AIS. PMID:24833718

  8. Analyzing Somatic Genome Rearrangements in Human Cancers by Using Whole-Exome Sequencing

    PubMed Central

    Yang, Lixing; Lee, Mi-Sook; Lu, Hengyu; Oh, Doo-Yi; Kim, Yeon Jeong; Park, Donghyun; Park, Gahee; Ren, Xiaojia; Bristow, Christopher A.; Haseley, Psalm S.; Lee, Soohyun; Pantazi, Angeliki; Kucherlapati, Raju; Park, Woong-Yang; Scott, Kenneth L.; Choi, Yoon-La; Park, Peter J.

    2016-01-01

    Although exome sequencing data are generated primarily to detect single-nucleotide variants and indels, they can also be used to identify a subset of genomic rearrangements whose breakpoints are located in or near exons. Using >4,600 tumor and normal pairs across 15 cancer types, we identified over 9,000 high confidence somatic rearrangements, including a large number of gene fusions. We find that the 5′ fusion partners of functional fusions are often housekeeping genes, whereas the 3′ fusion partners are enriched in tyrosine kinases. We establish the oncogenic potential of ROR1-DNAJC6 and CEP85L-ROS1 fusions by showing that they can promote cell proliferation in vitro and tumor formation in vivo. Furthermore, we found that ∼4% of the samples have massively rearranged chromosomes, many of which are associated with upregulation of oncogenes such as ERBB2 and TERT. Although the sensitivity of detecting structural alterations from exomes is considerably lower than that from whole genomes, this approach will be fruitful for the multitude of exomes that have been and will be generated, both in cancer and in other diseases. PMID:27153396

  9. [Exome sequencing revealed Allan-Herndon-Dudley syndrome underlying multiple disabilities].

    PubMed

    Arvio, Maria; Philips, Anju K; Ahvenainen, Minna; Somer, Mirja; Kalscheuer, Vera; Järvelä, Irma

    2014-01-01

    Normal function of the thyroid gland is the cornerstone of a child's mental development and physical growth. We describe a Finnish family, in which the diagnosis of three brothers became clear after investigations that lasted for more than 30 years. Two of the sons have already died. DNA analysis of the third one, a 16-year-old boy, revealed in exome sequencing of the complete X chromosome a mutation in the SLC16A2 gene, i.e. MCT8, coding for a thyroid hormone transport protein. Allan-Herndon-Dudley syndrome was thus shown to be the cause of multiple disabilities.

  10. Case Report: Application of whole exome sequencing for accurate diagnosis of rare syndromes of mineralocorticoid excess

    PubMed Central

    Narayanan, Ranjit; Karuthedath Vellarikkal, Shamsudheen; Jayarajan, Rijith; Verma, Ankit; Dixit, Vishal; Scaria, Vinod; Sivasubbu, Sridhar

    2017-01-01

    Syndromes of mineralocorticoid excess (SME) are closely related clinical manifestations occurring within a specific set of diseases. Overlapping clinical manifestations of such syndromes often create a dilemma in accurate diagnosis, which is crucial for disease surveillance and management especially in rare genetic disorders. Here we demonstrate the use of whole exome sequencing (WES) for accurate diagnosis of rare SME and report that p.R337C variation in the HSD11B2 gene causes progressive apparent mineralocorticoid excess (AME) syndrome in a South Indian family of Mappila origin. PMID:29067160

  11. Whole exome sequencing is an efficient, sensitive and specific method for determining the genetic cause of short-rib thoracic dystrophies.

    PubMed

    McInerney-Leo, A M; Harris, J E; Leo, P J; Marshall, M S; Gardiner, B; Kinning, E; Leong, H Y; McKenzie, F; Ong, W P; Vodopiutz, J; Wicking, C; Brown, M A; Zankl, A; Duncan, E L

    2015-12-01

    Short-rib thoracic dystrophies (SRTDs) are congenital disorders due to defects in primary cilium function. SRTDs are recessively inherited with mutations identified in 14 genes to date (comprising 398 exons). Conventional mutation detection (usually by iterative Sanger sequencing) is inefficient and expensive, and often not undertaken. Whole exome massive parallel sequencing has been used to identify new genes for SRTD (WDR34, WDR60 and IFT172); however, the clinical utility of whole exome sequencing (WES) has not been established. WES was performed in 11 individuals with SRTDs. Compound heterozygous or homozygous mutations were identified in six confirmed SRTD genes in 10 individuals (IFT172, DYNC2H1, TTC21B, WDR60, WDR34 and NEK1), giving overall sensitivity of 90.9%. WES data from 993 unaffected individuals sequenced using similar technology showed two individuals with rare (minor allele frequency <0.005) compound heterozygous variants of unknown significance in SRTD genes (specificity >99%). Costs for consumables, laboratory processing and bioinformatic analysis were

  12. Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

    PubMed Central

    Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

    2014-01-01

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775

  13. Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol.

    PubMed

    Lange, Leslie A; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M; Smith, Joshua D; Turner, Emily H; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-Ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A; Holmen, Oddgeir L; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C; Correa, Adolfo; Griswold, Michael E; Jakobsdottir, Johanna; Smith, Albert V; Schreiner, Pamela J; Feitosa, Mary F; Zhang, Qunyuan; Huffman, Jennifer E; Crosby, Jacy; Wassel, Christina L; Do, Ron; Franceschini, Nora; Martin, Lisa W; Robinson, Jennifer G; Assimes, Themistocles L; Crosslin, David R; Rosenthal, Elisabeth A; Tsai, Michael; Rieder, Mark J; Farlow, Deborah N; Folsom, Aaron R; Lumley, Thomas; Fox, Ervin R; Carlson, Christopher S; Peters, Ulrike; Jackson, Rebecca D; van Duijn, Cornelia M; Uitterlinden, André G; Levy, Daniel; Rotter, Jerome I; Taylor, Herman A; Gudnason, Vilmundur; Siscovick, David S; Fornage, Myriam; Borecki, Ingrid B; Hayward, Caroline; Rudan, Igor; Chen, Y Eugene; Bottinger, Erwin P; Loos, Ruth J F; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M; Gabriel, Stacey B; O'Donnell, Christopher J; Post, Wendy S; North, Kari E; Reiner, Alexander P; Boerwinkle, Eric; Psaty, Bruce M; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P; Cupples, L Adrienne; Kooperberg, Charles; Wilson, James G; Nickerson, Deborah A; Abecasis, Goncalo R; Rich, Stephen S; Tracy, Russell P; Willer, Cristen J

    2014-02-06

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98(th) or <2(nd) percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  14. Rare variants in RTEL1 are associated with familial interstitial pneumonia.

    PubMed

    Cogan, Joy D; Kropski, Jonathan A; Zhao, Min; Mitchell, Daphne B; Rives, Lynette; Markin, Cheryl; Garnett, Errine T; Montgomery, Keri H; Mason, Wendi R; McKean, David F; Powers, Julia; Murphy, Elissa; Olson, Lana M; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R; Lancaster, Lisa H; Steele, Mark P; Brown, Kevin K; Schwarz, Marvin I; Fingerlin, Tasha E; Schwartz, David A; Lawson, William E; Loyd, James E; Zhao, Zhongming; Phillips, John A; Blackwell, Timothy S

    2015-03-15

    Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis.

  15. Identification of a novel mutation in a Chinese family with Nance-Horan syndrome by whole exome sequencing*

    PubMed Central

    Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun

    2014-01-01

    Objective: Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Methods: Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. Results: A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. Conclusions: We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis. PMID:25091991

  16. Brief Report: Cryopyrin-Associated Periodic Syndrome Caused by a Myeloid-Restricted Somatic NLRP3 Mutation.

    PubMed

    Zhou, Qing; Aksentijevich, Ivona; Wood, Geryl M; Walts, Avram D; Hoffmann, Patrycja; Remmers, Elaine F; Kastner, Daniel L; Ombrello, Amanda K

    2015-09-01

    To identify the cause of disease in an adult patient presenting with recent-onset fevers, chills, urticaria, fatigue, and profound myalgia, who was found to be negative for cryopyrin-associated periodic syndrome (CAPS) NLRP3 mutations by conventional Sanger DNA sequencing. We performed whole-exome sequencing and targeted deep sequencing using DNA from the patient's whole blood to identify a possible NLRP3 somatic mutation. We then screened for this mutation in subcloned NLRP3 amplicons from fibroblasts, buccal cells, granulocytes, negatively selected monocytes, and T and B lymphocytes and further confirmed the somatic mutation by targeted sequencing of exon 3. We identified a previously reported CAPS-associated mutation, p.Tyr570Cys, with a mutant allele frequency of 15% based on exome data. Targeted sequencing and subcloning of NLRP3 amplicons confirmed the presence of the somatic mutation in whole blood at a ratio similar to the exome data. The mutant allele frequency was in the range of 13.3-16.8% in monocytes and 15.2-18% in granulocytes. Notably, this mutation was either absent or present at a very low frequency in B and T lymphocytes, in buccal cells, and in the patient's cultured fibroblasts. Our findings indicate the possibility of myeloid-restricted somatic mosaicism in the pathogenesis of CAPS, underscoring the emerging role of massively parallel sequencing in clinical diagnosis. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.

  17. Targeted exome sequencing reveals novel USH2A mutations in Chinese patients with simplex Usher syndrome.

    PubMed

    Shu, Hai-Rong; Bi, Huai; Pan, Yang-Chun; Xu, Hang-Yu; Song, Jian-Xin; Hu, Jie

    2015-09-16

    Usher syndrome (USH) is an autosomal recessive disorder characterized by hearing impairment and vision dysfunction due to retinitis pigmentosa. Phenotypic and genetic heterogeneities of this disease make it impractical to obtain a genetic diagnosis by conventional Sanger sequencing. In this study, we applied a next-generation sequencing approach to detect genetic abnormalities in patients with USH. Two unrelated Chinese families were recruited, consisting of two USH afflicted patients and four unaffected relatives. We selected 199 genes related to inherited retinal diseases as targets for deep exome sequencing. Through systematic data analysis using an established bioinformatics pipeline, all variants that passed filter criteria were validated by Sanger sequencing and co-segregation analysis. A homozygous frameshift mutation (c.4382delA, p.T1462Lfs*2) was revealed in exon20 of gene USH2A in the F1 family. Two compound heterozygous mutations, IVS47 + 1G > A and c.13156A > T (p.I4386F), located in intron 48 and exon 63 respectively, of USH2A, were identified as causative mutations for the F2 family. Of note, the missense mutation c.13156A > T has not been reported so far. In conclusion, targeted exome sequencing precisely and rapidly identified the genetic defects in two Chinese USH families and this technique can be applied as a routine examination for these disorders with significant clinical and genetic heterogeneity.

  18. Exome Sequencing in 32 Patients with Anophthalmia/Microphthalmia and Developmental Eye Defects

    PubMed Central

    Slavotinek, Anne M.; Garcia, Sarah T.; Chandratillake, Gemma; Bardakjian, Tanya; Ullah, Ehsan; Wu, Di; Umeda, Kyle; Lao, Richard; Tang, Paul Ling-Fung; Wan, Eunice; Madireddy, Lohith; Lyalina, Svetlana; Mendelsohn, Bryce A.; Dugan, Sarah; Tirch, Jean; Tischler, Reana; Harris, Jason; Clark, Michael J.; Chervitz, Stephen; Patwardhan, Anil; West, John M.; Ursell, Phillip; de Alba Campomanes, Alejandra; Schneider, Adele; Kwok, Pui-yan; Baranzini, Sergio; Chen, Richard O.

    2014-01-01

    Anophthalmia/microphthalmia (A/M) is a genetically heterogeneous birth defect for which the etiology is unknown in more than 50% of patients. We used exome sequencing with the ACE Exome™ (Personalis, Inc; 18 cases) and UCSF Genomics Core (21 cases) to sequence 28 patients with A/M and four patients with varied developmental eye defects. In the 28 patients with A/M, we identified de novo mutations in three patients (OTX2, p.(Gln91His), RARB, p.Arg387Cys and GDF6, p.Ala249Glu) and inherited mutations in STRA6 in two patients. In patients with developmental eye defects, a female with cataracts and cardiomyopathy had a de novo COL4A1 mutation, p.(Gly773Arg), expanding the phenotype associated with COL4A1 to include cardiomyopathy. A male with a chorioretinal defect, microcephaly, seizures and sensorineural deafness had two PNPT1 mutations, p.(Ala507Ser) and c.401-1G>A, and we describe eye defects associated with this gene for the first time. Exome sequencing was efficient for identifying mutations in pathogenic genes for which there is no clinical testing available and for identifying cases that expand phenotypic spectra, such as the PNPT1 and COL4A1-associated disorders described here. PMID:25457163

  19. Promises, pitfalls and practicalities of prenatal whole exome sequencing.

    PubMed

    Best, Sunayna; Wou, Karen; Vora, Neeta; Van der Veyver, Ignatia B; Wapner, Ronald; Chitty, Lyn S

    2018-01-01

    Prenatal genetic diagnosis provides information for pregnancy and perinatal decision-making and management. In several small series, prenatal whole exome sequencing (WES) approaches have identified genetic diagnoses when conventional tests (karyotype and microarray) were not diagnostic. Here, we review published prenatal WES studies and recent conference abstracts. Thirty-one studies were identified, with diagnostic rates in series of five or more fetuses varying between 6.2% and 80%. Differences in inclusion criteria and trio versus singleton approaches to sequencing largely account for the wide range of diagnostic rates. The data suggest that diagnostic yields will be greater in fetuses with multiple anomalies or in cases preselected following genetic review. Beyond its ability to improve diagnostic rates, we explore the potential of WES to improve understanding of prenatal presentations of genetic disorders and lethal fetal syndromes. We discuss prenatal phenotyping limitations, counselling challenges regarding variants of uncertain significance, incidental and secondary findings, and technical problems in WES. We review the practical, ethical, social and economic issues that must be considered before prenatal WES could become part of routine testing. Finally, we reflect upon the potential future of prenatal genetic diagnosis, including a move towards whole genome sequencing and non-invasive whole exome and whole genome testing. © 2017 John Wiley & Sons, Ltd. © 2017 John Wiley & Sons, Ltd.

  20. Application of exome sequencing in the search for genetic causes of rare disorders of copper metabolism.

    PubMed

    Fuchs, Sabine A; Harakalova, Magdalena; van Haaften, Gijs; van Hasselt, Peter M; Cuppen, Edwin; Houwen, Roderick H J

    2012-07-01

    The genetic defect in a number of rare disorders of metal metabolism remains elusive. The limited number of patients with these disorders impedes the identification of the causative gene through positional cloning, which requires numerous families with multiple affected individuals. However, with next-generation sequencing all coding DNA (exomes) or whole genomes of patients can be sequenced to identify genes that are consistently mutated in patients. With this strategy only a limited number of patients and/or pedigrees is needed, bringing the elucidation of the genetic cause of even very rare diseases within reach. The main challenge associated with whole exome sequencing is the identification of the disease-causing mutation(s) among abundant genetic candidate variants. We describe several strategies to manage this data wealth, including comparison with control databases, increasing the number of patients and controls, and reducing the genomic region under investigation through homozygosity mapping. In this review we introduce a number of rare disorders of copper metabolism, with a suspected but yet unknown monogenetic cause, as an attractive target for this strategy. We anticipate that use of these novel techniques will identify the basic defect in the disorders described in this review, as well as in other genetic disorders of metal metabolism, in the next few years.

  1. Clinical diagnostic exome evaluation for an infant with a lethal disorder: genetic diagnosis of TARP syndrome and expansion of the phenotype in a patient with a newly reported RBM10 alteration.

    PubMed

    Powis, Zöe; Hart, Alexa; Cherny, Sara; Petrik, Igor; Palmaer, Erika; Tang, Sha; Jones, Carolyn

    2017-06-02

    Diagnostic Exome Sequencing (DES) has been shown to be an effective tool for diagnosis individuals with suspected genetic conditions. We report a male infant born with multiple anomalies including bilateral dysplastic kidneys, cleft palate, bilateral talipes, and bilateral absence of thumbs and first toes. Prenatal testing including chromosome analysis and microarray did not identify a cause for the multiple congenital anomalies. Postnatal diagnostic exome studies (DES) were utilized to find a molecular diagnosis for the patient. Exome sequencing of the proband, mother, and father showed a previously unreported maternally inherited RNA binding motif protein 10 (RBM10) c.1352_1353delAG (p.E451Vfs*66) alteration. Mutations in RBM10 are associated with TARP syndrome, an X-linked recessive disorder originally described with cardinal features of talipes equinovarus, atrial septal defect, Robin sequence, and persistent left superior vena cava. DES established a molecular genetic diagnosis of TARP syndrome for a neonatal patient with a poor prognosis in whom traditional testing methods were uninformative and allowed for efficient diagnosis and future reproductive options for the parents. Other reported cases of TARP syndrome demonstrate significant variability in clinical phenotype. The reported features in this infant including multiple hemivertebrae, imperforate anus, aplasia of thumbs and first toes have not been reported in previous patients, thus expanding the clinical phenotype for this rare disorder.

  2. RTTN Mutations Cause Primary Microcephaly and Primordial Dwarfism in Humans

    PubMed Central

    Shamseldin, Hanan; Alazami, Anas M.; Manning, Melanie; Hashem, Amal; Caluseiu, Oana; Tabarki, Brahim; Esplin, Edward; Schelley, Susan; Innes, A. Micheil; Parboosingh, Jillian S.; Lamont, Ryan; Majewski, Jacek; Bernier, Francois P.; Alkuraya, Fowzan S.

    2015-01-01

    Primary microcephaly is a developmental brain anomaly that results from defective proliferation of neuroprogenitors in the germinal periventricular zone. More than a dozen genes are known to be mutated in autosomal-recessive primary microcephaly in isolation or in association with a more generalized growth deficiency (microcephalic primordial dwarfism), but the genetic heterogeneity is probably more extensive. In a research protocol involving autozygome mapping and exome sequencing, we recruited a multiplex consanguineous family who is affected by severe microcephalic primordial dwarfism and tested negative on clinical exome sequencing. Two candidate autozygous intervals were identified, and the second round of exome sequencing revealed a single intronic variant therein (c.2885+8A>G [p.Ser963∗] in RTTN exon 23). RT-PCR confirmed that this change creates a cryptic splice donor and thus causes retention of the intervening 7 bp of the intron and leads to premature truncation. On the basis of this finding, we reanalyzed the exome file of a second consanguineous family affected by a similar phenotype and identified another homozygous change in RTTN as the likely causal mutation. Combined linkage analysis of the two families confirmed that RTTN maps to the only significant linkage peak. Finally, through international collaboration, a Canadian multiplex family affected by microcephalic primordial dwarfism and biallelic mutation of RTTN was identified. Our results expand the phenotype of RTTN-related disorders, hitherto limited to polymicrogyria, to include microcephalic primordial dwarfism with a complex brain phenotype involving simplified gyration. PMID:26608784

  3. Clinical exome sequencing reports: current informatics practice and future opportunities.

    PubMed

    Swaminathan, Rajeswari; Huang, Yungui; Astbury, Caroline; Fitzgerald-Butt, Sara; Miller, Katherine; Cole, Justin; Bartlett, Christopher; Lin, Simon

    2017-11-01

    The increased adoption of clinical whole exome sequencing (WES) has improved the diagnostic yield for patients with complex genetic conditions. However, the informatics practice for handling information contained in whole exome reports is still in its infancy, as evidenced by the lack of a common vocabulary within clinical sequencing reports generated across genetic laboratories. Genetic testing results are mostly transmitted using portable document format, which can make secondary analysis and data extraction challenging. This paper reviews a sample of clinical exome reports generated by Clinical Laboratory Improvement Amendments-certified genetic testing laboratories at tertiary-care facilities to assess and identify common data elements. Like structured radiology reports, which enable faster information retrieval and reuse, structuring genetic information within clinical WES reports would help facilitate integration of genetic information into electronic health records and enable retrospective research on the clinical utility of WES. We identify elements listed as mandatory according to practice guidelines but are currently missing from some of the clinical reports, which might help to organize the data when stored within structured databases. We also highlight elements, such as patient consent, that, although they do not appear within any of the current reports, may help in interpreting some of the information within the reports. Integrating genetic and clinical information would assist the adoption of personalized medicine for improved patient care and outcomes. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. Amplicon-based semiconductor sequencing of human exomes: performance evaluation and optimization strategies.

    PubMed

    Damiati, E; Borsani, G; Giacopuzzi, Edoardo

    2016-05-01

    The Ion Proton platform allows to perform whole exome sequencing (WES) at low cost, providing rapid turnaround time and great flexibility. Products for WES on Ion Proton system include the AmpliSeq Exome kit and the recently introduced HiQ sequencing chemistry. Here, we used gold standard variants from GIAB consortium to assess the performances in variants identification, characterize the erroneous calls and develop a filtering strategy to reduce false positives. The AmpliSeq Exome kit captures a large fraction of bases (>94 %) in human CDS, ClinVar genes and ACMG genes, but with 2,041 (7 %), 449 (13 %) and 11 (19 %) genes not fully represented, respectively. Overall, 515 protein coding genes contain hard-to-sequence regions, including 90 genes from ClinVar. Performance in variants detection was maximum at mean coverage >120×, while at 90× and 70× we measured a loss of variants of 3.2 and 4.5 %, respectively. WES using HiQ chemistry showed ~71/97.5 % sensitivity, ~37/2 % FDR and ~0.66/0.98 F1 score for indels and SNPs, respectively. The proposed low, medium or high-stringency filters reduced the amount of false positives by 10.2, 21.2 and 40.4 % for indels and 21.2, 41.9 and 68.2 % for SNP, respectively. Amplicon-based WES on Ion Proton platform using HiQ chemistry emerged as a competitive approach, with improved accuracy in variants identification. False-positive variants remain an issue for the Ion Torrent technology, but our filtering strategy can be applied to reduce erroneous variants.

  5. Novel candidate genes may be possible predisposing factors revealed by whole exome sequencing in familial esophageal squamous cell carcinoma.

    PubMed

    Forouzanfar, Narjes; Baranova, Ancha; Milanizadeh, Saman; Heravi-Moussavi, Alireza; Jebelli, Amir; Abbaszadegan, Mohammad Reza

    2017-05-01

    Esophageal squamous cell carcinoma is one of the deadliest of all the cancers. Its metastatic properties portend poor prognosis and high rate of recurrence. A more advanced method to identify new molecular biomarkers predicting disease prognosis can be whole exome sequencing. Here, we report the most effective genetic variants of the Notch signaling pathway in esophageal squamous cell carcinoma susceptibility by whole exome sequencing. We analyzed nine probands in unrelated familial esophageal squamous cell carcinoma pedigrees to identify candidate genes. Genomic DNA was extracted and whole exome sequencing performed to generate information about genetic variants in the coding regions. Bioinformatics software applications were utilized to exploit statistical algorithms to demonstrate protein structure and variants conservation. Polymorphic regions were excluded by false-positive investigations. Gene-gene interactions were analyzed for Notch signaling pathway candidates. We identified novel and damaging variants of the Notch signaling pathway through extensive pathway-oriented filtering and functional predictions, which led to the study of 27 candidate novel mutations in all nine patients. Detection of the trinucleotide repeat containing 6B gene mutation (a slice site alteration) in five of the nine probands, but not in any of the healthy samples, suggested that it may be a susceptibility factor for familial esophageal squamous cell carcinoma. Noticeably, 8 of 27 novel candidate gene mutations (e.g. epidermal growth factor, signal transducer and activator of transcription 3, MET) act in a cascade leading to cell survival and proliferation. Our results suggest that the trinucleotide repeat containing 6B mutation may be a candidate predisposing gene in esophageal squamous cell carcinoma. In addition, some of the Notch signaling pathway genetic mutations may act as key contributors to esophageal squamous cell carcinoma.

  6. Exome Sequencing in the Clinical Diagnosis of Sporadic or Familial Cerebellar Ataxia

    PubMed Central

    Fogel, Brent L.; Lee, Hane; Deignan, Joshua L.; Strom, Samuel P.; Kantarci, Sibel; Wang, Xizhe; Quintero-Rivera, Fabiola; Vilain, Eric; Grody, Wayne W.; Perlman, Susan; Geschwind, Daniel H.; Nelson, Stanley F.

    2015-01-01

    IMPORTANCE Cerebellar ataxias are a diverse collection of neurologic disorders with causes ranging from common acquired etiologies to rare genetic conditions. Numerous genetic disorders have been associated with chronic progressive ataxia and this consequently presents a diagnostic challenge for the clinician regarding how to approach and prioritize genetic testing in patients with such clinically heterogeneous phenotypes. Additionally, while the value of genetic testing in early-onset and/or familial cases seems clear, many patients with ataxia present sporadically with adult onset of symptoms and the contribution of genetic variation to the phenotype of these patients has not yet been established. OBJECTIVE To investigate the contribution of genetic disease in a population of patients with predominantly adult- and sporadic-onset cerebellar ataxia. DESIGN, SETTING, AND PARTICIPANTS We examined a consecutive series of 76 patients presenting to a tertiary referral center for evaluation of chronic progressive cerebellar ataxia. MAIN OUTCOMES AND MEASURES Next-generation exome sequencing coupled with comprehensive bioinformatic analysis, phenotypic analysis, and clinical correlation. RESULTS We identified clinically relevant genetic information in more than 60% of patients studied (n = 46), including diagnostic pathogenic gene variants in 21% (n = 16), a notable yield given the diverse genetics and clinical heterogeneity of the cerebellar ataxias. CONCLUSIONS AND RELEVANCE This study demonstrated that clinical exome sequencing in patients with adult-onset and sporadic presentations of ataxia is a high-yield test, providing a definitive diagnosis in more than one-fifth of patients and suggesting a potential diagnosis in more than one-third to guide additional phenotyping and diagnostic evaluation. Therefore, clinical exome sequencing is an appropriate consideration in the routine genetic evaluation of all patients presenting with chronic progressive cerebellar ataxia. PMID:25133958

  7. Genomic and clinical profiling of a national nephrotic syndrome cohort advocates a precision medicine approach to disease management.

    PubMed

    Bierzynska, Agnieszka; McCarthy, Hugh J; Soderquest, Katrina; Sen, Ethan S; Colby, Elizabeth; Ding, Wen Y; Nabhan, Marwa M; Kerecuk, Larissa; Hegde, Shivram; Hughes, David; Marks, Stephen; Feather, Sally; Jones, Caroline; Webb, Nicholas J A; Ognjanovic, Milos; Christian, Martin; Gilbert, Rodney D; Sinha, Manish D; Lord, Graham M; Simpson, Michael; Koziell, Ania B; Welsh, Gavin I; Saleem, Moin A

    2017-04-01

    Steroid Resistant Nephrotic Syndrome (SRNS) in children and young adults has differing etiologies with monogenic disease accounting for 2.9-30% in selected series. Using whole exome sequencing we sought to stratify a national population of children with SRNS into monogenic and non-monogenic forms, and further define those groups by detailed phenotypic analysis. Pediatric patients with SRNS were identified via a national United Kingdom Renal Registry. Whole exome sequencing was performed on 187 patients, of which 12% have a positive family history with a focus on the 53 genes currently known to be associated with nephrotic syndrome. Genetic findings were correlated with individual case disease characteristics. Disease causing variants were detected in 26.2% of patients. Most often this occurred in the three most common SRNS-associated genes: NPHS1, NPHS2, and WT1 but also in 14 other genes. The genotype did not always correlate with expected phenotype since mutations in OCRL, COL4A3, and DGKE associated with specific syndromes were detected in patients with isolated renal disease. Analysis by primary/presumed compared with secondary steroid resistance found 30.8% monogenic disease in primary compared with none in secondary SRNS permitting further mechanistic stratification. Genetic SRNS progressed faster to end stage renal failure, with no documented disease recurrence post-transplantation within this cohort. Primary steroid resistance in which no gene mutation was identified had a 47.8% risk of recurrence. In this unbiased pediatric population, whole exome sequencing allowed screening of all current candidate genes. Thus, deep phenotyping combined with whole exome sequencing is an effective tool for early identification of SRNS etiology, yielding an evidence-based algorithm for clinical management. Copyright © 2016 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.

  8. Increased frequency of de novo copy number variants in congenital heart disease by integrative analysis of single nucleotide polymorphism array and exome sequence data.

    PubMed

    Glessner, Joseph T; Bick, Alexander G; Ito, Kaoru; Homsy, Jason; Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R; Golhar, Ryan; Sanders, Stephan J; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A Jeremy; State, Matthew W; Kaltman, Jonathan R; White, Peter S; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K

    2014-10-24

    Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown pathogenesis. To determine the contribution of de novo copy number variants (CNVs) in the pathogenesis of sporadic CHD. We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism arrays and whole exome sequencing. Results were experimentally validated using digital droplet polymerase chain reaction. We compared validated CNVs in CHD cases with CNVs in 1301 healthy control trios. The 2 complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either single nucleotide polymorphism array (P=7×10(-5); odds ratio, 4.6) or whole exome sequencing data (P=6×10(-4); odds ratio, 3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (P=0.02; odds ratio, 2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in whole exome sequencing and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q subtelomeric deletions. We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. © 2014 American Heart Association, Inc.

  9. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.

    PubMed

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-11-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.

  10. Identification of Novel Single Nucleotide Polymorphisms Associated with Acute Respiratory Distress Syndrome by Exome-Seq

    PubMed Central

    Shortt, Katherine; Chaudhary, Suman; Grigoryev, Dmitry; Heruth, Daniel P.; Venkitachalam, Lakshmi; Zhang, Li Q.; Ye, Shui Q.

    2014-01-01

    Acute respiratory distress syndrome (ARDS) is a lung condition characterized by impaired gas exchange with systemic release of inflammatory mediators, causing pulmonary inflammation, vascular leak and hypoxemia. Existing biomarkers have limited effectiveness as diagnostic and therapeutic targets. To identify disease-associating variants in ARDS patients, whole-exome sequencing was performed on 96 ARDS patients, detecting 1,382,399 SNPs. By comparing these exome data to those of the 1000 Genomes Project, we identified a number of single nucleotide polymorphisms (SNP) which are potentially associated with ARDS. 50,190SNPs were found in all case subgroups and controls, of which89 SNPs were associated with susceptibility. We validated three SNPs (rs78142040, rs9605146 and rs3848719) in additional ARDS patients to substantiate their associations with susceptibility, severity and outcome of ARDS. rs78142040 (C>T) occurs within a histone mark (intron 6) of the Arylsulfatase D gene. rs9605146 (G>A) causes a deleterious coding change (proline to leucine) in the XK, Kell blood group complex subunit-related family, member 3 gene. rs3848719 (G>A) is a synonymous SNP in the Zinc-Finger/Leucine-Zipper Co-Transducer NIF1 gene. rs78142040, rs9605146, and rs3848719 are associated significantly with susceptibility to ARDS. rs3848719 is associated with APACHE II score quartile. rs78142040 is associated with 60-day mortality in the overall ARDS patient population. Exome-seq is a powerful tool to identify potential new biomarkers for ARDS. We selectively validated three SNPs which have not been previously associated with ARDS and represent potential new genetic biomarkers for ARDS. Additional validation in larger patient populations and further exploration of underlying molecular mechanisms are warranted. PMID:25372662

  11. Genome at Juncture of Early Human Migration: A Systematic Analysis of Two Whole Genomes and Thirteen Exomes from Kuwaiti Population Subgroup of Inferred Saudi Arabian Tribe Ancestry

    PubMed Central

    Alsmadi, Osama; Hebbar, Prashantha; Antony, Dinu; Behbehani, Kazem; Thanaraj, Thangavel Alphonse

    2014-01-01

    Population of the State of Kuwait is composed of three genetic subgroups of inferred Persian, Saudi Arabian tribe and Bedouin ancestry. The Saudi Arabian tribe subgroup traces its origin to the Najd region of Saudi Arabia. By sequencing two whole genomes and thirteen exomes from this subgroup at high coverage (>40X), we identify 4,950,724 Single Nucleotide Polymorphisms (SNPs), 515,802 indels and 39,762 structural variations. Of the identified variants, 10,098 (8.3%) exomic SNPs, 139,923 (2.9%) non-exomic SNPs, 5,256 (54.3%) exomic indels, and 374,959 (74.08%) non-exomic indels are ‘novel’. Up to 8,070 (79.9%) of the reported novel biallelic exomic SNPs are seen in low frequency (minor allele frequency <5%). We observe 5,462 known and 1,004 novel potentially deleterious nonsynonymous SNPs. Allele frequencies of common SNPs from the 15 exomes is significantly correlated with those from genotype data of a larger cohort of 48 individuals (Pearson correlation coefficient, 0.91; p <2.2×10−16). A set of 2,485 SNPs show significantly different allele frequencies when compared to populations from other continents. Two notable variants having risk alleles in high frequencies in this subgroup are: a nonsynonymous deleterious SNP (rs2108622 [19:g.15990431C>T] from CYP4F2 gene [MIM:*604426]) associated with warfarin dosage levels [MIM:#122700] required to elicit normal anticoagulant response; and a 3′ UTR SNP (rs6151429 [22:g.51063477T>C]) from ARSA gene [MIM:*607574]) associated with Metachromatic Leukodystrophy [MIM:#250100]. Hemoglobin Riyadh variant (identified for the first time in a Saudi Arabian woman) is observed in the exome data. The mitochondrial haplogroup profiles of the 15 individuals are consistent with the haplogroup diversity seen in Saudi Arabian natives, who are believed to have received substantial gene flow from Africa and eastern provenance. We present the first genome resource imperative for designing future genetic studies in Saudi Arabian tribe subgroup. The full-length genome sequences and the identified variants are available at ftp://dgr.dasmaninstitute.org and http://dgr.dasmaninstitute.org/DGR/gb.html. PMID:24896259

  12. The UK10K project identifies rare variants in health and disease.

    PubMed

    Walter, Klaudia; Min, Josine L; Huang, Jie; Crooks, Lucy; Memari, Yasin; McCarthy, Shane; Perry, John R B; Xu, ChangJiang; Futema, Marta; Lawson, Daniel; Iotchkova, Valentina; Schiffels, Stephan; Hendricks, Audrey E; Danecek, Petr; Li, Rui; Floyd, James; Wain, Louise V; Barroso, Inês; Humphries, Steve E; Hurles, Matthew E; Zeggini, Eleftheria; Barrett, Jeffrey C; Plagnol, Vincent; Richards, J Brent; Greenwood, Celia M T; Timpson, Nicholas J; Durbin, Richard; Soranzo, Nicole

    2015-10-01

    The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.

  13. Chemoresistance Evolution in Triple-Negative Breast Cancer Delineated by Single-Cell Sequencing.

    PubMed

    Kim, Charissa; Gao, Ruli; Sei, Emi; Brandt, Rachel; Hartman, Johan; Hatschek, Thomas; Crosetto, Nicola; Foukakis, Theodoros; Navin, Nicholas E

    2018-05-03

    Triple-negative breast cancer (TNBC) is an aggressive subtype that frequently develops resistance to chemotherapy. An unresolved question is whether resistance is caused by the selection of rare pre-existing clones or alternatively through the acquisition of new genomic aberrations. To investigate this question, we applied single-cell DNA and RNA sequencing in addition to bulk exome sequencing to profile longitudinal samples from 20 TNBC patients during neoadjuvant chemotherapy (NAC). Deep-exome sequencing identified 10 patients in which NAC led to clonal extinction and 10 patients in which clones persisted after treatment. In 8 patients, we performed a more detailed study using single-cell DNA sequencing to analyze 900 cells and single-cell RNA sequencing to analyze 6,862 cells. Our data showed that resistant genotypes were pre-existing and adaptively selected by NAC, while transcriptional profiles were acquired by reprogramming in response to chemotherapy in TNBC patients. Copyright © 2018 Elsevier Inc. All rights reserved.

  14. Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism

    PubMed Central

    Willsey, A. Jeremy; Sanders, Stephan J.; Li, Mingfeng; Dong, Shan; Tebbenkamp, Andrew T.; Muhle, Rebecca A.; Reilly, Steven K.; Lin, Leon; Fertuzinhos, Sofia; Miller, Jeremy A.; Murtha, Michael T.; Bichsel, Candace; Niu, Wei; Cotney, Justin; Ercan-Sencicek, A. Gulhan; Gockley, Jake; Gupta, Abha; Han, Wenqi; He, Xin; Hoffman, Ellen; Klei, Lambertus; Lei, Jing; Liu, Wenzhong; Liu, Li; Lu, Cong; Xu, Xuming; Zhu, Ying; Mane, Shrikant M.; Lein, Edward S.; Wei, Liping; Noonan, James P.; Roeder, Kathryn; Devlin, Bernie; Šestan, Nenad; State, Matthew W.

    2013-01-01

    SUMMARY Autism spectrum disorder (ASD) is a complex developmental syndrome of unknown etiology. Recent studies employing exome- and genome-wide sequencing have identified nine high-confidence ASD (hcASD) genes. Working from the hypothesis that ASD-associated mutations in these biologically pleiotropic genes will disrupt intersecting developmental processes to contribute to a common phenotype, we have attempted to identify time periods, brain regions, and cell types in which these genes converge. We have constructed coexpression networks based on the hcASD “seed” genes, leveraging a rich expression data set encompassing multiple human brain regions across human development and into adulthood. By assessing enrichment of an independent set of probable ASD (pASD) genes, derived from the same sequencing studies, we demonstrate a key point of convergence in midfetal layer 5/6 cortical projection neurons. This approach informs when, where, and in what cell types mutations in these specific genes may be productively studied to clarify ASD pathophysiology. PMID:24267886

  15. Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease.

    PubMed

    Kelsen, Judith R; Dawany, Noor; Moran, Christopher J; Petersen, Britt-Sabina; Sarmady, Mahdi; Sasson, Ariella; Pauly-Hubbard, Helen; Martinez, Alejandro; Maurer, Kelly; Soong, Joanne; Rappaport, Eric; Franke, Andre; Keller, Andreas; Winter, Harland S; Mamula, Petar; Piccoli, David; Artis, David; Sonnenberg, Gregory F; Daly, Mark; Sullivan, Kathleen E; Baldassano, Robert N; Devoto, Marcella

    2015-11-01

    Very early onset inflammatory bowel disease (VEO-IBD), IBD diagnosed at 5 years of age or younger, frequently presents with a different and more severe phenotype than older-onset IBD. We investigated whether patients with VEO-IBD carry rare or novel variants in genes associated with immunodeficiencies that might contribute to disease development. Patients with VEO-IBD and parents (when available) were recruited from the Children's Hospital of Philadelphia from March 2013 through July 2014. We analyzed DNA from 125 patients with VEO-IBD (age, 3 wk to 4 y) and 19 parents, 4 of whom also had IBD. Exome capture was performed by Agilent SureSelect V4, and sequencing was performed using the Illumina HiSeq platform. Alignment to human genome GRCh37 was achieved followed by postprocessing and variant calling. After functional annotation, candidate variants were analyzed for change in protein function, minor allele frequency less than 0.1%, and scaled combined annotation-dependent depletion scores of 10 or less. We focused on genes associated with primary immunodeficiencies and related pathways. An additional 210 exome samples from patients with pediatric IBD (n = 45) or adult-onset Crohn's disease (n = 20) and healthy individuals (controls, n = 145) were obtained from the University of Kiel, Germany, and used as control groups. Four hundred genes and regions associated with primary immunodeficiency, covering approximately 6500 coding exons totaling more than 1 Mbp of coding sequence, were selected from the whole-exome data. Our analysis showed novel and rare variants within these genes that could contribute to the development of VEO-IBD, including rare heterozygous missense variants in IL10RA and previously unidentified variants in MSH5 and CD19. In an exome sequence analysis of patients with VEO-IBD and their parents, we identified variants in genes that regulate B- and T-cell functions and could contribute to pathogenesis. Our analysis could lead to the identification of previously unidentified IBD-associated variants. Copyright © 2015 AGA Institute. Published by Elsevier Inc. All rights reserved.

  16. Whole Exome Sequencing and Heterologous Cellular Electrophysiology Studies Elucidate a Novel Loss-of-Function Mutation in the CACNA1A-Encoded Neuronal P/Q-Type Calcium Channel in a Child With Congenital Hypotonia and Developmental Delay.

    PubMed

    Weyhrauch, Derek L; Ye, Dan; Boczek, Nicole J; Tester, David J; Gavrilova, Ralitza H; Patterson, Marc C; Wieben, Eric D; Ackerman, Michael J

    2016-02-01

    A 4-year-old boy born at 37 weeks' gestation with intrauterine growth retardation presented with developmental delay with pronounced language and gross motor delay, axial hypotonia, and dynamic hypertonia of the extremities. Investigations including the Minnesota Newborn Screen, thyroid stimulating hormone/thyroxin, and inborn errors of metabolism screening were negative. Cerebral magnetic resonance imaging and spectroscopy were normal. Genetic testing was negative for coagulopathy, Smith-Lemli-Opitz, fragile X, and Prader-Willi/Angelman syndromes. Whole genome array analysis was unremarkable. Whole exome sequencing was performed through a commercial testing laboratory to elucidate the underlying etiology for the child's presentation. A de novo mutation was hypothesized. In attempt to establish pathogenicity of our candidate variant, cellular electrophysiologic functional analysis of the putative de novo mutation was performed using patch-clamp technology. Whole exome sequencing revealed a p.P1353L variant in the CACNA1A gene, which encodes for the α1-subunit of the brain-specific P/Q-type calcium channel (CaV2.1). This presynaptic high-voltage-gated channel couples neuronal excitation to the vesicular release of neurotransmitter and is implicated in several neurologic disorders. DNA Sanger sequencing confirmed that the de novo mutation was absent in both parents and present in the child only. Electrophysiologic analysis of P1353L-CACNA1A demonstrated near complete loss of function, with a 95% reduction in peak current density. Whole exome sequencing coupled with cellular electrophysiologic functional analysis of a de novoCACNA1A missense mutation has elucidated the probable underlying pathophysiologic mechanism responsible for the child's phenotype. Genetic testing of CACNA1A in patients with congenital hypotonia and developmental delay may be warranted. Copyright © 2016. Published by Elsevier Inc.

  17. Microcephaly-capillary malformation syndrome: Brothers with a homozygous STAMBP mutation, uncovered by exome sequencing.

    PubMed

    Naseer, Muhammad Imran; Sogaty, Sameera; Rasool, Mahmood; Chaudhary, Adeel G; Abutalib, Yousif Ahmed; Walker, Susan; Marshall, Christian R; Merico, Daniele; Carter, Melissa T; Scherer, Stephen W; Al-Qahtani, Mohammad H; Zarrei, Mehdi

    2016-11-01

    We describe two brothers from a consanguineous family of Egyptian ancestry, presenting with microcephaly, apparent global developmental delay, seizures, spasticity, congenital blindness, and multiple cutaneous capillary malformations. Through exome sequencing, we uncovered a homozygous missense variant in STAMBP (p.K303R) in the two siblings, inherited from heterozygous carrier parents. Mutations in STAMBP are known to cause microcephaly-capillary malformation syndrome (MIC-CAP) and the phenotype in this family is consistent with this diagnosis. We compared the findings in the present brothers with those of earlier reported patients. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  18. Systematic comparison of variant calling pipelines using gold standard personal exome variants

    PubMed Central

    Hwang, Sohyun; Kim, Eiru; Lee, Insuk; Marcotte, Edward M.

    2015-01-01

    The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes. PMID:26639839

  19. Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

    PubMed

    Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

    2015-10-01

    Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).

  20. Prenatal whole exome sequencing: the views of clinicians, scientists, genetic counsellors and patient representatives.

    PubMed

    Quinlan-Jones, Elizabeth; Kilby, Mark D; Greenfield, Sheila; Parker, Michael; McMullan, Dominic; Hurles, Matthew E; Hillman, Sarah C

    2016-10-01

    Focus groups were conducted with individuals involved in prenatal diagnosis to determine their opinions relating to whole exome sequencing in fetuses with structural anomalies. Five representatives of patient groups/charities (PRGs) and eight clinical professionals (CPs) participated. Three focus groups occurred (the two groups separately and then combined). Framework analysis was performed to elicit themes. A thematic coding frame was identified based on emerging themes. Seven main themes (consent, analysis, interpretation/reinterpretation of results, prenatal issues, uncertainty, incidental findings and information access) with subthemes emerged. The main themes were raised by both groups, apart from 'analysis', which was raised by CPs only. Some subthemes were raised by PRGs and CPs (with different perspectives). Others were raised either by PRGs or CPs, showing differences in patient/clinician agendas. Prenatal consent for whole exome sequencing is not a 'perfect' process, but consent takers should be fully educated regarding the test. PRGs highlighted issues involving access to results, feeling that women want to know all information. PRGs also felt that patients want reinterpretation of results over time, whilst CPs felt that interpretation should be performed at the point of testing only. © 2016 John Wiley & Sons, Ltd. © 2016 John Wiley & Sons, Ltd.

  1. Exome sequencing for the differential diagnosis of ciliary chondrodysplasias: Example of a WDR35 mutation case and review of the literature.

    PubMed

    Antony, Dinu; Nampoory, Narayanan; Bacchelli, Chiara; Melhem, Motasem; Wu, Kaman; James, Chela T; Beales, Philip L; Hubank, Mike; Thomas, Daisy; Mashankar, Anant; Behbehani, Kazem; Schmidts, Miriam; Alsmadi, Osama

    2017-12-01

    Exome sequencing is becoming widely popular and affordable, making it one of the most desirable methods for the identification of rare genetic variants for clinical diagnosis. Here, we report the clinical application of whole exome sequencing for the ultimate diagnosis of a ciliary chondrodysplasia case presented with an initial clinical diagnosis of Asphyxiating Thoracic Dystrophy (ATD, Jeune Syndrome). We have identified a novel homozygous missense mutation in WDR35 (c.206G > A), a gene previously associated with Sensenbrenner Syndrome, Ellis-van Creveld syndrome and Short-rib polydactyly syndrome type V. The genetic findings in this family led to the re-evaluation of the initial diagnosis and a differential diagnosis of Sensenbrenner Syndrome was made after cautious re-examination of the patient. Cell culture studies revealed normal subcellular localization of the mutant WDR35 protein in comparison to wildtype protein, pointing towards impaired protein-protein interaction and/or altered cell signaling pathways as a consequence of the mutated allele. This research study highlights the importance of including pathogenic variant identification in the diagnosis pipeline of ciliary chondrodysplasias, especially for clinically not fully defined phenotypes. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  2. STAG3 truncating variant as the cause of primary ovarian insufficiency

    PubMed Central

    Le Quesne Stabej, Polona; Williams, Hywel J; James, Chela; Tekman, Mehmet; Stanescu, Horia C; Kleta, Robert; Ocaka, Louise; Lescai, Francesco; Storr, Helen L; Bitner-Glindzicz, Maria; Bacchelli, Chiara; Conway, Gerard S

    2016-01-01

    Primary ovarian insufficiency (POI) is a distressing cause of infertility in young women. POI is heterogeneous with only a few causative genes having been discovered so far. Our objective was to determine the genetic cause of POI in a consanguineous Lebanese family with two affected sisters presenting with primary amenorrhoea and an absence of any pubertal development. Multipoint parametric linkage analysis was performed. Whole-exome sequencing was done on the proband. Linkage analysis identified a locus on chromosome 7 where exome sequencing successfully identified a homozygous two base pair duplication (c.1947_48dupCT), leading to a truncated protein p.(Y650Sfs*22) in the STAG3 gene, confirming it as the cause of POI in this family. Exome sequencing combined with linkage analyses offers a powerful tool to efficiently find novel genetic causes of rare, heterogeneous disorders, even in small single families. This is only the second report of a STAG3 variant; the first STAG3 variant was recently described in a phenotypically similar family with extreme POI. Identification of an additional family highlights the importance of STAG3 in POI pathogenesis and suggests it should be evaluated in families affected with POI. PMID:26059840

  3. Identification of a Novel Missense FBN2 Mutation in a Chinese Family with Congenital Contractural Arachnodactyly Using Exome Sequencing

    PubMed Central

    Deng, Hao; Lu, Qian; Xu, Hongbo; Deng, Xiong; Yuan, Lamei; Yang, Zhijian; Guo, Yi; Lin, Qiongfen; Xiao, Jingjing; Guan, Liping; Song, Zhi

    2016-01-01

    Congenital contractural arachnodactyly (CCA, OMIM 121050), also known as Beals-Hecht syndrome, is an autosomal dominant disorder of connective tissue. CCA is characterized by arachnodactyly, dolichostenomelia, pectus deformities, kyphoscoliosis, congenital contractures and a crumpled appearance of the helix of the ear. The aim of this study is to identify the genetic cause of a 4-generation Chinese family of Tujia ethnicity with congenital contractural arachnodactyly by exome sequencing. The clinical features of patients in this family are consistent with CCA. A novel missense mutation, c.3769T>C (p.C1257R), in the fibrillin 2 gene (FBN2) was identified responsible for the genetic cause of our family with CCA. The p.C1257R mutation occurs in the 19th calcium-binding epidermal growth factor-like (cbEGF) domain. The amino acid residue cysteine in this domain is conserved among different species. Our findings suggest that exome sequencing is a powerful tool to discover mutation(s) in CCA. Our results may also provide new insights into the cause and diagnosis of CCA, and may have implications for genetic counseling and clinical management. PMID:27196565

  4. Whole-exome sequencing identifies novel compound heterozygous mutations in USH2A in Spanish patients with autosomal recessive retinitis pigmentosa

    PubMed Central

    Méndez-Vidal, Cristina; González-del Pozo, María; Vela-Boza, Alicia; Santoyo-López, Javier; López-Domingo, Francisco J.; Vázquez-Marouschek, Carmen; Dopazo, Joaquin; Borrego, Salud

    2013-01-01

    Purpose Retinitis pigmentosa (RP) is an inherited retinal dystrophy characterized by extreme genetic and clinical heterogeneity. Thus, the diagnosis is not always easily performed due to phenotypic and genetic overlap. Current clinical practices have focused on the systematic evaluation of a set of known genes for each phenotype, but this approach may fail in patients with inaccurate diagnosis or infrequent genetic cause. In the present study, we investigated the genetic cause of autosomal recessive RP (arRP) in a Spanish family in which the causal mutation has not yet been identified with primer extension technology and resequencing. Methods We designed a whole-exome sequencing (WES)-based approach using NimbleGen SeqCap EZ Exome V3 sample preparation kit and the SOLiD 5500×l next-generation sequencing platform. We sequenced the exomes of both unaffected parents and two affected siblings. Exome analysis resulted in the identification of 43,204 variants in the index patient. All variants passing filter criteria were validated with Sanger sequencing to confirm familial segregation and absence in the control population. In silico prediction tools were used to determine mutational impact on protein function and the structure of the identified variants. Results Novel Usher syndrome type 2A (USH2A) compound heterozygous mutations, c.4325T>C (p.F1442S) and c.15188T>G (p.L5063R), located in exons 20 and 70, respectively, were identified as probable causative mutations for RP in this family. Family segregation of the variants showed the presence of both mutations in all affected members and in two siblings who were apparently asymptomatic at the time of family ascertainment. Clinical reassessment confirmed the diagnosis of RP in these patients. Conclusions Using WES, we identified two heterozygous novel mutations in USH2A as the most likely disease-causing variants in a Spanish family diagnosed with arRP in which the cause of the disease had not yet been identified with commonly used techniques. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases where conventional genetic approaches have previously failed in achieving a proper diagnosis. PMID:24227914

  5. Whole-exome analysis of foetal autopsy tissue reveals a frameshift mutation in OBSL1, consistent with a diagnosis of 3-M Syndrome.

    PubMed

    Marshall, Christian R; Farrell, Sandra A; Cushing, Donna; Paton, Tara; Stockley, Tracy L; Stavropoulos, Dimitri J; Ray, Peter N; Szego, Michael; Lau, Lynette; Pereira, Sergio L; Cohn, Ronald D; Wintle, Richard F; Abuzenadah, Adel M; Abu-Elmagd, Muhammad; Scherer, Stephen W

    2015-01-01

    We report a consanguineous couple that has experienced three consecutive pregnancy losses following the foetal ultrasound finding of short limbs. Post-termination examination revealed no skeletal dysplasia, but some subtle proximal limb shortening in two foetuses, and a spectrum of mildly dysmorphic features. Karyotype was normal in all three foetuses (46, XX) and comparative genomic hybridization microarray analysis detected no pathogenic copy number variants. Whole-exome sequencing and genome-wide homozygosity mapping revealed a previously reported frameshift mutation in the OBSL1 gene (c.1273insA p.T425nfsX40), consistent with a diagnosis of 3-M Syndrome 2 (OMIM #612921), which had not been anticipated from the clinical findings. Our study provides novel insight into the early clinical manifestations of this form of 3-M syndrome, and demonstrates the utility of whole exome sequencing as a tool for prenatal diagnosis in particular when there is a family history suggestive of a recurrent set of clinical symptoms.

  6. Identification of two novel pathogenic compound heterozygous MYO7A mutations in Usher syndrome by whole exome sequencing.

    PubMed

    Jia, Ying; Li, Xiaoge; Yang, Dong; Xu, Yi; Guo, Ying; Li, Xin

    2018-01-01

    The current study aims to identify the pathogenic sites in a core pedigree of Usher syndrome (USH). A core pedigree of USH was analyzed by whole exome sequencing (WES). Mutations were verified by polymerase chain reaction (PCR) amplification and Sanger sequencing. Two pathogenic variations (c.849+2T>C and c.5994G>A) in MYO7A were successfully identified and individually separated from parents. One variant (c.849+2T>C) was nonsense mutation, causing the protein terminated in advance, and the other one (c.5994G>A) located near the boundary of exon could cause aberrant splicing. This study provides a meaningful exploration for identification of clinical core genetic pedigrees. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing

    PubMed Central

    Diroma, Maria Angela; Santorsola, Mariangela; Guttà, Cristiano; Gasparre, Giuseppe; Picardi, Ernesto; Pesole, Graziano; Attimonelli, Marcella

    2014-01-01

    Motivation: The increasing availability of mitochondria-targeted and off-target sequencing data in whole-exome and whole-genome sequencing studies (WXS and WGS) has risen the demand of effective pipelines to accurately measure heteroplasmy and to easily recognize the most functionally important mitochondrial variants among a huge number of candidates. To this purpose, we developed MToolBox, a highly automated pipeline to reconstruct and analyze human mitochondrial DNA from high-throughput sequencing data. Results: MToolBox implements an effective computational strategy for mitochondrial genomes assembling and haplogroup assignment also including a prioritization analysis of detected variants. MToolBox provides a Variant Call Format file featuring, for the first time, allele-specific heteroplasmy and annotation files with prioritized variants. MToolBox was tested on simulated samples and applied on 1000 Genomes WXS datasets. Availability and implementation: MToolBox package is available at https://sourceforge.net/projects/mtoolbox/. Contact: marcella.attimonelli@uniba.it Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25028726

  8. Rapid identification of kidney cyst mutations by whole exome sequencing in zebrafish

    PubMed Central

    Ryan, Sean; Willer, Jason; Marjoram, Lindsay; Bagwell, Jennifer; Mankiewicz, Jamie; Leshchiner, Ignaty; Goessling, Wolfram; Bagnat, Michel; Katsanis, Nicholas

    2013-01-01

    Forward genetic approaches in zebrafish have provided invaluable information about developmental processes. However, the relative difficulty of mapping and isolating mutations has limited the number of new genetic screens. Recent improvements in the annotation of the zebrafish genome coupled to a reduction in sequencing costs prompted the development of whole genome and RNA sequencing approaches for gene discovery. Here we describe a whole exome sequencing (WES) approach that allows rapid and cost-effective identification of mutations. We used our WES methodology to isolate four mutations that cause kidney cysts; we identified novel alleles in two ciliary genes as well as two novel mutants. The WES approach described here does not require specialized infrastructure or training and is therefore widely accessible. This methodology should thus help facilitate genetic screens and expedite the identification of mutants that can inform basic biological processes and the causality of genetic disorders in humans. PMID:24130329

  9. Whole Exome Analysis of Early Onset Alzheimer’s Disease

    DTIC Science & Technology

    2013-04-01

    FTD), FTD with Parkinsonism , and early-onset Alzheimer Disease (EOAD)-like presentations. Using whole exome capture with subsequent sequencing, we...dementia. The MAPT R406W mutation is associated with EOAD-like symptoms and Parkinsonism without FTD, as well as distinct cognitive courses. KEY...OUTCOMES: Carney RM, Kohli MA, Kunkle BW, Naj AC, Gilbert JR, Züchner S, PERICAK-VANCE MA, Parkinsonism and distinct dementia patterns in a

  10. Opportunities and challenges for the integration of massively parallel genomic sequencing into clinical practice: lessons from the ClinSeq project.

    PubMed

    Biesecker, Leslie G

    2012-04-01

    The debate surrounding the return of results from high-throughput genomic interrogation encompasses many important issues including ethics, law, economics, and social policy. As well, the debate is also informed by the molecular, genetic, and clinical foundations of the emerging field of clinical genomics, which is based on this new technology. This article outlines the main biomedical considerations of sequencing technologies and demonstrates some of the early clinical experiences with the technology to enable the debate to stay focused on real-world practicalities. These experiences are based on early data from the ClinSeq project, which is a project to pilot the use of massively parallel sequencing in a clinical research context with a major aim to develop modes of returning results to individual subjects. The study has enrolled >900 subjects and generated exome sequence data on 572 subjects. These data are beginning to be interpreted and returned to the subjects, which provides examples of the potential usefulness and pitfalls of clinical genomics. There are numerous genetic results that can be readily derived from a genome including rare, high-penetrance traits, and carrier states. However, much work needs to be done to develop the tools and resources for genomic interpretation. The main lesson learned is that a genome sequence may be better considered as a health-care resource, rather than a test, one that can be interpreted and used over the lifetime of the patient.

  11. VCF-Explorer: filtering and analysing whole genome VCF files.

    PubMed

    Akgün, Mete; Demirci, Hüseyin

    2017-11-01

    The decreasing cost in high-throughput technologies led to a number of sequencing projects consisting of thousands of whole genomes. The paradigm shift from exome to whole genome brings a significant increase in the size of output files. Most of the existing tools which are developed to analyse exome files are not adequate for larger VCF files produced by whole genome studies. In this work we present VCF-Explorer, a variant analysis software capable of handling large files. Memory efficiency and avoiding computationally costly pre-processing step enable to carry out the analysis to be performed with ordinary computers. VCF-Explorer provides an easy to use environment where users can define various types of queries based on variant and sample genotype level annotations. VCF-Explorer can be run in different environments and computational platforms ranging from a standard laptop to a high performance server. VCF-Explorer is freely available at: http://vcfexplorer.sourceforge.net/. mete.akgun@tubitak.gov.tr. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  12. Exome Sequencing of 18 Chinese Families with Congenital Cataracts: A New Sight of the NHS Gene

    PubMed Central

    Sun, Wenmin; Xiao, Xueshan; Li, Shiqiang; Guo, Xiangming; Zhang, Qingjiong

    2014-01-01

    Purpose The aim of this study was to investigate the mutation spectrum and frequency of 34 known genes in 18 Chinese families with congenital cataracts. Methods Genomic DNA and clinical data was collected from 18 families with congenital cataracts. Variations in 34 cataract-associated genes were screened by whole exome sequencing and then validated by Sanger sequencing. Results Eleven candidate variants in seven of the 34 genes were detected by exome sequencing and then confirmed by Sanger sequencing, including two variants predicted to be benign and the other pathogenic mutations. The nine mutations were present in 9 of the 18 (50%) families with congenital cataracts. Of the four families with mutations in the X-linked NHS gene, no other abnormalities were recorded except for cataract, in which a pseudo-dominant inheritance form was suggested, as female carriers also had different forms of cataracts. Conclusion This study expands the mutation spectrum and frequency of genes responsible for congenital cataract. Mutation in NHS is a common cause of nonsyndromic congenital cataract with pseudo-autosomal dominant inheritance. Combined with our previous studies, a genetic basis could be identified in 67.6% of families with congenital cataracts in our case series, in which mutations in genes encoding crystallins, genes encoding connexins, and NHS are responsible for 29.4%, 14.7%, and 11.8% of families, respectively. Our results suggest that mutations in NHS are the common cause of congenital cataract, both syndromic and nonsyndromic. PMID:24968223

  13. Rare Variants in RTEL1 Are Associated with Familial Interstitial Pneumonia

    PubMed Central

    Cogan, Joy D.; Zhao, Min; Mitchell, Daphne B.; Rives, Lynette; Markin, Cheryl; Garnett, Errine T.; Montgomery, Keri H.; Mason, Wendi R.; McKean, David F.; Powers, Julia; Murphy, Elissa; Olson, Lana M.; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R.; Lancaster, Lisa H.; Steele, Mark P.; Brown, Kevin K.; Schwarz, Marvin I.; Fingerlin, Tasha E.; Schwartz, David A.; Lawson, William E.; Loyd, James E.; Zhao, Zhongming; Phillips, John A.; Blackwell, Timothy S.

    2015-01-01

    Rationale: Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. Objectives: To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Methods: Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. Measurements and Main Results: We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Conclusions: Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis. PMID:25607374

  14. Customisation of the exome data analysis pipeline using a combinatorial approach.

    PubMed

    Pattnaik, Swetansu; Vaidyanathan, Srividya; Pooja, Durgad G; Deepak, Sa; Panda, Binay

    2012-01-01

    The advent of next generation sequencing (NGS) technologies have revolutionised the way biologists produce, analyse and interpret data. Although NGS platforms provide a cost-effective way to discover genome-wide variants from a single experiment, variants discovered by NGS need follow up validation due to the high error rates associated with various sequencing chemistries. Recently, whole exome sequencing has been proposed as an affordable option compared to whole genome runs but it still requires follow up validation of all the novel exomic variants. Customarily, a consensus approach is used to overcome the systematic errors inherent to the sequencing technology, alignment and post alignment variant detection algorithms. However, the aforementioned approach warrants the use of multiple sequencing chemistry, multiple alignment tools, multiple variant callers which may not be viable in terms of time and money for individual investigators with limited informatics know-how. Biologists often lack the requisite training to deal with the huge amount of data produced by NGS runs and face difficulty in choosing from the list of freely available analytical tools for NGS data analysis. Hence, there is a need to customise the NGS data analysis pipeline to preferentially retain true variants by minimising the incidence of false positives and make the choice of right analytical tools easier. To this end, we have sampled different freely available tools used at the alignment and post alignment stage suggesting the use of the most suitable combination determined by a simple framework of pre-existing metrics to create significant datasets.

  15. Rare variant associations with waist-to-hip ratio in European-American and African-American women from the NHLBI-Exome Sequencing Project

    PubMed Central

    Kan, Mengyuan; Auer, Paul L; Wang, Gao T; Bucasas, Kristine L; Hooker, Stanley; Rodriguez, Alejandra; Li, Biao; Ellis, Jaclyn; Adrienne Cupples, L; Ida Chen, Yii-Der; Dupuis, Josée; Fox, Caroline S; Gross, Myron D; Smith, Joshua D; Heard-Costa, Nancy; Meigs, James B; Pankow, James S; Rotter, Jerome I; Siscovick, David; Wilson, James G; Shendure, Jay; Jackson, Rebecca; Peters, Ulrike; Zhong, Hua; Lin, Danyu; Hsu, Li; Franceschini, Nora; Carlson, Chris; Abecasis, Goncalo; Gabriel, Stacey; Bamshad, Michael J; Altshuler, David; Nickerson, Deborah A; North, Kari E; Lange, Leslie A; Reiner, Alexander P; Leal, Suzanne M

    2016-01-01

    Waist-to-hip ratio (WHR), a relative comparison of waist and hip circumferences, is an easily accessible measurement of body fat distribution, in particular central abdominal fat. A high WHR indicates more intra-abdominal fat deposition and is an established risk factor for cardiovascular disease and type 2 diabetes. Recent genome-wide association studies have identified numerous common genetic loci influencing WHR, but the contributions of rare variants have not been previously reported. We investigated rare variant associations with WHR in 1510 European-American and 1186 African-American women from the National Heart, Lung, and Blood Institute-Exome Sequencing Project. Association analysis was performed on the gene level using several rare variant association methods. The strongest association was observed for rare variants in IKBKB (P=4.0 × 10−8) in European-Americans, where rare variants in this gene are predicted to decrease WHRs. The activation of the IKBKB gene is involved in inflammatory processes and insulin resistance, which may affect normal food intake and body weight and shape. Meanwhile, aggregation of rare variants in COBLL1, previously found to harbor common variants associated with WHR and fasting insulin, were nominally associated (P=2.23 × 10−4) with higher WHR in European-Americans. However, these significant results are not shared between African-Americans and European-Americans that may be due to differences in the allelic architecture of the two populations and the small sample sizes. Our study indicates that the combined effect of rare variants contribute to the inter-individual variation in fat distribution through the regulation of insulin response. PMID:26757982

  16. Rare variant associations with waist-to-hip ratio in European-American and African-American women from the NHLBI-Exome Sequencing Project.

    PubMed

    Kan, Mengyuan; Auer, Paul L; Wang, Gao T; Bucasas, Kristine L; Hooker, Stanley; Rodriguez, Alejandra; Li, Biao; Ellis, Jaclyn; Adrienne Cupples, L; Ida Chen, Yii-Der; Dupuis, Josée; Fox, Caroline S; Gross, Myron D; Smith, Joshua D; Heard-Costa, Nancy; Meigs, James B; Pankow, James S; Rotter, Jerome I; Siscovick, David; Wilson, James G; Shendure, Jay; Jackson, Rebecca; Peters, Ulrike; Zhong, Hua; Lin, Danyu; Hsu, Li; Franceschini, Nora; Carlson, Chris; Abecasis, Goncalo; Gabriel, Stacey; Bamshad, Michael J; Altshuler, David; Nickerson, Deborah A; North, Kari E; Lange, Leslie A; Reiner, Alexander P; Leal, Suzanne M

    2016-08-01

    Waist-to-hip ratio (WHR), a relative comparison of waist and hip circumferences, is an easily accessible measurement of body fat distribution, in particular central abdominal fat. A high WHR indicates more intra-abdominal fat deposition and is an established risk factor for cardiovascular disease and type 2 diabetes. Recent genome-wide association studies have identified numerous common genetic loci influencing WHR, but the contributions of rare variants have not been previously reported. We investigated rare variant associations with WHR in 1510 European-American and 1186 African-American women from the National Heart, Lung, and Blood Institute-Exome Sequencing Project. Association analysis was performed on the gene level using several rare variant association methods. The strongest association was observed for rare variants in IKBKB (P=4.0 × 10(-8)) in European-Americans, where rare variants in this gene are predicted to decrease WHRs. The activation of the IKBKB gene is involved in inflammatory processes and insulin resistance, which may affect normal food intake and body weight and shape. Meanwhile, aggregation of rare variants in COBLL1, previously found to harbor common variants associated with WHR and fasting insulin, were nominally associated (P=2.23 × 10(-4)) with higher WHR in European-Americans. However, these significant results are not shared between African-Americans and European-Americans that may be due to differences in the allelic architecture of the two populations and the small sample sizes. Our study indicates that the combined effect of rare variants contribute to the inter-individual variation in fat distribution through the regulation of insulin response.

  17. Screening for susceptibility genes in hereditary non-polyposis colorectal cancer.

    PubMed

    Yu, Li; Yin, Bo; Qu, Kaiying; Li, Jingjing; Jin, Qiao; Liu, Ling; Liu, Chunlan; Zhu, Yuxing; Wang, Qi; Peng, Xiaowei; Zhou, Jianda; Cao, Peiguo; Cao, Ke

    2018-06-01

    In the present study, hereditary non-polyposis colorectal cancer (HNPCC) susceptibility genes were screened for using whole exome sequencing in 3 HNPCC patients from 1 family and using single nucleotide polymorphism (SNP) genotyping assays in 96 other colorectal cancer and control samples. Peripheral blood was obtained from 3 HNPCC patients from 1 family; the proband and the proband's brother and cousin. High-throughput sequencing was performed using whole exome capture technology. Sequences were aligned against the HAPMAP, dbSNP130 and 1,000 Genome Project databases. Reported common variations and synonymous mutations were filtered out. Non-synonymous single nucleotide variants in the 3 HNPCC patients were integrated and the candidate genes were identified. Finally, SNP genotyping was performed for the genes in 96 peripheral blood samples. In total, 60.4 Gb of data was retrieved from the 3 HNPCC patients using whole exome capture technology. Subsequently, according to certain screening criteria, 15 candidate genes were identified. Among the 96 samples that had been SNP genotyped, 92 were successfully genotyped for 15 gene loci, while genotyping for HTRA1 failed in 4 sporadic colorectal cancer patient samples. In 12 control subjects and 81 sporadic colorectal cancer patients, genotypes at 13 loci were wild-type, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 . The CEP290 genotype was mutant in 1 sporadic colorectal cancer patient and was wild-type in all other subjects. A total of 5 of the 12 control subjects and 30 of the 81 sporadic colorectal cancer patients had a mutant HTRA1 genotype. In all 3 HNPCC patients, the same mutant genotypes were identified at all 15 gene loci. Overall, 13 potential susceptibility genes for HNPCC were identified, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 .

  18. Evaluation of targeted exome sequencing for 28 protein-based blood group systems, including the homologous gene systems, for blood group genotyping.

    PubMed

    Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A

    2017-04-01

    Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.

  19. RTTN Mutations Cause Primary Microcephaly and Primordial Dwarfism in Humans.

    PubMed

    Shamseldin, Hanan; Alazami, Anas M; Manning, Melanie; Hashem, Amal; Caluseiu, Oana; Tabarki, Brahim; Esplin, Edward; Schelley, Susan; Innes, A Micheil; Parboosingh, Jillian S; Lamont, Ryan; Majewski, Jacek; Bernier, Francois P; Alkuraya, Fowzan S

    2015-12-03

    Primary microcephaly is a developmental brain anomaly that results from defective proliferation of neuroprogenitors in the germinal periventricular zone. More than a dozen genes are known to be mutated in autosomal-recessive primary microcephaly in isolation or in association with a more generalized growth deficiency (microcephalic primordial dwarfism), but the genetic heterogeneity is probably more extensive. In a research protocol involving autozygome mapping and exome sequencing, we recruited a multiplex consanguineous family who is affected by severe microcephalic primordial dwarfism and tested negative on clinical exome sequencing. Two candidate autozygous intervals were identified, and the second round of exome sequencing revealed a single intronic variant therein (c.2885+8A>G [p.Ser963(∗)] in RTTN exon 23). RT-PCR confirmed that this change creates a cryptic splice donor and thus causes retention of the intervening 7 bp of the intron and leads to premature truncation. On the basis of this finding, we reanalyzed the exome file of a second consanguineous family affected by a similar phenotype and identified another homozygous change in RTTN as the likely causal mutation. Combined linkage analysis of the two families confirmed that RTTN maps to the only significant linkage peak. Finally, through international collaboration, a Canadian multiplex family affected by microcephalic primordial dwarfism and biallelic mutation of RTTN was identified. Our results expand the phenotype of RTTN-related disorders, hitherto limited to polymicrogyria, to include microcephalic primordial dwarfism with a complex brain phenotype involving simplified gyration. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  20. Exome sequencing coupled with mRNA analysis identifies NDUFAF6 as a Leigh gene.

    PubMed

    Bianciardi, Laura; Imperatore, Valentina; Fernandez-Vizarra, Erika; Lopomo, Angela; Falabella, Micol; Furini, Simone; Galluzzi, Paolo; Grosso, Salvatore; Zeviani, Massimo; Renieri, Alessandra; Mari, Francesca; Frullanti, Elisa

    2016-11-01

    We report here the case of a young male who started to show verbal fluency disturbance, clumsiness and gait anomalies at the age of 3.5years and presented bilateral striatal necrosis. Clinically, the diagnosis was compatible with Leigh syndrome but the underlying molecular defect remained elusive even after exome analysis using autosomal/X-linked recessive or de novo models. Dosage of respiratory chain activity on fibroblasts, but not in muscle, underlined a deficit in complex I. Re-analysis of heterozygous probably pathogenic variants, inherited from one healthy parent, identified the p.Ala178Pro in NDUFAF6, a complex I assembly factor. RNA analysis showed an almost mono-allelic expression of the mutated allele in blood and fibroblasts and puromycin treatment on cultured fibroblasts did not lead to the rescue of the maternal allele expression, not supporting the involvement of nonsense-mediated RNA decay mechanism. Complementation assay underlined a recovery of complex I activity after transduction of the wild-type gene. Since the second mutation was not detected and promoter methylation analysis resulted normal, we hypothesized a non-exonic event in the maternal allele affecting a regulatory element that, in conjunction with the paternal mutation, leads to the autosomal recessive disorder and the different allele expression in various tissues. This paper confirms NDUFAF6 as a genuine morbid gene and proposes the coupling of exome sequencing with mRNA analysis as a method useful for enhancing the exome sequencing detection rate when the simple application of classical inheritance models fails. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations.

    PubMed

    O'Roak, Brian J; Vives, Laura; Girirajan, Santhosh; Karakoc, Emre; Krumm, Niklas; Coe, Bradley P; Levy, Roie; Ko, Arthur; Lee, Choli; Smith, Joshua D; Turner, Emily H; Stanaway, Ian B; Vernot, Benjamin; Malig, Maika; Baker, Carl; Reilly, Beau; Akey, Joshua M; Borenstein, Elhanan; Rieder, Mark J; Nickerson, Deborah A; Bernier, Raphael; Shendure, Jay; Eichler, Evan E

    2012-04-04

    It is well established that autism spectrum disorders (ASD) have a strong genetic component; however, for at least 70% of cases, the underlying genetic cause is unknown. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes--so-called sporadic or simplex families--we sequenced all coding regions of the genome (the exome) for parent-child trios exhibiting sporadic ASD, including 189 new trios and 20 that were previously reported. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19), for a total of 677 individual exomes from 209 families. Here we show that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD. Moreover, 39% (49 of 126) of the most severe or disruptive de novo mutations map to a highly interconnected β-catenin/chromatin remodelling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes: CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3 and SCN1A. Combined with copy number variant (CNV) data, these results indicate extreme locus heterogeneity but also provide a target for future discovery, diagnostics and therapeutics.

  2. Identification of Small Exonic CNV from Whole-Exome Sequence Data and Application to Autism Spectrum Disorder

    PubMed Central

    Poultney, Christopher S.; Goldberg, Arthur P.; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J.; Ma’ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D.

    2013-01-01

    Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1–30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1–30 kb CNV, 1–30 kb deletions, and 1–10 kb deletions in ASD. CNV in the 1–30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1–30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. PMID:24094742

  3. Advanced Development of TIES-Enhancing Access to Tissue for Cancer Research | Informatics Technology for Cancer Research (ITCR)

    Cancer.gov

    Archived human tissues are an essential resource for translational research. Formalin-fixed, paraffin embedded (FFPE) tissues from cancer patients are used in a wide range of assays, including RT-PCR, SNP profiling, multiplex biomarkers, imaging biomarkers, targeted exome, whole exome, and whole genome sequencing. Remainder FFPE tissues generated during patient care are ‘retrospective'; use of these tissues under specific conditions does not require consent.

  4. Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.

    PubMed

    Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B

    2015-03-27

    Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. Copyright © 2015, American Association for the Advancement of Science.

  5. Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

    DTIC Science & Technology

    2014-10-01

    INTRODUCTION: Despite tremendous advances in mutation detection with gene panels and exome sequencing the majority of high risk breast...2a. Align reads to the reference sequence (months 4-10) 2b. Identify SNPs, indels, CNVs and rearrangements by bioinformatic tools (months 4-10) 2c

  6. Splice-site mutations identified in PDE6A responsible for retinitis pigmentosa in consanguineous Pakistani families

    PubMed Central

    Khan, Shahid Y.; Ali, Shahbaz; Naeem, Muhammad Asif; Khan, Shaheen N.; Husnain, Tayyab; Butt, Nadeem H.; Qazi, Zaheeruddin A.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding

    2015-01-01

    Purpose This study was conducted to localize and identify causal mutations associated with autosomal recessive retinitis pigmentosa (RP) in consanguineous familial cases of Pakistani origin. Methods Ophthalmic examinations that included funduscopy and electroretinography (ERG) were performed to confirm the affectation status. Blood samples were collected from all participating individuals, and genomic DNA was extracted. A genome-wide scan was performed, and two-point logarithm of odds (LOD) scores were calculated. Sanger sequencing was performed to identify the causative variants. Subsequently, we performed whole exome sequencing to rule out the possibility of a second causal variant within the linkage interval. Sequence conservation was performed with alignment analyses of PDE6A orthologs, and in silico splicing analysis was completed with Human Splicing Finder version 2.4.1. Results A large multigenerational consanguineous family diagnosed with early-onset RP was ascertained. An ophthalmic clinical examination consisting of fundus photography and electroretinography confirmed the diagnosis of RP. A genome-wide scan was performed, and suggestive two-point LOD scores were observed with markers on chromosome 5q. Haplotype analyses identified the region; however, the region did not segregate with the disease phenotype in the family. Subsequently, we performed a second genome-wide scan that excluded the entire genome except the chromosome 5q region harboring PDE6A. Next-generation whole exome sequencing identified a splice acceptor site mutation in intron 16: c.2028–1G>A, which was completely conserved in PDE6A orthologs and was absent in ethnically matched 350 control chromosomes, the 1000 Genomes database, and the NHLBI Exome Sequencing Project. Subsequently, we investigated our entire cohort of RP familial cases and identified a second family who harbored a splice acceptor site mutation in intron 10: c.1408–2A>G. In silico analysis suggested that these mutations will result in the elimination of wild-type splice acceptor sites that would result in either skipping of the respective exon or the creation of a new cryptic splice acceptor site; both possibilities would result in retinal photoreceptor cells that lack PDE6A wild-type protein. Conclusions we report two splice acceptor site variations in PDE6A in consanguineous Pakistani families who manifested cardinal symptoms of RP. Taken together with our previously published work, our data suggest that mutations in PDE6A account for about 2% of the total genetic load of RP in our cohort and possibly in the Pakistani population as well. PMID:26321862

  7. Exome sequence analysis and follow up genotyping implicates rare ULK1 variants to be involved in susceptibility to schizophrenia

    PubMed Central

    Al Eissa, Mariam M.; Fiorentino, Alessia; Sharp, Sally I.; O'Brien, Niamh L.; Wolfe, Kate; Giaroli, Giovanni; Curtis, David; Bass, Nicholas J.

    2017-01-01

    Summary Schizophrenia (SCZ) is a severe, highly heritable psychiatric disorder. Elucidation of the genetic architecture of the disorder will facilitate greater understanding of the altered underlying neurobiological mechanisms. The aim of this study was to identify likely aetiological variants in subjects affected with SCZ. Exome sequence data from a SCZ cas–control sample from Sweden was analysed for likely aetiological variants using a weighted burden test. Suggestive evidence implicated the UNC‐51‐like kinase (ULK1) gene, and it was observed that four rare variants that were more common in the Swedish SCZ cases were also more common in UK10K SCZ cases, as compared to obesity cases. These three missense variants and one intronic variant were genotyped in the University College London cohort of 1304 SCZ cases and 1348 ethnically matched controls. All four variants were more common in the SCZ cases than controls and combining them produced a result significant at P = 0.02. The results presented here demonstrate the importance of following up exome sequencing studies using additional datasets. The roles of ULK1 in autophagy and mTOR signalling strengthen the case that these pathways may be important in the pathophysiology of SCZ. The findings reported here await independent replication. PMID:29148569

  8. Somatic Genetic Variation in Solid Pseudopapillary Tumor of the Pancreas by Whole Exome Sequencing

    PubMed Central

    Guo, Meng; Luo, Guopei; Jin, Kaizhou; Long, Jiang; Cheng, He; Lu, Yu; Wang, Zhengshi; Yang, Chao; Xu, Jin; Ni, Quanxing; Yu, Xianjun; Liu, Chen

    2017-01-01

    Solid pseudopapillary tumor of the pancreas (SPT) is a rare pancreatic disease with a unique clinical manifestation. Although CTNNB1 gene mutations had been universally reported, genetic variation profiles of SPT are largely unidentified. We conducted whole exome sequencing in nine SPT patients to probe the SPT-specific insertions and deletions (indels) and single nucleotide polymorphisms (SNPs). In total, 54 SNPs and 41 indels of prominent variations were demonstrated through parallel exome sequencing. We detected that CTNNB1 mutations presented throughout all patients studied (100%), and a higher count of SNPs was particularly detected in patients with older age, larger tumor, and metastatic disease. By aggregating 95 detected variation events and viewing the interconnections among each of the genes with variations, CTNNB1 was identified as the core portion in the network, which might collaborate with other events such as variations of USP9X, EP400, HTT, MED12, and PKD1 to regulate tumorigenesis. Pathway analysis showed that the events involved in other cancers had the potential to influence the progression of the SNPs count. Our study revealed an insight into the variation of the gene encoding region underlying solid-pseudopapillary neoplasm tumorigenesis. The detection of these variations might partly reflect the potential molecular mechanism. PMID:28054945

  9. Molecular Diagnostic Experience of Whole-Exome Sequencing in Adult Patients

    PubMed Central

    Posey, Jennifer E.; Rosenfeld, Jill A.; James, Regis A.; Bainbridge, Matthew; Niu, Zhiyv; Wang, Xia; Dhar, Shweta; Wiszniewski, Wojciech; Akdemir, Zeynep H.C.; Gambin, Tomasz; Xia, Fan; Person, Richard E.; Walkiewicz, Magdalena; Shaw, Chad A.; Sutton, V. Reid; Beaudet, Arthur L.; Muzny, Donna; Eng, Christine M.; Yang, Yaping; Gibbs, Richard A.; Lupski, James R.; Boerwinkle, Eric; Plon, Sharon E.

    2015-01-01

    Purpose Whole exome sequencing (WES) is increasingly used as a diagnostic tool in medicine, but prior reports focus on predominantly pediatric cohorts with neurologic or developmental disorders. We describe the diagnostic yield and characteristics of whole exome sequencing in adults. Methods We performed a retrospective analysis of consecutive WES reports for adults from a diagnostic laboratory. Phenotype composition was determined using Human Phenotype Ontology terms. Results Molecular diagnoses were reported for 17.5% (85/486) of adults, lower than a primarily pediatric population (25.2%; p=0.0003); the diagnostic rate was higher (23.9%) in those 18–30 years of age compared to patients over 30 years (10.4%; p=0.0001). Dual Mendelian diagnoses contributed to 7% of diagnoses, revealing blended phenotypes. Diagnoses were more frequent among individuals with abnormalities of the nervous system, skeletal system, head/neck, and growth. Diagnostic rate was independent of family history information, and de novo mutations contributed to 61.4% of autosomal dominant diagnoses. Conclusion Early WES experience in adults demonstrates molecular diagnoses in a substantial proportion of patients, informing clinical management, recurrence risk and recommendations for relatives. A positive family history was not predictive, consistent with molecular diagnoses often revealed by de novo events, informing the Mendelian basis of genetic disease in adults. PMID:26633545

  10. Development and validation of a whole-exome sequencing test for simultaneous detection of point mutations, indels and copy-number alterations for precision cancer care

    PubMed Central

    Rennert, Hanna; Eng, Kenneth; Zhang, Tuo; Tan, Adrian; Xiang, Jenny; Romanel, Alessandro; Kim, Robert; Tam, Wayne; Liu, Yen-Chun; Bhinder, Bhavneet; Cyrta, Joanna; Beltran, Himisha; Robinson, Brian; Mosquera, Juan Miguel; Fernandes, Helen; Demichelis, Francesca; Sboner, Andrea; Kluk, Michael; Rubin, Mark A; Elemento, Olivier

    2016-01-01

    We describe Exome Cancer Test v1.0 (EXaCT-1), the first New York State-Department of Health-approved whole-exome sequencing (WES)-based test for precision cancer care. EXaCT-1 uses HaloPlex (Agilent) target enrichment followed by next-generation sequencing (Illumina) of tumour and matched constitutional control DNA. We present a detailed clinical development and validation pipeline suitable for simultaneous detection of somatic point/indel mutations and copy-number alterations (CNAs). A computational framework for data analysis, reporting and sign-out is also presented. For the validation, we tested EXaCT-1 on 57 tumours covering five distinct clinically relevant mutations. Results demonstrated elevated and uniform coverage compatible with clinical testing as well as complete concordance in variant quality metrics between formalin-fixed paraffin embedded and fresh-frozen tumours. Extensive sensitivity studies identified limits of detection threshold for point/indel mutations and CNAs. Prospective analysis of 337 cancer cases revealed mutations in clinically relevant genes in 82% of tumours, demonstrating that EXaCT-1 is an accurate and sensitive method for identifying actionable mutations, with reasonable costs and time, greatly expanding its utility for advanced cancer care. PMID:28781886

  11. Panel-based Genetic Diagnostic Testing for Inherited Eye Diseases is Highly Accurate and Reproducible and More Sensitive for Variant Detection Than Exome Sequencing

    PubMed Central

    Bujakowska, Kinga M.; Sousa, Maria E.; Fonseca-Kelly, Zoë D.; Taub, Daniel G.; Janessian, Maria; Wang, Dan Yi; Au, Elizabeth D.; Sims, Katherine B.; Sweetser, David A.; Fulton, Anne B.; Liu, Qin; Wiggs, Janey L.; Gai, Xiaowu; Pierce, Eric A.

    2015-01-01

    Purpose Next-generation sequencing (NGS) based methods are being adopted broadly for genetic diagnostic testing, but the performance characteristics of these techniques have not been fully defined with regard to test accuracy and reproducibility. Methods We developed a targeted enrichment and NGS approach for genetic diagnostic testing of patients with inherited eye disorders, including inherited retinal degenerations, optic atrophy and glaucoma. In preparation for providing this Genetic Eye Disease (GEDi) test on a CLIA-certified basis, we performed experiments to measure the sensitivity, specificity, reproducibility as well as the clinical sensitivity of the test. Results The GEDi test is highly reproducible and accurate, with sensitivity and specificity for single nucleotide variant detection of 97.9% and 100%, respectively. The sensitivity for variant detection was notably better than the 88.3% achieved by whole exome sequencing (WES) using the same metrics, due to better coverage of targeted genes in the GEDi test compared to commercially available exome capture sets. Prospective testing of 192 patients with IRDs indicated that the clinical sensitivity of the GEDi test is high, with a diagnostic rate of 51%. Conclusion The data suggest that based on quantified performance metrics, selective targeted enrichment is preferable to WES for genetic diagnostic testing. PMID:25412400

  12. Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.

    PubMed

    Cutcutache, Ioana; Suzuki, Yuka; Tan, Iain Beehuat; Ramgopal, Subhashini; Zhang, Shenli; Ramnarayanan, Kalpana; Gan, Anna; Lee, Heng Hong; Tay, Su Ting; Ooi, Aikseng; Ong, Choon Kiat; Bolthouse, Jonathan T; Lane, Brian R; Anema, John G; Kahnoski, Richard J; Tan, Patrick; Teh, Bin Tean; Rozen, Steven G

    2015-07-01

    Testicular germ cell tumors are the most common cancer diagnosed in young men, and seminomas are the most common type of these cancers. There have been no exome-wide examinations of genes mutated in seminomas or of overall rates of nonsilent somatic mutations in these tumors. The objective was to analyze somatic mutations in seminomas to determine which genes are affected and to determine rates of nonsilent mutations. Eight seminomas and matched normal samples were surgically obtained from eight patients. DNA was extracted from tissue samples and exome sequenced on massively parallel Illumina DNA sequencers. Single-nucleotide polymorphism chip-based copy number analysis was also performed to assess copy number alterations. The DNA sequencing read data were analyzed to detect somatic mutations including single-nucleotide substitutions and short insertions and deletions. The detected mutations were validated by independent sequencing and further checked for subclonality. The rate of nonsynonymous somatic mutations averaged 0.31 mutations/Mb. We detected nonsilent somatic mutations in 96 genes that were not previously known to be mutated in seminomas, of which some may be driver mutations. Many of the mutations appear to have been present in subclonal populations. In addition, two genes, KIT and KRAS, were affected in two tumors each with mutations that were previously observed in other cancers and are presumably oncogenic. Our study, the first report on exome sequencing of seminomas, detected somatic mutations in 96 new genes, several of which may be targetable drivers. Furthermore, our results show that seminoma mutation rates are five times higher than previously thought, but are nevertheless low compared to other common cancers. Similar low rates are seen in other cancers that also have excellent rates of remission achieved with chemotherapy. We examined the DNA sequences of seminomas, the most common type of testicular germ cell cancer. Our study identified 96 new genes in which mutations occurred during seminoma development, some of which might contribute to cancer development or progression. The study also showed that the rates of DNA mutations during seminoma development are higher than previously thought, but still lower than for other common solid-organ cancers. Such low rates are also observed among other cancers that, like seminomas, show excellent rates of disease remission after chemotherapy. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  13. Assessing the Power of Exome Chips.

    PubMed

    Page, Christian Magnus; Baranzini, Sergio E; Mevik, Bjørn-Helge; Bos, Steffan Daniel; Harbo, Hanne F; Andreassen, Bettina Kulle

    2015-01-01

    Genotyping chips for rare and low-frequent variants have recently gained popularity with the introduction of exome chips, but the utility of these chips remains unclear. These chips were designed using exome sequencing data from mainly American-European individuals, enriched for a narrow set of common diseases. In addition, it is well-known that the statistical power of detecting associations with rare and low-frequent variants is much lower compared to studies exclusively involving common variants. We developed a simulation program adaptable to any exome chip design to empirically evaluate the power of the exome chips. We implemented the main properties of the Illumina HumanExome BeadChip array. The simulated data sets were used to assess the power of exome chip based studies for varying effect sizes and causal variant scenarios. We applied two widely-used statistical approaches for rare and low-frequency variants, which collapse the variants into genetic regions or genes. Under optimal conditions, we found that a sample size between 20,000 to 30,000 individuals were needed in order to detect modest effect sizes (0.5% < PAR > 1%) with 80% power. For small effect sizes (PAR <0.5%), 60,000-100,000 individuals were needed in the presence of non-causal variants. In conclusion, we found that at least tens of thousands of individuals are necessary to detect modest effects under optimal conditions. In addition, when using rare variant chips on cohorts or diseases they were not originally designed for, the identification of associated variants or genes will be even more challenging.

  14. Whole-Exome Sequencing in Adults With Chronic Kidney Disease: A Pilot Study.

    PubMed

    Lata, Sneh; Marasa, Maddalena; Li, Yifu; Fasel, David A; Groopman, Emily; Jobanputra, Vaidehi; Rasouly, Hila; Mitrotti, Adele; Westland, Rik; Verbitsky, Miguel; Nestor, Jordan; Slater, Lindsey M; D'Agati, Vivette; Zaniew, Marcin; Materna-Kiryluk, Anna; Lugani, Francesca; Caridi, Gianluca; Rampoldi, Luca; Mattoo, Aditya; Newton, Chad A; Rao, Maya K; Radhakrishnan, Jai; Ahn, Wooin; Canetta, Pietro A; Bomback, Andrew S; Appel, Gerald B; Antignac, Corinne; Markowitz, Glen S; Garcia, Christine K; Kiryluk, Krzysztof; Sanna-Cherchi, Simone; Gharavi, Ali G

    2018-01-16

    The utility of whole-exome sequencing (WES) for the diagnosis and management of adult-onset constitutional disorders has not been adequately studied. Genetic diagnostics may be advantageous in adults with chronic kidney disease (CKD), in whom the cause of kidney failure often remains unknown. To study the diagnostic utility of WES in a selected referral population of adults with CKD. Observational cohort. A major academic medical center. 92 adults with CKD of unknown cause or familial nephropathy or hypertension. The diagnostic yield of WES and its potential effect on clinical management. Whole-exome sequencing provided a diagnosis in 22 of 92 patients (24%), including 9 probands with CKD of unknown cause and encompassing 13 distinct genetic disorders. Among these, loss-of-function mutations were identified in PARN in 2 probands with tubulointerstitial fibrosis. PARN mutations have been implicated in a short telomere syndrome characterized by lung, bone marrow, and liver fibrosis; these findings extend the phenotype of PARN mutations to renal fibrosis. In addition, review of the American College of Medical Genetics actionable genes identified a pathogenic BRCA2 mutation in a proband who was diagnosed with breast cancer on follow-up. The results affected clinical management in most identified cases, including initiation of targeted surveillance, familial screening to guide donor selection for transplantation, and changes in therapy. The small sample size and recruitment at a tertiary care academic center limit generalizability of findings among the broader CKD population. Whole-exome sequencing identified diagnostic mutations in a substantial number of adults with CKD of many causes. Further study of the utility of WES in the evaluation and care of patients with CKD in additional settings is warranted. New York State Empire Clinical Research Investigator Program, Renal Research Institute, and National Human Genome Research Institute of the National Institutes of Health.

  15. Whole exome sequencing of rare variants in EIF4G1 and VPS35 in Parkinson disease

    PubMed Central

    Nuytemans, Karen; Bademci, Guney; Inchausti, Vanessa; Dressen, Amy; Kinnamon, Daniel D.; Mehta, Arpit; Wang, Liyong; Züchner, Stephan; Beecham, Gary W.; Martin, Eden R.; Scott, William K.

    2013-01-01

    Objective: Recently, vacuolar protein sorting 35 (VPS35) and eukaryotic translation initiation factor 4 gamma 1 (EIF4G1) have been identified as 2 causal Parkinson disease (PD) genes. We used whole exome sequencing for rapid, parallel analysis of variations in these 2 genes. Methods: We performed whole exome sequencing in 213 patients with PD and 272 control individuals. Those rare variants (RVs) with <5% frequency in the exome variant server database and our own control data were considered for analysis. We performed joint gene-based tests for association using RVASSOC and SKAT (Sequence Kernel Association Test) as well as single-variant test statistics. Results: We identified 3 novel VPS35 variations that changed the coded amino acid (nonsynonymous) in 3 cases. Two variations were in multiplex families and neither segregated with PD. In EIF4G1, we identified 11 (9 nonsynonymous and 2 small indels) RVs including the reported pathogenic mutation p.R1205H, which segregated in all affected members of a large family, but also in 1 unaffected 86-year-old family member. Two additional RVs were found in isolated patients only. Whereas initial association studies suggested an association (p = 0.04) with all RVs in EIF4G1, subsequent testing in a second dataset for the driving variant (p.F1461) suggested no association between RVs in the gene and PD. Conclusions: We confirm that the specific EIF4G1 variation p.R1205H seems to be a strong PD risk factor, but is nonpenetrant in at least one 86-year-old. A few other select RVs in both genes could not be ruled out as causal. However, there was no evidence for an overall contribution of genetic variability in VPS35 or EIF4G1 to PD development in our dataset. PMID:23408866

  16. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases

    PubMed Central

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-01-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC3 (NM_003786.3:c.1783-1G>A), KLHDC1 (NM_172193.1:c.568-2A>G), HOOK1 (NM_015888.4:c.1662-1G>A), SMAD9 (NM_001127217.2:c.1004-1C>T), and DNAH9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC3, HOOK1. In ABCC3 and HOOK1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4–6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis. PMID:24498620

  17. Actionable mutations in canine hemangiosarcoma

    PubMed Central

    Wang, Guannan; Wu, Ming; Maloneyhuss, Martha A.; Wojcik, John; Durham, Amy C.; Mason, Nicola J.

    2017-01-01

    Background Angiosarcomas (AS) are rare in humans, but they are a deadly subtype of soft tissue sarcoma. Discovery sequencing in AS, especially the visceral form, is hampered by the rarity of cases. Most diagnostic material exists as archival formalin fixed, paraffin embedded tissue which serves as a poor source of high quality DNA for genome-wide sequencing. We approached this problem through comparative genomics. We hypothesized that exome sequencing a histologically similar tumor, hemangiosarcoma (HSA), that occurs in approximately 50,000 dogs per year, may lead to the identification of potential oncogenic drivers and druggable targets that could also occur in angiosarcoma. Methods Splenic hemangiosarcomas are common in dogs, which allowed us to collect a cohort of archived matched tumor and normal tissue samples suitable for whole exome sequencing. Mapping of the reads to the latest canine reference genome (Canfam3) demonstrated that >99% of the targeted exomal regions were covered, with >80% at 20X coverage and >90% at 10X coverage. Results and conclusions Sequence analysis of 20 samples identified somatic mutations in PIK3CA, TP53, PTEN, and PLCG1, all of which correspond to well-known tumor drivers in human cancer, in more than half of the cases. In one case, we identified a mutation in PLCG1 identical to a mutation observed previously in this gene in human visceral AS. Activating PIK3CA mutations present novel therapeutic targets, and clinical trials of targeted inhibitors are underway in human cancers. Our results lay a foundation for similar clinical trials in canine HSA, enabling a precision medicine approach to this disease. PMID:29190660

  18. Actionable mutations in canine hemangiosarcoma.

    PubMed

    Wang, Guannan; Wu, Ming; Maloneyhuss, Martha A; Wojcik, John; Durham, Amy C; Mason, Nicola J; Roth, David B

    2017-01-01

    Angiosarcomas (AS) are rare in humans, but they are a deadly subtype of soft tissue sarcoma. Discovery sequencing in AS, especially the visceral form, is hampered by the rarity of cases. Most diagnostic material exists as archival formalin fixed, paraffin embedded tissue which serves as a poor source of high quality DNA for genome-wide sequencing. We approached this problem through comparative genomics. We hypothesized that exome sequencing a histologically similar tumor, hemangiosarcoma (HSA), that occurs in approximately 50,000 dogs per year, may lead to the identification of potential oncogenic drivers and druggable targets that could also occur in angiosarcoma. Splenic hemangiosarcomas are common in dogs, which allowed us to collect a cohort of archived matched tumor and normal tissue samples suitable for whole exome sequencing. Mapping of the reads to the latest canine reference genome (Canfam3) demonstrated that >99% of the targeted exomal regions were covered, with >80% at 20X coverage and >90% at 10X coverage. Sequence analysis of 20 samples identified somatic mutations in PIK3CA, TP53, PTEN, and PLCG1, all of which correspond to well-known tumor drivers in human cancer, in more than half of the cases. In one case, we identified a mutation in PLCG1 identical to a mutation observed previously in this gene in human visceral AS. Activating PIK3CA mutations present novel therapeutic targets, and clinical trials of targeted inhibitors are underway in human cancers. Our results lay a foundation for similar clinical trials in canine HSA, enabling a precision medicine approach to this disease.

  19. Identification of small exonic CNV from whole-exome sequence data and application to autism spectrum disorder.

    PubMed

    Poultney, Christopher S; Goldberg, Arthur P; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J; Ma'ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D

    2013-10-03

    Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1-30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1-30 kb CNV, 1-30 kb deletions, and 1-10 kb deletions in ASD. CNV in the 1-30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1-30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  20. Rare variants and autoimmune disease.

    PubMed

    Massey, Jonathan; Eyre, Steve

    2014-09-01

    The study of rare variants in monogenic forms of autoimmune disease has offered insight into the aetiology of more complex pathologies. Research in complex autoimmune disease initially focused on sequencing candidate genes, with some early successes, notably in uncovering low-frequency variation associated with Type 1 diabetes mellitus. However, other early examples have proved difficult to replicate, and a recent study across six autoimmune diseases, re-sequencing 25 autoimmune disease-associated genes in large sample sizes, failed to find any associated rare variants. The study of rare and low-frequency variation in autoimmune diseases has been made accessible by the inclusion of such variants on custom genotyping arrays (e.g. Immunochip and Exome arrays). Whole-exome sequencing approaches are now also being utilised to uncover the contribution of rare coding variants to disease susceptibility, severity and treatment response. Other sequencing strategies are starting to uncover the role of regulatory rare variation. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  1. Genetic architecture of retinal and macular degenerative diseases: the promise and challenges of next-generation sequencing

    PubMed Central

    2013-01-01

    Inherited retinal degenerative diseases (RDDs) display wide variation in their mode of inheritance, underlying genetic defects, age of onset, and phenotypic severity. Molecular mechanisms have not been delineated for many retinal diseases, and treatment options are limited. In most instances, genotype-phenotype correlations have not been elucidated because of extensive clinical and genetic heterogeneity. Next-generation sequencing (NGS) methods, including exome, genome, transcriptome and epigenome sequencing, provide novel avenues towards achieving comprehensive understanding of the genetic architecture of RDDs. Whole-exome sequencing (WES) has already revealed several new RDD genes, whereas RNA-Seq and ChIP-Seq analyses are expected to uncover novel aspects of gene regulation and biological networks that are involved in retinal development, aging and disease. In this review, we focus on the genetic characterization of retinal and macular degeneration using NGS technology and discuss the basic framework for further investigations. We also examine the challenges of NGS application in clinical diagnosis and management. PMID:24112618

  2. Dynamical System Modeling to Simulate Donor T Cell Response to Whole Exome Sequencing-Derived Recipient Peptides Demonstrates Different Alloreactivity Potential in HLA-Matched and -Mismatched Donor-Recipient Pairs.

    PubMed

    Abdul Razzaq, Badar; Scalora, Allison; Koparde, Vishal N; Meier, Jeremy; Mahmood, Musa; Salman, Salman; Jameson-Lee, Max; Serrano, Myrna G; Sheth, Nihar; Voelkner, Mark; Kobulnicky, David J; Roberts, Catherine H; Ferreira-Gonzalez, Andrea; Manjili, Masoud H; Buck, Gregory A; Neale, Michael C; Toor, Amir A

    2016-05-01

    Immune reconstitution kinetics and subsequent clinical outcomes in HLA-matched recipients of allogeneic stem cell transplantation (SCT) are variable and difficult to predict. Considering SCT as a dynamical system may allow sequence differences across the exomes of the transplant donors and recipients to be used to simulate an alloreactive T cell response, which may allow better clinical outcome prediction. To accomplish this, whole exome sequencing was performed on 34 HLA-matched SCT donor-recipient pairs (DRPs) and the nucleotide sequence differences translated to peptides. The binding affinity of the peptides to the relevant HLA in each DRP was determined. The resulting array of peptide-HLA binding affinity values in each patient was considered as an operator modifying a hypothetical T cell repertoire vector, in which each T cell clone proliferates in accordance with the logistic equation of growth. Using an iterating system of matrices, each simulated T cell clone's growth was calculated with the steady-state population being proportional to the magnitude of the binding affinity of the driving HLA-peptide complex. Incorporating competition between T cell clones responding to different HLA-peptide complexes reproduces a number of features of clinically observed T cell clonal repertoire in the simulated repertoire, including sigmoidal growth kinetics of individual T cell clones and overall repertoire, Power Law clonal frequency distribution, increase in repertoire complexity over time with increasing clonal diversity, and alteration of clonal dominance when a different antigen array is encountered, such as in SCT. The simulated, alloreactive T cell repertoire was markedly different in HLA-matched DRPs. The patterns were differentiated by rate of growth and steady-state magnitude of the simulated T cell repertoire and demonstrate a possible correlation with survival. In conclusion, exome wide sequence differences in DRPs may allow simulation of donor alloreactive T cell response to recipient antigens and may provide a quantitative basis for refining donor selection and titration of immunosuppression after SCT. Copyright © 2016 American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.

  3. Whole-Exome Sequencing of 10 Scientists: Evaluation of the Process and Outcomes.

    PubMed

    Lindor, Noralane M; Schahl, Kimberly A; Johnson, Kiley J; Hunt, Katherine S; Mensink, Kara A; Wieben, Eric D; Klee, Eric; Black, John L; Highsmith, W Edward; Thibodeau, Stephen N; Ferber, Matthew J; Aypar, Umut; Ji, Yuan; Graham, Rondell P; Fiksdal, Alexander S; Sarangi, Vivek; Ormond, Kelly E; Riegert-Johnson, Douglas L; McAllister, Tammy M; Farrugia, Gianrico; McCormick, Jennifer B

    2015-10-01

    To understand motivations, educational needs, and concerns of individuals contemplating whole-exome sequencing (WES) and determine what amount of genetic information might be obtained by sequencing a generally healthy cohort so as to more effectively counsel future patients. From 2012 to 2014, 40 medically educated, generally healthy scientists at Mayo Clinic were invited to have WES conducted on a research basis; 26 agreed to be in a drawing from which 10 participants were selected. The study involved pre- and posttest genetic counseling and completion of 4 surveys related to the experience and outcomes. Whole-exome sequencing was conducted on DNA from blood from each person. Most variants (76,305 per person; range, 74,505-77,387) were known benign allelic variants, variants in genes of unknown function, or variants of uncertain significance in genes of known function. The results of suspected pathogenic/pathogenic variants in Mendelian disorders and pharmacogenomic variants were disclosed. The mean number of suspected pathogenic/pathogenic variants was 2.2 per person (range, 1-4). Four pharmacogenomic genes were included for reporting; variants were found in 9 of 10 participants. This study provides data that may be useful in establishing reality-based patient expectations, outlines specific points to cover during counseling, and increases confidence in the feasibility of providing adequate preparation and counseling for WES in generally healthy individuals. Copyright © 2015 Mayo Foundation for Medical Education and Research. Published by Elsevier Inc. All rights reserved.

  4. Germline PTPN11 and somatic PIK3CA variant in a boy with megalencephaly-capillary malformation syndrome (MCAP) - pure coincidence?

    PubMed Central

    Döcker, Dennis; Schubach, Max; Menzel, Moritz; Spaich, Christiane; Gabriel, Heinz-Dieter; Zenker, Martin; Bartholdi, Deborah; Biskup, Saskia

    2015-01-01

    Megalencephaly-capillary malformation (MCAP) syndrome is an overgrowth syndrome that is diagnosed by clinical criteria. Recently, somatic and germline variants in genes that are involved in the PI3K-AKT pathway (AKT3, PIK3R2 and PIK3CA) have been described to be associated with MCAP and/or other related megalencephaly syndromes. We performed trio-exome sequencing in a 6-year-old boy and his healthy parents. Clinical features were macrocephaly, cutis marmorata, angiomata, asymmetric overgrowth, developmental delay, discrete midline facial nevus flammeus, toe syndactyly and postaxial polydactyly—thus, clearly an MCAP phenotype. Exome sequencing revealed a pathogenic de novo germline variant in the PTPN11 gene (c.1529A>G; p.(Gln510Arg)), which has so far been associated with Noonan, as well as LEOPARD syndrome. Whole-exome sequencing (>100 × coverage) did not reveal any alteration in the known megalencephaly genes. However, ultra-deep sequencing results from saliva (>1000 × coverage) revealed a 22% mosaic variant in PIK3CA (c.2740G>A; p.(Gly914Arg)). To our knowledge, this report is the first description of a PTPN11 germline variant in an MCAP patient. Data from experimental studies show a complex interaction of SHP2 (gene product of PTPN11) and the PI3K-AKT pathway. We hypothesize that certain PTPN11 germline variants might drive toward additional second-hit alterations. PMID:24939587

  5. Germline PTPN11 and somatic PIK3CA variant in a boy with megalencephaly-capillary malformation syndrome (MCAP)--pure coincidence?

    PubMed

    Döcker, Dennis; Schubach, Max; Menzel, Moritz; Spaich, Christiane; Gabriel, Heinz-Dieter; Zenker, Martin; Bartholdi, Deborah; Biskup, Saskia

    2015-03-01

    Megalencephaly-capillary malformation (MCAP) syndrome is an overgrowth syndrome that is diagnosed by clinical criteria. Recently, somatic and germline variants in genes that are involved in the PI3K-AKT pathway (AKT3, PIK3R2 and PIK3CA) have been described to be associated with MCAP and/or other related megalencephaly syndromes. We performed trio-exome sequencing in a 6-year-old boy and his healthy parents. Clinical features were macrocephaly, cutis marmorata, angiomata, asymmetric overgrowth, developmental delay, discrete midline facial nevus flammeus, toe syndactyly and postaxial polydactyly--thus, clearly an MCAP phenotype. Exome sequencing revealed a pathogenic de novo germline variant in the PTPN11 gene (c.1529A>G; p.(Gln510Arg)), which has so far been associated with Noonan, as well as LEOPARD syndrome. Whole-exome sequencing (>100 × coverage) did not reveal any alteration in the known megalencephaly genes. However, ultra-deep sequencing results from saliva (>1000 × coverage) revealed a 22% mosaic variant in PIK3CA (c.2740G>A; p.(Gly914Arg)). To our knowledge, this report is the first description of a PTPN11 germline variant in an MCAP patient. Data from experimental studies show a complex interaction of SHP2 (gene product of PTPN11) and the PI3K-AKT pathway. We hypothesize that certain PTPN11 germline variants might drive toward additional second-hit alterations.

  6. Exome Sequencing Finds a Novel PCSK1 Mutation in a Child With Generalized Malabsorptive Diarrhea and Diabetes Insipidus

    PubMed Central

    Yourshaw, Michael; Solorzano-Vargas, R. Sergio; Pickett, Lindsay A.; Lindberg, Iris; Wang, Jiafang; Cortina, Galen; Pawlikowska-Haddal, Anna; Baron, Howard; Venick, Robert S.; Nelson, Stanley F.; Martín, Martín G.

    2014-01-01

    Objectives Congenital diarrhea disorders are a group of genetically diverse and typically autosomal recessive disorders that have yet to be well characterized phenotypically or molecularly. Diagnostic assessments are generally limited to nutritional challenges and histologic evaluation, and many subjects eventually require a prolonged course of intravenous nutrition. Here we describe next-generation sequencing techniques to investigate a child with perplexing congenital malabsorptive diarrhea and other presumably unrelated clinical problems; this method provides an alternative approach to molecular diagnosis. Methods We screened the diploid genome of an affected individual, using exome sequencing, for uncommon variants that have observed protein-coding consequences. We assessed the functional activity of the mutant protein, as well as its lack of expression using immunohistochemistry. Results Among several rare variants detected was a homozygous nonsense mutation in the catalytic domain of the proprotein convertase subtilisin/kexin type 1 gene. The mutation abolishes prohormone convertase 1/3 endoprotease activity as well as expression in the intestine. These primary genetic findings prompted a careful endocrine reevaluation of the child at 4.5 years of age, and multiple significant problems were subsequently identified consistent with the known phenotypic consequences of proprotein convertase subtilisin/kexin type 1 (PCSK1) gene mutations. Based on the molecular diagnosis, alternate medical and dietary management was implemented for diabetes insipidus, polyphagia, and micropenis. Conclusions Whole-exome sequencing provides a powerful diagnostic tool to clinicians managing rare genetic disorders with multiple perplexing clinical manifestations. PMID:24280991

  7. Exome sequencing finds a novel PCSK1 mutation in a child with generalized malabsorptive diarrhea and diabetes insipidus.

    PubMed

    Yourshaw, Michael; Solorzano-Vargas, R Sergio; Pickett, Lindsay A; Lindberg, Iris; Wang, Jiafang; Cortina, Galen; Pawlikowska-Haddal, Anna; Baron, Howard; Venick, Robert S; Nelson, Stanley F; Martín, Martín G

    2013-12-01

    Congenital diarrhea disorders are a group of genetically diverse and typically autosomal recessive disorders that have yet to be well characterized phenotypically or molecularly. Diagnostic assessments are generally limited to nutritional challenges and histologic evaluation, and many subjects eventually require a prolonged course of intravenous nutrition. Here we describe next-generation sequencing techniques to investigate a child with perplexing congenital malabsorptive diarrhea and other presumably unrelated clinical problems; this method provides an alternative approach to molecular diagnosis. We screened the diploid genome of an affected individual, using exome sequencing, for uncommon variants that have observed protein-coding consequences. We assessed the functional activity of the mutant protein, as well as its lack of expression using immunohistochemistry. Among several rare variants detected was a homozygous nonsense mutation in the catalytic domain of the proprotein convertase subtilisin/kexin type 1 gene. The mutation abolishes prohormone convertase 1/3 endoprotease activity as well as expression in the intestine. These primary genetic findings prompted a careful endocrine reevaluation of the child at 4.5 years of age, and multiple significant problems were subsequently identified consistent with the known phenotypic consequences of proprotein convertase subtilisin/kexin type 1 (PCSK1) gene mutations. Based on the molecular diagnosis, alternate medical and dietary management was implemented for diabetes insipidus, polyphagia, and micropenis. Whole-exome sequencing provides a powerful diagnostic tool to clinicians managing rare genetic disorders with multiple perplexing clinical manifestations.

  8. Personalized genomic analyses for cancer mutation discovery and interpretation

    PubMed Central

    Jones, Siân; Anagnostou, Valsamo; Lytle, Karli; Parpart-Li, Sonya; Nesselbush, Monica; Riley, David R.; Shukla, Manish; Chesnick, Bryan; Kadan, Maura; Papp, Eniko; Galens, Kevin G.; Murphy, Derek; Zhang, Theresa; Kann, Lisa; Sausen, Mark; Angiuoli, Samuel V.; Diaz, Luis A.; Velculescu, Victor E.

    2015-01-01

    Massively parallel sequencing approaches are beginning to be used clinically to characterize individual patient tumors and to select therapies based on the identified mutations. A major question in these analyses is the extent to which these methods identify clinically actionable alterations and whether the examination of the tumor tissue alone is sufficient or whether matched normal DNA should also be analyzed to accurately identify tumor-specific (somatic) alterations. To address these issues, we comprehensively evaluated 815 tumor-normal paired samples from patients of 15 tumor types. We identified genomic alterations using next-generation sequencing of whole exomes or 111 targeted genes that were validated with sensitivities >95% and >99%, respectively, and specificities >99.99%. These analyses revealed an average of 140 and 4.3 somatic mutations per exome and targeted analysis, respectively. More than 75% of cases had somatic alterations in genes associated with known therapies or current clinical trials. Analyses of matched normal DNA identified germline alterations in cancer-predisposing genes in 3% of patients with apparently sporadic cancers. In contrast, a tumor-only sequencing approach could not definitively identify germline changes in cancer-predisposing genes and led to additional false-positive findings comprising 31% and 65% of alterations identified in targeted and exome analyses, respectively, including in potentially actionable genes. These data suggest that matched tumor-normal sequencing analyses are essential for precise identification and interpretation of somatic and germline alterations and have important implications for the diagnostic and therapeutic management of cancer patients. PMID:25877891

  9. Homozygosity Mapping and Whole Exome Sequencing to Detect SLC45A2 and G6PC3 Mutations in a Single Patient with Oculocutaneous Albinism and Neutropenia

    PubMed Central

    Cullinane, Andrew R.; Vilboux, Thierry; O’Brien, Kevin; Curry, James A.; Maynard, Dawn M.; Carlson-Donohoe, Hannah; Ciccone, Carla; Markello, Thomas C.; Gunay-Aygun, Meral; Huizing, Marjan; Gahl, William A.

    2011-01-01

    We evaluated a 32 year-old woman whose oculocutaneous albinism, bleeding diathesis, neutropenia, and history of recurrent infections prompted consideration of the diagnosis of Hermansky-Pudlak syndrome type 2 (HPS-2). This was ruled out due to the presence of platelet delta granules and absence of AP3B1 mutations. Since parental consanguinity suggested an autosomal recessive mode of inheritance, we employed homozygosity mapping, followed by whole exome sequencing, to identify two candidate disease-causing genes, SLC45A2 and G6PC3. Conventional di-deoxy sequencing confirmed pathogenic mutations in SLC45A2, associated with oculocutaneous albinism type 4 (OCA-4), and G6PC3, associated with neutropenia. The substantial reduction of SLC45A2 protein in the patient’s melanocytes caused the mis-localization of tyrosinase from melanosomes to the plasma membrane and also led to the incorporation of tyrosinase into exosomes and secretion into the culture medium, explaining the hypopigmentation in OCA-4. Our patient’s G6PC3 mRNA expression level was also reduced, leading to increased apoptosis of her fibroblasts under ER stress. This report describes the first North American patient with OCA-4, the first culture of human OCA-4 melanocytes, and the use of homozygosity mapping followed by whole exome sequencing to identify disease-causing mutations in multiple genes in a single affected individual. PMID:21677667

  10. Exome sequencing reveals FAM20c mutations associated with fibroblast growth factor 23-related hypophosphatemia, dental anomalies, and ectopic calcification.

    PubMed

    Rafaelsen, Silje Hjorth; Raeder, Helge; Fagerheim, Anne Kristine; Knappskog, Per; Carpenter, Thomas O; Johansson, Stefan; Bjerknes, Robert

    2013-06-01

    Fibroblast growth factor 23 (FGF23) plays a crucial role in renal phosphate regulation, exemplified by the causal role of PHEX and DMP1 mutations in X-linked hypophosphatemic rickets and autosomal recessive rickets type 1, respectively. Using whole exome sequencing we identified compound heterozygous mutations in family with sequence similarity 20, member C (FAM20C) in two siblings referred for hypophosphatemia and severe dental demineralization disease. FAM20C mutations were not found in other undiagnosed probands of a national Norwegian population of familial hypophosphatemia. Our results demonstrate that mutations in FAM20C provide a putative new mechanism in human subjects leading to dysregulated FGF23 levels, hypophosphatemia, hyperphosphaturia, dental anomalies, intracerebral calcifications and osteosclerosis of the long bones in the absence of rickets. Copyright © 2013 American Society for Bone and Mineral Research.

  11. Exome sequencing identifies SUCO mutations in mesial temporal lobe epilepsy.

    PubMed

    Sha, Zhiqiang; Sha, Longze; Li, Wenting; Dou, Wanchen; Shen, Yan; Wu, Liwen; Xu, Qi

    2015-03-30

    Mesial temporal lobe epilepsy (mTLE) is the main type and most common medically intractable form of epilepsy. Severity of disease-based stratified samples may help identify new disease-associated mutant genes. We analyzed mRNA expression profiles from patient hippocampal tissue. Three of the seven patients had severe mTLE with generalized-onset convulsions and consciousness loss that occurred over many years. We found that compared with other groups, patients with severe mTLE were classified into a distinct group. Whole-exome sequencing and Sanger sequencing validation in all seven patients identified three novel SUN domain-containing ossification factor (SUCO) mutations in severely affected patients. Furthermore, SUCO knock down significantly reduced dendritic length in vitro. Our results indicate that mTLE defects may affect neuronal development, and suggest that neurons have abnormal development due to lack of SUCO, which may be a generalized-onset epilepsy-related gene. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  12. Exome Sequence Reveals Mutations in CoA Synthase as a Cause of Neurodegeneration with Brain Iron Accumulation

    PubMed Central

    Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B.; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M.; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A.; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria

    2014-01-01

    Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. PMID:24360804

  13. Novel compound heterozygous mutations in the GPR98 (USH2C) gene identified by whole exome sequencing in a Moroccan deaf family.

    PubMed

    Bousfiha, Amale; Bakhchane, Amina; Charoute, Hicham; Detsouli, Mustapha; Rouba, Hassan; Charif, Majida; Lenaers, Guy; Barakat, Abdelhamid

    2017-10-01

    In the present work, we identified two novel compound heterozygote mutations in the GPR98 (G protein-coupled receptor 98) gene causing Usher syndrome. Whole-exome sequencing was performed to study the genetic causes of Usher syndrome in a Moroccan family with three affected siblings. We identify two novel compound heterozygote mutations (c.1054C > A, c.16544delT) in the GPR98 gene in the three affected siblings carrying post-linguale bilateral moderate hearing loss with normal vestibular functions and before installing visual disturbances. This is the first time that mutations in the GPR98 gene are described in the Moroccan deaf patients.

  14. Nasopharyngeal teratoma, congenital diaphragmatic hernia and Dandy-Walker malformation - a yet uncharacterized syndrome.

    PubMed

    Gupta, N; Shastri, S; Singh, P K; Jana, M; Mridha, A; Verma, G; Kabra, M

    2016-11-01

    An association of congenital diaphragmatic hernia, dandy walker malformation and nasopharyngeal teratoma is very rare. Here, we report a fourth case with this association where chromosomal microarray and whole exome sequencing (WES) was performed to understand the underlying genetic basis. Findings of few variants especially a novel variation in HIRA provided some insights. An association of congenital diaphragmatic hernia, dandy walker malformation and nasopharyngeal teratoma is very rare. Here, we report a fourth case with this association where chromosomal microarray and whole exome sequencing (WES) was performed to understand the underlying genetic basis. Findings of few variants especially a novel variation in HIRA provided some insights. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  15. Whole-exome sequencing identifies common and rare variant metabolic QTLs in a Middle Eastern population.

    PubMed

    Yousri, Noha A; Fakhro, Khalid A; Robay, Amal; Rodriguez-Flores, Juan L; Mohney, Robert P; Zeriri, Hassina; Odeh, Tala; Kader, Sara Abdul; Aldous, Eman K; Thareja, Gaurav; Kumar, Manish; Al-Shakaki, Alya; Chidiac, Omar M; Mohamoud, Yasmin A; Mezey, Jason G; Malek, Joel A; Crystal, Ronald G; Suhre, Karsten

    2018-01-23

    Metabolomics-genome-wide association studies (mGWAS) have uncovered many metabolic quantitative trait loci (mQTLs) influencing human metabolic individuality, though predominantly in European cohorts. By combining whole-exome sequencing with a high-resolution metabolomics profiling for a highly consanguineous Middle Eastern population, we discover 21 common variant and 12 functional rare variant mQTLs, of which 45% are novel altogether. We fine-map 10 common variant mQTLs to new metabolite ratio associations, and 11 common variant mQTLs to putative protein-altering variants. This is the first work to report common and rare variant mQTLs linked to diseases and/or pharmacological targets in a consanguineous Arab cohort, with wide implications for precision medicine in the Middle East.

  16. Portero versus portador: Spanish interpretation of genomic terminology during whole exome sequencing results disclosure.

    PubMed

    Gutierrez, Amanda M; Robinson, Jill O; Statham, Emily E; Scollon, Sarah; Bergstrom, Katie L; Slashinski, Melody J; Parsons, Donald W; Plon, Sharon E; McGuire, Amy L; Street, Richard L

    2017-11-01

    Describe modifications to technical genomic terminology made by interpreters during disclosure of whole exome sequencing (WES) results. Using discourse analysis, we identified and categorized interpretations of genomic terminology in 42 disclosure sessions where Spanish-speaking parents received their child's WES results either from a clinician using a medical interpreter, or directly from a bilingual physician. Overall, 76% of genomic terms were interpreted accordantly, 11% were misinterpreted and 13% were omitted. Misinterpretations made by interpreters and bilingual physicians included using literal and nonmedical terminology to interpret genomic concepts. Modifications to genomic terminology made during interpretation highlight the need to standardize bilingual genomic lexicons. We recommend Spanish terms that can be used to refer to genomic concepts.

  17. Exome-wide association analysis reveals novel coding sequence variants associated with lipid traits in Chinese.

    PubMed

    Tang, Clara S; Zhang, He; Cheung, Chloe Y Y; Xu, Ming; Ho, Jenny C Y; Zhou, Wei; Cherny, Stacey S; Zhang, Yan; Holmen, Oddgeir; Au, Ka-Wing; Yu, Haiyi; Xu, Lin; Jia, Jia; Porsch, Robert M; Sun, Lijie; Xu, Weixian; Zheng, Huiping; Wong, Lai-Yung; Mu, Yiming; Dou, Jingtao; Fong, Carol H Y; Wang, Shuyu; Hong, Xueyu; Dong, Liguang; Liao, Yanhua; Wang, Jiansong; Lam, Levina S M; Su, Xi; Yan, Hua; Yang, Min-Lee; Chen, Jin; Siu, Chung-Wah; Xie, Gaoqiang; Woo, Yu-Cho; Wu, Yangfeng; Tan, Kathryn C B; Hveem, Kristian; Cheung, Bernard M Y; Zöllner, Sebastian; Xu, Aimin; Eugene Chen, Y; Jiang, Chao Qiang; Zhang, Youyi; Lam, Tai-Hing; Ganesh, Santhi K; Huo, Yong; Sham, Pak C; Lam, Karen S L; Willer, Cristen J; Tse, Hung-Fat; Gao, Wei

    2015-12-22

    Blood lipids are important risk factors for coronary artery disease (CAD). Here we perform an exome-wide association study by genotyping 12,685 Chinese, using a custom Illumina HumanExome BeadChip, to identify additional loci influencing lipid levels. Single-variant association analysis on 65,671 single nucleotide polymorphisms reveals 19 loci associated with lipids at exome-wide significance (P<2.69 × 10(-7)), including three Asian-specific coding variants in known genes (CETP p.Asp459Gly, PCSK9 p.Arg93Cys and LDLR p.Arg257Trp). Furthermore, missense variants at two novel loci-PNPLA3 p.Ile148Met and PKD1L3 p.Thr429Ser-also influence levels of triglycerides and low-density lipoprotein cholesterol, respectively. Another novel gene, TEAD2, is found to be associated with high-density lipoprotein cholesterol through gene-based association analysis. Most of these newly identified coding variants show suggestive association (P<0.05) with CAD. These findings demonstrate that exome-wide genotyping on samples of non-European ancestry can identify additional population-specific possible causal variants, shedding light on novel lipid biology and CAD.

  18. Analysis of Sequence Data Under Multivariate Trait-Dependent Sampling.

    PubMed

    Tao, Ran; Zeng, Donglin; Franceschini, Nora; North, Kari E; Boerwinkle, Eric; Lin, Dan-Yu

    2015-06-01

    High-throughput DNA sequencing allows for the genotyping of common and rare variants for genetic association studies. At the present time and for the foreseeable future, it is not economically feasible to sequence all individuals in a large cohort. A cost-effective strategy is to sequence those individuals with extreme values of a quantitative trait. We consider the design under which the sampling depends on multiple quantitative traits. Under such trait-dependent sampling, standard linear regression analysis can result in bias of parameter estimation, inflation of type I error, and loss of power. We construct a likelihood function that properly reflects the sampling mechanism and utilizes all available data. We implement a computationally efficient EM algorithm and establish the theoretical properties of the resulting maximum likelihood estimators. Our methods can be used to perform separate inference on each trait or simultaneous inference on multiple traits. We pay special attention to gene-level association tests for rare variants. We demonstrate the superiority of the proposed methods over standard linear regression through extensive simulation studies. We provide applications to the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study and the National Heart, Lung, and Blood Institute Exome Sequencing Project.

  19. Implementing genomic medicine in pathology.

    PubMed

    Williams, Eli S; Hegde, Madhuri

    2013-07-01

    The finished sequence of the Human Genome Project, published 50 years after Watson and Crick's seminal paper on the structure of DNA, pushed human genetics into the public eye and ushered in the genomic era. A significant, if overlooked, aspect of the race to complete the genome was the technology that propelled scientists to the finish line. DNA sequencing technologies have become more standardized, automated, and capable of higher throughput. This technology has continued to grow at an astounding rate in the decade since the Human Genome Project was completed. Today, massively parallel sequencing, or next-generation sequencing (NGS), allows the detection of genetic variants across the entire genome. This ability has led to the identification of new causes of disease and is changing the way we categorize, treat, and manage disease. NGS approaches such as whole-exome sequencing and whole-genome sequencing are rapidly becoming an affordable genetic testing strategy for the clinical laboratory. One test can now provide vast amounts of health information pertaining not only to the disease of interest, but information that may also predict adult-onset disease, reveal carrier status for a rare disease and predict drug responsiveness. The issue of what to do with these incidental findings, along with questions pertaining to NGS testing strategies, data interpretation and storage, and applying genetic testing results into patient care, remains without a clear answer. This review will explore these issues and others relevant to the implementation of NGS in the clinical laboratory.

  20. Whole Exome Analysis of Early Onset Alzheimer’s Disease

    DTIC Science & Technology

    2016-04-01

    Early Onset Alzheimer’s Disease 5a. CONTRACT NUMBER W81XWH-12-1-0013 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR( S ) Margaret A. Pericak...relationship between SORL1, AD, and Parkinsonism . 16 Appendix V: ABCA7 Frameshift Deletion Associated with Alzheimer’s Disease in African Americans...onset Alzheimer disease identified using whole-exome sequencing G. W. Beecham1, B. W. Kunkle1, B. Vardarajan2, P. L. Whitehead1, S . Rolati1, E. R

  1. Exome-first approach identified a novel gloss deletion associated with Lowe syndrome.

    PubMed

    Watanabe, Miki; Nakagawa, Ryuji; Kohmoto, Tomohiro; Naruto, Takuya; Suga, Ken-Ichi; Goji, Aya; Horikawa, Hideaki; Masuda, Kiyoshi; Kagami, Shoji; Imoto, Issei

    2016-01-01

    Lowe syndrome (LS) is an X-linked disorder affecting the eyes, nervous system and kidneys, typically caused by missense or nonsense/frameshift OCRL mutations. We report a 6-month-old male clinically suspected to have LS, but without the Fanconi-type renal dysfunction. Using a targeted-exome sequencing-first approach, LS was diagnosed by the identification of a deletion involving 1.7 Mb at Xq25-q26.1, encompassing the entire OCRL gene and neighboring loci.

  2. Detection of 1p36 deletion by clinical exome-first diagnostic approach.

    PubMed

    Watanabe, Miki; Hayabuchi, Yasunobu; Ono, Akemi; Naruto, Takuya; Horikawa, Hideaki; Kohmoto, Tomohiro; Masuda, Kiyoshi; Nakagawa, Ryuji; Ito, Hiromichi; Kagami, Shoji; Imoto, Issei

    2016-01-01

    Although chromosome 1p36 deletion syndrome is considered clinically recognizable based on characteristic features, the clinical manifestations of patients during infancy are often not consistent with those observed later in life. We report a 4-month-old girl who showed multiple congenital anomalies and developmental delay, but no clinical signs of syndromic disease caused by a terminal deletion in 1p36.32-p36.33 that was first identified by targeted-exome sequencing for molecular diagnosis.

  3. Detection of 1p36 deletion by clinical exome-first diagnostic approach

    PubMed Central

    Watanabe, Miki; Hayabuchi, Yasunobu; Ono, Akemi; Naruto, Takuya; Horikawa, Hideaki; Kohmoto, Tomohiro; Masuda, Kiyoshi; Nakagawa, Ryuji; Ito, Hiromichi; Kagami, Shoji; Imoto, Issei

    2016-01-01

    Although chromosome 1p36 deletion syndrome is considered clinically recognizable based on characteristic features, the clinical manifestations of patients during infancy are often not consistent with those observed later in life. We report a 4-month-old girl who showed multiple congenital anomalies and developmental delay, but no clinical signs of syndromic disease caused by a terminal deletion in 1p36.32-p36.33 that was first identified by targeted-exome sequencing for molecular diagnosis. PMID:28428889

  4. Exome-first approach identified a novel gloss deletion associated with Lowe syndrome

    PubMed Central

    Watanabe, Miki; Nakagawa, Ryuji; Kohmoto, Tomohiro; Naruto, Takuya; Suga, Ken-ichi; Goji, Aya; Horikawa, Hideaki; Masuda, Kiyoshi; Kagami, Shoji; Imoto, Issei

    2016-01-01

    Lowe syndrome (LS) is an X-linked disorder affecting the eyes, nervous system and kidneys, typically caused by missense or nonsense/frameshift OCRL mutations. We report a 6-month-old male clinically suspected to have LS, but without the Fanconi-type renal dysfunction. Using a targeted-exome sequencing-first approach, LS was diagnosed by the identification of a deletion involving 1.7 Mb at Xq25-q26.1, encompassing the entire OCRL gene and neighboring loci. PMID:27867521

  5. Next-generation sequencing of translocation renal cell carcinoma reveals novel RNA splicing partners and frequent mutations of chromatin-remodeling genes.

    PubMed

    Malouf, Gabriel G; Su, Xiaoping; Yao, Hui; Gao, Jianjun; Xiong, Liangwen; He, Qiuming; Compérat, Eva; Couturier, Jérôme; Molinié, Vincent; Escudier, Bernard; Camparo, Philippe; Doss, Denaha J; Thompson, Erika J; Khayat, David; Wood, Christopher G; Yu, Willie; Teh, Bin T; Weinstein, John; Tannir, Nizar M

    2014-08-01

    MITF/TFE translocation renal cell carcinoma (TRCC) is a rare subtype of kidney cancer. Its incidence and the genome-wide characterization of its genetic origin have not been fully elucidated. We performed RNA and exome sequencing on an exploratory set of TRCC (n = 7), and validated our findings using The Cancer Genome Atlas (TCGA) clear-cell RCC (ccRCC) dataset (n = 460). Using the TCGA dataset, we identified seven TRCC (1.5%) cases and determined their genomic profile. We discovered three novel partners of MITF/TFE (LUC7L3, KHSRP, and KHDRBS2) that are involved in RNA splicing. TRCC displayed a unique gene expression signature as compared with other RCC types, and showed activation of MITF, the transforming growth factor β1 and the PI3K complex targets. Genes differentially spliced between TRCC and other RCC types were enriched for MITF and ID2 targets. Exome sequencing of TRCC revealed a distinct mutational spectrum as compared with ccRCC, with frequent mutations in chromatin-remodeling genes (six of eight cases, three of which were from the TCGA). In two cases, we identified mutations in INO80D, an ATP-dependent chromatin-remodeling gene, previously shown to control the amplitude of the S phase. Knockdown of INO80D decreased cell proliferation in a novel cell line bearing LUC7L3-TFE3 translocation. This genome-wide study defines the incidence of TRCC within a ccRCC-directed project and expands the genomic spectrum of TRCC by identifying novel MITF/TFE partners involved in RNA splicing and frequent mutations in chromatin-remodeling genes. ©2014 American Association for Cancer Research.

  6. Epistasis analysis for quantitative traits by functional regression model.

    PubMed

    Zhang, Futao; Boerwinkle, Eric; Xiong, Momiao

    2014-06-01

    The critical barrier in interaction analysis for rare variants is that most traditional statistical methods for testing interactions were originally designed for testing the interaction between common variants and are difficult to apply to rare variants because of their prohibitive computational time and poor ability. The great challenges for successful detection of interactions with next-generation sequencing (NGS) data are (1) lack of methods for interaction analysis with rare variants, (2) severe multiple testing, and (3) time-consuming computations. To meet these challenges, we shift the paradigm of interaction analysis between two loci to interaction analysis between two sets of loci or genomic regions and collectively test interactions between all possible pairs of SNPs within two genomic regions. In other words, we take a genome region as a basic unit of interaction analysis and use high-dimensional data reduction and functional data analysis techniques to develop a novel functional regression model to collectively test interactions between all possible pairs of single nucleotide polymorphisms (SNPs) within two genome regions. By intensive simulations, we demonstrate that the functional regression models for interaction analysis of the quantitative trait have the correct type 1 error rates and a much better ability to detect interactions than the current pairwise interaction analysis. The proposed method was applied to exome sequence data from the NHLBI's Exome Sequencing Project (ESP) and CHARGE-S study. We discovered 27 pairs of genes showing significant interactions after applying the Bonferroni correction (P-values < 4.58 × 10(-10)) in the ESP, and 11 were replicated in the CHARGE-S study. © 2014 Zhang et al.; Published by Cold Spring Harbor Laboratory Press.

  7. The evolution of the search for novel genes in mammalian sex determination: from mice to men.

    PubMed

    Arboleda, Valerie A; Vilain, Eric

    2011-01-01

    Disorders of sex determination are a genetically heterogeneous group of rare disorders, presenting with sex-specific phenotypes and variable expressivity. Prior to the advent of the Human Genome Project, the identification of novel mammalian sex determination genes was hindered by the rarity of disorders of sex determination and small family sizes that made traditional linkage approaches difficult, if not impossible. This article reviews the revolutionary role of the Human Genome Project in the history of sex determination research and highlights the important role of inbred mouse models in elucidating the role of identified sex determination genes in mammalian sex determination. Next generation sequencing technologies has made it possible to sequence complete human genomes or exomes for the purpose of providing a genetic diagnosis to more patients with unexplained disorders of sex determination and identifying novel sex determination genes. However, beyond novel gene discovery, these tools have the power to inform us on more intricate and complex regulation-taking place within the heterogeneous cells that make up the testis and ovary. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.

    PubMed

    Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao

    2016-04-01

    To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.

  9. Further delineation of the SATB2 phenotype.

    PubMed

    Döcker, Dennis; Schubach, Max; Menzel, Moritz; Munz, Marita; Spaich, Christiane; Biskup, Saskia; Bartholdi, Deborah

    2014-08-01

    SATB2 is an evolutionarily highly conserved chromatin remodeling gene located on chromosome 2q33.1. Vertebrate animal models have shown that Satb2 has a crucial role in craniofacial patterning and osteoblast differentiation, as well as in determining the fates of neuronal projections in the developing neocortex. In humans, chromosomal translocations and deletions of 2q33.1 leading to SATB2 haploinsufficiency are associated with cleft palate (CP), facial dysmorphism and intellectual disability (ID). A single patient carrying a nonsense mutation in SATB2 has been described to date. In this study, we performed trio-exome sequencing in a 3-year-old girl with CP and severely delayed speech development, and her unaffected parents. Previously, the girl had undergone conventional and molecular karyotyping (microarray analysis), as well as targeted analysis for different diseases associated with developmental delay, including Angelman syndrome, Rett syndrome and Fragile X syndrome. No diagnosis could be established. Exome sequencing revealed a de novo nonsense mutation in the SATB2 gene (c.715C>T; p.R239*). The identification of a second patient carrying a de novo nonsense mutation in SATB2 confirms that this gene is essential for normal craniofacial patterning and cognitive development. Based on our data and the literature published so far, we propose a new clinically recognizable syndrome - the SATB2-associated syndrome (SAS). SAS is likely to be underdiagnosed and should be considered in children with ID, severe speech delay, cleft or high-arched palate and abnormal dentition with crowded and irregularly shaped teeth.

  10. CIDR

    Science.gov Websites

    Genotyping General Information Genome Wide Association Custom FFPE Sample Options Methylation Linkage Enrichment Options 51 Mb 51 Mb plus 6.8 - 24Mb custom option 54 Mb Clinical Exome 71 Mb (includes UTRs) Next Generation Sequencing Platform Illumina HiSeq sequencers Options for Formalin-Fixed Paraffin-Embedded (FFPE

  11. The genetic architecture of long QT syndrome: A critical reappraisal.

    PubMed

    Giudicessi, John R; Wilde, Arthur A M; Ackerman, Michael J

    2018-03-30

    Collectively, the completion of the Human Genome Project and subsequent development of high-throughput next-generation sequencing methodologies have revolutionized genomic research. However, the rapid sequencing and analysis of thousands upon thousands of human exomes and genomes has taught us that most genes, including those known to cause heritable cardiovascular disorders such as long QT syndrome, harbor an unexpected background rate of rare, and presumably innocuous, non-synonymous genetic variation. In this Review, we aim to reappraise the genetic architecture underlying both the acquired and congenital forms of long QT syndrome by examining how the clinical phenotype associated with and background genetic variation in long QT syndrome-susceptibility genes impacts the clinical validity of existing gene-disease associations and the variant classification and reporting strategies that serve as the foundation for diagnostic long QT syndrome genetic testing. Copyright © 2018 Elsevier Inc. All rights reserved.

  12. LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias.

    PubMed

    Johansen, Christopher T; Dubé, Joseph B; Loyzer, Melissa N; MacDonald, Austin; Carter, David E; McIntyre, Adam D; Cao, Henian; Wang, Jian; Robinson, John F; Hegele, Robert A

    2014-04-01

    We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues.

  13. LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias[S

    PubMed Central

    Johansen, Christopher T.; Dubé, Joseph B.; Loyzer, Melissa N.; MacDonald, Austin; Carter, David E.; McIntyre, Adam D.; Cao, Henian; Wang, Jian; Robinson, John F.; Hegele, Robert A.

    2014-01-01

    We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues. PMID:24503134

  14. Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.

    PubMed

    Carss, Keren J; Arno, Gavin; Erwood, Marie; Stephens, Jonathan; Sanchis-Juan, Alba; Hull, Sarah; Megy, Karyn; Grozeva, Detelina; Dewhurst, Eleanor; Malka, Samantha; Plagnol, Vincent; Penkett, Christopher; Stirrups, Kathleen; Rizzo, Roberta; Wright, Genevieve; Josifova, Dragana; Bitner-Glindzicz, Maria; Scott, Richard H; Clement, Emma; Allen, Louise; Armstrong, Ruth; Brady, Angela F; Carmichael, Jenny; Chitre, Manali; Henderson, Robert H H; Hurst, Jane; MacLaren, Robert E; Murphy, Elaine; Paterson, Joan; Rosser, Elisabeth; Thompson, Dorothy A; Wakeling, Emma; Ouwehand, Willem H; Michaelides, Michel; Moore, Anthony T; Webster, Andrew R; Raymond, F Lucy

    2017-01-05

    Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease. Copyright © 2017. Published by Elsevier Inc.

  15. Exome Sequencing Identifies Truncating Mutations in Human SERPINF1 in Autosomal-Recessive Osteogenesis Imperfecta

    PubMed Central

    Becker, Jutta; Semler, Oliver; Gilissen, Christian; Li, Yun; Bolz, Hanno Jörn; Giunta, Cecilia; Bergmann, Carsten; Rohrbach, Marianne; Koerber, Friederike; Zimmermann, Katharina; de Vries, Petra; Wirth, Brunhilde; Schoenau, Eckhard; Wollnik, Bernd; Veltman, Joris A.; Hoischen, Alexander; Netzer, Christian

    2011-01-01

    Osteogenesis imperfecta (OI) is a heterogeneous genetic disorder characterized by bone fragility and susceptibility to fractures after minimal trauma. After mutations in all known OI genes had been excluded by Sanger sequencing, we applied next-generation sequencing to analyze the exome of a single individual who has a severe form of the disease and whose parents are second cousins. A total of 26,922 variations from the human reference genome sequence were subjected to several filtering steps. In addition, we extracted the genotypes of all dbSNP130-annotated SNPs from the exome sequencing data and used these 299,494 genotypes as markers for the genome-wide identification of homozygous regions. A single homozygous truncating mutation, affecting SERPINF1 on chromosome 17p13.3, that was embedded into a homozygous stretch of 2.99 Mb remained. The mutation was also homozygous in the affected brother of the index patient. Subsequently, we identified homozygosity for two different truncating SERPINF1 mutations in two unrelated patients with OI and parental consanguinity. All four individuals with SERPINF1 mutations have severe OI. Fractures of long bones and severe vertebral compression fractures with resulting deformities were observed as early as the first year of life in these individuals. Collagen analyses with cultured dermal fibroblasts displayed no evidence for impaired collagen folding, posttranslational modification, or secretion. SERPINF1 encodes pigment epithelium-derived factor (PEDF), a secreted glycoprotein of the serpin superfamily. PEDF is a multifunctional protein and one of the strongest inhibitors of angiogenesis currently known in humans. Our data provide genetic evidence for PEDF involvement in human bone homeostasis. PMID:21353196

  16. Mutation analysis in 129 genes associated with other forms of retinal dystrophy in 157 families with retinitis pigmentosa based on exome sequencing.

    PubMed

    Xu, Yan; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong

    2015-01-01

    Mutations in 60 known genes were previously identified by exome sequencing in 79 of 157 families with retinitis pigmentosa (RP). This study analyzed variants in 129 genes associated with other forms of hereditary retinal dystrophy in the same cohort. Apart from the 73 genes previously analyzed, a further 129 genes responsible for other forms of hereditary retinal dystrophy were selected based on RetNet. Variants in the 129 genes determined by whole exome sequencing were selected and filtered by bioinformatics analysis. Candidate variants were confirmed by Sanger sequencing and validated by analysis of available family members and controls. A total of 90 candidate variants were present in the 129 genes. Sanger sequencing confirmed 83 of the 90 variants. Analysis of family members and controls excluded 76 of these 83 variants. The remaining seven variants were considered to be potential pathogenic mutations; these were c.899A>G, c.1814C>G, and c.2107C>T in BBS2; c.1073C>T and c.1669C>T in INPP5E; and c.3582C>G and c.5704-5C>G in CACNA1F. Six of these seven mutations were novel. The mutations were detected in five unrelated patients without a family history, including three patients with homozygous or compound heterozygous mutations in BBS2 and INPP5E, and two patients with hemizygous mutations in CACNA1F. None of the patients had mutations in the genes associated with autosome dominant retinal dystrophy. Only a small portion of patients with RP, about 3% (5/157), had causative mutations in the 129 genes associated with other forms of hereditary retinal dystrophy.

  17. Targeted exome sequencing for the identification of a protective variant against Internet gaming disorder at rs2229910 of neurotrophic tyrosine kinase receptor, type 3 (NTRK3): A pilot study

    PubMed Central

    Kim, Jeong-Yu; Jeong, Jo-Eun; Rhee, Je-Keun; Cho, Hyun; Chun, Ji-Won; Kim, Tae-Min; Choi, Sam-Wook; Choi, Jung-Seok; Kim, Dai-Jin

    2016-01-01

    Background and aims Internet gaming disorder (IGD) has gained recognition as a potential new diagnosis in the fifth revision of the Diagnostic and Statistical Manual of Mental Disorders, but genetic evidence supporting this disorder remains scarce. Methods In this study, targeted exome sequencing was conducted in 30 IGD patients and 30 control subjects with a focus on genes linked to various neurotransmitters associated with substance and non-substance addictions, depression, and attention deficit hyperactivity disorder. Results rs2229910 of neurotrophic tyrosine kinase receptor, type 3 (NTRK3) was the only single nucleotide polymorphism (SNP) that exhibited a significantly different minor allele frequency in IGD subjects compared to controls (p = .01932), suggesting that this SNP has a protective effect against IGD (odds ratio = 0.1541). The presence of this potentially protective allele was also associated with less time spent on Internet gaming and lower scores on the Young’s Internet Addiction Test and Korean Internet Addiction Proneness Scale for Adults. Conclusions The results of this first targeted exome sequencing study of IGD subjects indicate that rs2229910 of NTRK3 is a genetic variant that is significantly related to IGD. These findings may have significant implications for future research investigating the genetics of IGD and other behavioral addictions. PMID:27826991

  18. Asparagine synthetase deficiency detected by whole exome sequencing causes congenital microcephaly, epileptic encephalopathy and psychomotor delay.

    PubMed

    Ben-Salem, Salma; Gleeson, Joseph G; Al-Shamsi, Aisha M; Islam, Barira; Hertecant, Jozef; Ali, Bassam R; Al-Gazali, Lihadh

    2015-06-01

    Deficiency of Asparagine Synthetase (ASNSD, MIM 615574) is a very rare autosomal recessive disorder presenting with some brain abnormalities. Affected individuals have congenital microcephaly and progressive encephalopathy associated with severe intellectual disability and intractable seizures. The loss of function of the asparagine synthetase (ASNS, EC 6.3.5.4), particularly in the brain, is the major cause of this particular congenital microcephaly. In this study, we clinically evaluated an affected child from a consanguineous Emirati family presenting with congenital microcephaly and epileptic encephalopathy. In addition, whole-exome sequencing revealed a novel homozygous substitution mutation (c.1193A > C) in the ASNS gene. This mutation resulted in the substitution of highly conserved tyrosine residue by cysteine (p.Y398C). Molecular modeling analysis predicts hypomorphic and damaging effects of this mutation on the protein structure and altering its enzymatic activity. Therefore, we conclude that the loss of ASNS function is most likely the cause of this condition in the studied family. This report brings the number of reported families with this very rare disorder to five and the number of pathogenic mutations in the ASNS gene to four. This finding extends the ASNS pathogenic mutations spectrum and highlights the utility of whole-exome sequencing in elucidation the causes of rare recessive disorders that are heterogeneous and/or overlap with other conditions.

  19. Clinical multiplexed exome sequencing distinguishes adult oligodendroglial neoplasms from astrocytic and mixed lineage gliomas.

    PubMed

    Cryan, Jane B; Haidar, Sam; Ramkissoon, Lori A; Bi, Wenya Linda; Knoff, David S; Schultz, Nikolaus; Abedalthagafi, Malak; Brown, Loreal; Wen, Patrick Y; Reardon, David A; Dunn, Ian F; Folkerth, Rebecca D; Santagata, Sandro; Lindeman, Neal I; Ligon, Azra H; Beroukhim, Rameen; Hornick, Jason L; Alexander, Brian M; Ligon, Keith L; Ramkissoon, Shakti H

    2014-09-30

    Classifying adult gliomas remains largely a histologic diagnosis based on morphology; however astrocytic, oligodendroglial and mixed lineage tumors can display overlapping histologic features. We used multiplexed exome sequencing (OncoPanel) on 108 primary or recurrent adult gliomas, comprising 65 oligodendrogliomas, 28 astrocytomas and 15 mixed oligoastrocytomas to identify lesions that could enhance lineage classification. Mutations in TP53 (20/28, 71%) and ATRX (15/28, 54%) were enriched in astrocytic tumors compared to oligodendroglial tumors of which 4/65 (6%) had mutations in TP53 and 2/65 (3%) had ATRX mutations. We found that oligoastrocytomas harbored mutations in TP53 (80%, 12/15) and ATRX (60%, 9/15) at frequencies similar to pure astrocytic tumors, suggesting that oligoastrocytomas and astrocytomas may represent a single genetic or biological entity. p53 protein expression correlated with mutation status and showed significant increases in astrocytomas and oligoastrocytomas compared to oligodendrogliomas, a finding that also may facilitate accurate classification. Furthermore our OncoPanel analysis revealed that 15% of IDH1/2 mutant gliomas would not be detected by traditional IDH1 (p.R132H) antibody testing, supporting the use of genomic technologies in providing clinically relevant data. In all, our results demonstrate that multiplexed exome sequencing can support evaluation and classification of adult low-grade gliomas with a single clinical test.

  20. Use of whole exome sequencing for the identification of Ito-based arrhythmia mechanism and therapy.

    PubMed

    Sturm, Amy C; Kline, Crystal F; Glynn, Patric; Johnson, Benjamin L; Curran, Jerry; Kilic, Ahmet; Higgins, Robert S D; Binkley, Philip F; Janssen, Paul M L; Weiss, Raul; Raman, Subha V; Fowler, Steven J; Priori, Silvia G; Hund, Thomas J; Carnes, Cynthia A; Mohler, Peter J

    2015-05-26

    Identified genetic variants are insufficient to explain all cases of inherited arrhythmia. We tested whether the integration of whole exome sequencing with well-established clinical, translational, and basic science platforms could provide rapid and novel insight into human arrhythmia pathophysiology and disease treatment. We report a proband with recurrent ventricular fibrillation, resistant to standard therapeutic interventions. Using whole-exome sequencing, we identified a variant in a previously unidentified exon of the dipeptidyl aminopeptidase-like protein-6 (DPP6) gene. This variant is the first identified coding mutation in DPP6 and augments cardiac repolarizing current (Ito) causing pathological changes in Ito and action potential morphology. We designed a therapeutic regimen incorporating dalfampridine to target Ito. Dalfampridine, approved for multiple sclerosis, normalized the ECG and reduced arrhythmia burden in the proband by >90-fold. This was combined with cilostazol to accelerate the heart rate to minimize the reverse-rate dependence of augmented Ito. We describe a novel arrhythmia mechanism and therapeutic approach to ameliorate the disease. Specifically, we identify the first coding variant of DPP6 in human ventricular fibrillation. These findings illustrate the power of genetic approaches for the elucidation and treatment of disease when carefully integrated with clinical and basic/translational research teams. © 2015 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley Blackwell.

  1. Exome sequencing identifies a novel mutation of the GDI1 gene in a Chinese non-syndromic X-linked intellectual disability family

    PubMed Central

    Duan, Yongheng; Lin, Sheng; Xie, Lichun; Zheng, Kaifeng; Chen, Shiguo; Song, Hui; Zeng, Xuchun; Gu, Xueying; Wang, Heyun; Zhang, Linghua; Shao, Hao; Hong, Wenxu; Zhang, Lijie; Duan, Shan

    2017-01-01

    Abstract X-linked intellectual disability (XLID) has been associated with various genes. Diagnosis of XLID, especially for non-syndromic ones (NS-XLID), is often hampered by the heterogeneity of this disease. Here we report the case of a Chinese family in which three males suffer from intellectual disability (ID). The three patients shared the same phenotype: no typical clinical manifestation other than IQ score ≤ 70. For a genetic diagnosis for this family we carried out whole exome sequencing on the proband, and validated 16 variants of interest in the genomic DNA of all the family members. A missense mutation (c.710G > T), which mapped to exon 6 of the Rab GDP-Dissociation Inhibitor 1 (GDI1) gene, was found segregating with the ID phenotype, and this mutation changes the 237th position in the guanosine diphosphate dissociation inhibitor (GDI) protein from glycine to valine (p. Gly237Val). Through molecular dynamics simulations we found that this substitution results in a conformational change of GDI, possibly affecting the Rab-binding capacity of this protein. In conclusion, our study identified a novel GDI1 mutation that is possibly NS-XLID causative, and showed that whole exome sequencing provides advantages for detecting novel ID-associated variants and can greatly facilitate the genetic diagnosis of the disease. PMID:28863211

  2. Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency

    PubMed Central

    Bone, William P.; Washington, Nicole L.; Buske, Orion J.; Adams, David R.; Davis, Joie; Draper, David; Flynn, Elise D.; Girdea, Marta; Godfrey, Rena; Golas, Gretchen; Groden, Catherine; Jacobsen, Julius; Köhler, Sebastian; Lee, Elizabeth M. J.; Links, Amanda E.; Markello, Thomas C.; Mungall, Christopher J.; Nehrebecky, Michele; Robinson, Peter N.; Sincan, Murat; Soldatos, Ariane G.; Tifft, Cynthia J.; Toro, Camilo; Trang, Heather; Valkanas, Elise; Vasilevsky, Nicole; Wahl, Colleen; Wolfe, Lynne A.; Boerkoel, Cornelius F.; Brudno, Michael; Haendel, Melissa A.; Gahl, William A.; Smedley, Damian

    2016-01-01

    Purpose: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. Genet Med 18 6, 608–617. Methods: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease–gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein–protein association neighbors. Genet Med 18 6, 608–617. Results: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease–gene associations and ranked the correct seeded variant in up to 87% when detectable disease–gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. Genet Med 18 6, 608–617. Conclusion: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders. Genet Med 18 6, 608–617. PMID:26562225

  3. Adrenal Insufficiency, Sex Reversal, and Angelman Syndrome due to Uniparental Disomy Unmasking a Mutation in CYP11A1.

    PubMed

    Kim, Ahlee; Fujimoto, Masanobu; Hwa, Vivian; Backeljauw, Philippe; Dauber, Andrew

    2018-01-01

    Cholesterol side-chain cleavage enzyme (P450scc) deficiency is a rare genetic disorder causing primary adrenal insufficiency with or without a 46,XY disorder of sexual development (DSD). Herein, we report a case of the combination of primary adrenal insufficiency, a DSD (testes with female external genitalia in a setting of a 47,XXY karyotype), and Angelman syndrome. Comprehensive genetic analyses were performed, including a single nucleotide polymorphism microarray and whole-exome sequencing. In vitro studies were performed to evaluate the pathogenicity of the novel mutation that was identified by whole-exome sequencing. The patient was found to have segmental uniparental disomy (UPD) of chromosome 15 explaining her diagnosis of Angelman syndrome. Whole-exome sequencing further revealed a novel homozygous intronic variant in CYP11A1, the gene encoding P450scc, found within the region of UPD. In vitro studies confirmed that this variant led to decreased efficiency of CYP11A1 splicing. We report the first case of the combination of 2 rare genetic disorders, Angelman syndrome, and P450scc deficiency. After 20 years of diagnostic efforts, significant advances in genetic diagnostic technology allowed us to determine that these 2 disorders originate from a unified genetic etiology, segmental UPD unmasking a novel recessive mutation in CYP11A1. © 2018 S. Karger AG, Basel.

  4. Mapping autosomal recessive intellectual disability: combined microarray and exome sequencing identifies 26 novel candidate genes in 192 consanguineous families.

    PubMed

    Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B

    2018-04-01

    Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.

  5. Whole exome sequencing reveals recurrent mutations in BRCA2 and FAT genes in acinar cell carcinomas of the pancreas.

    PubMed

    Furukawa, Toru; Sakamoto, Hitomi; Takeuchi, Shoko; Ameri, Mitra; Kuboki, Yuko; Yamamoto, Toshiyuki; Hatori, Takashi; Yamamoto, Masakazu; Sugiyama, Masanori; Ohike, Nobuyuki; Yamaguchi, Hiroshi; Shimizu, Michio; Shibata, Noriyuki; Shimizu, Kyoko; Shiratori, Keiko

    2015-03-06

    Acinar cell carcinoma of the pancreas is a rare tumor with a poor prognosis. Compared to pancreatic ductal adenocarcinoma, its molecular features are poorly known. We studied a total of 11 acinar cell carcinomas, including 3 by exome and 4 by target sequencing. Exome sequencing revealed 65 nonsynonymous mutations and 22 indels with a mutation rate of 3.4 mutations/Mb per tumor, on average. By accounting for not only somatic but also germline mutations with loss of the wild-type allele, we identified recurrent mutations of BRCA2 and FAT genes. BRCA2 showed somatic or germline premature termination mutations, with loss of the wild-type allele in 3 of 7 tumors. FAT1, FAT3, and FAT4 showed somatic or germline missense mutations in 4 of 7 tumors. The germline FAT mutations were with loss of the wild-type allele. Loss of BRCA2 expression was observed in 5 of 11 tumors. One patient with a BRCA2-mutated tumor experienced complete remission of liver metastasis following cisplatinum chemotherapy. In conclusion, acinar cell carcinomas show a distinct mutation pattern and often harbor somatic or germline mutations of BRCA2 and FAT genes. This result may warrant assessment of BRCA2 abrogation in patients with the carcinoma to determine their sensitivity to chemotherapy.

  6. Identification of Candidate Gene Variants in Korean MODY Families by Whole-Exome Sequencing.

    PubMed

    Shim, Ye Jee; Kim, Jung Eun; Hwang, Su-Kyeong; Choi, Bong Seok; Choi, Byung Ho; Cho, Eun-Mi; Jang, Kyoung Mi; Ko, Cheol Woo

    2015-01-01

    To date, 13 genes causing maturity-onset diabetes of the young (MODY) have been identified. However, there is a big discrepancy in the genetic locus between Asian and Caucasian patients with MODY. Thus, we conducted whole-exome sequencing in Korean MODY families to identify causative gene variants. Six MODY probands and their family members were included. Variants in the dbSNP135 and TIARA databases for Koreans and the variants with minor allele frequencies >0.5% of the 1000 Genomes database were excluded. We selected only the functional variants (gain of stop codon, frameshifts and nonsynonymous single-nucleotide variants) and conducted a case-control comparison in the family members. The selected variants were scanned for the previously introduced gene set implicated in glucose metabolism. Three variants c.620C>T:p.Thr207Ile in PTPRD, c.559C>G:p.Gln187Glu in SYT9, and c.1526T>G:p.Val509Gly in WFS1 were respectively identified in 3 families. We could not find any disease-causative alleles of known MODY 1-13 genes. Based on the predictive program, Thr207Ile in PTPRD was considered pathogenic. Whole-exome sequencing is a valuable method for the genetic diagnosis of MODY. Further evaluation is necessary about the role of PTPRD, SYT9 and WFS1 in normal insulin release from pancreatic beta cells. © 2015 S. Karger AG, Basel.

  7. Exome sequencing of a colorectal cancer family reveals shared mutation pattern and predisposition circuitry along tumor pathways.

    PubMed

    Suleiman, Suleiman H; Koko, Mahmoud E; Nasir, Wafaa H; Elfateh, Ommnyiah; Elgizouli, Ubai K; Abdallah, Mohammed O E; Alfarouk, Khalid O; Hussain, Ayman; Faisal, Shima; Ibrahim, Fathelrahamn M A; Romano, Maurizio; Sultan, Ali; Banks, Lawrence; Newport, Melanie; Baralle, Francesco; Elhassan, Ahmed M; Mohamed, Hiba S; Ibrahim, Muntaser E

    2015-01-01

    The molecular basis of cancer and cancer multiple phenotypes are not yet fully understood. Next Generation Sequencing promises new insight into the role of genetic interactions in shaping the complexity of cancer. Aiming to outline the differences in mutation patterns between familial colorectal cancer cases and controls we analyzed whole exomes of cancer tissues and control samples from an extended colorectal cancer pedigree, providing one of the first data sets of exome sequencing of cancer in an African population against a background of large effective size typically with excess of variants. Tumors showed hMSH2 loss of function SNV consistent with Lynch syndrome. Sets of genes harboring insertions-deletions in tumor tissues revealed, however, significant GO enrichment, a feature that was not seen in control samples, suggesting that ordered insertions-deletions are central to tumorigenesis in this type of cancer. Network analysis identified multiple hub genes of centrality. ELAVL1/HuR showed remarkable centrality, interacting specially with genes harboring non-synonymous SNVs thus reinforcing the proposition of targeted mutagenesis in cancer pathways. A likely explanation to such mutation pattern is DNA/RNA editing, suggested here by nucleotide transition-to-transversion ratio that significantly departed from expected values (p-value 5e-6). NFKB1 also showed significant centrality along with ELAVL1, raising the suspicion of viral etiology given the known interaction between oncogenic viruses and these proteins.

  8. Novel inborn error of folate metabolism: identification by exome capture and sequencing of mutations in the MTHFD1 gene in a single proband.

    PubMed

    Watkins, David; Schwartzentruber, Jeremy A; Ganesh, Jaya; Orange, Jordan S; Kaplan, Bernard S; Nunez, Laura Dempsey; Majewski, Jacek; Rosenblatt, David S

    2011-09-01

    An infant was investigated because of megaloblastic anaemia, atypical hemolytic uraemic syndrome, severe combined immune deficiency, elevated blood levels of homocysteine and methylmalonic acid, and a selective decreased synthesis of methylcobalamin in cultured fibroblasts. Exome sequencing was performed on patient genomic DNA. Two mutations were identified in the MTHFD1 gene, which encodes a protein that catalyses three reactions involved in cellular folate metabolism. This protein is essential for the generation of formyltetrahydrofolate and methylenetetrahydrofolate and important for nucleotide and homocysteine metabolism. One mutation (c.727+1G>A) affects the splice acceptor site of intron 8. The second mutation, c.517C>T (p.R173C), changes a critical arginine residue in the NADP-binding site of the protein. Mutations affecting this arginine have previously been shown to affect enzyme activity. Both parents carry a single mutation and an unaffected sibling carries neither mutation. The combination of two mutations in the MTHFRD1 gene, predicted to have severe consequences, in the patient and their absence in the unaffected sibling, supports causality. This patient represents the first case of an inborn error of folate metabolism affecting the trifunctional MTHFD1 protein. This report reinforces the power of exome capture and sequencing for the discovery of novel genes, even when only a single proband is available for study.

  9. Loss-of-function mutations in APOC3, triglycerides, and coronary disease.

    PubMed

    Crosby, Jacy; Peloso, Gina M; Auer, Paul L; Crosslin, David R; Stitziel, Nathan O; Lange, Leslie A; Lu, Yingchang; Tang, Zheng-zheng; Zhang, He; Hindy, George; Masca, Nicholas; Stirrups, Kathleen; Kanoni, Stavroula; Do, Ron; Jun, Goo; Hu, Youna; Kang, Hyun Min; Xue, Chenyi; Goel, Anuj; Farrall, Martin; Duga, Stefano; Merlini, Pier Angelica; Asselta, Rosanna; Girelli, Domenico; Olivieri, Oliviero; Martinelli, Nicola; Yin, Wu; Reilly, Dermot; Speliotes, Elizabeth; Fox, Caroline S; Hveem, Kristian; Holmen, Oddgeir L; Nikpay, Majid; Farlow, Deborah N; Assimes, Themistocles L; Franceschini, Nora; Robinson, Jennifer; North, Kari E; Martin, Lisa W; DePristo, Mark; Gupta, Namrata; Escher, Stefan A; Jansson, Jan-Håkan; Van Zuydam, Natalie; Palmer, Colin N A; Wareham, Nicholas; Koch, Werner; Meitinger, Thomas; Peters, Annette; Lieb, Wolfgang; Erbel, Raimund; Konig, Inke R; Kruppa, Jochen; Degenhardt, Franziska; Gottesman, Omri; Bottinger, Erwin P; O'Donnell, Christopher J; Psaty, Bruce M; Ballantyne, Christie M; Abecasis, Goncalo; Ordovas, Jose M; Melander, Olle; Watkins, Hugh; Orho-Melander, Marju; Ardissino, Diego; Loos, Ruth J F; McPherson, Ruth; Willer, Cristen J; Erdmann, Jeanette; Hall, Alistair S; Samani, Nilesh J; Deloukas, Panos; Schunkert, Heribert; Wilson, James G; Kooperberg, Charles; Rich, Stephen S; Tracy, Russell P; Lin, Dan-Yu; Altshuler, David; Gabriel, Stacey; Nickerson, Deborah A; Jarvik, Gail P; Cupples, L Adrienne; Reiner, Alex P; Boerwinkle, Eric; Kathiresan, Sekar

    2014-07-03

    Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to identify rare mutations that have a large effect on phenotype. We sequenced the protein-coding regions of 18,666 genes in each of 3734 participants of European or African ancestry in the Exome Sequencing Project. We conducted tests to determine whether rare mutations in coding sequence, individually or in aggregate within a gene, were associated with plasma triglyceride levels. For mutations associated with triglyceride levels, we subsequently evaluated their association with the risk of coronary heart disease in 110,970 persons. An aggregate of rare mutations in the gene encoding apolipoprotein C3 (APOC3) was associated with lower plasma triglyceride levels. Among the four mutations that drove this result, three were loss-of-function mutations: a nonsense mutation (R19X) and two splice-site mutations (IVS2+1G→A and IVS3+1G→T). The fourth was a missense mutation (A43T). Approximately 1 in 150 persons in the study was a heterozygous carrier of at least one of these four mutations. Triglyceride levels in the carriers were 39% lower than levels in noncarriers (P<1×10(-20)), and circulating levels of APOC3 in carriers were 46% lower than levels in noncarriers (P=8×10(-10)). The risk of coronary heart disease among 498 carriers of any rare APOC3 mutation was 40% lower than the risk among 110,472 noncarriers (odds ratio, 0.60; 95% confidence interval, 0.47 to 0.75; P=4×10(-6)). Rare mutations that disrupt APOC3 function were associated with lower levels of plasma triglycerides and APOC3. Carriers of these mutations were found to have a reduced risk of coronary heart disease. (Funded by the National Heart, Lung, and Blood Institute and others.).

  10. Loss-of-Function Mutations in APOC3, Triglycerides, and Coronary Disease

    PubMed Central

    2014-01-01

    Background Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to identify rare mutations that have a large effect on phenotype. Methods We sequenced the protein-coding regions of 18,666 genes in each of 3734 participants of European or African ancestry in the Exome Sequencing Project. We conducted tests to determine whether rare mutations in coding sequence, individually or in aggregate within a gene, were associated with plasma triglyceride levels. For mutations associated with triglyceride levels, we subsequently evaluated their association with the risk of coronary heart disease in 110,970 persons. Results An aggregate of rare mutations in the gene encoding apolipoprotein C3 (APOC3) was associated with lower plasma triglyceride levels. Among the four mutations that drove this result, three were loss-of-function mutations: a nonsense mutation (R19X) and two splice-site mutations (IVS2+1G→A and IVS3+1G→T). The fourth was a missense mutation (A43T). Approximately 1 in 150 persons in the study was a heterozygous carrier of at least one of these four mutations. Triglyceride levels in the carriers were 39% lower than levels in noncarriers (P<1×10−20), and circulating levels of APOC3 in carriers were 46% lower than levels in noncarriers (P = 8×10−10). The risk of coronary heart disease among 498 carriers of any rare APOC3 mutation was 40% lower than the risk among 110,472 noncarriers (odds ratio, 0.60; 95% confidence interval, 0.47 to 0.75; P = 4×10−6). Conclusions Rare mutations that disrupt APOC3 function were associated with lower levels of plasma triglycerides and APOC3. Carriers of these mutations were found to have a reduced risk of coronary heart disease. (Funded by the National Heart, Lung, and Blood Institute and others.) PMID:24941081

  11. Whole-exome sequencing identified a homozygous FNBP4 mutation in a family with a condition similar to microphthalmia with limb anomalies.

    PubMed

    Kondo, Yukiko; Koshimizu, Eriko; Megarbane, Andre; Hamanoue, Haruka; Okada, Ippei; Nishiyama, Kiyomi; Kodera, Hirofumi; Miyatake, Satoko; Tsurusaki, Yoshinori; Nakashima, Mitsuko; Doi, Hiroshi; Miyake, Noriko; Saitsu, Hirotomo; Matsumoto, Naomichi

    2013-07-01

    Microphthalmia with limb anomalies (MLA), also known as Waardenburg anophthalmia syndrome or ophthalmoacromelic syndrome, is a rare autosomal recessive disorder. Recently, we and others successfully identified SMOC1 as the causative gene for MLA. However, there are several MLA families without SMOC1 abnormality, suggesting locus heterogeneity in MLA. We aimed to identify a pathogenic mutation in one Lebanese family having an MLA-like condition without SMOC1 mutation by whole-exome sequencing (WES) combined with homozygosity mapping. A c.683C>T (p.Thr228Met) in FNBP4 was found as a primary candidate, drawing the attention that FNBP4 and SMOC1 may potentially modulate BMP signaling. Copyright © 2013 Wiley Periodicals, Inc.

  12. Exome sequencing links corticospinal motor neuron disease to common neurodegenerative disorders.

    PubMed

    Novarino, Gaia; Fenstermaker, Ali G; Zaki, Maha S; Hofree, Matan; Silhavy, Jennifer L; Heiberg, Andrew D; Abdellateef, Mostafa; Rosti, Basak; Scott, Eric; Mansour, Lobna; Masri, Amira; Kayserili, Hulya; Al-Aama, Jumana Y; Abdel-Salam, Ghada M H; Karminejad, Ariana; Kara, Majdi; Kara, Bulent; Bozorgmehri, Bita; Ben-Omran, Tawfeg; Mojahedi, Faezeh; El Din Mahmoud, Iman Gamal; Bouslam, Naima; Bouhouche, Ahmed; Benomar, Ali; Hanein, Sylvain; Raymond, Laure; Forlani, Sylvie; Mascaro, Massimo; Selim, Laila; Shehata, Nabil; Al-Allawi, Nasir; Bindu, P S; Azam, Matloob; Gunel, Murat; Caglayan, Ahmet; Bilguvar, Kaya; Tolun, Aslihan; Issa, Mahmoud Y; Schroth, Jana; Spencer, Emily G; Rosti, Rasim O; Akizu, Naiara; Vaux, Keith K; Johansen, Anide; Koh, Alice A; Megahed, Hisham; Durr, Alexandra; Brice, Alexis; Stevanin, Giovanni; Gabriel, Stacy B; Ideker, Trey; Gleeson, Joseph G

    2014-01-31

    Hereditary spastic paraplegias (HSPs) are neurodegenerative motor neuron diseases characterized by progressive age-dependent loss of corticospinal motor tract function. Although the genetic basis is partly understood, only a fraction of cases can receive a genetic diagnosis, and a global view of HSP is lacking. By using whole-exome sequencing in combination with network analysis, we identified 18 previously unknown putative HSP genes and validated nearly all of these genes functionally or genetically. The pathways highlighted by these mutations link HSP to cellular transport, nucleotide metabolism, and synapse and axon development. Network analysis revealed a host of further candidate genes, of which three were mutated in our cohort. Our analysis links HSP to other neurodegenerative disorders and can facilitate gene discovery and mechanistic understanding of disease.

  13. Whole Exome Sequencing identifies a splicing mutation in NSUN2 as a cause of a Dubowitz-like syndrome

    PubMed Central

    Martinez, Fernando; Lee, Jeong Ho; Lee, Ji Eun; Blanco, Sandra; Nickerson, Elizabeth; Gabriel, Stacey; Frye, Michaela; Al-Gazali, Lihadh; Gleeson, Joseph G.

    2016-01-01

    Dubowitz Syndrome is an autosomal recessive disorder characterized by the constellation of mild microcephaly, growth and mental retardation, eczema and peculiar facies, but causes are still unknown. We studied a multiplex consanguineous family with many features of Dubowitz syndrome using whole exome sequencing and identified a splice mutation in NSUN2, encoding a conserved RNA methyltransferase. NSUN2 has been implicated in Myc-induced cell proliferation and mitotic spindle stability, which might help explain the varied clinical presentations that can include chromosomal instability and immunological defects. Patient cells displayed loss of NSUN2-specific methylation at two residues of the aspartate tRNA. Our findings establish NSUN2 as the first causal gene with relationship to the Dubowitz syndrome spectrum phenotype. PMID:22577224

  14. Clinical sequencing in leukemia with the assistance of artificial intelligence.

    PubMed

    Tojo, Arinobu

    2017-01-01

    Next generation sequencing (NGS) of cancer genomes is now becoming a prerequisite for accurate diagnosis and proper treatment in clinical oncology. Because the genomic regions for NGS expand from a certain set of genes to the whole exome or whole genome, the resulting sequence data becomes incredibly enormous and makes it quite laborious to translate the genomic data into medicine, so-called annotation and curation. We organized a clinical sequencing team and established a bidirectional (bed-to-bench and bench-to-bed) system to integrate clinical and genomic data for hematological malignancies. We also started a collaborative research project with IBM Japan to adopt the artificial intelligence Watson for Genomics (WfG) to the pipeline of medical informatics. Genomic DNA was prepared from malignant as well as normal tissues in each patient and subjected to NGS. Sequence data was analyzed using an in-house semi-automated pipeline in combination with WfG, which was used to identify candidate driver mutations and relevant pathways from which applicable drug information was deduced. Currently, we have analyzed more than 150 patients with hematological disorders, including AML and ALL, and obtained many informative findings. In this presentation, I will introduce some of the achievements we have made so far.

  15. Whole-Exome Sequencing Study of Thyrotropin-Secreting Pituitary Adenomas.

    PubMed

    Sapkota, Santosh; Horiguchi, Kazuhiko; Tosaka, Masahiko; Yamada, Syozo; Yamada, Masanobu

    2017-02-01

    Thyrotropin (TSH)-secreting pituitary adenomas (TSHomas) are a rare cause of hyperthyroidism, and the genetic aberrations responsible remain unknown. To identify somatic genetic abnormalities in TSHomas. A single-nucleotide polymorphism (SNP) array analysis was performed on 8 TSHomas. Four tumors with no allelic losses or limited loss of heterozygosity were selected, and whole-exome sequencing was performed, including their corresponding blood samples. Somatic variants were confirmed by Sanger sequencing. A set of 8 tumors was also assessed to validate candidate genes. Twelve patients with sporadic TSHomas were examined. The overall performance of whole-exome sequencing was good, with an average coverage of each base in the targeted region of 97.6%. Six DNA variants were confirmed as candidate driver mutations, with an average of 1.5 somatic mutations per tumor. No mutations were recurrent. Two of these mutations were found in genes with an established role in malignant tumorigenesis (SMOX and SYTL3), and 4 had unknown roles (ZSCAN23, ASTN2, R3HDM2, and CWH43). Similarly, an SNP array analysis revealed frequent chromosomal regions of copy number gains, including recurrent gains at loci harboring 4 of these 6 genes. Several candidate somatic mutations and changes in copy numbers for TSHomas were identified. The results showed no recurrence of mutations in the tumors studied but a low number of mutations, thereby highlighting their benign nature. Further studies on a larger cohort of TSHomas, along with the use of epigenetic and transcriptomic approaches, may reveal the underlying genetic lesions. Copyright © 2017 by the Endocrine Society

  16. Whole-exome sequencing of primary plasma cell leukemia discloses heterogeneous mutational patterns.

    PubMed

    Cifola, Ingrid; Lionetti, Marta; Pinatel, Eva; Todoerti, Katia; Mangano, Eleonora; Pietrelli, Alessandro; Fabris, Sonia; Mosca, Laura; Simeon, Vittorio; Petrucci, Maria Teresa; Morabito, Fortunato; Offidani, Massimo; Di Raimondo, Francesco; Falcone, Antonietta; Caravita, Tommaso; Battaglia, Cristina; De Bellis, Gianluca; Palumbo, Antonio; Musto, Pellegrino; Neri, Antonino

    2015-07-10

    Primary plasma cell leukemia (pPCL) is a rare and aggressive form of plasma cell dyscrasia and may represent a valid model for high-risk multiple myeloma (MM). To provide novel information concerning the mutational profile of this disease, we performed the whole-exome sequencing of a prospective series of 12 pPCL cases included in a Phase II multicenter clinical trial and previously characterized at clinical and molecular levels. We identified 1, 928 coding somatic non-silent variants on 1, 643 genes, with a mean of 166 variants per sample, and only few variants and genes recurrent in two or more samples. An excess of C > T transitions and the presence of two main mutational signatures (related to APOBEC over-activity and aging) occurring in different translocation groups were observed. We identified 14 candidate cancer driver genes, mainly involved in cell-matrix adhesion, cell cycle, genome stability, RNA metabolism and protein folding. Furthermore, integration of mutation data with copy number alteration profiles evidenced biallelically disrupted genes with potential tumor suppressor functions. Globally, cadherin/Wnt signaling, extracellular matrix and cell cycle checkpoint resulted the most affected functional pathways. Sequencing results were finally combined with gene expression data to better elucidate the biological relevance of mutated genes. This study represents the first whole-exome sequencing screen of pPCL and evidenced a remarkable genetic heterogeneity of mutational patterns. This may provide a contribution to the comprehension of the pathogenetic mechanisms associated with this aggressive form of PC dyscrasia and potentially with high-risk MM.

  17. Gratitude, protective buffering, and cognitive dissonance: How families respond to pediatric whole exome sequencing in the absence of actionable results.

    PubMed

    Werner-Lin, Allison; Zaspel, Lori; Carlson, Mae; Mueller, Rebecca; Walser, Sarah A; Desai, Ria; Bernhardt, Barbara A

    2018-03-01

    Clinical genome and exome sequencing (CGES) may identify variants leading to targeted management of existing conditions. Yet, CGES often fails to identify pathogenic diagnostic variants and introduces uncertainties by detecting variants of uncertain significance (VUS) and secondary findings. This study investigated how families understand findings and adjust their perspectives on CGES. As part of NIH's Clinical Sequencing Exploratory Research Consortium, children were recruited from clinics at the Children's Hospital of Pennsylvania (CHOP) and offered exome sequencing. Primary pathogenic and possibly pathogenic, and some secondary findings were returned. Investigators digitally recorded results disclosure sessions and conducted 3-month follow up interviews with 10 adolescents and a parent. An interdisciplinary team coded all transcripts. Participants were initially disappointed with findings, yet reactions evolved within disclosure sessions and at 3-month interviews toward acceptance and satisfaction. Families erroneously expected, and prepared extensively, to learn about risk for common conditions. During disclosure sessions, parents and adolescents varied in how they monitored and responded to each others reactions. Several misinterpreted, or overestimated, the utility of findings to attribute meaning and achieve closure for the CGES experience. Participants perceived testing as an opportunity to improve disease management despite results that did not introduce new treatments or diagnoses. Future research may examine whether families experience cognitive dissonance regarding discrepancies between expectations and findings, and how protective buffering minimizes the burden of disappointment on loved ones. As CGES is increasingly integrated into clinical care providers must contend with tempering family expectations and interpretations of findings while managing complex medical care. © 2018 Wiley Periodicals, Inc.

  18. Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression

    PubMed Central

    Lasorsa, Vito Alessandro; Formicola, Daniela; Pignataro, Piero; Cimmino, Flora; Calabrese, Francesco Maria; Mora, Jaume; Esposito, Maria Rosaria; Pantile, Marcella; Zanon, Carlo; De Mariano, Marilena; Longo, Luca; Hogarty, Michael D.; de Torres, Carmen; Tonini, Gian Paolo; Iolascon, Achille; Capasso, Mario

    2016-01-01

    The spectrum of somatic mutation of the most aggressive forms of neuroblastoma is not completely determined. We sought to identify potential cancer drivers in clinically aggressive neuroblastoma. Whole exome sequencing was conducted on 17 germline and tumor DNA samples from high-risk patients with adverse events within 36 months from diagnosis (HR-Event3) to identify somatic mutations and deep targeted sequencing of 134 genes selected from the initial screening in additional 48 germline and tumor pairs (62.5% HR-Event3 and high-risk patients), 17 HR-Event3 tumors and 17 human-derived neuroblastoma cell lines. We revealed 22 significantly mutated genes, many of which implicated in cancer progression. Fifteen genes (68.2%) were highly expressed in neuroblastoma supporting their involvement in the disease. CHD9, a cancer driver gene, was the most significantly altered (4.0% of cases) after ALK. Other genes (PTK2, NAV3, NAV1, FZD1 and ATRX), expressed in neuroblastoma and involved in cell invasion and migration were mutated at frequency ranged from 4% to 2%. Focal adhesion and regulation of actin cytoskeleton pathways, were frequently disrupted (14.1% of cases) thus suggesting potential novel therapeutic strategies to prevent disease progression. Notably BARD1, CHEK2 and AXIN2 were enriched in rare, potentially pathogenic, germline variants. In summary, whole exome and deep targeted sequencing identified novel cancer genes of clinically aggressive neuroblastoma. Our analyses show pathway-level implications of infrequently mutated genes in leading neuroblastoma progression. PMID:27009842

  19. Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression.

    PubMed

    Lasorsa, Vito Alessandro; Formicola, Daniela; Pignataro, Piero; Cimmino, Flora; Calabrese, Francesco Maria; Mora, Jaume; Esposito, Maria Rosaria; Pantile, Marcella; Zanon, Carlo; De Mariano, Marilena; Longo, Luca; Hogarty, Michael D; de Torres, Carmen; Tonini, Gian Paolo; Iolascon, Achille; Capasso, Mario

    2016-04-19

    The spectrum of somatic mutation of the most aggressive forms of neuroblastoma is not completely determined. We sought to identify potential cancer drivers in clinically aggressive neuroblastoma.Whole exome sequencing was conducted on 17 germline and tumor DNA samples from high-risk patients with adverse events within 36 months from diagnosis (HR-Event3) to identify somatic mutations and deep targeted sequencing of 134 genes selected from the initial screening in additional 48 germline and tumor pairs (62.5% HR-Event3 and high-risk patients), 17 HR-Event3 tumors and 17 human-derived neuroblastoma cell lines.We revealed 22 significantly mutated genes, many of which implicated in cancer progression. Fifteen genes (68.2%) were highly expressed in neuroblastoma supporting their involvement in the disease. CHD9, a cancer driver gene, was the most significantly altered (4.0% of cases) after ALK.Other genes (PTK2, NAV3, NAV1, FZD1 and ATRX), expressed in neuroblastoma and involved in cell invasion and migration were mutated at frequency ranged from 4% to 2%.Focal adhesion and regulation of actin cytoskeleton pathways, were frequently disrupted (14.1% of cases) thus suggesting potential novel therapeutic strategies to prevent disease progression.Notably BARD1, CHEK2 and AXIN2 were enriched in rare, potentially pathogenic, germline variants.In summary, whole exome and deep targeted sequencing identified novel cancer genes of clinically aggressive neuroblastoma. Our analyses show pathway-level implications of infrequently mutated genes in leading neuroblastoma progression.

  20. Muscle RAS oncogene homolog (MRAS) recurrent mutation in Borrmann type IV gastric cancer.

    PubMed

    Yasumoto, Makiko; Sakamoto, Etsuko; Ogasawara, Sachiko; Isobe, Taro; Kizaki, Junya; Sumi, Akiko; Kusano, Hironori; Akiba, Jun; Torimura, Takuji; Akagi, Yoshito; Itadani, Hiraku; Kobayashi, Tsutomu; Hasako, Shinichi; Kumazaki, Masafumi; Mizuarai, Shinji; Oie, Shinji; Yano, Hirohisa

    2017-01-01

    The prognosis of patients with Borrmann type IV gastric cancer (Type IV) is extremely poor. Thus, there is an urgent need to elucidate the molecular mechanisms underlying the oncogenesis of Type IV and to identify new therapeutic targets. Although previous studies using whole-exome and whole-genome sequencing have elucidated genomic alterations in gastric cancer, none has focused on comprehensive genetic analysis of Type IV. To discover cancer-relevant genes in Type IV, we performed whole-exome sequencing and genome-wide copy number analysis on 13 patients with Type IV. Exome sequencing identified 178 somatic mutations in protein-coding sequences or at splice sites. Among the mutations, we found a mutation in muscle RAS oncogene homolog (MRAS), which is predicted to cause molecular dysfunction. MRAS belongs to the Ras subgroup of small G proteins, which includes the prototypic RAS oncogenes. We analyzed an additional 46 Type IV samples to investigate the frequency of MRAS mutation. There were eight nonsynonymous mutations (mutation frequency, 17%), showing that MRAS is recurrently mutated in Type IV. Copy number analysis identified six focal amplifications and one homozygous deletion, including insulin-like growth factor 1 receptor (IGF1R) amplification. The samples with IGF1R amplification had remarkably higher IGF1R mRNA and protein expression levels compared with the other samples. This is the first report of MRAS recurrent mutation in human tumor samples. Our results suggest that MRAS mutation and IGF1R amplification could drive tumorigenesis of Type IV and could be new therapeutic targets. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

  1. Whole-exome sequencing identifies novel candidate predisposition genes for familial polycythemia vera.

    PubMed

    Hirvonen, Elina A M; Pitkänen, Esa; Hemminki, Kari; Aaltonen, Lauri A; Kilpivaara, Outi

    2017-04-20

    Polycythemia vera (PV), characterized by massive production of erythrocytes, is one of the myeloproliferative neoplasms. Most patients carry a somatic gain-of-function mutation in JAK2, c.1849G > T (p.Val617Phe), leading to constitutive activation of JAK-STAT signaling pathway. Familial clustering is also observed occasionally, but high-penetrance predisposition genes to PV have remained unidentified. We studied the predisposition to PV by exome sequencing (three cases) in a Finnish PV family with four patients. The 12 shared variants (maximum allowed minor allele frequency <0.001 in Finnish population in ExAC database) predicted damaging in silico and absent in an additional control set of over 500 Finns were further validated by Sanger sequencing in a fourth affected family member. Three novel predisposition candidate variants were identified: c.1254C > G (p.Phe418Leu) in ZXDC, c.1931C > G (p.Pro644Arg) in ATN1, and c.701G > A (p.Arg234Gln) in LRRC3. We also observed a rare, predicted benign germline variant c.2912C > G (p.Ala971Gly) in BCORL1 in all four patients. Somatic mutations in BCORL1 have been reported in myeloid malignancies. We further screened the variants in eight PV patients in six other Finnish families, but no other carriers were found. Exome sequencing provides a powerful tool for the identification of novel variants, and understanding the familial predisposition of diseases. This is the first report on Finnish familial PV cases, and we identified three novel candidate variants that may predispose to the disease.

  2. Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

    PubMed

    Curtis, David

    2018-01-01

    Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.

  3. Missense-depleted regions in population exomes implicate ras superfamily nucleotide-binding protein alteration in patients with brain malformation

    PubMed Central

    Ge, Xiaoyan; Gong, Henry; Dumas, Kevin; Litwin, Jessica; Phillips, Joanna J; Waisfisz, Quinten; Weiss, Marjan M; Hendriks, Yvonne; Stuurman, Kyra E; Nelson, Stanley F; Grody, Wayne W; Lee, Hane; Kwok, Pui-Yan; Shieh, Joseph T C

    2016-01-01

    Genomic sequence interpretation can miss clinically relevant missense variants for several reasons. Rare missense variants are numerous in the exome and difficult to prioritise. Affected genes may also not have existing disease association. To improve variant prioritisation, we leverage population exome data to identify intragenic missense-depleted regions (MDRs) genome-wide that may be important in disease. We then use missense depletion analyses to help prioritise undiagnosed disease exome variants. We demonstrate application of this strategy to identify a novel gene association for human brain malformation. We identified de novo missense variants that affect the GDP/GTP-binding site of ARF1 in three unrelated patients. Corresponding functional analysis suggests ARF1 GDP/GTP-activation is affected by the specific missense mutations associated with heterotopia. These findings expand the genetic pathway underpinning neurologic disease that classically includes FLNA. ARF1 along with ARFGEF2 add further evidence implicating ARF/GEFs in the brain. Using functional ontology, top MDR-containing genes were highly enriched for nucleotide-binding function, suggesting these may be candidates for human disease. Routine consideration of MDR in the interpretation of exome data for rare diseases may help identify strong genetic factors for many severe conditions, infertility/reduction in reproductive capability, and embryonic conditions contributing to preterm loss. PMID:28868155

  4. Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases

    PubMed Central

    Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.

    2012-01-01

    The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524

  5. De novo germline and postzygotic mutations in AKT3, PIK3R2 and PIK3CA cause a spectrum of related megalencephaly syndromes

    PubMed Central

    Rivière, Jean-Baptiste; Mirzaa, Ghayda M.; O’Roak, Brian J.; Beddaoui, Margaret; Alcantara, Diana; Conway, Robert L.; St-Onge, Judith; Schwartzentruber, Jeremy A.; Gripp, Karen W.; Nikkel, Sarah M.; Worthylake, Thea; Sullivan, Christopher T.; Ward, Thomas R.; Butler, Hailly E.; Kramer, Nancy A.; Albrecht, Beate; Armour, Christine M.; Armstrong, Linlea; Caluseriu, Oana; Cytrynbaum, Cheryl; Drolet, Beth A.; Innes, A. Micheil; Lauzon, Julie L.; Lin, Angela E.; Mancini, Grazia M. S.; Meschino, Wendy S.; Reggin, James D.; Saggar, Anand K.; Lerman-Sagie, Tally; Uyanik, Gökhan; Weksberg, Rosanna; Zirn, Birgit; Beaulieu, Chandree L.; Majewski, Jacek; Bulman, Dennis E.; O’Driscoll, Mark; Shendure, Jay; Graham, John M.; Boycott, Kym M.; Dobyns, William B.

    2012-01-01

    Megalencephaly-capillary malformation (MCAP) and megalencephaly-polymicrogyria-polydactyly-hydrocephalus (MPPH) syndromes are sporadic overgrowth disorders associated with markedly enlarged brain size and other recognizable features1-5. We performed exome sequencing in three families with MCAP or MPPH and confirmed our initial observations in exomes from 7 MCAP and 174 control individuals, as well as in 40 additional megalencephaly subjects using a combination of Sanger sequencing, restriction-enzyme assays, and targeted deep sequencing. We identified de novo germline or postzygotic mutations in three core components of the phosphatidylinositol-3-kinase (PI3K)/AKT pathway. These include two mutations of AKT3, one recurrent mutation of PIK3R2 in 11 unrelated MPPH families, and 15 mostly postzygotic mutations of PIK3CA in 23 MCAP and one MPPH patients. Our data highlight the central role of PI3K/AKT signaling in vascular, limb and brain development, and emphasize the power of massively parallel sequencing in a challenging context of phenotypic and genetic heterogeneity combined with postzygotic mosaicism. PMID:22729224

  6. De novo germline and postzygotic mutations in AKT3, PIK3R2 and PIK3CA cause a spectrum of related megalencephaly syndromes.

    PubMed

    Rivière, Jean-Baptiste; Mirzaa, Ghayda M; O'Roak, Brian J; Beddaoui, Margaret; Alcantara, Diana; Conway, Robert L; St-Onge, Judith; Schwartzentruber, Jeremy A; Gripp, Karen W; Nikkel, Sarah M; Worthylake, Thea; Sullivan, Christopher T; Ward, Thomas R; Butler, Hailly E; Kramer, Nancy A; Albrecht, Beate; Armour, Christine M; Armstrong, Linlea; Caluseriu, Oana; Cytrynbaum, Cheryl; Drolet, Beth A; Innes, A Micheil; Lauzon, Julie L; Lin, Angela E; Mancini, Grazia M S; Meschino, Wendy S; Reggin, James D; Saggar, Anand K; Lerman-Sagie, Tally; Uyanik, Gökhan; Weksberg, Rosanna; Zirn, Birgit; Beaulieu, Chandree L; Majewski, Jacek; Bulman, Dennis E; O'Driscoll, Mark; Shendure, Jay; Graham, John M; Boycott, Kym M; Dobyns, William B

    2012-06-24

    Megalencephaly-capillary malformation (MCAP) and megalencephaly-polymicrogyria-polydactyly-hydrocephalus (MPPH) syndromes are sporadic overgrowth disorders associated with markedly enlarged brain size and other recognizable features. We performed exome sequencing in 3 families with MCAP or MPPH, and our initial observations were confirmed in exomes from 7 individuals with MCAP and 174 control individuals, as well as in 40 additional subjects with megalencephaly, using a combination of Sanger sequencing, restriction enzyme assays and targeted deep sequencing. We identified de novo germline or postzygotic mutations in three core components of the phosphatidylinositol 3-kinase (PI3K)-AKT pathway. These include 2 mutations in AKT3, 1 recurrent mutation in PIK3R2 in 11 unrelated families with MPPH and 15 mostly postzygotic mutations in PIK3CA in 23 individuals with MCAP and 1 with MPPH. Our data highlight the central role of PI3K-AKT signaling in vascular, limb and brain development and emphasize the power of massively parallel sequencing in a challenging context of phenotypic and genetic heterogeneity combined with postzygotic mosaicism.

  7. Use of whole-exome sequencing to determine the genetic basis of multiple mitochondrial respiratory chain complex deficiencies.

    PubMed

    Taylor, Robert W; Pyle, Angela; Griffin, Helen; Blakely, Emma L; Duff, Jennifer; He, Langping; Smertenko, Tania; Alston, Charlotte L; Neeve, Vivienne C; Best, Andrew; Yarham, John W; Kirschner, Janbernd; Schara, Ulrike; Talim, Beril; Topaloglu, Haluk; Baric, Ivo; Holinski-Feder, Elke; Abicht, Angela; Czermin, Birgit; Kleinle, Stephanie; Morris, Andrew A M; Vassallo, Grace; Gorman, Grainne S; Ramesh, Venkateswaran; Turnbull, Douglass M; Santibanez-Koref, Mauro; McFarland, Robert; Horvath, Rita; Chinnery, Patrick F

    2014-07-02

    Mitochondrial disorders have emerged as a common cause of inherited disease, but their diagnosis remains challenging. Multiple respiratory chain complex defects are particularly difficult to diagnose at the molecular level because of the massive number of nuclear genes potentially involved in intramitochondrial protein synthesis, with many not yet linked to human disease. To determine the molecular basis of multiple respiratory chain complex deficiencies. We studied 53 patients referred to 2 national centers in the United Kingdom and Germany between 2005 and 2012. All had biochemical evidence of multiple respiratory chain complex defects but no primary pathogenic mitochondrial DNA mutation. Whole-exome sequencing was performed using 62-Mb exome enrichment, followed by variant prioritization using bioinformatic prediction tools, variant validation by Sanger sequencing, and segregation of the variant with the disease phenotype in the family. Presumptive causal variants were identified in 28 patients (53%; 95% CI, 39%-67%) and possible causal variants were identified in 4 (8%; 95% CI, 2%-18%). Together these accounted for 32 patients (60% 95% CI, 46%-74%) and involved 18 different genes. These included recurrent mutations in RMND1, AARS2, and MTO1, each on a haplotype background consistent with a shared founder allele, and potential novel mutations in 4 possible mitochondrial disease genes (VARS2, GARS, FLAD1, and PTCD1). Distinguishing clinical features included deafness and renal involvement associated with RMND1 and cardiomyopathy with AARS2 and MTO1. However, atypical clinical features were present in some patients, including normal liver function and Leigh syndrome (subacute necrotizing encephalomyelopathy) seen in association with TRMU mutations and no cardiomyopathy with founder SCO2 mutations. It was not possible to confidently identify the underlying genetic basis in 21 patients (40%; 95% CI, 26%-54%). Exome sequencing enhances the ability to identify potential nuclear gene mutations in patients with biochemically defined defects affecting multiple mitochondrial respiratory chain complexes. Additional study is required in independent patient populations to determine the utility of this approach in comparison with traditional diagnostic methods.

  8. Motivations, concerns and preferences of personal genome sequencing research participants: Baseline findings from the HealthSeq project

    PubMed Central

    Sanderson, Saskia C; Linderman, Michael D; Suckiel, Sabrina A; Diaz, George A; Zinberg, Randi E; Ferryman, Kadija; Wasserstein, Melissa; Kasarskis, Andrew; Schadt, Eric E

    2016-01-01

    Whole exome/genome sequencing (WES/WGS) is increasingly offered to ostensibly healthy individuals. Understanding the motivations and concerns of research participants seeking out personal WGS and their preferences regarding return-of-results and data sharing will help optimize protocols for WES/WGS. Baseline interviews including both qualitative and quantitative components were conducted with research participants (n=35) in the HealthSeq project, a longitudinal cohort study of individuals receiving personal WGS results. Data sharing preferences were recorded during informed consent. In the qualitative interview component, the dominant motivations that emerged were obtaining personal disease risk information, satisfying curiosity, contributing to research, self-exploration and interest in ancestry, and the dominant concern was the potential psychological impact of the results. In the quantitative component, 57% endorsed concerns about privacy. Most wanted to receive all personal WGS results (94%) and their raw data (89%); a third (37%) consented to having their data shared to the Database of Genotypes and Phenotypes (dbGaP). Early adopters of personal WGS in the HealthSeq project express a variety of health- and non-health-related motivations. Almost all want all available findings, while also expressing concerns about the psychological impact and privacy of their results. PMID:26036856

  9. GenomeVIP: a cloud platform for genomic variant discovery and interpretation

    PubMed Central

    Mashl, R. Jay; Scott, Adam D.; Huang, Kuan-lin; Wyczalkowski, Matthew A.; Yoon, Christopher J.; Niu, Beifang; DeNardo, Erin; Yellapantula, Venkata D.; Handsaker, Robert E.; Chen, Ken; Koboldt, Daniel C.; Ye, Kai; Fenyö, David; Raphael, Benjamin J.; Wendl, Michael C.; Ding, Li

    2017-01-01

    Identifying genomic variants is a fundamental first step toward the understanding of the role of inherited and acquired variation in disease. The accelerating growth in the corpus of sequencing data that underpins such analysis is making the data-download bottleneck more evident, placing substantial burdens on the research community to keep pace. As a result, the search for alternative approaches to the traditional “download and analyze” paradigm on local computing resources has led to a rapidly growing demand for cloud-computing solutions for genomics analysis. Here, we introduce the Genome Variant Investigation Platform (GenomeVIP), an open-source framework for performing genomics variant discovery and annotation using cloud- or local high-performance computing infrastructure. GenomeVIP orchestrates the analysis of whole-genome and exome sequence data using a set of robust and popular task-specific tools, including VarScan, GATK, Pindel, BreakDancer, Strelka, and Genome STRiP, through a web interface. GenomeVIP has been used for genomic analysis in large-data projects such as the TCGA PanCanAtlas and in other projects, such as the ICGC Pilots, CPTAC, ICGC-TCGA DREAM Challenges, and the 1000 Genomes SV Project. Here, we demonstrate GenomeVIP's ability to provide high-confidence annotated somatic, germline, and de novo variants of potential biological significance using publicly available data sets. PMID:28522612

  10. Navigating highly homologous genes in a molecular diagnostic setting: a resource for clinical next-generation sequencing.

    PubMed

    Mandelker, Diana; Schmidt, Ryan J; Ankala, Arunkanth; McDonald Gibson, Kristin; Bowser, Mark; Sharma, Himanshu; Duffy, Elizabeth; Hegde, Madhuri; Santani, Avni; Lebo, Matthew; Funke, Birgit

    2016-12-01

    Next-generation sequencing (NGS) is now routinely used to interrogate large sets of genes in a diagnostic setting. Regions of high sequence homology continue to be a major challenge for short-read technologies and can lead to false-positive and false-negative diagnostic errors. At the scale of whole-exome sequencing (WES), laboratories may be limited in their knowledge of genes and regions that pose technical hurdles due to high homology. We have created an exome-wide resource that catalogs highly homologous regions that is tailored toward diagnostic applications. This resource was developed using a mappability-based approach tailored to current Sanger and NGS protocols. Gene-level and exon-level lists delineate regions that are difficult or impossible to analyze via standard NGS. These regions are ranked by degree of affectedness, annotated for medical relevance, and classified by the type of homology (within-gene, different functional gene, known pseudogene, uncharacterized noncoding region). Additionally, we provide a list of exons that cannot be analyzed by short-amplicon Sanger sequencing. This resource can help guide clinical test design, supplemental assay implementation, and results interpretation in the context of high homology.Genet Med 18 12, 1282-1289.

  11. Whole exome sequencing reveals concomitant mutations of multiple FA genes in individual Fanconi anemia patients

    PubMed Central

    2014-01-01

    Background Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. Methods We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Results Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients’ clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Conclusions Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology. PMID:24885126

  12. Whole exome sequencing reveals concomitant mutations of multiple FA genes in individual Fanconi anemia patients.

    PubMed

    Chang, Lixian; Yuan, Weiping; Zeng, Huimin; Zhou, Quanquan; Wei, Wei; Zhou, Jianfeng; Li, Miaomiao; Wang, Xiaomin; Xu, Mingjiang; Yang, Fengchun; Yang, Yungui; Cheng, Tao; Zhu, Xiaofan

    2014-05-15

    Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients' clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology.

  13. Whole-exome sequencing reveals a rare interferon gamma receptor 1 mutation associated with myasthenia gravis.

    PubMed

    Qi, Guoyan; Liu, Peng; Gu, Shanshan; Yang, Hongxia; Dong, Huimin; Xue, Yinping

    2018-04-01

    Our study is aimed to explore the underlying genetic basis of myasthenia gravis. We collected a Chinese pedigree with myasthenia gravis, and whole-exome sequencing was performed on the two affected siblings and their parents. The candidate pathogenic gene was identified by bioinformatics filtering, which was further verified by Sanger sequencing. The homozygous mutation c.G40A (p.V14M) in interferon gamma receptor 1was identified. Moreover, the mutation was also detected in 3 cases of 44 sporadic myasthenia gravis patients. The p.V14M substitution in interferon gamma receptor 1 may affect the signal peptide function and the translocation on cell membrane, which could disrupt the binding of the ligand of interferon gamma and antibody production, contributing to myasthenia gravis susceptibility. We discovered that a rare variant c.G40A in interferon gamma receptor 1 potentially contributes to the myasthenia gravis pathogenesis. Further functional studies are needed to confirm the effect of the interferon gamma receptor 1 on the myasthenia gravis phenotype.

  14. Homozygous nonsense mutation in SGCA is a common cause of limb-girdle muscular dystrophy in Assiut, Egypt.

    PubMed

    Reddy, Hemakumar M; Hamed, Sherifa A; Lek, Monkol; Mitsuhashi, Satomi; Estrella, Elicia; Jones, Michael D; Mahoney, Lane J; Duncan, Anna R; Cho, Kyung-Ah; Macarthur, Daniel G; Kunkel, Louis M; Kang, Peter B

    2016-10-01

    The genetic causes of limb-girdle muscular dystrophy (LGMD) have been studied in numerous countries, but such investigations have been limited in Egypt. A cohort of 30 families with suspected LGMD from Assiut, Egypt, was studied using immunohistochemistry, homozygosity mapping, Sanger sequencing, and whole exome sequencing. Six families were confirmed to have pathogenic mutations, 4 in SGCA and 2 in DMD. Of these, 3 families harbored a single nonsense mutation in SGCA, suggesting that this may be a common mutation in Assiut, Egypt, originating from a founder effect. The Assiut region in Egypt appears to share at least several of the common LGMD genes found in other parts of the world. It is notable that 4 of the 6 mutations were ascertained by means of whole exome sequencing, even though it was the last approach adopted. This illustrates the power of this technique for identifying causative mutations for muscular dystrophies. Muscle Nerve 54: 690-695, 2016. © 2016 Wiley Periodicals, Inc.

  15. Whole exome sequencing identifies driver mutations in asymptomatic computed tomography-detected lung cancers with normal karyotype.

    PubMed

    Belloni, Elena; Veronesi, Giulia; Rotta, Luca; Volorio, Sara; Sardella, Domenico; Bernard, Loris; Pece, Salvatore; Di Fiore, Pier Paolo; Fumagalli, Caterina; Barberis, Massimo; Spaggiari, Lorenzo; Pelicci, Pier Giuseppe; Riva, Laura

    2015-04-01

    The efficacy of curative surgery for lung cancer could be largely improved by non-invasive screening programs, which can detect the disease at early stages. We previously showed that 18% of screening-identified lung cancers demonstrate a normal karyotype and, following high-density genome scanning, can be subdivided into samples with 1) numerous; 2) none; and 3) few copy number alterations. Whole exome sequencing was applied to the two normal karyotype, screening-detected lung cancers, constituting group 2, as well as normal controls. We identified mutations in both tumors, including KEAP1 (commonly mutated in lung cancers) in one, and TP53, PMS1, and MSH3 (well-characterized DNA-repair genes) in the other. The two normal karyotype screening-detected lung tumors displayed a typical lung cancer mutational profile that only next generation sequencing could reveal, which offered an additional contribution to the over-diagnosis bias concept hypothesized within lung cancer screening programs. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study.

    PubMed

    Dewey, Frederick E; Murray, Michael F; Overton, John D; Habegger, Lukas; Leader, Joseph B; Fetterolf, Samantha N; O'Dushlaine, Colm; Van Hout, Cristopher V; Staples, Jeffrey; Gonzaga-Jauregui, Claudia; Metpally, Raghu; Pendergrass, Sarah A; Giovanni, Monica A; Kirchner, H Lester; Balasubramanian, Suganthi; Abul-Husn, Noura S; Hartzel, Dustin N; Lavage, Daniel R; Kost, Korey A; Packer, Jonathan S; Lopez, Alexander E; Penn, John; Mukherjee, Semanti; Gosalia, Nehal; Kanagaraj, Manoj; Li, Alexander H; Mitnaul, Lyndon J; Adams, Lance J; Person, Thomas N; Praveen, Kavita; Marcketta, Anthony; Lebo, Matthew S; Austin-Tse, Christina A; Mason-Suares, Heather M; Bruse, Shannon; Mellis, Scott; Phillips, Robert; Stahl, Neil; Murphy, Andrew; Economides, Aris; Skelding, Kimberly A; Still, Christopher D; Elmore, James R; Borecki, Ingrid B; Yancopoulos, George D; Davis, F Daniel; Faucett, William A; Gottesman, Omri; Ritchie, Marylyn D; Shuldiner, Alan R; Reid, Jeffrey G; Ledbetter, David H; Baras, Aris; Carey, David J

    2016-12-23

    The DiscovEHR collaboration between the Regeneron Genetics Center and Geisinger Health System couples high-throughput sequencing to an integrated health care system using longitudinal electronic health records (EHRs). We sequenced the exomes of 50,726 adult participants in the DiscovEHR study to identify ~4.2 million rare single-nucleotide variants and insertion/deletion events, of which ~176,000 are predicted to result in a loss of gene function. Linking these data to EHR-derived clinical phenotypes, we find clinical associations supporting therapeutic targets, including genes encoding drug targets for lipid lowering, and identify previously unidentified rare alleles associated with lipid levels and other blood level traits. About 3.5% of individuals harbor deleterious variants in 76 clinically actionable genes. The DiscovEHR data set provides a blueprint for large-scale precision medicine initiatives and genomics-guided therapeutic discovery. Copyright © 2016, American Association for the Advancement of Science.

  17. Exome sequence reveals mutations in CoA synthase as a cause of neurodegeneration with brain iron accumulation.

    PubMed

    Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria

    2014-01-02

    Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  18. Novel SNP array analysis and exome sequencing detect a homozygous exon 7 deletion of MEGF10 causing early onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD)

    PubMed Central

    Pierson, Tyler Mark; Markello, Thomas; Accardi, John; Wolfe, Lynne; Adams, David; Sincan, Murat; Tarazi, Noor M.; Fajardo, Karin Fuentes; Cherukuri, Praveen F.; Bajraktari, Ilda; Meilleur, Katy G.; Donkervoort, Sandra; Jain, Mina; Hu, Ying; Lehky, Tanya J.; Cruz, Pedro; Mullikin, James C.; Bonnemann, Carsten; Gahl, William A.; Boerkoel, Cornelius F.; Tifft, Cynthia J.

    2013-01-01

    Early-onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD) is a myopathic disorder associated with mutations in MEGF10. By novel analysis of SNP array hybridization and exome sequence coverage, we diagnosed a 10-year old girl with EMARDD following identification of a novel homozygous deletion of exon 7 in MEGF10. In contrast to previously reported EMARDD patients, her weakness was more prominent proximally than distally, and involved her legs more than her arms. MRI of her pelvis and thighs showed muscle atrophy and fatty replacement. Ultrasound of several muscle groups revealed dense homogenous increases in echogenicity. Cloning and sequencing of the deletion breakpoint identified features suggesting the mutation arose by fork stalling and template switching. These findings constitute the first genomic deletion causing EMARDD, expand the clinical phenotype, and provide new insight into the pattern and histology of its muscular pathology. PMID:23453856

  19. Clinical Sequencing Exploratory Research Consortium: Accelerating Evidence-Based Practice of Genomic Medicine.

    PubMed

    Green, Robert C; Goddard, Katrina A B; Jarvik, Gail P; Amendola, Laura M; Appelbaum, Paul S; Berg, Jonathan S; Bernhardt, Barbara A; Biesecker, Leslie G; Biswas, Sawona; Blout, Carrie L; Bowling, Kevin M; Brothers, Kyle B; Burke, Wylie; Caga-Anan, Charlisse F; Chinnaiyan, Arul M; Chung, Wendy K; Clayton, Ellen W; Cooper, Gregory M; East, Kelly; Evans, James P; Fullerton, Stephanie M; Garraway, Levi A; Garrett, Jeremy R; Gray, Stacy W; Henderson, Gail E; Hindorff, Lucia A; Holm, Ingrid A; Lewis, Michelle Huckaby; Hutter, Carolyn M; Janne, Pasi A; Joffe, Steven; Kaufman, David; Knoppers, Bartha M; Koenig, Barbara A; Krantz, Ian D; Manolio, Teri A; McCullough, Laurence; McEwen, Jean; McGuire, Amy; Muzny, Donna; Myers, Richard M; Nickerson, Deborah A; Ou, Jeffrey; Parsons, Donald W; Petersen, Gloria M; Plon, Sharon E; Rehm, Heidi L; Roberts, J Scott; Robinson, Dan; Salama, Joseph S; Scollon, Sarah; Sharp, Richard R; Shirts, Brian; Spinner, Nancy B; Tabor, Holly K; Tarczy-Hornoch, Peter; Veenstra, David L; Wagle, Nikhil; Weck, Karen; Wilfond, Benjamin S; Wilhelmsen, Kirk; Wolf, Susan M; Wynn, Julia; Yu, Joon-Ho

    2016-06-02

    Despite rapid technical progress and demonstrable effectiveness for some types of diagnosis and therapy, much remains to be learned about clinical genome and exome sequencing (CGES) and its role within the practice of medicine. The Clinical Sequencing Exploratory Research (CSER) consortium includes 18 extramural research projects, one National Human Genome Research Institute (NHGRI) intramural project, and a coordinating center funded by the NHGRI and National Cancer Institute. The consortium is exploring analytic and clinical validity and utility, as well as the ethical, legal, and social implications of sequencing via multidisciplinary approaches; it has thus far recruited 5,577 participants across a spectrum of symptomatic and healthy children and adults by utilizing both germline and cancer sequencing. The CSER consortium is analyzing data and creating publically available procedures and tools related to participant preferences and consent, variant classification, disclosure and management of primary and secondary findings, health outcomes, and integration with electronic health records. Future research directions will refine measures of clinical utility of CGES in both germline and somatic testing, evaluate the use of CGES for screening in healthy individuals, explore the penetrance of pathogenic variants through extensive phenotyping, reduce discordances in public databases of genes and variants, examine social and ethnic disparities in the provision of genomics services, explore regulatory issues, and estimate the value and downstream costs of sequencing. The CSER consortium has established a shared community of research sites by using diverse approaches to pursue the evidence-based development of best practices in genomic medicine. Copyright © 2016 American Society of Human Genetics. All rights reserved.

  20. An Exome Sequencing Study to Assess the Role of Rare Genetic Variation in Pulmonary Fibrosis.

    PubMed

    Petrovski, Slavé; Todd, Jamie L; Durheim, Michael T; Wang, Quanli; Chien, Jason W; Kelly, Fran L; Frankel, Courtney; Mebane, Caroline M; Ren, Zhong; Bridgers, Joshua; Urban, Thomas J; Malone, Colin D; Finlen Copeland, Ashley; Brinkley, Christie; Allen, Andrew S; O'Riordan, Thomas; McHutchison, John G; Palmer, Scott M; Goldstein, David B

    2017-07-01

    Idiopathic pulmonary fibrosis (IPF) is an increasingly recognized, often fatal lung disease of unknown etiology. The aim of this study was to use whole-exome sequencing to improve understanding of the genetic architecture of pulmonary fibrosis. We performed a case-control exome-wide collapsing analysis including 262 unrelated individuals with pulmonary fibrosis clinically classified as IPF according to American Thoracic Society/European Respiratory Society/Japanese Respiratory Society/Latin American Thoracic Association guidelines (81.3%), usual interstitial pneumonia secondary to autoimmune conditions (11.5%), or fibrosing nonspecific interstitial pneumonia (7.2%). The majority (87%) of case subjects reported no family history of pulmonary fibrosis. We searched 18,668 protein-coding genes for an excess of rare deleterious genetic variation using whole-exome sequence data from 262 case subjects with pulmonary fibrosis and 4,141 control subjects drawn from among a set of individuals of European ancestry. Comparing genetic variation across 18,668 protein-coding genes, we found a study-wide significant (P < 4.5 × 10 -7 ) case enrichment of qualifying variants in TERT, RTEL1, and PARN. A model qualifying ultrarare, deleterious, nonsynonymous variants implicated TERT and RTEL1, and a model specifically qualifying loss-of-function variants implicated RTEL1 and PARN. A subanalysis of 186 case subjects with sporadic IPF confirmed TERT, RTEL1, and PARN as study-wide significant contributors to sporadic IPF. Collectively, 11.3% of case subjects with sporadic IPF carried a qualifying variant in one of these three genes compared with the 0.3% carrier rate observed among control subjects (odds ratio, 47.7; 95% confidence interval, 21.5-111.6; P = 5.5 × 10 -22 ). We identified TERT, RTEL1, and PARN-three telomere-related genes previously implicated in familial pulmonary fibrosis-as significant contributors to sporadic IPF. These results support the idea that telomere dysfunction is involved in IPF pathogenesis.

  1. Estimating genotype error rates from high-coverage next-generation sequence data.

    PubMed

    Wall, Jeffrey D; Tang, Ling Fung; Zerbe, Brandon; Kvale, Mark N; Kwok, Pui-Yan; Schaefer, Catherine; Risch, Neil

    2014-11-01

    Exome and whole-genome sequencing studies are becoming increasingly common, but little is known about the accuracy of the genotype calls made by the commonly used platforms. Here we use replicate high-coverage sequencing of blood and saliva DNA samples from four European-American individuals to estimate lower bounds on the error rates of Complete Genomics and Illumina HiSeq whole-genome and whole-exome sequencing. Error rates for nonreference genotype calls range from 0.1% to 0.6%, depending on the platform and the depth of coverage. Additionally, we found (1) no difference in the error profiles or rates between blood and saliva samples; (2) Complete Genomics sequences had substantially higher error rates than Illumina sequences had; (3) error rates were higher (up to 6%) for rare or unique variants; (4) error rates generally declined with genotype quality (GQ) score, but in a nonlinear fashion for the Illumina data, likely due to loss of specificity of GQ scores greater than 60; and (5) error rates increased with increasing depth of coverage for the Illumina data. These findings, especially (3)-(5), suggest that caution should be taken in interpreting the results of next-generation sequencing-based association studies, and even more so in clinical application of this technology in the absence of validation by other more robust sequencing or genotyping methods. © 2014 Wall et al.; Published by Cold Spring Harbor Laboratory Press.

  2. Exome analysis in clinical practice: expanding the phenotype of Bartsocas-Papas syndrome.

    PubMed

    Gripp, Karen W; Ennis, Sara; Napoli, Joseph

    2013-05-01

    Exome analysis has had a dramatic impact on genetic research. We present the application of such newly generated information to patient care. The patient was a female, born with normal growth parameters to nonconsanguineous parents after an uneventful pregnancy. She had bilateral cleft lip/palate and ankyloblepharon. Sparse hair, dysplastic nails and hypohidrosis were subsequently noted. With exception of speech related issues, her development was normal. A clinical diagnosis of ankyloblepharon-ectodermal defects-cleft lip/palate or Hay-Wells syndrome resulted in TP63 sequence analysis. TP63 sequence and deletion/duplication analysis of all coding exons had a normal result, as did chromosome and SNP array analysis. Diagnostic exome analysis revealed a heterozygous nonsense mutation in KRT83 categorized as deleterious and associated with monilethrix. In addition, a homozygous missense variant of unknown clinical significance was reported in RIPK4. Using research based exome analysis, RIPK4 had just a few months prior been identified as pathogenic for Bartsocas-Papas syndrome. While the clinical diagnostic report implied the KRT83 mutation as a more likely cause for the patient's phenotype, clinical correlation, literature review and use of computerized mutation analysis programs allowed us to identify the homozygous RIPK4 (c.488G > A; p.Gly163Asp) mutation as the underlying pathogenic change. Consequently, we expand the phenotype of Bartsocas-Papas syndrome to an attenuated presentation resembling Hay-Wells syndrome, lacking lethality and pterygia. In contrast to the autosomal dominant Hay-Wells syndrome, Bartsocas-Papas syndrome is autosomal recessive, implying a 25% recurrence risk. Copyright © 2013 Wiley Periodicals, Inc.

  3. A variational Bayes discrete mixture test for rare variant association

    PubMed Central

    Logsdon, Benjamin A.; Dai, James Y.; Auer, Paul L.; Johnsen, Jill M.; Ganesh, Santhi K.; Smith, Nicholas L.; Wilson, James G.; Tracy, Russell P.; Lange, Leslie A.; Jiao, Shuo; Rich, Stephen S.; Lettre, Guillaume; Carlson, Christopher S.; Jackson, Rebecca D.; O’Donnell, Christopher J.; Wurfel, Mark M.; Nickerson, Deborah A.; Tang, Hua; Reiner, Alexander P.; Kooperberg, Charles

    2014-01-01

    Recently, many statistical methods have been proposed to test for associations between rare genetic variants and complex traits. Most of these methods test for association by aggregating genetic variations within a predefined region, such as a gene. Although there is evidence that “aggregate” tests are more powerful than the single marker test, these tests generally ignore neutral variants and therefore are unable to identify specific variants driving the association with phenotype. We propose a novel aggregate rare-variant test that explicitly models a fraction of variants as neutral, tests associations at the gene-level, and infers the rare-variants driving the association. Simulations show that in the practical scenario where there are many variants within a given region of the genome with only a fraction causal our approach has greater power compared to other popular tests such as the Sequence Kernel Association Test (SKAT), the Weighted Sum Statistic (WSS), and the collapsing method of Morris and Zeggini (MZ). Our algorithm leverages a fast variational Bayes approximate inference methodology to scale to exome-wide analyses, a significant computational advantage over exact inference model selection methodologies. To demonstrate the efficacy of our methodology we test for associations between von Willebrand Factor (VWF) levels and VWF missense rare-variants imputed from the National Heart, Lung, and Blood Institute’s Exome Sequencing project into 2,487 African Americans within the VWF gene. Our method suggests that a relatively small fraction (~10%) of the imputed rare missense variants within VWF are strongly associated with lower VWF levels in African Americans. PMID:24482836

  4. A variational Bayes discrete mixture test for rare variant association.

    PubMed

    Logsdon, Benjamin A; Dai, James Y; Auer, Paul L; Johnsen, Jill M; Ganesh, Santhi K; Smith, Nicholas L; Wilson, James G; Tracy, Russell P; Lange, Leslie A; Jiao, Shuo; Rich, Stephen S; Lettre, Guillaume; Carlson, Christopher S; Jackson, Rebecca D; O'Donnell, Christopher J; Wurfel, Mark M; Nickerson, Deborah A; Tang, Hua; Reiner, Alexander P; Kooperberg, Charles

    2014-01-01

    Recently, many statistical methods have been proposed to test for associations between rare genetic variants and complex traits. Most of these methods test for association by aggregating genetic variations within a predefined region, such as a gene. Although there is evidence that "aggregate" tests are more powerful than the single marker test, these tests generally ignore neutral variants and therefore are unable to identify specific variants driving the association with phenotype. We propose a novel aggregate rare-variant test that explicitly models a fraction of variants as neutral, tests associations at the gene-level, and infers the rare-variants driving the association. Simulations show that in the practical scenario where there are many variants within a given region of the genome with only a fraction causal our approach has greater power compared to other popular tests such as the Sequence Kernel Association Test (SKAT), the Weighted Sum Statistic (WSS), and the collapsing method of Morris and Zeggini (MZ). Our algorithm leverages a fast variational Bayes approximate inference methodology to scale to exome-wide analyses, a significant computational advantage over exact inference model selection methodologies. To demonstrate the efficacy of our methodology we test for associations between von Willebrand Factor (VWF) levels and VWF missense rare-variants imputed from the National Heart, Lung, and Blood Institute's Exome Sequencing project into 2,487 African Americans within the VWF gene. Our method suggests that a relatively small fraction (~10%) of the imputed rare missense variants within VWF are strongly associated with lower VWF levels in African Americans.

  5. Further delineation of the SATB2 phenotype

    PubMed Central

    Döcker, Dennis; Schubach, Max; Menzel, Moritz; Munz, Marita; Spaich, Christiane; Biskup, Saskia; Bartholdi, Deborah

    2014-01-01

    SATB2 is an evolutionarily highly conserved chromatin remodeling gene located on chromosome 2q33.1. Vertebrate animal models have shown that Satb2 has a crucial role in craniofacial patterning and osteoblast differentiation, as well as in determining the fates of neuronal projections in the developing neocortex. In humans, chromosomal translocations and deletions of 2q33.1 leading to SATB2 haploinsufficiency are associated with cleft palate (CP), facial dysmorphism and intellectual disability (ID). A single patient carrying a nonsense mutation in SATB2 has been described to date. In this study, we performed trio-exome sequencing in a 3-year-old girl with CP and severely delayed speech development, and her unaffected parents. Previously, the girl had undergone conventional and molecular karyotyping (microarray analysis), as well as targeted analysis for different diseases associated with developmental delay, including Angelman syndrome, Rett syndrome and Fragile X syndrome. No diagnosis could be established. Exome sequencing revealed a de novo nonsense mutation in the SATB2 gene (c.715C>T; p.R239*). The identification of a second patient carrying a de novo nonsense mutation in SATB2 confirms that this gene is essential for normal craniofacial patterning and cognitive development. Based on our data and the literature published so far, we propose a new clinically recognizable syndrome – the SATB2-associated syndrome (SAS). SAS is likely to be underdiagnosed and should be considered in children with ID, severe speech delay, cleft or high-arched palate and abnormal dentition with crowded and irregularly shaped teeth. PMID:24301056

  6. A novel homozygous missense variant in NECTIN4 (PVRL4) causing ectodermal dysplasia cutaneous syndactyly syndrome.

    PubMed

    Ahmad, Farooq; Nasir, Abdul; Thiele, Holger; Umair, Muhammad; Borck, Guntram; Ahmad, Wasim

    2018-02-12

    Ectodermal dysplasia syndactyly syndrome 1 (EDSS1) is a rare form of ectodermal dysplasia including anomalies of hair, nails, and teeth along with bilateral cutaneous syndactyly of hands and feet. In the present report, we performed a clinical and genetic characterization of a consanguineous Pakistani family with four individuals affected by EDSS1. We performed exome sequencing using DNA of one affected individual. Exome data analysis identified a novel homozygous missense variant (c.242T>C; p.(Leu81Pro)) in NECTIN4 (PVRL4). Sanger sequencing validated this variant and confirmed its cosegregation with the disease phenotype in the family members. Thus, our report adds a novel variant to the NECTIN4 mutation spectrum and contributes to the NECTIN4-related clinical characterization. © 2018 John Wiley & Sons Ltd/University College London.

  7. A Case of KCNQ2-Associated Movement Disorder Triggered by Fever.

    PubMed

    Dhamija, Radhika; Goodkin, Howard P; Bailey, Russell; Chambers, Chelsea; Brenton, J Nicholas

    2017-12-01

    The differential diagnosis of fever-induced movement disorders in childhood is broad. Whole exome sequencing has yielded new insights into those cases with a suspected genetic basis. We report the case of an 8-year-old boy with a history of neonatal seizures who presented with near-continuous hyperkinetic movements of his limbs during a febrile illness. Initial diagnostic testing did not explain his abnormalities; however, given the suspicion for a channelopathy, whole exome sequencing was performed and it demonstrated a de novo pathogenic heterozygous variant in KCNQ2. There is an expanding phenotypic spectrum of heterozygous alterations in KCNQ2; however, this report provides the first description of a pathogenic KCNQ2 variant fever-induced hyperkinetic movement disorder in childhood. We also review the literature of cases previously published with the same pathogenic variant.

  8. Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing

    PubMed Central

    Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.

    2015-01-01

    The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379

  9. Exome sequencing identifies putative drivers of progression of transient myeloproliferative disorder to AMKL in infants with Down syndrome.

    PubMed

    Nikolaev, Sergey I; Santoni, Federico; Vannier, Anne; Falconnet, Emilie; Giarin, Emanuela; Basso, Giuseppe; Hoischen, Alexander; Veltman, Joris A; Groet, Jurgen; Nizetic, Dean; Antonarakis, Stylianos E

    2013-07-25

    Some neonates with Down syndrome (DS) are diagnosed with self-regressing transient myeloproliferative disorder (TMD), and 20% to 30% of those progress to acute megakaryoblastic leukemia (AMKL). We performed exome sequencing in 7 TMD/AMKL cases and copy-number analysis in these and 10 additional cases. All TMD/AMKL samples contained GATA1 mutations. No exome-sequenced TMD/AMKL sample had other recurrently mutated genes. However, 2 of 5 TMD cases, and all AMKL cases, showed mutations/deletions other than GATA1, in genes proven as transformation drivers in non-DS leukemia (EZH2, APC, FLT3, JAK1, PARK2-PACRG, EXT1, DLEC1, and SMC3). One patient at the TMD stage revealed 2 clonal expansions with different GATA1 mutations, of which 1 clone had an additional driver mutation. Interestingly, it was the other clone that gave rise to AMKL after accumulating mutations in 7 other genes. Data suggest that GATA1 mutations alone are sufficient for clonal expansions, and additional driver mutations at the TMD stage do not necessarily predict AMKL progression. Later in infancy, leukemic progression requires "third-hit driver" mutations/somatic copy-number alterations found in non-DS leukemias. Putative driver mutations affecting WNT (wingless-related integration site), JAK-STAT (Janus kinase/signal transducer and activator of transcription), or MAPK/PI3K (mitogen-activated kinase/phosphatidylinositol-3 kinase) pathways were found in all cases, aberrant activation of which converges on overexpression of MYC.

  10. Whole exome sequencing and array-based molecular karyotyping as aids to prenatal diagnosis in fetuses with suspected Simpson-Golabi-Behmel syndrome.

    PubMed

    Kehrer, Christina; Hoischen, Alexander; Menkhaus, Ralf; Schwab, Eva; Müller, Andreas; Kim, Sarah; Kreiß, Martina; Weitensteiner, Valerie; Hilger, Alina; Berg, Christoph; Geipel, Anne; Reutter, Heiko; Gembruch, Ulrich

    2016-10-01

    Simpson-Golabi-Behmel (SGBS) syndrome type 1 and type 2 represent rare X-linked prenatal overgrowth disorders. The aim of our study is to describe the prenatal sonographic features as well as the genetic work-up. Retrospective analysis of four cases with a pre- or postnatal diagnosis of SGBS in a single tertiary referral center within a period of 4 years. In the study period, four male fetuses with SGBS were detected. The final diagnosis was made prenatally in three cases. In all cases the second trimester anomaly scan revealed left sided congenital diaphragmatic hernia (CDH) with additional anomalies; three fetuses with SGBS type 1 showed fetal overgrowth. In two of these, whole exome sequencing showed a possible frameshift mutation and a point mutation in the gene GPC3, respectively. In the third case, multiplex ligation-dependent probe amplification (MLPA) revealed a hemizygous duplication of exon 3-7 in the gene GPC3. In the fourth case, SGBS type 2 was confirmed by array comparative genomic hybridization (CGH) of amniotic fluid cells showing a deletion of the gene OFD1. We could demonstrate, that in the presence of a CDH, syndromes of the fetus can be increasingly differentiated by detailed sonography followed by a selective and graded molecular diagnostic using microarray techniques and whole exome sequencing. © 2016 John Wiley & Sons, Ltd. © 2016 John Wiley & Sons, Ltd.

  11. Novel homozygous variants in ATCAY, MCOLN1, and SACS in complex neurological disorders.

    PubMed

    Manzoor, Humera; Brüggemann, Norbert; Hussain, Hafiz Muhammad Jafar; Bäumer, Tobias; Hinrichs, Frauke; Wajid, Muhammad; Münchau, Alexander; Naz, Sadaf; Lohmann, Katja

    2018-06-01

    Neurological disorders comprise a large group of clinically and genetically heterogeneous disorders, many of which have a genetic cause. In addition to a detailed neurological examination, exome sequencing is being increasingly used as a complementary diagnostic tool to identify the underlying genetic cause in patients with unclear, supposedly genetically determined disorders. To identify the genetic cause of a complex movement disorder in five consanguineous Pakistani families. We included five consanguineous Pakistani families with complex recessively inherited movement disorders. Clinical investigation including videotaping was carried out in a total of 59 family members (4-21 per family) and MRI in six patients. Exome sequencing was performed in 4-5 family members per pedigree to explore the underlying genetic cause. Patients presented a wide spectrum of neurological symptoms including ataxia and/or dystonia. We identified three novel homozygous, segregating variants in ATCAY (p.Pro200Profs*20), MCOLN1 (p.Ile184Thr), and SACS (p.Asn3040Lysfs*4) in three of the families. Thus, we were able to identify the likely cause of the disease in a considerable number of families (60%) with the relatively simple and nowadays widely available method of exome sequencing. Of note, close collaboration of neurologists and geneticists was instrumental for proper data interpretation. We expand the phenotypic, genotypic, and ethnical spectrum of mutations in these genes. Our findings alert neurologists that rare genetic causes should be considered in complex phenotypes regardless of ethnicity. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. Exome sequencing of a colorectal cancer family reveals shared mutation pattern and predisposition circuitry along tumor pathways

    PubMed Central

    Suleiman, Suleiman H.; Koko, Mahmoud E.; Nasir, Wafaa H.; Elfateh, Ommnyiah; Elgizouli, Ubai K.; Abdallah, Mohammed O. E.; Alfarouk, Khalid O.; Hussain, Ayman; Faisal, Shima; Ibrahim, Fathelrahamn M. A.; Romano, Maurizio; Sultan, Ali; Banks, Lawrence; Newport, Melanie; Baralle, Francesco; Elhassan, Ahmed M.; Mohamed, Hiba S.; Ibrahim, Muntaser E.

    2015-01-01

    The molecular basis of cancer and cancer multiple phenotypes are not yet fully understood. Next Generation Sequencing promises new insight into the role of genetic interactions in shaping the complexity of cancer. Aiming to outline the differences in mutation patterns between familial colorectal cancer cases and controls we analyzed whole exomes of cancer tissues and control samples from an extended colorectal cancer pedigree, providing one of the first data sets of exome sequencing of cancer in an African population against a background of large effective size typically with excess of variants. Tumors showed hMSH2 loss of function SNV consistent with Lynch syndrome. Sets of genes harboring insertions–deletions in tumor tissues revealed, however, significant GO enrichment, a feature that was not seen in control samples, suggesting that ordered insertions–deletions are central to tumorigenesis in this type of cancer. Network analysis identified multiple hub genes of centrality. ELAVL1/HuR showed remarkable centrality, interacting specially with genes harboring non-synonymous SNVs thus reinforcing the proposition of targeted mutagenesis in cancer pathways. A likely explanation to such mutation pattern is DNA/RNA editing, suggested here by nucleotide transition-to-transversion ratio that significantly departed from expected values (p-value 5e-6). NFKB1 also showed significant centrality along with ELAVL1, raising the suspicion of viral etiology given the known interaction between oncogenic viruses and these proteins. PMID:26442106

  13. Variants in SKP1, PROB1, and IL17B genes at keratoconus 5q31.1–q35.3 susceptibility locus identified by whole-exome sequencing

    PubMed Central

    Karolak, Justyna A; Gambin, Tomasz; Pitarque, Jose A; Molinari, Andrea; Jhangiani, Shalini; Stankiewicz, Pawel; Lupski, James R; Gajecka, Marzena

    2017-01-01

    Keratoconus (KTCN) is a protrusion and thinning of the cornea, resulting in impairment of visual function. The extreme genetic heterogeneity makes it difficult to discover factors unambiguously influencing the KTCN phenotype. In this study, we used whole-exome sequencing (WES) and Sanger sequencing to reduce the number of candidate genes at the 5q31.1–q35.3 locus and to prioritize other potentially relevant variants in an Ecuadorian family with KTCN. We applied WES in two affected KTCN individuals from the Ecuadorian family that showed a suggestive linkage between the KTCN phenotype and the 5q31.1–q35.3 locus. Putative variants identified by WES were further evaluated in this family using Sanger sequencing. Exome capture discovered a total of 173 rare (minor allele frequency <0.001 in control population) nonsynonymous variants in both affected individuals. Among them, 16 SNVs were selected for further evaluation. Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1–q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied KTCN family. We demonstrate that a combination of various techniques significantly narrowed the studied genomic region and reduced the list of the putative exonic variants. Moreover, since this locus overlapped two other chromosomal regions previously recognized in distinct KTCN studies, our findings suggest that this 5q31.1–q35.3 locus might be linked with KTCN. PMID:27703147

  14. Copy number variation in CEP57L1 predisposes to congenital absence of bilateral ACL and PCL ligaments.

    PubMed

    Liu, Yichuan; Li, Yun; March, Michael E; Nguyen, Kenny; Kenny, Nguyen; Xu, Kexiang; Wang, Fengxiang; Guo, Yiran; Keating, Brendan; Glessner, Joseph; Li, Jiankang; Ganley, Theodore J; Zhang, Jianguo; Deardorff, Matthew A; Xu, Xun; Hakonarson, Hakon

    2015-11-11

    Absence of the anterior (ACL) or posterior cruciate ligament (PCL) are rare congenital malformations that result in knee joint instability, with a prevalence of 1.7 per 100,000 live births and can be associated with other lower-limb abnormalities such as ACL agnesia and absence of the menisci of the knee. While a few cases of absence of ACL/PCL are reported in the literature, a number of large familial case series of related conditions such as ACL agnesia suggest a potential underlying monogenic etiology. We performed whole exome sequencing of a family with two individuals affected by ACL/PCL. We identified copy number variation (CNV) deletion impacting the exon sequences of CEP57L1, present in the affected mother and her affected daughter based on the exome sequencing data. The deletion was validated using quantitative PCR (qPCR), and the gene was confirmed to be expressed in ACL ligament tissue. Interestingly, we detected reduced expression of CEP57L1 in Epstein-Barr virus (EBV) cells from the two patients in comparison with healthy controls. Evaluation of 3D protein structure showed that the helix-binding sites of the protein remain intact with the deletion, but other functional binding sites related to microtubule attachment are missing. The specificity of the CNV deletion was confirmed by showing that it was absent in ~700 exome sequencing samples as well as in the database of genomic variations (DGV), a database containing large numbers of annotated CNVs from previous scientific reports. We identified a novel CNV deletion that was inherited through an autosomal dominant transmission from an affected mother to her affected daughter, both of whom suffered from the absence of the anterior and posterior cruciate ligaments of the knees.

  15. Increased Probability of Co-Occurrence of Two Rare Diseases in Consanguineous Families and Resolution of a Complex Phenotype by Next Generation Sequencing

    PubMed Central

    Lal, Dennis; Neubauer, Bernd A.; Toliat, Mohammad R.; Altmüller, Janine; Thiele, Holger; Nürnberg, Peter; Kamrath, Clemens; Schänzer, Anne; Sander, Thomas; Hahn, Andreas; Nothnagel, Michael

    2016-01-01

    Massively parallel sequencing of whole genomes and exomes has facilitated a direct assessment of causative genetic variation, now enabling the identification of genetic factors involved in rare diseases (RD) with Mendelian inheritance patterns on an almost routine basis. Here, we describe the illustrative case of a single consanguineous family where this strategy suffered from the difficulty to distinguish between two etiologically distinct disorders, namely the co-occurrence of hereditary hypophosphatemic rickets (HRR) and congenital myopathies (CM), by their phenotypic manifestation alone. We used parametric linkage analysis, homozygosity mapping and whole exome-sequencing to identify mutations underlying HRR and CM. We also present an approximate approach for assessing the probability of co-occurrence of two unlinked recessive RD in a single family as a function of the degree of consanguinity and the frequency of the disease-causing alleles. Linkage analysis and homozygosity mapping yielded elusive results when assuming a single RD, but whole-exome sequencing helped to identify two mutations in two genes, namely SLC34A3 and SEPN1, that segregated independently in this family and that have previously been linked to two etiologically different diseases. We assess the increase in chance co-occurrence of rare diseases due to consanguinity, i.e. under circumstances that generally favor linkage mapping of recessive disease, and show that this probability can increase by several orders of magnitudes. We conclude that such potential co-occurrence represents an underestimated risk when analyzing rare or undefined diseases in consanguineous families and should be given more consideration in the clinical and genetic evaluation. PMID:26789268

  16. Genetics Home Reference: melorheostosis

    MedlinePlus

    ... to the production of a version of MEK1 protein kinase that is overactive, which increases RAS/MAPK signaling in bone tissue. ... BA, Griffith OL, Mardis ER. Melorheostosis: Exome sequencing of an associated dermatosis implicates postzygotic mosaicism of ...

  17. Performance comparison of SNP detection tools with illumina exome sequencing data—an assessment using both family pedigree information and sample-matched SNP array data

    PubMed Central

    Yi, Ming; Zhao, Yongmei; Jia, Li; He, Mei; Kebebew, Electron; Stephens, Robert M.

    2014-01-01

    To apply exome-seq-derived variants in the clinical setting, there is an urgent need to identify the best variant caller(s) from a large collection of available options. We have used an Illumina exome-seq dataset as a benchmark, with two validation scenarios—family pedigree information and SNP array data for the same samples, permitting global high-throughput cross-validation, to evaluate the quality of SNP calls derived from several popular variant discovery tools from both the open-source and commercial communities using a set of designated quality metrics. To the best of our knowledge, this is the first large-scale performance comparison of exome-seq variant discovery tools using high-throughput validation with both Mendelian inheritance checking and SNP array data, which allows us to gain insights into the accuracy of SNP calling through such high-throughput validation in an unprecedented way, whereas the previously reported comparison studies have only assessed concordance of these tools without directly assessing the quality of the derived SNPs. More importantly, the main purpose of our study was to establish a reusable procedure that applies high-throughput validation to compare the quality of SNP discovery tools with a focus on exome-seq, which can be used to compare any forthcoming tool(s) of interest. PMID:24831545

  18. High-Throughput Sequencing of Germline and Tumor From Men with Early-Onset Metastatic Prostate Cancer

    DTIC Science & Technology

    2016-12-01

    AWARD NUMBER: W81XWH-13-1-0371 TITLE: High-Throughput Sequencing of Germline and Tumor From Men with Early- Onset Metastatic Prostate Cancer...DATES COVERED 30 Sep 2013 - 29 Sep 2016 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER High-Throughput Sequencing of Germline and Tumor From Men with...presenting with metastatic prostate cancer at a young age (before age 60 years). Whole exome sequencing identified a panel of germline variants that have

  19. Sensitivity to sequencing depth in single-cell cancer genomics.

    PubMed

    Alves, João M; Posada, David

    2018-04-16

    Querying cancer genomes at single-cell resolution is expected to provide a powerful framework to understand in detail the dynamics of cancer evolution. However, given the high costs currently associated with single-cell sequencing, together with the inevitable technical noise arising from single-cell genome amplification, cost-effective strategies that maximize the quality of single-cell data are critically needed. Taking advantage of previously published single-cell whole-genome and whole-exome cancer datasets, we studied the impact of sequencing depth and sampling effort towards single-cell variant detection. Five single-cell whole-genome and whole-exome cancer datasets were independently downscaled to 25, 10, 5, and 1× sequencing depth. For each depth level, ten technical replicates were generated, resulting in a total of 6280 single-cell BAM files. The sensitivity of variant detection, including structural and driver mutations, genotyping, clonal inference, and phylogenetic reconstruction to sequencing depth was evaluated using recent tools specifically designed for single-cell data. Altogether, our results suggest that for relatively large sample sizes (25 or more cells) sequencing single tumor cells at depths > 5× does not drastically improve somatic variant discovery, characterization of clonal genotypes, or estimation of single-cell phylogenies. We suggest that sequencing multiple individual tumor cells at a modest depth represents an effective alternative to explore the mutational landscape and clonal evolutionary patterns of cancer genomes.

  20. New splicing mutation in the choline kinase beta (CHKB) gene causing a muscular dystrophy detected by whole-exome sequencing.

    PubMed

    Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário

    2015-06-01

    Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.

  1. [Identification of novel pathogenic gene mutations in pediatric acute myeloid leukemia by whole-exome resequencing].

    PubMed

    Shiba, Norio

    2015-12-01

    A new class of gene mutations, identified in the pathogenesis of adult acute myeloid leukemia (AML), includes DNMT3A, IDH1/2, TET2 and EZH2. However, these mutations are rare in pediatric AML cases, indicating that pathogeneses differ between adult and pediatric forms of AML. Meanwhile, the recent development of massively parallel sequencing technologies has provided a new opportunity to discover genetic changes across entire genomes or proteincoding sequences. In order to reveal a complete registry of gene mutations, we performed whole exome resequencing of paired tumor-normal specimens from 19 pediatric AML cases using Illumina HiSeq 2000. In total, 80 somatic mutations or 4.2 mutations per sample were identified. Many of the recurrent mutations identified in this study involved previously reported targets in AML, such as FLT3, CEBPA, KIT, CBL, NRAS, WT1 and EZH2. On the other hand, several genes were newly identified in the current study, including BCORL1 and major cohesin components such as SMC3 and RAD21. Whole exome resequencing revealed a complex array of gene mutations in pediatric AML genomes. Our results indicate that a subset of pediatric AML represents a discrete entity that could be discriminated from its adult counterpart, in terms of the spectrum of gene mutations.

  2. Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

    DOE PAGES

    Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina; ...

    2014-07-30

    Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less

  3. Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina

    Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less

  4. Predicting discovery rates of genomic features.

    PubMed

    Gravel, Simon

    2014-06-01

    Successful sequencing experiments require judicious sample selection. However, this selection must often be performed on the basis of limited preliminary data. Predicting the statistical properties of the final sample based on preliminary data can be challenging, because numerous uncertain model assumptions may be involved. Here, we ask whether we can predict "omics" variation across many samples by sequencing only a fraction of them. In the infinite-genome limit, we find that a pilot study sequencing 5% of a population is sufficient to predict the number of genetic variants in the entire population within 6% of the correct value, using an estimator agnostic to demography, selection, or population structure. To reach similar accuracy in a finite genome with millions of polymorphisms, the pilot study would require ∼15% of the population. We present computationally efficient jackknife and linear programming methods that exhibit substantially less bias than the state of the art when applied to simulated data and subsampled 1000 Genomes Project data. Extrapolating based on the National Heart, Lung, and Blood Institute Exome Sequencing Project data, we predict that 7.2% of sites in the capture region would be variable in a sample of 50,000 African Americans and 8.8% in a European sample of equal size. Finally, we show how the linear programming method can also predict discovery rates of various genomic features, such as the number of transcription factor binding sites across different cell types. Copyright © 2014 by the Genetics Society of America.

  5. Ethical and legal implications of whole genome and whole exome sequencing in African populations.

    PubMed

    Wright, Galen E B; Koornhof, Pieter G J; Adeyemo, Adebowale A; Tiffin, Nicki

    2013-05-28

    Rapid advances in high throughput genomic technologies and next generation sequencing are making medical genomic research more readily accessible and affordable, including the sequencing of patient and control whole genomes and exomes in order to elucidate genetic factors underlying disease. Over the next five years, the Human Heredity and Health in Africa (H3Africa) Initiative, funded by the Wellcome Trust (United Kingdom) and the National Institutes of Health (United States of America), will contribute greatly towards sequencing of numerous African samples for biomedical research. Funding agencies and journals often require submission of genomic data from research participants to databases that allow open or controlled data access for all investigators. Access to such genotype-phenotype and pedigree data, however, needs careful control in order to prevent identification of individuals or families. This is particularly the case in Africa, where many researchers and their patients are inexperienced in the ethical issues accompanying whole genome and exome research; and where an historical unidirectional flow of samples and data out of Africa has created a sense of exploitation and distrust. In the current study, we analysed the implications of the anticipated surge of next generation sequencing data in Africa and the subsequent data sharing concepts on the protection of privacy of research subjects. We performed a retrospective analysis of the informed consent process for the continent and the rest-of-the-world and examined relevant legislation, both current and proposed. We investigated the following issues: (i) informed consent, including guidelines for performing culturally-sensitive next generation sequencing research in Africa and availability of suitable informed consent documents; (ii) data security and subject privacy whilst practicing data sharing; (iii) conveying the implications of such concepts to research participants in resource limited settings. We conclude that, in order to meet the unique requirements of performing next generation sequencing-related research in African populations, novel approaches to the informed consent process are required. This will help to avoid infringement of privacy of individual subjects as well as to ensure that informed consent adheres to acceptable data protection levels with regard to use and transfer of such information.

  6. Ethical and legal implications of whole genome and whole exome sequencing in African populations

    PubMed Central

    2013-01-01

    Background Rapid advances in high throughput genomic technologies and next generation sequencing are making medical genomic research more readily accessible and affordable, including the sequencing of patient and control whole genomes and exomes in order to elucidate genetic factors underlying disease. Over the next five years, the Human Heredity and Health in Africa (H3Africa) Initiative, funded by the Wellcome Trust (United Kingdom) and the National Institutes of Health (United States of America), will contribute greatly towards sequencing of numerous African samples for biomedical research. Discussion Funding agencies and journals often require submission of genomic data from research participants to databases that allow open or controlled data access for all investigators. Access to such genotype-phenotype and pedigree data, however, needs careful control in order to prevent identification of individuals or families. This is particularly the case in Africa, where many researchers and their patients are inexperienced in the ethical issues accompanying whole genome and exome research; and where an historical unidirectional flow of samples and data out of Africa has created a sense of exploitation and distrust. In the current study, we analysed the implications of the anticipated surge of next generation sequencing data in Africa and the subsequent data sharing concepts on the protection of privacy of research subjects. We performed a retrospective analysis of the informed consent process for the continent and the rest-of-the-world and examined relevant legislation, both current and proposed. We investigated the following issues: (i) informed consent, including guidelines for performing culturally-sensitive next generation sequencing research in Africa and availability of suitable informed consent documents; (ii) data security and subject privacy whilst practicing data sharing; (iii) conveying the implications of such concepts to research participants in resource limited settings. Summary We conclude that, in order to meet the unique requirements of performing next generation sequencing-related research in African populations, novel approaches to the informed consent process are required. This will help to avoid infringement of privacy of individual subjects as well as to ensure that informed consent adheres to acceptable data protection levels with regard to use and transfer of such information. PMID:23714101

  7. Atlas2 Cloud: a framework for personal genome analysis in the cloud

    PubMed Central

    2012-01-01

    Background Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. Results We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. Conclusions We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms. PMID:23134663

  8. Atlas2 Cloud: a framework for personal genome analysis in the cloud.

    PubMed

    Evani, Uday S; Challis, Danny; Yu, Jin; Jackson, Andrew R; Paithankar, Sameer; Bainbridge, Matthew N; Jakkamsetti, Adinarayana; Pham, Peter; Coarfa, Cristian; Milosavljevic, Aleksandar; Yu, Fuli

    2012-01-01

    Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms.

  9. Mutations in RSPH1 cause primary ciliary dyskinesia with a unique clinical and ciliary phenotype.

    PubMed

    Knowles, Michael R; Ostrowski, Lawrence E; Leigh, Margaret W; Sears, Patrick R; Davis, Stephanie D; Wolf, Whitney E; Hazucha, Milan J; Carson, Johnny L; Olivier, Kenneth N; Sagel, Scott D; Rosenfeld, Margaret; Ferkol, Thomas W; Dell, Sharon D; Milla, Carlos E; Randell, Scott H; Yin, Weining; Sannuti, Aruna; Metjian, Hilda M; Noone, Peadar G; Noone, Peter J; Olson, Christina A; Patrone, Michael V; Dang, Hong; Lee, Hye-Seung; Hurd, Toby W; Gee, Heon Yung; Otto, Edgar A; Halbritter, Jan; Kohl, Stefan; Kircher, Martin; Krischer, Jeffrey; Bamshad, Michael J; Nickerson, Deborah A; Hildebrandt, Friedhelm; Shendure, Jay; Zariwala, Maimoona A

    2014-03-15

    Primary ciliary dyskinesia (PCD) is a genetically heterogeneous recessive disorder of motile cilia, but the genetic cause is not defined for all patients with PCD. To identify disease-causing mutations in novel genes, we performed exome sequencing, follow-up characterization, mutation scanning, and genotype-phenotype studies in patients with PCD. Whole-exome sequencing was performed using NimbleGen capture and Illumina HiSeq sequencing. Sanger-based sequencing was used for mutation scanning, validation, and segregation analysis. We performed exome sequencing on an affected sib-pair with normal ultrastructure in more than 85% of cilia. A homozygous splice-site mutation was detected in RSPH1 in both siblings; parents were carriers. Screening RSPH1 in 413 unrelated probands, including 325 with PCD and 88 with idiopathic bronchiectasis, revealed biallelic loss-of-function mutations in nine additional probands. Five affected siblings of probands in RSPH1 families harbored the familial mutations. The 16 individuals with RSPH1 mutations had some features of PCD; however, nasal nitric oxide levels were higher than in patients with PCD with other gene mutations (98.3 vs. 20.7 nl/min; P < 0.0003). Additionally, individuals with RSPH1 mutations had a lower prevalence (8 of 16) of neonatal respiratory distress, and later onset of daily wet cough than typical for PCD, and better lung function (FEV1), compared with 75 age- and sex-matched PCD cases (73.0 vs. 61.8, FEV1 % predicted; P = 0.043). Cilia from individuals with RSPH1 mutations had normal beat frequency (6.1 ± Hz at 25°C), but an abnormal, circular beat pattern. The milder clinical disease and higher nasal nitric oxide in individuals with biallelic mutations in RSPH1 provides evidence of a unique genotype-phenotype relationship in PCD, and suggests that mutations in RSPH1 may be associated with residual ciliary function.

  10. CIDR

    Science.gov Websites

    CIDR Skip navigation Home About CIDR General Highlights Newsletter Staff Employment Opportunities Genotyping General Information Genome Wide Association Custom FFPE Sample Options Methylation Linkage Consortium Developed Mouse Whole Genome Sequencing General Information Whole Genome Whole Exome Custom

  11. CIDR

    Science.gov Websites

    Initiation Application Schedule Service Information and Pricing Services Sample Requirements Pricing SNP Genotyping General Information Genome Wide Association Custom FFPE Sample Options Methylation Linkage Consortium Developed Mouse Whole Genome Sequencing General Information Whole Genome Whole Exome Custom

  12. Genetics Home Reference: Miller syndrome

    MedlinePlus

    ... a limb bud, which grows outward. Many different proteins are involved in the normal shaping (patterning) of each limb. Once the overall pattern of a ... for This Page Biesecker LG. Exome sequencing makes medical genomics a reality. Nat Genet. 2010 ...

  13. Tumor Biology and Immunology | Center for Cancer Research

    Cancer.gov

    Tumor Biology and Immunology The Comparative Brain Tumor Consortium is collaborating with National Center for Advanced Translational Sciences to complete whole exome sequencing on canine meningioma samples. Results will be published and made publicly available.

  14. North Carolina Genomic Evaluation by Next-generation Exome Sequencing, 2

    ClinicalTrials.gov

    2018-06-06

    Epilepsy; Seizure; Neuromuscular Diseases; Brain Malformation; Intellectual Disability; Autism Spectrum Disorder; Hypotonia; Inborn Errors of Metabolism; Movement Disorders; Genetic Disease; Development Delay; Chromosome Abnormality; Hearing Loss; Dysmorphic Features; Skeletal Dysplasia; Congenital Abnormality; Microcephaly; Macrocephaly

  15. Mutational and structural analysis of diffuse large B-cell lymphoma using whole genome sequencing | Office of Cancer Genomics

    Cancer.gov

    Abstract: Diffuse large B-cell lymphoma (DLBCL) is a genetically heterogeneous cancer comprising at least two molecular subtypes that differ in gene expression and distribution of mutations. Recently, application of genome/exome sequencing and RNA-seq to DLBCL has revealed numerous genes that are recurrent targets of somatic point mutation in this disease.

  16. SLC52A2 mutations cause SCABD2 phenotype: A second report.

    PubMed

    Babanejad, Mojgan; Adeli, Omid Ali; Nikzat, Nooshin; Beheshtian, Maryam; Azarafra, Hakimeh; Sadeghnia, Farnaz; Mohseni, Marzieh; Najmabadi, Hossein; Kahrizi, Kimia

    2018-01-01

    Autosomal recessive cerebellar ataxias (ARCAs) are a large group of neurodegenerative disorders that manifest mainly in children and young adults. Most ARCAs are heterogeneous with respect to age at onset, severity of disease progression, and frequency of extracerebellar and systemic signs. The phenotype of a consanguineous Iranian family was characterized using clinical testing and pedigree analysis. Whole-exome sequencing was used to identify the disease-causing gene in this family. Using whole exome sequencing (WES), a novel missense mutation in SLC52A2 gene is reported in a consanguineous Iranian family with progressive severe hearing loss, optic atrophy and ataxia. This is the second report of the genotype-phenotype correlation between this syndrome named spinocerebellar ataxia with blindness and deafness type 2 (SCABD2) and SLC52A2 gene. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Case Report: Whole exome sequencing helps in accurate molecular diagnosis in siblings with a rare co-occurrence of paternally inherited 22q12 duplication and autosomal recessive non-syndromic ichthyosis.

    PubMed Central

    Gupta, Aayush; Sharma, Yugal; Deo, Kirti; Vellarikkal, Shamsudheen; Jayarajan, Rijith; Dixit, Vishal; Verma, Ankit; Scaria, Vinod; Sivasubbu, Sridhar

    2015-01-01

    Lamellar ichthyosis (LI), considered an autosomal recessive monogenic genodermatosis, has an incidence of approximately 1 in 250,000. Usually associated with mutations in the transglutaminase gene ( TGM1), mutations in six other genes have, less frequently, been shown to be causative. Two siblings, born in a collodion membrane, presented with fish like scales all over the body. Karyotyping revealed duplication of the chromosome arm on 22q12+ in the father and two siblings. Whole exome sequencing revealed a homozygous p.Gly218Ser variation in TGM1; a variation reported earlier in an isolated Finnish population in association with autosomal recessive non-syndromic ichthyosis. This concurrence of a potentially benign 22q12+ duplication and LI, both rare individually, is reported here likely for the first time. PMID:26594337

  18. Whole Exome Sequencing of a Patient with Metastatic Hidradenocarcinoma and Review of the Literature

    PubMed Central

    Gupta, Eva; Guthrie, Kimberly J.; Krishna, Murli; Asmann, Yan; Parker, Alexander S.; Joseph, Richard W.

    2015-01-01

    Hidradenocarcinoma is a rare malignancy of the sweat glands with only a few cases reported in literature. The management of these tumors is based on the extent of disease with local disease managed with surgical resection. These can tumors carry a high potential of lymphatic and vascular spread and local and distant metastases are not uncommon. Given the rarity of the tumor and lack of genetic and clinical data about these tumors, there is no consensus on the proper management of metastatic disease. Here in we report the first case of metastatic hidradenocarcinoma with detailed molecular profiling including whole exome sequencing. We identified mutations in multiple genes including two that are potentially targetable: PTCH1 and TCF7L1. Further work is necessary to not only confirm the presence of these mutations but also to confirm the clinical significance. PMID:25918615

  19. Whole exome sequencing of a patient with metastatic hidradenocarcinoma and review of the literature.

    PubMed

    Gupta, Eva; Guthrie, Kimberly J; Krishna, Murli; Asmann, Yan; Parker, Alexander S; Joseph, Richard W

    2015-02-11

    Hidradenocarcinoma is a rare malignancy of the sweat glands with only a few cases reported in literature. The management of these tumors is based on the extent of disease with local disease managed with surgical resection. These can tumors carry a high potential of lymphatic and vascular spread and local and distant metastases are not uncommon. Given the rarity of the tumor and lack of genetic and clinical data about these tumors, there is no consensus on the proper management of metastatic disease. Here in we report the first case of metastatic hidradenocarcinoma with detailed molecular profiling including whole exome sequencing. We identified mutations in multiple genes including two that are potentially targetable: PTCH1 and TCF7L1. Further work is necessary to not only confirm the presence of these mutations but also to confirm the clinical significance.

  20. A polygenic burden of rare disruptive mutations in schizophrenia

    PubMed Central

    Purcell, Shaun M.; Moran, Jennifer L.; Fromer, Menachem; Ruderfer, Douglas; Solovieff, Nadia; Roussos, Panos; O’Dushlaine, Colm; Chambert, Kimberly; Bergen, Sarah E.; Kähler, Anna; Duncan, Laramie; Stahl, Eli; Genovese, Giulio; Fernández, Esperanza; Collins, Mark O; Komiyama, Noboru H.; Choudhary, Jyoti S.; Magnusson, Patrik K. E.; Banks, Eric; Shakir, Khalid; Garimella, Kiran; Fennell, Tim; de Pristo, Mark; Grant, Seth G.N.; Haggarty, Stephen; Gabriel, Stacey; Scolnick, Edward M.; Lander, Eric S.; Hultman, Christina; Sullivan, Patrick F.; McCarroll, Steven A.; Sklar, Pamela

    2014-01-01

    By analyzing the exome sequences of 2,536 schizophrenia cases and 2,543 controls, we have demonstrated a polygenic burden primarily arising from rare (<1/10,000), disruptive mutations distributed across many genes. Especially enriched genesets included the voltage-gated calcium ion channel and the signaling complex formed by the activity-regulated cytoskeleton-associated (ARC) scaffold protein of the postsynaptic density (PSD), sets previously implicated by genome-wide association studies (GWAS) and copy-number variation (CNV) studies. Similar to reports in autism, targets of the fragile × mental retardation protein (FMRP, product of FMR1) were enriched for case mutations. No individual gene-based test achieved significance after correction for multiple testing and we did not detect any alleles of moderately low frequency (~0.5-1%) and moderately large effect. Taken together, these data suggest that population-based exome sequencing can discover risk alleles and complements established gene mapping paradigms in neuropsychiatric disease. PMID:24463508

  1. Exome sequencing supports a de novo mutational paradigm for schizophrenia

    PubMed Central

    Xu, Bin; Roos, J. Louw; Dexheimer, Phillip; Boone, Braden; Plummer, Brooks; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria

    2011-01-01

    Despite high heritability, a large fraction of cases with schizophrenia do not have a family history of the disease (sporadic cases). Here, we examine the possibility that rare de novo protein-altering mutations contribute to the genetic component of schizophrenia by sequencing the exome of 53 sporadic cases, 22 unaffected controls and their parents. We identified 40 de novo mutations in 27 patients affecting 40 genes including a potentially disruptive mutation in DGCR2, a gene removed by the recurrent schizophrenia-predisposing 22q11.2 microdeletion. Comparison to rare inherited variants revealed that the identified de novo mutations show a large excess of nonsynonymous changes in cases, as well as a greater potential to affect protein structure and function. Our analysis reveals a major role of de novo mutations in schizophrenia and also a large mutational target, which together provide a plausible explanation for the high global incidence and persistence of the disease. PMID:21822266

  2. De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia

    PubMed Central

    Xu, Bin; Ionita-Laza, Iuliana; Roos, J. Louw; Boone, Braden; Woodrick, Scarlet; Sun, Yan; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria

    2013-01-01

    To evaluate evidence for de novo etiologies in schizophrenia, we sequenced at high coverage the exomes of families recruited from two populations with distinct demographic structure and history. We sequenced a total of 795 exomes from 231 parent-proband trios enriched for sporadic schizophrenia cases, as well as 34 unaffected trios. We observed in cases an excess of non-synonymous single nucleotide variants as well as a higher prevalence of gene-disruptive de novo mutations. We found four genes (LAMA2, DPYD, TRRAP and VPS39) affected by recurrent de novo events within or across the two populations, a finding unlikely to have occurred by chance. We show that de novo mutations affect genes with diverse functions and developmental profiles but we also find a substantial contribution of mutations in genes with higher expression in early fetal life. Our results help define the pattern of genomic and neural architecture of schizophrenia. PMID:23042115

  3. Whole-exome sequencing links TMCO1 defect syndrome with cerebro-facio-thoracic dysplasia.

    PubMed

    Pehlivan, Davut; Karaca, Ender; Aydin, Hatip; Beck, Christine R; Gambin, Tomasz; Muzny, Donna M; Bilge Geckinli, B; Karaman, Ali; Jhangiani, Shalini N; Gibbs, Richard A; Lupski, James R

    2014-09-01

    Whole-exome sequencing (WES) is a type of disruptive technology that has tremendous influence on human and clinical genetics research. An efficient and cost-effective method, WES is now widely used as a diagnostic tool for identifying the molecular basis of genetic syndromes that are often challenging to diagnose. Here we report a patient with a clinical diagnosis of cerebro-facio-thoracic dysplasia (CFTD; MIM#213980) in whom we identified a homozygous splice-site mutation in the transmembrane and coiled-coil domains 1 (TMCO1) gene using WES. TMCO1 mutations cause craniofacial dysmorphism, skeletal anomalies characterized by multiple malformations of the vertebrae and ribs, and intellectual disability (MIM#614132). A retrospective review revealed that clinical manifestations of both syndromes are very similar and overlap remarkably. We propose that mutations of TMCO1 are not only responsible for craniofacial dysmorphism, skeletal anomalies and mental retardation syndrome but also for CFTD.

  4. Toward clinical genomics in everyday medicine: perspectives and recommendations.

    PubMed

    Delaney, Susan K; Hultner, Michael L; Jacob, Howard J; Ledbetter, David H; McCarthy, Jeanette J; Ball, Michael; Beckman, Kenneth B; Belmont, John W; Bloss, Cinnamon S; Christman, Michael F; Cosgrove, Andy; Damiani, Stephen A; Danis, Timothy; Delledonne, Massimo; Dougherty, Michael J; Dudley, Joel T; Faucett, W Andrew; Friedman, Jennifer R; Haase, David H; Hays, Tom S; Heilsberg, Stu; Huber, Jeff; Kaminsky, Leah; Ledbetter, Nikki; Lee, Warren H; Levin, Elissa; Libiger, Ondrej; Linderman, Michael; Love, Richard L; Magnus, David C; Martland, AnneMarie; McClure, Susan L; Megill, Scott E; Messier, Helen; Nussbaum, Robert L; Palaniappan, Latha; Patay, Bradley A; Popovich, Bradley W; Quackenbush, John; Savant, Mark J; Su, Michael M; Terry, Sharon F; Tucker, Steven; Wong, William T; Green, Robert C

    2016-01-01

    Precision or personalized medicine through clinical genome and exome sequencing has been described by some as a revolution that could transform healthcare delivery, yet it is currently used in only a small fraction of patients, principally for the diagnosis of suspected Mendelian conditions and for targeting cancer treatments. Given the burden of illness in our society, it is of interest to ask how clinical genome and exome sequencing can be constructively integrated more broadly into the routine practice of medicine for the betterment of public health. In November 2014, 46 experts from academia, industry, policy and patient advocacy gathered in a conference sponsored by Illumina, Inc. to discuss this question, share viewpoints and propose recommendations. This perspective summarizes that work and identifies some of the obstacles and opportunities that must be considered in translating advances in genomics more widely into the practice of medicine.

  5. Toward clinical genomics in everyday medicine: perspectives and recommendations

    PubMed Central

    Delaney, Susan K.; Hultner, Michael L.; Jacob, Howard J.; Ledbetter, David H.; McCarthy, Jeanette J.; Ball, Michael; Beckman, Kenneth B.; Belmont, John W.; Bloss, Cinnamon S.; Christman, Michael F.; Cosgrove, Andy; Damiani, Stephen A.; Danis, Timothy; Delledonne, Massimo; Dougherty, Michael J.; Dudley, Joel T.; Faucett, W. Andrew; Friedman, Jennifer R.; Haase, David H.; Hays, Tom S.; Heilsberg, Stu; Huber, Jeff; Kaminsky, Leah; Ledbetter, Nikki; Lee, Warren H.; Levin, Elissa; Libiger, Ondrej; Linderman, Michael; Love, Richard L.; Magnus, David C.; Martland, AnneMarie; McClure, Susan L.; Megill, Scott E.; Messier, Helen; Nussbaum, Robert L.; Palaniappan, Latha; Patay, Bradley A.; Popovich, Bradley W.; Quackenbush, John; Savant, Mark J.; Su, Michael M.; Terry, Sharon F.; Tucker, Steven; Wong, William T.; Green, Robert C.

    2016-01-01

    ABSTRACT Precision or personalized medicine through clinical genome and exome sequencing has been described by some as a revolution that could transform healthcare delivery, yet it is currently used in only a small fraction of patients, principally for the diagnosis of suspected Mendelian conditions and for targeting cancer treatments. Given the burden of illness in our society, it is of interest to ask how clinical genome and exome sequencing can be constructively integrated more broadly into the routine practice of medicine for the betterment of public health. In November 2014, 46 experts from academia, industry, policy and patient advocacy gathered in a conference sponsored by Illumina, Inc. to discuss this question, share viewpoints and propose recommendations. This perspective summarizes that work and identifies some of the obstacles and opportunities that must be considered in translating advances in genomics more widely into the practice of medicine. PMID:26810587

  6. Exome sequence analysis suggests genetic burden contributes to phenotypic variability and complex neuropathy

    PubMed Central

    Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.

    2015-01-01

    Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172

  7. Whole-exome sequencing links TMCO1 defect syndrome with cerebro-facio-thoracic dysplasia

    PubMed Central

    Pehlivan, Davut; Karaca, Ender; Aydin, Hatip; Beck, Christine R; Gambin, Tomasz; Muzny, Donna M; Bilge Geckinli, B; Karaman, Ali; Jhangiani, Shalini N; Gibbs, Richard A; Lupski, James R

    2014-01-01

    Whole-exome sequencing (WES) is a type of disruptive technology that has tremendous influence on human and clinical genetics research. An efficient and cost-effective method, WES is now widely used as a diagnostic tool for identifying the molecular basis of genetic syndromes that are often challenging to diagnose. Here we report a patient with a clinical diagnosis of cerebro-facio-thoracic dysplasia (CFTD; MIM#213980) in whom we identified a homozygous splice-site mutation in the transmembrane and coiled-coil domains 1 (TMCO1) gene using WES. TMCO1 mutations cause craniofacial dysmorphism, skeletal anomalies characterized by multiple malformations of the vertebrae and ribs, and intellectual disability (MIM#614132). A retrospective review revealed that clinical manifestations of both syndromes are very similar and overlap remarkably. We propose that mutations of TMCO1 are not only responsible for craniofacial dysmorphism, skeletal anomalies and mental retardation syndrome but also for CFTD. PMID:24424126

  8. Late-onset spastic paraplegia: Aberrant SPG11 transcripts generated by a novel splice site donor mutation.

    PubMed

    Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji

    2015-12-15

    We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Impact of NGS in the medical sciences: Genetic syndromes with an increased risk of developing cancer as an example of the use of new technologies

    PubMed Central

    Lapunzina, Pablo; López, Rocío Ortiz; Rodríguez-Laguna, Lara; García-Miguel, Purificación; Martínez, Augusto Rojas; Martínez-Glez, Víctor

    2014-01-01

    The increased speed and decreasing cost of sequencing, along with an understanding of the clinical relevance of emerging information for patient management, has led to an explosion of potential applications in healthcare. Currently, SNP arrays and Next-Generation Sequencing (NGS) technologies are relatively new techniques used to scan genomes for gains and losses, losses of heterozygosity (LOH), SNPs, and indel variants as well as to perform complete sequencing of a panel of candidate genes, the entire exome (whole exome sequencing) or even the whole genome. As a result, these new high-throughput technologies have facilitated progress in the understanding and diagnosis of genetic syndromes and cancers, two disorders traditionally considered to be separate diseases but that can share causal genetic alterations in a group of developmental disorders associated with congenital malformations and cancer risk. The purpose of this work is to review these syndromes as an example of a group of disorders that has been included in a panel of genes for NGS analysis. We also highlight the relationship between development and cancer and underline the connections between these syndromes. PMID:24764758

  10. A Novel Computational Strategy to Identify A-to-I RNA Editing Sites by RNA-Seq Data: De Novo Detection in Human Spinal Cord Tissue

    PubMed Central

    Picardi, Ernesto; Gallo, Angela; Galeano, Federica; Tomaselli, Sara; Pesole, Graziano

    2012-01-01

    RNA editing is a post-transcriptional process occurring in a wide range of organisms. In human brain, the A-to-I RNA editing, in which individual adenosine (A) bases in pre-mRNA are modified to yield inosine (I), is the most frequent event. Modulating gene expression, RNA editing is essential for cellular homeostasis. Indeed, its deregulation has been linked to several neurological and neurodegenerative diseases. To date, many RNA editing sites have been identified by next generation sequencing technologies employing massive transcriptome sequencing together with whole genome or exome sequencing. While genome and transcriptome reads are not always available for single individuals, RNA-Seq data are widespread through public databases and represent a relevant source of yet unexplored RNA editing sites. In this context, we propose a simple computational strategy to identify genomic positions enriched in novel hypothetical RNA editing events by means of a new two-steps mapping procedure requiring only RNA-Seq data and no a priori knowledge of RNA editing characteristics and genomic reads. We assessed the suitability of our procedure by confirming A-to-I candidates using conventional Sanger sequencing and performing RNA-Seq as well as whole exome sequencing of human spinal cord tissue from a single individual. PMID:22957051

  11. Kassiopeia: a database and web application for the analysis of mutually exclusive exomes of eukaryotes

    PubMed Central

    2014-01-01

    Background Alternative splicing is an important process in higher eukaryotes that allows obtaining several transcripts from one gene. A specific case of alternative splicing is mutually exclusive splicing, in which exactly one exon out of a cluster of neighbouring exons is spliced into the mature transcript. Recently, a new algorithm for the prediction of these exons has been developed based on the preconditions that the exons of the cluster have similar lengths, sequence homology, and conserved splice sites, and that they are translated in the same reading frame. Description In this contribution we introduce Kassiopeia, a database and web application for the generation, storage, and presentation of genome-wide analyses of mutually exclusive exomes. Currently, Kassiopeia provides access to the mutually exclusive exomes of twelve Drosophila species, the thale cress Arabidopsis thaliana, the flatworm Caenorhabditis elegans, and human. Mutually exclusive spliced exons (MXEs) were predicted based on gene reconstructions from Scipio. Based on the standard prediction values, with which 83.5% of the annotated MXEs of Drosophila melanogaster were reconstructed, the exomes contain surprisingly more MXEs than previously supposed and identified. The user can search Kassiopeia using BLAST or browse the genes of each species optionally adjusting the parameters used for the prediction to reveal more divergent or only very similar exon candidates. Conclusions We developed a pipeline to predict MXEs in the genomes of several model organisms and a web interface, Kassiopeia, for their visualization. For each gene Kassiopeia provides a comprehensive gene structure scheme, the sequences and predicted secondary structures of the MXEs, and, if available, further evidence for MXE candidates from cDNA/EST data, predictions of MXEs in homologous genes of closely related species, and RNA secondary structure predictions. Kassiopeia can be accessed at http://www.motorprotein.de/kassiopeia. PMID:24507667

  12. Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

    PubMed

    Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

    2016-01-07

    Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.

  13. Excessive burden of lysosomal storage disorder gene variants in Parkinson's disease.

    PubMed

    Robak, Laurie A; Jansen, Iris E; van Rooij, Jeroen; Uitterlinden, André G; Kraaij, Robert; Jankovic, Joseph; Heutink, Peter; Shulman, Joshua M

    2017-12-01

    Mutations in the glucocerebrosidase gene (GBA), which cause Gaucher disease, are also potent risk factors for Parkinson's disease. We examined whether a genetic burden of variants in other lysosomal storage disorder genes is more broadly associated with Parkinson's disease susceptibility. The sequence kernel association test was used to interrogate variant burden among 54 lysosomal storage disorder genes, leveraging whole exome sequencing data from 1156 Parkinson's disease cases and 1679 control subjects. We discovered a significant burden of rare, likely damaging lysosomal storage disorder gene variants in association with Parkinson's disease risk. The association signal was robust to the exclusion of GBA, and consistent results were obtained in two independent replication cohorts, including 436 cases and 169 controls with whole exome sequencing and an additional 6713 cases and 5964 controls with exome-wide genotyping. In secondary analyses designed to highlight the specific genes driving the aggregate signal, we confirmed associations at the GBA and SMPD1 loci and newly implicate CTSD, SLC17A5, and ASAH1 as candidate Parkinson's disease susceptibility genes. In our discovery cohort, the majority of Parkinson's disease cases (56%) have at least one putative damaging variant in a lysosomal storage disorder gene, and 21% carry multiple alleles. Our results highlight several promising new susceptibility loci and reinforce the importance of lysosomal mechanisms in Parkinson's disease pathogenesis. We suggest that multiple genetic hits may act in combination to degrade lysosomal function, enhancing Parkinson's disease susceptibility. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  14. Whole exome sequencing in neurogenetic odysseys: An effective, cost- and time-saving diagnostic approach.

    PubMed

    Córdoba, Marta; Rodriguez-Quiroga, Sergio Alejandro; Vega, Patricia Analía; Salinas, Valeria; Perez-Maturo, Josefina; Amartino, Hernán; Vásquez-Dusefante, Cecilia; Medina, Nancy; González-Morón, Dolores; Kauffman, Marcelo Andrés

    2018-01-01

    Diagnostic trajectories for neurogenetic disorders frequently require the use of considerable time and resources, exposing patients and families to so-called "diagnostic odysseys". Previous studies have provided strong evidence for increased diagnostic and clinical utility of whole-exome sequencing in medical genetics. However, specific reports assessing its utility in a setting such as ours- a neurogeneticist led academic group serving in a low-income country-are rare. To assess the diagnostic yield of WES in patients suspected of having a neurogenetic condition and explore the cost-effectiveness of its implementation in a research group located in an Argentinean public hospital. This is a prospective study of the clinical utility of WES in a series of 40 consecutive patients selected from a Neurogenetic Clinic of a tertiary Hospital in Argentina. We evaluated patients retrospectively for previous diagnostic trajectories. Diagnostic yield, clinical impact on management and economic diagnostic burden were evaluated. We demonstrated the clinical utility of Whole Exome Sequencing in our patient cohort, obtaining a diagnostic yield of 40% (95% CI, 24.8%-55.2%) among a diverse group of neurological disorders. The average age at the time of WES was 23 (range 3-70). The mean time elapsed from symptom onset to WES was 11 years (range 3-42). The mean cost of the diagnostic workup prior to WES was USD 1646 (USD 1439 to 1853), which is 60% higher than WES cost in our center. WES for neurogenetics proved to be an effective, cost- and time-saving approach for the molecular diagnosis of this heterogeneous and complex group of patients.

  15. Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

    PubMed Central

    Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

    2014-01-01

    Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613

  16. Whole Exome Sequencing, Familial Genomic Triangulation, and Systems Biology Converge to Identify a Novel Nonsense Mutation in TAB2-encoded TGF-beta Activated Kinase 1 in a Child with Polyvalvular Syndrome.

    PubMed

    Ackerman, Jaeger P; Smestad, John A; Tester, David J; Qureshi, Muhammad Y; Crabb, Beau A; Mendelsohn, Nancy J; Ackerman, Michael J

    2016-09-01

    To use whole exome sequencing (WES) of a family trio to identify a genetic cause for polyvalvular syndrome. A male child was born with mild pulmonary valve stenosis and mild aortic root dilatation, and an atrial septal defect, ventricular septal defect, and patent ductus arteriosus that were closed surgically. Subsequently, the phenotype of polyvalvular syndrome with involvement of both semilunar and both atrioventricular valves emerged. His family history was negative for congenital heart disease. Because of hypotonia, myopia, soft pale skin, joint hypermobility, and mild facial dysmorphism, either Noonan syndrome- or William syndrome-spectrum disorders were suspected clinically. However, chromosomal analysis was normal and commercially available Noonan syndrome and William syndrome genetic tests were negative. Whole exome sequencing of the patient and both parents was performed. Variants were analyzed by sporadic and autosomal recessive inheritance models. A sporadic mutation, annotated as c.1491 T > A, in TAB2, resulting in a nonsense mutation, p.Y497X, in the TAB2-encoded TGF-beta activated kinase 1 (TAK1) was identified as the most likely disease-susceptibility gene. This mutation results in elimination of the terminal 197 amino acids, including the C-terminal binding motif critical for interactions with TRAF6 and TAK1. The combination of WES, genomic triangulation, and systems biology has uncovered perturbations in TGF-beta activated kinase 1 signaling as a novel pathogenic substrate for polyvalvular syndrome. © 2016 Wiley Periodicals, Inc.

  17. Exome Sequencing Establishes Diagnosis of Alström Syndrome in an Infant Presenting with Non-Syndromic Dilated Cardiomyopathy

    PubMed Central

    Long, Pamela A.; Evans, Jared M.; Olson, Timothy M.

    2015-01-01

    Idiopathic dilated cardiomyopathy is a heritable, genetically heterogeneous disorder characterized by progressive heart failure. Dilated cardiomyopathy typically exhibits autosomal dominant inheritance, yet frequently remains clinically silent until adulthood. We sought to discover the molecular basis of idiopathic, non-syndromic dilated cardiomyopathy in a one-month-old male presenting with severe heart failure. Previous comprehensive testing of blood, urine, and skin biopsy specimen was negative for metabolic, mitochondrial, storage, and infectious etiologies. Ophthalmologic examination was normal. Chromosomal microarray and commercial dilated cardiomyopathy gene panel testing failed to identify a causative mutation. Parental screening echocardiograms revealed no evidence of clinically silent dilated cardiomyopathy. Whole exome sequencing was carried out on the family trio on a research basis, filtering for rare, deleterious, recessive and de novo genetic variants. Pathogenic compound heterozygous truncating mutations were identified in ALMS1, diagnostic of Alström syndrome and prompting disclosure of genetic findings. Alström syndrome is a known cause for dilated cardiomyopathy in children yet delayed and mis-diagnosis are common owing to its rarity and age-dependent emergence of multisystem clinical manifestations. At six months of age the patient ultimately developed bilateral nystagmus and hyperopia, features characteristic of the syndrome. Early diagnosis is guiding clinical monitoring of other organ systems and allowing for presymptomatic intervention. Furthermore, recognition of recessive inheritance as the mechanism for sporadic disease has informed family planning. This case highlights a limitation of standard gene testing panels for pediatric dilated cardiomyopathy and exemplifies the potential for whole exome sequencing to solve a diagnostic dilemma and enable personalized care. PMID:25706677

  18. Post-mortem whole-exome analysis in a large sudden infant death syndrome cohort with a focus on cardiovascular and metabolic genetic diseases.

    PubMed

    Neubauer, Jacqueline; Lecca, Maria Rita; Russo, Giancarlo; Bartsch, Christine; Medeiros-Domingo, Argelia; Berger, Wolfgang; Haas, Cordula

    2017-04-01

    Sudden infant death syndrome (SIDS) is described as the sudden and unexplained death of an apparently healthy infant younger than one year of age. Genetic studies indicate that up to 35% of SIDS cases might be explained by familial or genetic diseases such as cardiomyopathies, ion channelopathies or metabolic disorders that remained undetected during conventional forensic autopsy procedures. Post-mortem genetic testing by using massive parallel sequencing (MPS) approaches represents an efficient and rapid tool to further investigate unexplained death cases and might help to elucidate pathogenic genetic variants and mechanisms in cases without a conclusive cause of death. In this study, we performed whole-exome sequencing (WES) in 161 European SIDS infants with focus on 192 genes associated with cardiovascular and metabolic diseases. Potentially causative variants were detected in 20% of the SIDS cases. The majority of infants had variants with likely functional effects in genes associated with channelopathies (9%), followed by cardiomyopathies (7%) and metabolic diseases (1%). Although lethal arrhythmia represents the most plausible and likely cause of death, the majority of SIDS cases still remains elusive and might be explained by a multifactorial etiology, triggered by a combination of different genetic and environmental risk factors. As WES is not substantially more expensive than a targeted sequencing approach, it represents an unbiased screening of the exome, which could help to investigate different pathogenic mechanisms within the genetically heterogeneous SIDS cohort. Additionally, re-analysis of the datasets provides the basis to identify new candidate genes in sudden infant death.

  19. Whole exome sequence analysis of Peters anomaly

    PubMed Central

    Weh, Eric; Reis, Linda M.; Happ, Hannah C.; Levin, Alex V.; Wheeler, Patricia G.; David, Karen L.; Carney, Erin; Angle, Brad; Hauser, Natalie

    2015-01-01

    Peters anomaly is a rare form of anterior segment ocular dysgenesis, which can also be associated with additional systemic defects. At this time, the majority of cases of Peters anomaly lack a genetic diagnosis. We performed whole exome sequencing of 27 patients with syndromic or isolated Peters anomaly to search for pathogenic mutations in currently known ocular genes. Among the eight previously recognized Peters anomaly genes, we identified a de novo missense mutation in PAX6, c.155G>A, p.(Cys52Tyr), in one patient. Analysis of 691 additional genes currently associated with a different ocular phenotype identified a heterozygous splicing mutation c.1025+2T>A in TFAP2A, a de novo heterozygous nonsense mutation c.715C>T, p.(Gln239*) in HCCS, a hemizygous mutation c.385G>A, p.(Glu129Lys) in NDP, a hemizygous mutation c.3446C>T, p.(Pro1149Leu) in FLNA, and compound heterozygous mutations c.1422T>A, p.(Tyr474*) and c.2544G>A, p.(Met848Ile) in SLC4A11; all mutations, except for the FLNA and SLC4A11 c.2544G>A alleles, are novel. This is the frst study to use whole exome sequencing to discern the genetic etiology of a large cohort of patients with syndromic or isolated Peters anomaly. We report five new genes associated with this condition and suggest screening of TFAP2A and FLNA in patients with Peters anomaly and relevant syndromic features and HCCS, NDP and SLC4A11 in patients with isolated Peters anomaly. PMID:25182519

  20. Variability of Creatine Metabolism Genes in Children with Autism Spectrum Disorder.

    PubMed

    Cameron, Jessie M; Levandovskiy, Valeriy; Roberts, Wendy; Anagnostou, Evdokia; Scherer, Stephen; Loh, Alvin; Schulze, Andreas

    2017-07-31

    Creatine deficiency syndrome (CDS) comprises three separate enzyme deficiencies with overlapping clinical presentations: arginine:glycine amidinotransferase ( GATM gene, glycine amidinotransferase), guanidinoacetate methyltransferase ( GAMT gene), and creatine transporter deficiency ( SLC6A8 gene, solute carrier family 6 member 8). CDS presents with developmental delays/regression, intellectual disability, speech and language impairment, autistic behaviour, epileptic seizures, treatment-refractory epilepsy, and extrapyramidal movement disorders; symptoms that are also evident in children with autism. The objective of the study was to test the hypothesis that genetic variability in creatine metabolism genes is associated with autism. We sequenced GATM , GAMT and SLC6A8 genes in 166 patients with autism (coding sequence, introns and adjacent untranslated regions). A total of 29, 16 and 25 variants were identified in each gene, respectively. Four variants were novel in GATM , and 5 in SLC6A8 (not present in the 1000 Genomes, Exome Sequencing Project (ESP) or Exome Aggregation Consortium (ExAC) databases). A single variant in each gene was identified as non-synonymous, and computationally predicted to be potentially damaging. Nine variants in GATM were shown to have a lower minor allele frequency (MAF) in the autism population than in the 1000 Genomes database, specifically in the East Asian population (Fisher's exact test). Two variants also had lower MAFs in the European population. In summary, there were no apparent associations of variants in GAMT and SLC6A8 genes with autism. The data implying there could be a lower association of some specific GATM gene variants with autism is an observation that would need to be corroborated in a larger group of autism patients, and with sub-populations of Asian ethnicities. Overall, our findings suggest that the genetic variability of creatine synthesis/transport is unlikely to play a part in the pathogenesis of autism spectrum disorder (ASD) in children.

  1. Whole-exome sequencing revealed two novel mutations in Usher syndrome.

    PubMed

    Koparir, Asuman; Karatas, Omer Faruk; Atayoglu, Ali Timucin; Yuksel, Bayram; Sagiroglu, Mahmut Samil; Seven, Mehmet; Ulucan, Hakan; Yuksel, Adnan; Ozen, Mustafa

    2015-06-01

    Usher syndrome is a clinically and genetically heterogeneous autosomal recessive inherited disorder accompanied by hearing loss and retinitis pigmentosa (RP). Since the associated genes are various and quite large, we utilized whole-exome sequencing (WES) as a diagnostic tool to identify the molecular basis of Usher syndrome. DNA from a 12-year-old male diagnosed with Usher syndrome was analyzed by WES. Mutations detected were confirmed by Sanger sequencing. The pathogenicity of these mutations was determined by in silico analysis. A maternally inherited deleterious frameshift mutation, c.14439_14454del in exon 66 and a paternally inherited non-sense c.10830G>A stop-gain SNV in exon 55 of USH2A were found as two novel compound heterozygous mutations. Both of these mutations disrupt the C terminal of USH2A protein. As a result, WES revealed two novel compound heterozygous mutations in a Turkish USH2A patient. This approach gave us an opportunity to have an appropriate diagnosis and provide genetic counseling to the family within a reasonable time. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. Whole-exome sequencing, without prior linkage, identifies a mutation in LAMB3 as a cause of dominant hypoplastic amelogenesis imperfecta.

    PubMed

    Poulter, James A; El-Sayed, Walid; Shore, Roger C; Kirkham, Jennifer; Inglehearn, Chris F; Mighell, Alan J

    2014-01-01

    The conventional approach to identifying the defective gene in a family with an inherited disease is to find the disease locus through family studies. However, the rapid development and decreasing cost of next generation sequencing facilitates a more direct approach. Here, we report the identification of a frameshift mutation in LAMB3 as a cause of dominant hypoplastic amelogenesis imperfecta (AI). Whole-exome sequencing of three affected family members and subsequent filtering of shared variants, without prior genetic linkage, sufficed to identify the pathogenic variant. Simultaneous analysis of multiple family members confirms segregation, enhancing the power to filter the genetic variation found and leading to rapid identification of the pathogenic variant. LAMB3 encodes a subunit of Laminin-5, one of a family of basement membrane proteins with essential functions in cell growth, movement and adhesion. Homozygous LAMB3 mutations cause junctional epidermolysis bullosa (JEB) and enamel defects are seen in JEB cases. However, to our knowledge, this is the first report of dominant AI due to a LAMB3 mutation in the absence of JEB.

  3. Targeted exome sequencing of Korean triple-negative breast cancer reveals homozygous deletions associated with poor prognosis of adjuvant chemotherapy-treated patients

    PubMed Central

    Jeong, Hae Min; Kim, Ryong Nam; Kwon, Mi Jeong; Oh, Ensel; Han, Jinil; Lee, Se Kyung; Choi, Jong-Sun; Park, Sara; Nam, Seok Jin; Gong, Gyung Yup; Nam, Jin Wu; Choi, Doo Ho; Lee, Hannah; Nam, Byung-Ho; Choi, Yoon-La; Shin, Young Kee

    2017-01-01

    Triple-negative breast cancer is characterized by the absence of estrogen and progesterone receptors and human epidermal growth factor receptor 2, and is associated with a poorer outcome than other subtypes of breast cancer. Moreover, there are no accurate prognostic genes or effective therapeutic targets, thereby necessitating continued intensive investigation. This study analyzed the genetic mutation landscape in 70 patients with triple-negative breast cancer by targeted exome sequencing of tumor and matched normal samples. Sequencing showed that more than 50% of these patients had deleterious mutations and homozygous deletions of DNA repair genes, such as ATM, BRCA1, BRCA2, WRN, and CHEK2. These findings suggested that a large number of patients with triple-negative breast cancer have impaired DNA repair function and that therefore a poly ADP-ribose polymerase inhibitor may be an effective drug in the treatment of this disease. Notably, homozygous deletion of three genes, EPHA5, MITF, and ACSL3, was significantly associated with an increased risk of recurrence or distant metastasis in adjuvant chemotherapy-treated patients. PMID:28977883

  4. Identification of a Novel De Novo Variant in the PAX3 Gene in Waardenburg Syndrome by Diagnostic Exome Sequencing: The First Molecular Diagnosis in Korea.

    PubMed

    Jang, Mi-Ae; Lee, Taeheon; Lee, Junnam; Cho, Eun-Hae; Ki, Chang-Seok

    2015-05-01

    Waardenburg syndrome (WS) is a clinically and genetically heterogeneous hereditary auditory pigmentary disorder characterized by congenital sensorineural hearing loss and iris discoloration. Many genes have been linked to WS, including PAX3, MITF, SNAI2, EDNRB, EDN3, and SOX10, and many additional genes have been associated with disorders with phenotypic overlap with WS. To screen all possible genes associated with WS and congenital deafness simultaneously, we performed diagnostic exome sequencing (DES) in a male patient with clinical features consistent with WS. Using DES, we identified a novel missense variant (c.220C>G; p.Arg74Gly) in exon 2 of the PAX3 gene in the patient. Further analysis by Sanger sequencing of the patient and his parents revealed a de novo occurrence of the variant. Our findings show that DES can be a useful tool for the identification of pathogenic gene variants in WS patients and for differentiation between WS and similar disorders. To the best of our knowledge, this is the first report of genetically confirmed WS in Korea.

  5. Exome-wide DNA capture and next generation sequencing in domestic and wild species.

    PubMed

    Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon

    2011-07-05

    Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.

  6. Whole Exome Sequencing Identifies New Host Genomic Susceptibility Factors in Empyema Caused by Streptococcus pneumoniae in Children: A Pilot Study.

    PubMed

    Salas, Antonio; Pardo-Seco, Jacobo; Barral-Arca, Ruth; Cebey-López, Miriam; Gómez-Carballa, Alberto; Rivero-Calle, Irene; Pischedda, Sara; Currás-Tuala, María-José; Amigo, Jorge; Gómez-Rial, José; Martinón-Torres, Federico

    2018-05-03

    Pneumonia is the leading cause of death amongst infectious diseases. Streptococcus pneumoniae is responsible for about 25% of pneumonia cases worldwide, and it is a major cause of childhood mortality. We carried out a whole exome sequencing (WES) study in eight patients with complicated cases of pneumococcal pneumonia (empyema). An initial assessment of statistical association of WES variation with pneumonia was carried out using data from the 1000 Genomes Project (1000G) for the Iberian Peninsula (IBS) as reference controls. Pseudo-replication statistical analyses were carried out using different European control groups. Association tests pointed to single nucleotide polymorphism (SNP) rs201967957 (gene MEIS1 ; chromosome 2; p -value IBS = 3.71 × 10 -13 ) and rs576099063 (gene TSPAN15 ; chromosome 10; p -value IBS = 2.36 × 10 -8 ) as the best candidate variants associated to pneumococcal pneumonia. A burden gene test of pathogenicity signaled four genes, namely, OR9G9 , MUC6 , MUC3A and APOB , which carry significantly increased pathogenic variation when compared to controls. By analyzing various transcriptomic data repositories, we found strong supportive evidence for the role of MEIS1, TSPAN15 and APOBR (encoding the receptor of the APOB protein) in pneumonia in mouse and human models. Furthermore, the association of the olfactory receptor gene OR9G9 has recently been related to some viral infectious diseases, while the role of mucin genes ( MUC6 and MUC3A ), encoding mucin glycoproteins, are well-known factors related to chronic obstructive airway disease. WES emerges as a promising technique to disentangle the genetic basis of host genome susceptibility to infectious respiratory diseases.

  7. PUF60 variants cause a syndrome of ID, short stature, microcephaly, coloboma, craniofacial, cardiac, renal and spinal features

    PubMed Central

    Low, Karen J; Ansari, Morad; Abou Jamra, Rami; Clarke, Angus; El Chehadeh, Salima; FitzPatrick, David R; Greenslade, Mark; Henderson, Alex; Hurst, Jane; Keller, Kory; Kuentz, Paul; Prescott, Trine; Roessler, Franziska; Selmer, Kaja K; Schneider, Michael C; Stewart, Fiona; Tatton-Brown, Katrina; Thevenon, Julien; Vigeland, Magnus D; Vogt, Julie; Willems, Marjolaine; Zonana, Jonathan; Study, D D D; Smithson, Sarah F

    2017-01-01

    PUF60 encodes a nucleic acid-binding protein, a component of multimeric complexes regulating RNA splicing and transcription. In 2013, patients with microdeletions of chromosome 8q24.3 including PUF60 were found to have developmental delay, microcephaly, craniofacial, renal and cardiac defects. Very similar phenotypes have been described in six patients with variants in PUF60, suggesting that it underlies the syndrome. We report 12 additional patients with PUF60 variants who were ascertained using exome sequencing: six through the Deciphering Developmental Disorders Study and six through similar projects. Detailed phenotypic analysis of all patients was undertaken. All 12 patients had de novo heterozygous PUF60 variants on exome analysis, each confirmed by Sanger sequencing: four frameshift variants resulting in premature stop codons, three missense variants that clustered within the RNA recognition motif of PUF60 and five essential splice-site (ESS) variant. Analysis of cDNA from a fibroblast cell line derived from one of the patients with an ESS variants revealed aberrant splicing. The consistent feature was developmental delay and most patients had short stature. The phenotypic variability was striking; however, we observed similarities including spinal segmentation anomalies, congenital heart disease, ocular colobomata, hand anomalies and (in two patients) unilateral renal agenesis/horseshoe kidney. Characteristic facial features included micrognathia, a thin upper lip and long philtrum, narrow almond-shaped palpebral fissures, synophrys, flared eyebrows and facial hypertrichosis. Heterozygote loss-of-function variants in PUF60 cause a phenotype comprising growth/developmental delay and craniofacial, cardiac, renal, ocular and spinal anomalies, adding to disorders of human development resulting from aberrant RNA processing/spliceosomal function. PMID:28327570

  8. Exome sequencing of oral squamous cell carcinoma in users of Arabian snuff reveals novel candidates for driver genes.

    PubMed

    Al-Hebshi, Nezar Noor; Li, Shiyong; Nasher, Akram Thabet; El-Setouhy, Maged; Alsanosi, Rashad; Blancato, Jan; Loffredo, Christopher

    2016-07-15

    The study sought to identify genetic aberrations driving oral squamous cell carcinoma (OSCC) development among users of shammah, an Arabian preparation of smokeless tobacco. Twenty archival OSCC samples, 15 of which with a history of shammah exposure, were whole-exome sequenced at an average depth of 127×. Somatic mutations were identified using a novel, matched controls-independent filtration algorithm. CODEX and Exomedepth coupled with a novel, Database of Genomic Variant-based filter were employed to call somatic gene-copy number variations. Significantly mutated genes were identified with Oncodrive FM and the Youn and Simon's method. Candidate driver genes were nominated based on Gene Set Enrichment Analysis. The observed mutational spectrum was similar to that reported by the TCGA project. In addition to confirming known genes of OSCC (TP53, CDKNA2, CASP8, PIK3CA, HRAS, FAT1, TP63, CCND1 and FADD) the analysis identified several candidate novel driver events including mutations of NOTCH3, CSMD3, CRB1, CLTCL1, OSMR and TRPM2, amplification of the proto-oncogenes FOSL1, RELA, TRAF6, MDM2, FRS2 and BAG1, and deletion of the recently described tumor suppressor SMARCC1. Analysis also revealed significantly altered pathways not previously implicated in OSCC including Oncostatin-M signalling pathway, AP-1 and C-MYB transcription networks and endocytosis. There was a trend for higher number of mutations, amplifications and driver events in samples with history of shammah exposure particularly those that tested EBV positive, suggesting an interaction between tobacco exposure and EBV. The work provides further evidence for the genetic heterogeneity of oral cancer and suggests shammah-associated OSCC is characterized by extensive amplification of oncogenes. © 2016 UICC.

  9. Whole Exome Sequencing Identifies New Host Genomic Susceptibility Factors in Empyema Caused by Streptococcus pneumoniae in Children: A Pilot Study

    PubMed Central

    Salas, Antonio; Barral-Arca, Ruth; Cebey-López, Miriam; Pischedda, Sara; Currás-Tuala, María-José; Gómez-Rial, José

    2018-01-01

    Pneumonia is the leading cause of death amongst infectious diseases. Streptococcus pneumoniae is responsible for about 25% of pneumonia cases worldwide, and it is a major cause of childhood mortality. We carried out a whole exome sequencing (WES) study in eight patients with complicated cases of pneumococcal pneumonia (empyema). An initial assessment of statistical association of WES variation with pneumonia was carried out using data from the 1000 Genomes Project (1000G) for the Iberian Peninsula (IBS) as reference controls. Pseudo-replication statistical analyses were carried out using different European control groups. Association tests pointed to single nucleotide polymorphism (SNP) rs201967957 (gene MEIS1; chromosome 2; p-valueIBS = 3.71 × 10−13) and rs576099063 (gene TSPAN15; chromosome 10; p-valueIBS = 2.36 × 10−8) as the best candidate variants associated to pneumococcal pneumonia. A burden gene test of pathogenicity signaled four genes, namely, OR9G9, MUC6, MUC3A and APOB, which carry significantly increased pathogenic variation when compared to controls. By analyzing various transcriptomic data repositories, we found strong supportive evidence for the role of MEIS1, TSPAN15 and APOBR (encoding the receptor of the APOB protein) in pneumonia in mouse and human models. Furthermore, the association of the olfactory receptor gene OR9G9 has recently been related to some viral infectious diseases, while the role of mucin genes (MUC6 and MUC3A), encoding mucin glycoproteins, are well-known factors related to chronic obstructive airway disease. WES emerges as a promising technique to disentangle the genetic basis of host genome susceptibility to infectious respiratory diseases. PMID:29751582

  10. Comprehensive benchmarking of SNV callers for highly admixed tumor data

    PubMed Central

    Bohnert, Regina; Vivas, Sonia

    2017-01-01

    Precision medicine attempts to individualize cancer therapy by matching tumor-specific genetic changes with effective targeted therapies. A crucial first step in this process is the reliable identification of cancer-relevant variants, which is considerably complicated by the impurity and heterogeneity of clinical tumor samples. We compared the impact of admixture of non-cancerous cells and low somatic allele frequencies on the sensitivity and precision of 19 state-of-the-art SNV callers. We studied both whole exome and targeted gene panel data and up to 13 distinct parameter configurations for each tool. We found vast differences among callers. Based on our comprehensive analyses we recommend joint tumor-normal calling with MuTect, EBCall or Strelka for whole exome somatic variant calling, and HaplotypeCaller or FreeBayes for whole exome germline calling. For targeted gene panel data on a single tumor sample, LoFreqStar performed best. We further found that tumor impurity and admixture had a negative impact on precision, and in particular, sensitivity in whole exome experiments. At admixture levels of 60% to 90% sometimes seen in pathological biopsies, sensitivity dropped significantly, even when variants were originally present in the tumor at 100% allele frequency. Sensitivity to low-frequency SNVs improved with targeted panel data, but whole exome data allowed more efficient identification of germline variants. Effective somatic variant calling requires high-quality pathological samples with minimal admixture, a consciously selected sequencing strategy, and the appropriate variant calling tool with settings optimized for the chosen type of data. PMID:29020110

  11. Effectiveness of exome and genome sequencing guided by acuity of illness for diagnosis of neurodevelopmental disorders

    PubMed Central

    Soden, Sarah E.; Saunders, Carol J.; Willig, Laurel K.; Farrow, Emily G.; Smith, Laurie D.; Petrikin, Josh E.; LePichon, Jean-Baptiste; Miller, Neil A.; Thiffault, Isabelle; Dinwiddie, Darrell L.; Twist, Greyson; Noll, Aaron; Heese, Bryce A.; Zellmer, Lee; Atherton, Andrea M.; Abdelmoity, Ahmed T.; Safina, Nicole; Nyp, Sarah S.; Zuccarelli, Britton; Larson, Ingrid A.; Modrcin, Ann; Herd, Suzanne; Creed, Mitchell; Ye, Zhaohui; Yuan, Xuan; Brodsky, Robert A.; Kingsmore, Stephen F.

    2014-01-01

    Neurodevelopmental disorders (NDDs) affect more than 3% of children and are attributable to single-gene mutations at more than 1000 loci. Traditional methods yield molecular diagnoses in less than one-half of children with NDD. Whole-genome sequencing (WGS) and whole-exome sequencing (WES) can enable diagnosis of NDD, but their clinical and cost-effectiveness are unknown. One hundred families with 119 children affected by NDD received diagnostic WGS and/or WES of parent-child trios, wherein the sequencing approach was guided by acuity of illness. Forty-five percent received molecular diagnoses. An accelerated sequencing modality, rapid WGS, yielded diagnoses in 73% of families with acutely ill children (11 of 15). Forty percent of families with children with nonacute NDD, followed in ambulatory care clinics (34 of 85), received diagnoses: 33 by WES and 1 by staged WES then WGS. The cost of prior negative tests in the nonacute patients was $19,100 per family, suggesting sequencing to be cost-effective at up to $7640 per family. A change in clinical care or impression of the pathophysiology was reported in 49% of newly diagnosed families. If WES or WGS had been performed at symptom onset, genomic diagnoses may have been made 77 months earlier than occurred in this study. It is suggested that initial diagnostic evaluation of children with NDD should include trio WGS or WES, with extension of accelerated sequencing modalities to high-acuity patients. PMID:25473036

  12. Genetics Home Reference: 3-M syndrome

    MedlinePlus

    ... complex, interfering with the process of tagging unneeded proteins for degradation. The body's response to growth hormones may be ... Kirk J, Chandler K, Kingston H, Donnai D, Clayton PE, Black GC. Exome sequencing identifies CCDC8 mutations in 3-M syndrome, suggesting ...

  13. A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries

    PubMed Central

    2011-01-01

    Genome targeting methods enable cost-effective capture of specific subsets of the genome for sequencing. We present here an automated, highly scalable method for carrying out the Solution Hybrid Selection capture approach that provides a dramatic increase in scale and throughput of sequence-ready libraries produced. Significant process improvements and a series of in-process quality control checkpoints are also added. These process improvements can also be used in a manual version of the protocol. PMID:21205303

  14. Compound heterozygous alterations in intraflagellar transport protein CLUAP1 in a child with a novel Joubert and oral-facial-digital overlap syndrome.

    PubMed

    Johnston, Jennifer J; Lee, Chanjae; Wentzensen, Ingrid M; Parisi, Melissa A; Crenshaw, Molly M; Sapp, Julie C; Gross, Jeffrey M; Wallingford, John B; Biesecker, Leslie G

    2017-07-01

    Disruption of normal ciliary function results in a range of diseases collectively referred to as ciliopathies. Here we report a child with a phenotype that overlapped with Joubert, oral-facial-digital, and Pallister-Hall syndromes including brain, limb, and craniofacial anomalies. We performed exome-sequence analysis on a proband and both parents, filtered for putative causative variants, and Sanger-verified variants of interest. Identified variants in CLUAP1 were functionally analyzed in a Xenopus system to determine their effect on ciliary function. Two variants in CLUAP1 were identified through exome-sequence analysis, Chr16:g.3558407T>G, c.338T>G, p.(Met113Arg) and Chr16:g.3570011C>T, c.688C>T, p.(Arg230Ter). These variants were rare in the Exome Aggregation Consortium (ExAC) data set of 65,000 individuals (one and two occurrences, respectively). Transfection of mutant CLUAP1 constructs into Xenopus embryos showed reduced protein levels p.(Arg230Ter) and reduced intraflagellar transport p.(Met113Arg). The genetic data show that these variants are present in an affected child, are rare in the population, and result in reduced, but not absent, intraflagellar transport. We conclude that biallelic mutations in CLUAP1 resulted in this novel ciliopathy syndrome in the proband. © 2017 Johnston et al.; Published by Cold Spring Harbor Laboratory Press.

  15. Compound heterozygous alterations in intraflagellar transport protein CLUAP1 in a child with a novel Joubert and oral–facial–digital overlap syndrome

    PubMed Central

    Johnston, Jennifer J.; Lee, Chanjae; Wentzensen, Ingrid M.; Parisi, Melissa A.; Crenshaw, Molly M.; Sapp, Julie C.; Gross, Jeffrey M.; Wallingford, John B.; Biesecker, Leslie G.

    2017-01-01

    Disruption of normal ciliary function results in a range of diseases collectively referred to as ciliopathies. Here we report a child with a phenotype that overlapped with Joubert, oral–facial–digital, and Pallister–Hall syndromes including brain, limb, and craniofacial anomalies. We performed exome-sequence analysis on a proband and both parents, filtered for putative causative variants, and Sanger-verified variants of interest. Identified variants in CLUAP1 were functionally analyzed in a Xenopus system to determine their effect on ciliary function. Two variants in CLUAP1 were identified through exome-sequence analysis, Chr16:g.3558407T>G, c.338T>G, p.(Met113Arg) and Chr16:g.3570011C>T, c.688C>T, p.(Arg230Ter). These variants were rare in the Exome Aggregation Consortium (ExAC) data set of 65,000 individuals (one and two occurrences, respectively). Transfection of mutant CLUAP1 constructs into Xenopus embryos showed reduced protein levels p.(Arg230Ter) and reduced intraflagellar transport p.(Met113Arg). The genetic data show that these variants are present in an affected child, are rare in the population, and result in reduced, but not absent, intraflagellar transport. We conclude that biallelic mutations in CLUAP1 resulted in this novel ciliopathy syndrome in the proband. PMID:28679688

  16. AUDIOME: a tiered exome sequencing-based comprehensive gene panel for the diagnosis of heterogeneous nonsyndromic sensorineural hearing loss.

    PubMed

    Guan, Qiaoning; Balciuniene, Jorune; Cao, Kajia; Fan, Zhiqian; Biswas, Sawona; Wilkens, Alisha; Gallo, Daniel J; Bedoukian, Emma; Tarpinian, Jennifer; Jayaraman, Pushkala; Sarmady, Mahdi; Dulik, Matthew; Santani, Avni; Spinner, Nancy; Abou Tayoun, Ahmad N; Krantz, Ian D; Conlin, Laura K; Luo, Minjie

    2018-03-29

    PurposeHereditary hearing loss is highly heterogeneous. To keep up with rapidly emerging disease-causing genes, we developed the AUDIOME test for nonsyndromic hearing loss (NSHL) using an exome sequencing (ES) platform and targeted analysis for the curated genes.MethodsA tiered strategy was implemented for this test. Tier 1 includes combined Sanger and targeted deletion analyses of the two most common NSHL genes and two mitochondrial genes. Nondiagnostic tier 1 cases are subjected to ES and array followed by targeted analysis of the remaining AUDIOME genes.ResultsES resulted in good coverage of the selected genes with 98.24% of targeted bases at >15 ×. A fill-in strategy was developed for the poorly covered regions, which generally fell within GC-rich or highly homologous regions. Prospective testing of 33 patients with NSHL revealed a diagnosis in 11 (33%) and a possible diagnosis in 8 cases (24.2%). Among those, 10 individuals had variants in tier 1 genes. The ES data in the remaining nondiagnostic cases are readily available for further analysis.ConclusionThe tiered and ES-based test provides an efficient and cost-effective diagnostic strategy for NSHL, with the potential to reflex to full exome to identify causal changes outside of the AUDIOME test.Genetics in Medicine advance online publication, 29 March 2018; doi:10.1038/gim.2018.48.

  17. X-exome sequencing identifies a HDAC8 variant in a large pedigree with X-linked intellectual disability, truncal obesity, gynaecomastia, hypogonadism and unusual face.

    PubMed

    Harakalova, Magdalena; van den Boogaard, Marie-Jose; Sinke, Richard; van Lieshout, Stef; van Tuil, Marc C; Duran, Karen; Renkens, Ivo; Terhal, Paulien A; de Kovel, Carolien; Nijman, Ies J; van Haelst, Mieke; Knoers, Nine V A M; van Haaften, Gijs; Kloosterman, Wigard; Hennekam, Raoul C M; Cuppen, Edwin; Ploos van Amstel, Hans Kristian

    2012-08-01

    We present a large Dutch family with seven males affected by a novel syndrome of X-linked intellectual disability, hypogonadism, gynaecomastia, truncal obesity, short stature and recognisable craniofacial manifestations resembling but not identical to Wilson-Turner syndrome. Seven female relatives show a much milder expression of the phenotype. We performed X chromosome exome (X-exome) sequencing in five individuals from this family and identified a novel intronic variant in the histone deacetylase 8 gene (HDAC8), c.164+5G>A, which disturbs the normal splicing of exon 2 resulting in exon skipping, and introduces a premature stop at the beginning of the histone deacetylase catalytic domain. The identified variant completely segregates in this family and was absent in 96 Dutch controls and available databases. Affected female carriers showed a notably skewed X-inactivation pattern in lymphocytes in which the mutated X-chromosome was completely inactivated. HDAC8 is a member of the protein family of histone deacetylases that play a major role in epigenetic gene silencing during development. HDAC8 specifically controls the patterning of the skull with the mouse HDAC8 knock-out showing craniofacial deformities of the skull. The present family provides the first evidence for involvement of HDAC8 in a syndromic form of intellectual disability.

  18. 76 FR 49777 - Government-Owned Inventions; Availability for Licensing

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-08-11

    ... Treatment of Melanoma Description of Technology: Using whole-exome sequencing of matched normal and.../ transcription domain-associated protein (TRRAP) gene, found the glutamate receptor ionotropic N-methyl D... therapeutic proteins that target this pathway. Potential Commercial Applications: Diagnostic array for the...

  19. Genetics Home Reference: MEGDEL syndrome

    MedlinePlus

    ... MEGDEL syndrome lead to little or no SERAC1 protein function. As a result, phosphatidylglycerol remodeling is impaired, which ... E, Jiménez-Almazán J, Dopazo J, Briones P, Elpeleg O, Ribes A. Exome sequencing identifies a new mutation in SERAC1 in a ...

  20. Genetics Home Reference: anauxetic dysplasia

    MedlinePlus

    ... one gene that provides instructions for making a protein component of the RNase MRP enzyme complex can also cause anauxetic ... A, Donskoi M, Kenna TJ, Thomas GP, Clark GR, Duncan EL, Brown MA. Whole-exome re-sequencing in a family quartet identifies POP1 mutations as ...

  1. Looking beyond the exome: a phenotype-first approach to molecular diagnostic resolution in rare and undiagnosed diseases.

    PubMed

    Pena, Loren D M; Jiang, Yong-Hui; Schoch, Kelly; Spillmann, Rebecca C; Walley, Nicole; Stong, Nicholas; Rapisardo Horn, Sarah; Sullivan, Jennifer A; McConkie-Rosell, Allyn; Kansagra, Sujay; Smith, Edward C; El-Dairi, Mays; Bellet, Jane; Keels, Martha Ann; Jasien, Joan; Kranz, Peter G; Noel, Richard; Nagaraj, Shashi K; Lark, Robert K; Wechsler, Daniel S G; Del Gaudio, Daniela; Leung, Marco L; Hendon, Laura G; Parker, Collette C; Jones, Kelly L; Goldstein, David B; Shashi, Vandana

    2018-04-01

    PurposeTo describe examples of missed pathogenic variants on whole-exome sequencing (WES) and the importance of deep phenotyping for further diagnostic testing.MethodsGuided by phenotypic information, three children with negative WES underwent targeted single-gene testing.ResultsIndividual 1 had a clinical diagnosis consistent with infantile systemic hyalinosis, although WES and a next-generation sequencing (NGS)-based ANTXR2 test were negative. Sanger sequencing of ANTXR2 revealed a homozygous single base pair insertion, previously missed by the WES variant caller software. Individual 2 had neurodevelopmental regression and cerebellar atrophy, with no diagnosis on WES. New clinical findings prompted Sanger sequencing and copy number testing of PLA2G6. A novel homozygous deletion of the noncoding exon 1 (not included in the WES capture kit) was detected, with extension into the promoter, confirming the clinical suspicion of infantile neuroaxonal dystrophy. Individual 3 had progressive ataxia, spasticity, and magnetic resonance image changes of vanishing white matter leukoencephalopathy. An NGS leukodystrophy gene panel and WES showed a heterozygous pathogenic variant in EIF2B5; no deletions/duplications were detected. Sanger sequencing of EIF2B5 showed a frameshift indel, probably missed owing to failure of alignment.ConclusionThese cases illustrate potential pitfalls of WES/NGS testing and the importance of phenotype-guided molecular testing in yielding diagnoses.

  2. Whole-Exome Sequencing to Identify Novel Biological Pathways Associated With Infertility After Pelvic Inflammatory Disease.

    PubMed

    Taylor, Brandie D; Zheng, Xiaojing; Darville, Toni; Zhong, Wujuan; Konganti, Kranti; Abiodun-Ojo, Olayinka; Ness, Roberta B; O'Connell, Catherine M; Haggerty, Catherine L

    2017-01-01

    Ideal management of sexually transmitted infections (STI) may require risk markers for pathology or vaccine development. Previously, we identified common genetic variants associated with chlamydial pelvic inflammatory disease (PID) and reduced fecundity. As this explains only a proportion of the long-term morbidity risk, we used whole-exome sequencing to identify biological pathways that may be associated with STI-related infertility. We obtained stored DNA from 43 non-Hispanic black women with PID from the PID Evaluation and Clinical Health Study. Infertility was assessed at a mean of 84 months. Principal component analysis revealed no population stratification. Potential covariates did not significantly differ between groups. Sequencing kernel association test was used to examine associations between aggregates of variants on a single gene and infertility. The results from the sequencing kernel association test were used to choose "focus genes" (P < 0.01; n = 150) for subsequent Ingenuity Pathway Analysis to identify "gene sets" that are enriched in biologically relevant pathways. Pathway analysis revealed that focus genes were enriched in canonical pathways including, IL-1 signaling, P2Y purinergic receptor signaling, and bone morphogenic protein signaling. Focus genes were enriched in pathways that impact innate and adaptive immunity, protein kinase A activity, cellular growth, and DNA repair. These may alter host resistance or immunopathology after infection. Targeted sequencing of biological pathways identified in this study may provide insight into STI-related infertility.

  3. Lessons learned from whole exome sequencing in multiplex families affected by a complex genetic disorder, intracranial aneurysm.

    PubMed

    Farlow, Janice L; Lin, Hai; Sauerbeck, Laura; Lai, Dongbing; Koller, Daniel L; Pugh, Elizabeth; Hetrick, Kurt; Ling, Hua; Kleinloog, Rachel; van der Vlies, Pieter; Deelen, Patrick; Swertz, Morris A; Verweij, Bon H; Regli, Luca; Rinkel, Gabriel J E; Ruigrok, Ynte M; Doheny, Kimberly; Liu, Yunlong; Broderick, Joseph; Foroud, Tatiana

    2015-01-01

    Genetic risk factors for intracranial aneurysm (IA) are not yet fully understood. Genomewide association studies have been successful at identifying common variants; however, the role of rare variation in IA susceptibility has not been fully explored. In this study, we report the use of whole exome sequencing (WES) in seven densely-affected families (45 individuals) recruited as part of the Familial Intracranial Aneurysm study. WES variants were prioritized by functional prediction, frequency, predicted pathogenicity, and segregation within families. Using these criteria, 68 variants in 68 genes were prioritized across the seven families. Of the genes that were expressed in IA tissue, one gene (TMEM132B) was differentially expressed in aneurysmal samples (n=44) as compared to control samples (n=16) (false discovery rate adjusted p-value=0.023). We demonstrate that sequencing of densely affected families permits exploration of the role of rare variants in a relatively common disease such as IA, although there are important study design considerations for applying sequencing to complex disorders. In this study, we explore methods of WES variant prioritization, including the incorporation of unaffected individuals, multipoint linkage analysis, biological pathway information, and transcriptome profiling. Further studies are needed to validate and characterize the set of variants and genes identified in this study.

  4. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

    PubMed Central

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699

  5. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

    PubMed

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.

  6. Exome Sequencing Identified a Recessive RDH12 Mutation in a Family with Severe Early-Onset Retinitis Pigmentosa

    PubMed Central

    Gong, Bo; Wei, Bo; Huang, Lulin; Hao, Jilong; Li, Xiulan; Yang, Yin; Zhou, Yu; Hao, Fang; Cui, Zhihua; Zhang, Dingding; Wang, Le

    2015-01-01

    Retinitis pigmentosa (RP) is the most important hereditary retinal disease caused by progressive degeneration of the photoreceptor cells. This study is to identify gene mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in a Chinese family using next-generation sequencing technology. A Chinese family with 7 members including two individuals affected with severe early-onset RP was studied. All patients underwent a complete ophthalmic examination. Exome sequencing was performed on a single RP patient (the proband of this family) and direct Sanger sequencing on other family members and normal controls was followed to confirm the causal mutations. A homozygous mutation c.437T

  7. BALSA: integrated secondary analysis for whole-genome and whole-exome sequencing, accelerated by GPU.

    PubMed

    Luo, Ruibang; Wong, Yiu-Lun; Law, Wai-Chun; Lee, Lap-Kei; Cheung, Jeanno; Liu, Chi-Man; Lam, Tak-Wah

    2014-01-01

    This paper reports an integrated solution, called BALSA, for the secondary analysis of next generation sequencing data; it exploits the computational power of GPU and an intricate memory management to give a fast and accurate analysis. From raw reads to variants (including SNPs and Indels), BALSA, using just a single computing node with a commodity GPU board, takes 5.5 h to process 50-fold whole genome sequencing (∼750 million 100 bp paired-end reads), or just 25 min for 210-fold whole exome sequencing. BALSA's speed is rooted at its parallel algorithms to effectively exploit a GPU to speed up processes like alignment, realignment and statistical testing. BALSA incorporates a 16-genotype model to support the calling of SNPs and Indels and achieves competitive variant calling accuracy and sensitivity when compared to the ensemble of six popular variant callers. BALSA also supports efficient identification of somatic SNVs and CNVs; experiments showed that BALSA recovers all the previously validated somatic SNVs and CNVs, and it is more sensitive for somatic Indel detection. BALSA outputs variants in VCF format. A pileup-like SNAPSHOT format, while maintaining the same fidelity as BAM in variant calling, enables efficient storage and indexing, and facilitates the App development of downstream analyses. BALSA is available at: http://sourceforge.net/p/balsa.

  8. A survey of current practices for genomic sequencing test interpretation and reporting processes in US laboratories.

    PubMed

    O'Daniel, Julianne M; McLaughlin, Heather M; Amendola, Laura M; Bale, Sherri J; Berg, Jonathan S; Bick, David; Bowling, Kevin M; Chao, Elizabeth C; Chung, Wendy K; Conlin, Laura K; Cooper, Gregory M; Das, Soma; Deignan, Joshua L; Dorschner, Michael O; Evans, James P; Ghazani, Arezou A; Goddard, Katrina A; Gornick, Michele; Farwell Hagman, Kelly D; Hambuch, Tina; Hegde, Madhuri; Hindorff, Lucia A; Holm, Ingrid A; Jarvik, Gail P; Knight Johnson, Amy; Mighion, Lindsey; Morra, Massimo; Plon, Sharon E; Punj, Sumit; Richards, C Sue; Santani, Avni; Shirts, Brian H; Spinner, Nancy B; Tang, Sha; Weck, Karen E; Wolf, Susan M; Yang, Yaping; Rehm, Heidi L

    2017-05-01

    While the diagnostic success of genomic sequencing expands, the complexity of this testing should not be overlooked. Numerous laboratory processes are required to support the identification, interpretation, and reporting of clinically significant variants. This study aimed to examine the workflow and reporting procedures among US laboratories to highlight shared practices and identify areas in need of standardization. Surveys and follow-up interviews were conducted with laboratories offering exome and/or genome sequencing to support a research program or for routine clinical services. The 73-item survey elicited multiple choice and free-text responses that were later clarified with phone interviews. Twenty-one laboratories participated. Practices highly concordant across all groups included consent documentation, multiperson case review, and enabling patient opt-out of incidental or secondary findings analysis. Noted divergence included use of phenotypic data to inform case analysis and interpretation and reporting of case-specific quality metrics and methods. Few laboratory policies detailed procedures for data reanalysis, data sharing, or patient access to data. This study provides an overview of practices and policies of experienced exome and genome sequencing laboratories. The results enable broader consideration of which practices are becoming standard approaches, where divergence remains, and areas of development in best practice guidelines that may be helpful.Genet Med advance online publication 03 Novemeber 2016.

  9. REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.

    PubMed

    Ioannidis, Nilah M; Rothstein, Joseph H; Pejaver, Vikas; Middha, Sumit; McDonnell, Shannon K; Baheti, Saurabh; Musolf, Anthony; Li, Qing; Holzinger, Emily; Karyadi, Danielle; Cannon-Albright, Lisa A; Teerlink, Craig C; Stanford, Janet L; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan M; Schleutker, Johanna; Carpten, John D; Powell, Isaac J; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William D; Mandal, Diptasri; Eeles, Rosalind A; Kote-Jarai, Zsofia; Bustamante, Carlos D; Schaid, Daniel J; Hastie, Trevor; Ostrander, Elaine A; Bailey-Wilson, Joan E; Radivojac, Predrag; Thibodeau, Stephen N; Whittemore, Alice S; Sieh, Weiva

    2016-10-06

    The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10 -12 ) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale. Copyright © 2016 American Society of Human Genetics. All rights reserved.

  10. Characterization of X Chromosome Inactivation Using Integrated Analysis of Whole-Exome and mRNA Sequencing

    PubMed Central

    Szelinger, Szabolcs; Malenica, Ivana; Corneveaux, Jason J.; Siniard, Ashley L.; Kurdoglu, Ahmet A.; Ramsey, Keri M.; Schrauwen, Isabelle; Trent, Jeffrey M.; Narayanan, Vinodh; Huentelman, Matthew J.; Craig, David W.

    2014-01-01

    In females, X chromosome inactivation (XCI) is an epigenetic, gene dosage compensatory mechanism by inactivation of one copy of X in cells. Random XCI of one of the parental chromosomes results in an approximately equal proportion of cells expressing alleles from either the maternally or paternally inherited active X, and is defined by the XCI ratio. Skewed XCI ratio is suggestive of non-random inactivation, which can play an important role in X-linked genetic conditions. Current methods rely on indirect, semi-quantitative DNA methylation-based assay to estimate XCI ratio. Here we report a direct approach to estimate XCI ratio by integrated, family-trio based whole-exome and mRNA sequencing using phase-by-transmission of alleles coupled with allele-specific expression analysis. We applied this method to in silico data and to a clinical patient with mild cognitive impairment but no clear diagnosis or understanding molecular mechanism underlying the phenotype. Simulation showed that phased and unphased heterozygous allele expression can be used to estimate XCI ratio. Segregation analysis of the patient's exome uncovered a de novo, interstitial, 1.7 Mb deletion on Xp22.31 that originated on the paternally inherited X and previously been associated with heterogeneous, neurological phenotype. Phased, allelic expression data suggested an 83∶20 moderately skewed XCI that favored the expression of the maternally inherited, cytogenetically normal X and suggested that the deleterious affect of the de novo event on the paternal copy may be offset by skewed XCI that favors expression of the wild-type X. This study shows the utility of integrated sequencing approach in XCI ratio estimation. PMID:25503791

  11. Clinical characterization and diagnosis of cystic fibrosis through exome sequencing in Chinese infants with Bartter-syndrome-like hypokalemia alkalosis.

    PubMed

    Qiu, Liru; Yang, Fengjie; He, Yonghua; Yuan, Huiqing; Zhou, Jianhua

    2018-03-09

    Cystic fibrosis (CF) is a fatal autosomal-recessive disease caused by mutations in the CF transmembrane conductance regulator (CFTR) gene. CF is characterized by recurrent pulmonary infection with obstructive pulmonary disease. CF is common in the Caucasian population but is rare in the Chinese population. The symptoms of early-stage CF are often untypical and may sometimes manifest as Bartter syndrome (BS)-like hypokalemic alkalosis. Therefore, the ability of doctors to differentiate CF from BS-like hypokalemic alkalosis in Chinese infants is a great challenge in the timely and accurate diagnosis of CF. In China, sporadic CF has not been diagnosed in children younger than three years of age to date. Three infants, who were initially admitted to our hospital over the period of June 2013 to September 2014 with BS-like hypokalemic alkalosis, were diagnosed with CF through exome sequencing and sweat chloride measurement. The compound heterozygous mutations of the CFTR gene were detected in two infants, and a homozygous missense mutation was found in one infant. Among the six identified mutations, two are novel point mutations (c.1526G > C and c.3062C > T) that are possibly pathogenic. The three infants are the youngest Chinese patients to have been diagnosed with sporadic CF at a very early stage. Follow-up examination showed that all of the cases remained symptom-free after early intervention, indicating the potential benefit of very early diagnosis and timely intervention in children with CF. Our results demonstrate the necessity of distinguishing CF from BS in Chinese infants with hypokalemic alkalosis and the significant diagnostic value of powerful exome sequencing for rare genetic diseases. Furthermore, our findings expand the CFTR mutation spectrum associated with CF.

  12. Whole exome sequencing in neurogenetic odysseys: An effective, cost- and time-saving diagnostic approach

    PubMed Central

    Córdoba, Marta; Rodriguez-Quiroga, Sergio Alejandro; Vega, Patricia Analía; Salinas, Valeria; Perez-Maturo, Josefina; Amartino, Hernán; Vásquez-Dusefante, Cecilia; Medina, Nancy; González-Morón, Dolores; Kauffman, Marcelo Andrés

    2018-01-01

    Background Diagnostic trajectories for neurogenetic disorders frequently require the use of considerable time and resources, exposing patients and families to so-called “diagnostic odysseys”. Previous studies have provided strong evidence for increased diagnostic and clinical utility of whole-exome sequencing in medical genetics. However, specific reports assessing its utility in a setting such as ours- a neurogeneticist led academic group serving in a low-income country—are rare. Objectives To assess the diagnostic yield of WES in patients suspected of having a neurogenetic condition and explore the cost-effectiveness of its implementation in a research group located in an Argentinean public hospital. Methods This is a prospective study of the clinical utility of WES in a series of 40 consecutive patients selected from a Neurogenetic Clinic of a tertiary Hospital in Argentina. We evaluated patients retrospectively for previous diagnostic trajectories. Diagnostic yield, clinical impact on management and economic diagnostic burden were evaluated. Results We demonstrated the clinical utility of Whole Exome Sequencing in our patient cohort, obtaining a diagnostic yield of 40% (95% CI, 24.8%-55.2%) among a diverse group of neurological disorders. The average age at the time of WES was 23 (range 3–70). The mean time elapsed from symptom onset to WES was 11 years (range 3–42). The mean cost of the diagnostic workup prior to WES was USD 1646 (USD 1439 to 1853), which is 60% higher than WES cost in our center. Conclusions WES for neurogenetics proved to be an effective, cost- and time-saving approach for the molecular diagnosis of this heterogeneous and complex group of patients. PMID:29389947

  13. IMPACT: a whole-exome sequencing analysis pipeline for integrating molecular profiles with actionable therapeutics in clinical samples

    PubMed Central

    Hintzsche, Jennifer; Kim, Jihye; Yadav, Vinod; Amato, Carol; Robinson, Steven E; Seelenfreund, Eric; Shellman, Yiqun; Wisell, Joshua; Applegate, Allison; McCarter, Martin; Box, Neil; Tentler, John; De, Subhajyoti

    2016-01-01

    Objective Currently, there is a disconnect between finding a patient’s relevant molecular profile and predicting actionable therapeutics. Here we develop and implement the Integrating Molecular Profiles with Actionable Therapeutics (IMPACT) analysis pipeline, linking variants detected from whole-exome sequencing (WES) to actionable therapeutics. Methods and materials The IMPACT pipeline contains 4 analytical modules: detecting somatic variants, calling copy number alterations, predicting drugs against deleterious variants, and analyzing tumor heterogeneity. We tested the IMPACT pipeline on whole-exome sequencing data in The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples with known EGFR mutations. We also used IMPACT to analyze melanoma patient tumor samples before treatment, after BRAF-inhibitor treatment, and after BRAF- and MEK-inhibitor treatment. Results IMPACT Food and Drug Administration (FDA) correctly identified known EGFR mutations in the TCGA lung adenocarcinoma samples. IMPACT linked these EGFR mutations to the appropriate FDA-approved EGFR inhibitors. For the melanoma patient samples, we identified NRAS p.Q61K as an acquired resistance mutation to BRAF-inhibitor treatment. We also identified CDKN2A deletion as a novel acquired resistance mutation to BRAFi/MEKi inhibition. The IMPACT analysis pipeline predicts these somatic variants to actionable therapeutics. We observed the clonal dynamic in the tumor samples after various treatments. We showed that IMPACT not only helped in successful prioritization of clinically relevant variants but also linked these variations to possible targeted therapies. Conclusion IMPACT provides a new bioinformatics strategy to delineate candidate somatic variants and actionable therapies. This approach can be applied to other patient tumor samples to discover effective drug targets for personalized medicine. IMPACT is publicly available at http://tanlab.ucdenver.edu/IMPACT. PMID:27026619

  14. Whole Genome SNP Genotyping and Exome Sequencing Reveal Novel Genetic Variants and Putative Causative Genes in Congenital Hyperinsulinism

    PubMed Central

    Proverbio, Maria Carla; Mangano, Eleonora; Gessi, Alessandra; Bordoni, Roberta; Spinelli, Roberta; Asselta, Rosanna; Valin, Paola Sogno; Di Candia, Stefania; Zamproni, Ilaria; Diceglie, Cecilia; Mora, Stefano; Caruso-Nicoletti, Manuela; Salvatoni, Alessandro; De Bellis, Gianluca; Battaglia, Cristina

    2013-01-01

    Congenital hyperinsulinism of infancy (CHI) is a rare disorder characterized by severe hypoglycemia due to inappropriate insulin secretion. The genetic causes of CHI have been found in genes regulating insulin secretion from pancreatic β-cells; recessive inactivating mutations in the ABCC8 and KCNJ11 genes represent the most common events. Despite the advances in understanding the molecular pathogenesis of CHI, specific genetic determinants in about 50 % of the CHI patients remain unknown, suggesting additional locus heterogeneity. In order to search for novel loci contributing to the pathogenesis of CHI, we combined a family-based association study, using the transmission disequilibrium test on 17 CHI patients lacking mutations in ABCC8/KCNJ11, with a whole-exome sequencing analysis performed on 10 probands. This strategy allowed the identification of the potential causative mutations in genes implicated in the regulation of insulin secretion such as transmembrane proteins (CACNA1A, KCNH6, KCNJ10, NOTCH2, RYR3, SCN8A, TRPV3, TRPC5), cytosolic (ACACB, CAMK2D, CDKAL1, GNAS, NOS2, PDE4C, PIK3R3) and mitochondrial enzymes (PC, SLC24A6), and in four genes (CSMD1, SLC37A3, SULF1, TLL1) suggested by TDT family-based association study. Moreover, the exome-sequencing approach resulted to be an efficient diagnostic tool for CHI, allowing the identification of mutations in three causative CHI genes (ABCC8, GLUD1, and HNF1A) in four out of 10 patients. Overall, the present study should be considered as a starting point to design further investigations: our results might indeed contribute to meta-analysis studies, aimed at the identification/confirmation of novel causative or modifier genes. PMID:23869231

  15. Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

    PubMed

    Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

    2018-01-10

    Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

  16. Evaluation of somatic copy number estimation tools for whole-exome sequencing data.

    PubMed

    Nam, Jae-Yong; Kim, Nayoung K D; Kim, Sang Cheol; Joung, Je-Gun; Xi, Ruibin; Lee, Semin; Park, Peter J; Park, Woong-Yang

    2016-03-01

    Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  17. Exome Sequencing Identifies Mitochondrial Alanyl-tRNA Synthetase Mutations in Infantile Mitochondrial Cardiomyopathy

    PubMed Central

    Götz, Alexandra; Tyynismaa, Henna; Euro, Liliya; Ellonen, Pekka; Hyötyläinen, Tuulia; Ojala, Tiina; Hämäläinen, Riikka H.; Tommiska, Johanna; Raivio, Taneli; Oresic, Matej; Karikoski, Riitta; Tammela, Outi; Simola, Kalle O.J.; Paetau, Anders; Tyni, Tiina; Suomalainen, Anu

    2011-01-01

    Infantile cardiomyopathies are devastating fatal disorders of the neonatal period or the first year of life. Mitochondrial dysfunction is a common cause of this group of diseases, but the underlying gene defects have been characterized in only a minority of cases, because tissue specificity of the manifestation hampers functional cloning and the heterogeneity of causative factors hinders collection of informative family materials. We sequenced the exome of a patient who died at the age of 10 months of hypertrophic mitochondrial cardiomyopathy with combined cardiac respiratory chain complex I and IV deficiency. Rigorous data analysis allowed us to identify a homozygous missense mutation in AARS2, which we showed to encode the mitochondrial alanyl-tRNA synthetase (mtAlaRS). Two siblings from another family, both of whom died perinatally of hypertrophic cardiomyopathy, had the same mutation, compound heterozygous with another missense mutation. Protein structure modeling of mtAlaRS suggested that one of the mutations affected a unique tRNA recognition site in the editing domain, leading to incorrect tRNA aminoacylation, whereas the second mutation severely disturbed the catalytic function, preventing tRNA aminoacylation. We show here that mutations in AARS2 cause perinatal or infantile cardiomyopathy with near-total combined mitochondrial respiratory chain deficiency in the heart. Our results indicate that exome sequencing is a powerful tool for identifying mutations in single patients and allows recognition of the genetic background in single-gene disorders of variable clinical manifestation and tissue-specific disease. Furthermore, we show that mitochondrial disorders extend to prenatal life and are an important cause of early infantile cardiac failure. PMID:21549344

  18. IMPACT: a whole-exome sequencing analysis pipeline for integrating molecular profiles with actionable therapeutics in clinical samples.

    PubMed

    Hintzsche, Jennifer; Kim, Jihye; Yadav, Vinod; Amato, Carol; Robinson, Steven E; Seelenfreund, Eric; Shellman, Yiqun; Wisell, Joshua; Applegate, Allison; McCarter, Martin; Box, Neil; Tentler, John; De, Subhajyoti; Robinson, William A; Tan, Aik Choon

    2016-07-01

    Currently, there is a disconnect between finding a patient's relevant molecular profile and predicting actionable therapeutics. Here we develop and implement the Integrating Molecular Profiles with Actionable Therapeutics (IMPACT) analysis pipeline, linking variants detected from whole-exome sequencing (WES) to actionable therapeutics. The IMPACT pipeline contains 4 analytical modules: detecting somatic variants, calling copy number alterations, predicting drugs against deleterious variants, and analyzing tumor heterogeneity. We tested the IMPACT pipeline on whole-exome sequencing data in The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples with known EGFR mutations. We also used IMPACT to analyze melanoma patient tumor samples before treatment, after BRAF-inhibitor treatment, and after BRAF- and MEK-inhibitor treatment. IMPACT Food and Drug Administration (FDA) correctly identified known EGFR mutations in the TCGA lung adenocarcinoma samples. IMPACT linked these EGFR mutations to the appropriate FDA-approved EGFR inhibitors. For the melanoma patient samples, we identified NRAS p.Q61K as an acquired resistance mutation to BRAF-inhibitor treatment. We also identified CDKN2A deletion as a novel acquired resistance mutation to BRAFi/MEKi inhibition. The IMPACT analysis pipeline predicts these somatic variants to actionable therapeutics. We observed the clonal dynamic in the tumor samples after various treatments. We showed that IMPACT not only helped in successful prioritization of clinically relevant variants but also linked these variations to possible targeted therapies. IMPACT provides a new bioinformatics strategy to delineate candidate somatic variants and actionable therapies. This approach can be applied to other patient tumor samples to discover effective drug targets for personalized medicine.IMPACT is publicly available at http://tanlab.ucdenver.edu/IMPACT. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. Genetic compendium of 1511 human brains available through the UK Medical Research Council Brain Banks Network Resource.

    PubMed

    Keogh, Michael J; Wei, Wei; Wilson, Ian; Coxhead, Jon; Ryan, Sarah; Rollinson, Sara; Griffin, Helen; Kurzawa-Akanbi, Marzena; Santibanez-Koref, Mauro; Talbot, Kevin; Turner, Martin R; McKenzie, Chris-Anne; Troakes, Claire; Attems, Johannes; Smith, Colin; Al Sarraj, Safa; Morris, Chris M; Ansorge, Olaf; Pickering-Brown, Stuart; Ironside, James W; Chinnery, Patrick F

    2017-01-01

    Given the central role of genetic factors in the pathogenesis of common neurodegenerative disorders, it is critical that mechanistic studies in human tissue are interpreted in a genetically enlightened context. To address this, we performed exome sequencing and copy number variant analysis on 1511 frozen human brains with a diagnosis of Alzheimer's disease (AD, n = 289), frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS, n = 252), Creutzfeldt-Jakob disease (CJD, n = 239), Parkinson's disease (PD, n = 39), dementia with Lewy bodies (DLB, n = 58), other neurodegenerative, vascular, or neurogenetic disorders (n = 266), and controls with no significant neuropathology (n = 368). Genomic DNA was extracted from brain tissue in all cases before exome sequencing (Illumina Nextera 62 Mb capture) with variants called by FreeBayes; copy number variant (CNV) analysis (Illumina HumanOmniExpress-12 BeadChip); C9orf72 repeat expansion detection; and APOE genotyping. Established or likely pathogenic heterozygous, compound heterozygous, or homozygous variants, together with the C9orf72 hexanucleotide repeat expansions and a copy number gain of APP, were found in 61 brains. In addition to known risk alleles in 349 brains (23.9% of 1461 undergoing exome sequencing), we saw an association between rare variants in GRN and DLB. Rare CNVs were found in <1.5% of brains, including copy number gains of PRPH that were overrepresented in AD. Clinical, pathological, and genetic data are available, enabling the retrieval of specific frozen brains through the UK Medical Research Council Brain Banks Network. This allows direct access to pathological and control human brain tissue based on an individual's genetic architecture, thus enabling the functional validation of known genetic risk factors and potentially pathogenic alleles identified in future studies. © 2017 Keogh et al.; Published by Cold Spring Harbor Laboratory Press.

  20. [Pharmacogenomics study of 620 whole-exome sequencing: focusing on aspirin application].

    PubMed

    Yang, L; Lu, Y L; Wang, H J; Zhou, W H

    2016-05-01

    To investigate the allele frequencies of aspirin-response-related variants in different population. The allele frequencies of reported clinically significant aspirin-response-related variants were evaluated based on 620 whole exome sequencing (WES) data collected from 2013 to 2016 in Children's Hospital of Fudan University.Then the local allele frequencies were compared with 1 000 Genomes project database, and χ(2) test was used. Thirty-eight aspirin-response-related variants that had clinical significance had been detected in the 620 WES data.Ten (26%) of them were related with drug efficacy while 28 (74%) were related with toxicity or adverse drug reaction (ADR). These variants were distributed in 33 genes.There were 23 aspirin-related variants further analysised, and the frequency of 7 (rs1050891, rs6065, rs7862221, rs1065776, rs3818822, rs3775291 and rs1126643) had no significant difference compared with frequency of European and East Asian population of 1 000 Genome project (P>0.01 for both), 10 (rs2228079, rs1613662, rs4523, rs28360521, rs1131882, rs1047626, rs3856806, rs2768759, rs7572857 and rs1126510) of them had no significant difference compared with East Asian but were significantly different from European population, 1 (rs2075797) had no significant difference compared with frequency of European and different with frequency of East Asian, and 5 variants(rs10279545, rs730012, rs16851030, rs1353411, rs1800469)were different from frequency of both East Asian(0.019, 0.058, 0.167, 0.452, 0.340 vs. 0.100, 0.151, 0.396, 0.568, 0.453, χ(2)=21.798, 20.400, 67.543, 16.531, 15.807, P all<0.01) and European population(0.531, 0.312, 0.037, 0.179, 0.688, χ(2)=325.799, 92.877, 144.811, 156.471, 174.533, P all<0.01). Most variants that have clinical significance in aspirin response are related with drug efficacy or drug toxicity or ADR, indicating the urgency of variants screen in clinical practice.Significant population-specificity is detected in local 620 WES data in aspirin-response-related variants.

  1. Lessons learned from additional research analyses of unsolved clinical exome cases.

    PubMed

    Eldomery, Mohammad K; Coban-Akdemir, Zeynep; Harel, Tamar; Rosenfeld, Jill A; Gambin, Tomasz; Stray-Pedersen, Asbjørg; Küry, Sébastien; Mercier, Sandra; Lessel, Davor; Denecke, Jonas; Wiszniewski, Wojciech; Penney, Samantha; Liu, Pengfei; Bi, Weimin; Lalani, Seema R; Schaaf, Christian P; Wangler, Michael F; Bacino, Carlos A; Lewis, Richard Alan; Potocki, Lorraine; Graham, Brett H; Belmont, John W; Scaglia, Fernando; Orange, Jordan S; Jhangiani, Shalini N; Chiang, Theodore; Doddapaneni, Harsha; Hu, Jianhong; Muzny, Donna M; Xia, Fan; Beaudet, Arthur L; Boerwinkle, Eric; Eng, Christine M; Plon, Sharon E; Sutton, V Reid; Gibbs, Richard A; Posey, Jennifer E; Yang, Yaping; Lupski, James R

    2017-03-21

    Given the rarity of most single-gene Mendelian disorders, concerted efforts of data exchange between clinical and scientific communities are critical to optimize molecular diagnosis and novel disease gene discovery. We designed and implemented protocols for the study of cases for which a plausible molecular diagnosis was not achieved in a clinical genomics diagnostic laboratory (i.e. unsolved clinical exomes). Such cases were recruited to a research laboratory for further analyses, in order to potentially: (1) accelerate novel disease gene discovery; (2) increase the molecular diagnostic yield of whole exome sequencing (WES); and (3) gain insight into the genetic mechanisms of disease. Pilot project data included 74 families, consisting mostly of parent-offspring trios. Analyses performed on a research basis employed both WES from additional family members and complementary bioinformatics approaches and protocols. Analysis of all possible modes of Mendelian inheritance, focusing on both single nucleotide variants (SNV) and copy number variant (CNV) alleles, yielded a likely contributory variant in 36% (27/74) of cases. If one includes candidate genes with variants identified within a single family, a potential contributory variant was identified in a total of ~51% (38/74) of cases enrolled in this pilot study. The molecular diagnosis was achieved in 30/63 trios (47.6%). Besides this, the analysis workflow yielded evidence for pathogenic variants in disease-associated genes in 4/6 singleton cases (66.6%), 1/1 multiplex family involving three affected siblings, and 3/4 (75%) quartet families. Both the analytical pipeline and the collaborative efforts between the diagnostic and research laboratories provided insights that allowed recent disease gene discoveries (PURA, TANGO2, EMC1, GNB5, ATAD3A, and MIPEP) and increased the number of novel genes, defined in this study as genes identified in more than one family (DHX30 and EBF3). An efficient genomics pipeline in which clinical sequencing in a diagnostic laboratory is followed by the detailed reanalysis of unsolved cases in a research environment, supplemented with WES data from additional family members, and subject to adjuvant bioinformatics analyses including relaxed variant filtering parameters in informatics pipelines, can enhance the molecular diagnostic yield and provide mechanistic insights into Mendelian disorders. Implementing these approaches requires collaborative clinical molecular diagnostic and research efforts.

  2. Genetics Home Reference: gray platelet syndrome

    MedlinePlus

    ... granules leak out into the bone marrow. The proteins lead to fibrosis that affects the bone marrow's ability to make new blood ... PA, Stephens J, Voss K, Nurden A, Rendon A, Nurden P, Ouwehand WH. Exome sequencing identifies NBEAL2 as the causative gene for gray ...

  3. Lessons learned from implementing a national infrastructure in Sweden for storage and analysis of next-generation sequencing data

    PubMed Central

    2013-01-01

    Analyzing and storing data and results from next-generation sequencing (NGS) experiments is a challenging task, hampered by ever-increasing data volumes and frequent updates of analysis methods and tools. Storage and computation have grown beyond the capacity of personal computers and there is a need for suitable e-infrastructures for processing. Here we describe UPPNEX, an implementation of such an infrastructure, tailored to the needs of data storage and analysis of NGS data in Sweden serving various labs and multiple instruments from the major sequencing technology platforms. UPPNEX comprises resources for high-performance computing, large-scale and high-availability storage, an extensive bioinformatics software suite, up-to-date reference genomes and annotations, a support function with system and application experts as well as a web portal and support ticket system. UPPNEX applications are numerous and diverse, and include whole genome-, de novo- and exome sequencing, targeted resequencing, SNP discovery, RNASeq, and methylation analysis. There are over 300 projects that utilize UPPNEX and include large undertakings such as the sequencing of the flycatcher and Norwegian spruce. We describe the strategic decisions made when investing in hardware, setting up maintenance and support, allocating resources, and illustrate major challenges such as managing data growth. We conclude with summarizing our experiences and observations with UPPNEX to date, providing insights into the successful and less successful decisions made. PMID:23800020

  4. Whole exome sequencing identifies a POLRID mutation segregating in a father and two daughters with findings of Klippel-Feil and Treacher Collins syndromes.

    PubMed

    Giampietro, Philip F; Armstrong, Linlea; Stoddard, Alex; Blank, Robert D; Livingston, Janet; Raggio, Cathy L; Rasmussen, Kristen; Pickart, Michael; Lorier, Rachel; Turner, Amy; Sund, Sarah; Sobrera, Nara; Neptune, Enid; Sweetser, David; Santiago-Cornier, Alberto; Broeckel, Ulrich

    2015-01-01

    We report on a father and his two daughters diagnosed with Klippel-Feil syndrome (KFS) but with craniofacial differences (zygomatic and mandibular hypoplasia and cleft palate) and external ear abnormalities suggestive of Treacher Collins syndrome (TCS). The diagnosis of KFS was favored, given that the neck anomalies were the predominant manifestations, and that the diagnosis predated later recognition of the association between spinal segmentation abnormalities and TCS. Genetic heterogeneity and the rarity of large families with KFS have limited the ability to identify mutations by traditional methods. Whole exome sequencing identified a nonsynonymous mutation in POLR1D (subunit of RNA polymerase I and II): exon2:c.T332C:p.L111P. Mutations in POLR1D are present in about 5% of individuals diagnosed with TCS. We propose that this mutation is causal in this family, suggesting a pathogenetic link between KFS and TCS. © 2014 Wiley Periodicals, Inc.

  5. Myopathic mtDNA Depletion Syndrome Due to Mutation in TK2 Gene.

    PubMed

    Martín-Hernández, Elena; García-Silva, María Teresa; Quijada-Fraile, Pilar; Rodríguez-García, María Elena; Hernández-Laín, Aurelio; Coca-Robinot, David; Rivera, Henry; Fernández-Toral, Joaquín; Arenas, Joaquín; Martín, MiguelÁngel; Martínez-Azorín, Francisco

    2016-02-29

    Whole-exome sequencing (WES) was used to identify the disease gene(s) in a Spanish girl with failure to thrive, muscle weakness, mild facial weakness, elevated creatine kinase (CK), deficiency of mitochondrial complex III and depletion of mtDNA. With WES data, it was possible to get the whole mtDNA sequencing and discard any pathogenic variant in this genome. The analysis of whole exome uncovered a homozygous pathogenic mutation in Thymidine kinase 2 gene (TK2; NM_004614.4:c.323C>T, p.T108M). TK2 mutations have been identified mainly in patients with the myopathic form of mtDNA depletion syndromes (MDS). This patient presents an atypical TK2 related-myopathic form of MDS, because despite having a very low content of mtDNA (<20%), she presents a slower and less severe evolution of the disease. In conclusion, our data confirm the role of TK2 gene in MDS and expanded the phenotypic spectrum.

  6. FOXI2: a possible gene contributing to ectodermal dysplasia.

    PubMed

    Kurban, Mazen; Zeineddine, Savo Bou; Hamie, Lamiaa; Safi, Remi; Abbas, Ossama; Kibbi, Abdul Ghani; Bitar, Fadi; Nemer, Georges

    2017-12-01

    Cardio-facio-cutaneous syndrome (CFC), Noonan syndrome (NS), and Costello syndrome are a group of diseases that belong to the RASopathies. The syndromes share clinical features making diagnosis a challenge. To investigate the phenotype and genotype of a 10-year-old Iraqi girl with overlapping features of CFC, NS, and Costello syndromes, with additional features of ectodermal dysplasia. DNA was examined by exome sequencing and protein expression by immunohistochemistry. Exome sequencing identified a mutation in the SOS1 gene and a de novo deletion in the FOXI2 gene which was neither present in the international databases, nor in 400 chromosomes from the same population. Based on immunohistochemical staining, FOXI2 was identified in the basal cell layer of the skin and overlapped with the expression of P63, a major player in ectodermal dysplasia. We therefore suggest screening for FOXI2 mutation in the setting of ectodermal features that are not associated with genes known to contribute to ectodermal dysplasia.

  7. Discovery of somatic mutations in the progression of chronic myeloid leukemia by whole-exome sequencing.

    PubMed

    Huang, Y; Zheng, J; Hu, J D; Wu, Y A; Zheng, X Y; Liu, T B; Chen, F L

    2014-02-19

    We performed whole-exome sequencing in samples representing accelerated phase (AP) and blastic crisis (BC) in a subject with chronic myeloid leukemia (CML). A total of 12.74 Gb clean data were generated, achieving a mean depth coverage of 64.45 and 69.53 for AP and BC samples, respectively, of the target region. A total of 148 somatic variants were detected, including 76 insertions and deletions (indels), 64 single-nucleotide variations (SNV), and 8 structural variations (SV). On the basis of annotation and functional prediction analysis, we identified 3 SNVs and 6 SVs that showed a potential association with CML progression. Among the genes that harbor the identified variants, GATA2 has previously been reported to play important roles in the progression from AP to BC in CML. Identification of these genes will allow us to gain a better understanding of the pathological mechanism of CML and represents a critical advance toward new molecular diagnostic tests for the development of potential therapies for CML.

  8. EIF2AK4 Mutations in Pulmonary Capillary Hemangiomatosis

    PubMed Central

    Best, D. Hunter; Sumner, Kelli L.; Austin, Eric D.; Chung, Wendy K.; Brown, Lynette M.; Borczuk, Alain C.; Rosenzweig, Erika B.; Bayrak-Toydemir, Pinar; Mao, Rong; Cahill, Barbara C.; Tazelaar, Henry D.; Leslie, Kevin O.; Hemnes, Anna R.; Robbins, Ivan M.

    2014-01-01

    Background: Pulmonary capillary hemangiomatosis (PCH) is a rare disease of capillary proliferation of unknown cause and with a high mortality. Families with multiple affected individuals with PCH suggest a heritable cause although the genetic etiology remains unknown. Methods: We used exome sequencing to identify a candidate gene for PCH in a family with two affected brothers. We then screened 11 unrelated patients with familial (n = 1) or sporadic (n = 10) PCH for mutations. Results: Using exome sequencing, we identified compound mutations in eukaryotic translation initiation factor 2 α kinase 4 (EIF2AK4) (formerly known as GCN2) in both affected brothers. Both parents and an unaffected sister were heterozygous carriers. In addition, we identified two EIF2AK4 mutations in each of two of 10 unrelated individuals with sporadic PCH. EIF2AK4 belongs to a family of kinases that regulate angiogenesis in response to cellular stress. Conclusions: Mutations in EIF2AK4 are likely to cause autosomal-recessive PCH in familial and some nonfamilial cases. PMID:24135949

  9. Practices and Policies of Clinical Exome Sequencing Providers: Analysis and Implications

    PubMed Central

    Jamal, Seema M.; Yu, Joon-Ho; Chong, Jessica X.; Dent, Karin M.; Conta, Jessie H.; Tabor, Holly K.; Bamshad, Michael J.

    2013-01-01

    Exome and whole genome sequencing (ES/WGS) offer potential advantages over traditional approaches to diagnostic genetic testing. Consequently, use of ES/WGS in clinical settings is rapidly becoming commonplace. Yet there are myriad moral, ethical, and perhaps legal implications attached to the use of ES and health care professionals and institutions will need to consider these implications in the context of the varied practices and policies of ES service providers. We developed “core elements” of content and procedures for informed consent, data sharing, and results management and a quantitative scale to assess the extent to which research protocols met the standards established by these core elements. We then used these tools to evaluate the practices and policies of each of the 6 U.S. CLIA-certified labs offering clinical ES. Approaches toward informed consent, data sharing, and results return vary widely among ES providers as do the overall potential merits and disadvantages of each, and more importantly, the balance between the two. PMID:23610049

  10. Exome sequencing identifies MAX mutations as a cause of hereditary pheochromocytoma.

    PubMed

    Comino-Méndez, Iñaki; Gracia-Aznárez, Francisco J; Schiavi, Francesca; Landa, Iñigo; Leandro-García, Luis J; Letón, Rocío; Honrado, Emiliano; Ramos-Medina, Rocío; Caronia, Daniela; Pita, Guillermo; Gómez-Graña, Alvaro; de Cubas, Aguirre A; Inglada-Pérez, Lucía; Maliszewska, Agnieszka; Taschin, Elisa; Bobisse, Sara; Pica, Giuseppe; Loli, Paola; Hernández-Lavado, Rafael; Díaz, José A; Gómez-Morales, Mercedes; González-Neira, Anna; Roncador, Giovanna; Rodríguez-Antona, Cristina; Benítez, Javier; Mannelli, Massimo; Opocher, Giuseppe; Robledo, Mercedes; Cascón, Alberto

    2011-06-19

    Hereditary pheochromocytoma (PCC) is often caused by germline mutations in one of nine susceptibility genes described to date, but there are familial cases without mutations in these known genes. We sequenced the exomes of three unrelated individuals with hereditary PCC (cases) and identified mutations in MAX, the MYC associated factor X gene. Absence of MAX protein in the tumors and loss of heterozygosity caused by uniparental disomy supported the involvement of MAX alterations in the disease. A follow-up study of a selected series of 59 cases with PCC identified five additional MAX mutations and suggested an association with malignant outcome and preferential paternal transmission of MAX mutations. The involvement of the MYC-MAX-MXD1 network in the development and progression of neural crest cell tumors is further supported by the lack of functional MAX in rat PCC (PC12) cells and by the amplification of MYCN in neuroblastoma and suggests that loss of MAX function is correlated with metastatic potential.

  11. Fine-scale patterns of population stratification confound rare variant association tests.

    PubMed

    O'Connor, Timothy D; Kiezun, Adam; Bamshad, Michael; Rich, Stephen S; Smith, Joshua D; Turner, Emily; Leal, Suzanne M; Akey, Joshua M

    2013-01-01

    Advances in next-generation sequencing technology have enabled systematic exploration of the contribution of rare variation to Mendelian and complex diseases. Although it is well known that population stratification can generate spurious associations with common alleles, its impact on rare variant association methods remains poorly understood. Here, we performed exhaustive coalescent simulations with demographic parameters calibrated from exome sequence data to evaluate the performance of nine rare variant association methods in the presence of fine-scale population structure. We find that all methods have an inflated spurious association rate for parameter values that are consistent with levels of differentiation typical of European populations. For example, at a nominal significance level of 5%, some test statistics have a spurious association rate as high as 40%. Finally, we empirically assess the impact of population stratification in a large data set of 4,298 European American exomes. Our results have important implications for the design, analysis, and interpretation of rare variant genome-wide association studies.

  12. Exome Sequencing Links Mutations in PARN and RTEL1 with Familial Pulmonary Fibrosis and Telomere Shortening

    PubMed Central

    Stuart, Bridget D.; Choi, Jungmin; Zaidi, Samir; Xing, Chao; Holohan, Brody; Chen, Rui; Choi, Mihwa; Dharwadkar, Pooja; Torres, Fernando; Girod, Carlos E.; Weissler, Jonathan; Fitzgerald, John; Kershaw, Corey; Klesney-Tait, Julia; Mageto, Yolanda; Shay, Jerry W.; Ji, Weizhen; Bilguvar, Kaya; Mane, Shrikant; Lifton, Richard P.; Garcia, Christine Kim

    2015-01-01

    Idiopathic pulmonary fibrosis (IPF) is an age-related disease featuring progressive lung scarring. To elucidate the molecular basis of IPF, we performed exome sequencing of familial pulmonary fibrosis kindreds. Gene burden analysis comparing 78 European cases and 2,816 controls implicated PARN, an exoribonuclease with no prior connection to telomere biology or disease, with five novel heterozygous damaging mutations in unrelated cases and none in controls (P-value = 1.3 × 10−8); mutations were shared by all affected relatives (odds in favor of linkage = 4,096:1). RTEL1, an established locus for dyskeratosis congenita, harbored significantly more novel damaging and missense variants at conserved residues in cases than controls (P = 1.6 × 10−6). PARN and RTEL1 mutation carriers had shortened leukocyte telomere lengths and epigenetic inheritance of short telomeres was seen in family members. Together these genes explain ~7% of familial pulmonary fibrosis and strengthen the link between lung fibrosis and telomere dysfunction. PMID:25848748

  13. Exome sequencing links mutations in PARN and RTEL1 with familial pulmonary fibrosis and telomere shortening.

    PubMed

    Stuart, Bridget D; Choi, Jungmin; Zaidi, Samir; Xing, Chao; Holohan, Brody; Chen, Rui; Choi, Mihwa; Dharwadkar, Pooja; Torres, Fernando; Girod, Carlos E; Weissler, Jonathan; Fitzgerald, John; Kershaw, Corey; Klesney-Tait, Julia; Mageto, Yolanda; Shay, Jerry W; Ji, Weizhen; Bilguvar, Kaya; Mane, Shrikant; Lifton, Richard P; Garcia, Christine Kim

    2015-05-01

    Idiopathic pulmonary fibrosis (IPF) is an age-related disease featuring progressive lung scarring. To elucidate the molecular basis of IPF, we performed exome sequencing of familial kindreds with pulmonary fibrosis. Gene burden analysis comparing 78 European cases and 2,816 controls implicated PARN, an exoribonuclease with no previous connection to telomere biology or disease, with five new heterozygous damaging mutations in unrelated cases and none in controls (P = 1.3 × 10(-8)); mutations were shared by all affected relatives (odds in favor of linkage = 4,096:1). RTEL1, an established locus for dyskeratosis congenita, harbored significantly more new damaging and missense variants at conserved residues in cases than in controls (P = 1.6 × 10(-6)). PARN and RTEL1 mutation carriers had shortened leukocyte telomere lengths, and we observed epigenetic inheritance of short telomeres in family members. Together, these genes explain ~7% of familial pulmonary fibrosis and strengthen the link between lung fibrosis and telomere dysfunction.

  14. LIPT1 deficiency presenting as early infantile epileptic encephalopathy, Leigh disease, and secondary pyruvate dehydrogenase complex deficiency.

    PubMed

    Stowe, Robert C; Sun, Qin; Elsea, Sarah H; Scaglia, Fernando

    2018-05-01

    Lipoic acid is an essential cofactor for the mitochondrial 2-ketoacid dehydrogenase complexes and the glycine cleavage system. Lipoyltransferase 1 catalyzes the covalent attachment of lipoate to these enzyme systems. Pathogenic variants in LIPT1 gene have recently been described in four patients from three families, commonly presenting with severe lactic acidosis resulting in neonatal death and/or poor neurocognitive outcomes. We report a 2-month-old male with severe lactic acidosis, refractory status epilepticus, and brain imaging suggestive of Leigh disease. Exome sequencing implicated compound heterozygous LIPT1 pathogenic variants. We describe the fifth case of LIPT1 deficiency, whose phenotype progressed to that of an early infantile epileptic encephalopathy, which is novel compared to previously described patients whom we will review. Due to the significant biochemical and phenotypic overlap that LIPT1 deficiency and mitochondrial energy cofactor disorders have with pyruvate dehydrogenase deficiency and/or nonketotic hyperglycinemia, they are and have been presumptively under-diagnosed without exome sequencing. © 2018 Wiley Periodicals, Inc.

  15. Next generation sequencing in clinical medicine: Challenges and lessons for pathology and biomedical informatics.

    PubMed

    Gullapalli, Rama R; Desai, Ketaki V; Santana-Santos, Lucas; Kant, Jeffrey A; Becich, Michael J

    2012-01-01

    The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future.

  16. Next generation sequencing in clinical medicine: Challenges and lessons for pathology and biomedical informatics

    PubMed Central

    Gullapalli, Rama R.; Desai, Ketaki V.; Santana-Santos, Lucas; Kant, Jeffrey A.; Becich, Michael J.

    2012-01-01

    The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future. PMID:23248761

  17. Whole-genome and multisector exome sequencing of primary and post-treatment glioblastoma reveals patterns of tumor evolution

    PubMed Central

    Kim, Hoon; Zheng, Siyuan; Amini, Seyed S.; Virk, Selene M.; Mikkelsen, Tom; Brat, Daniel J.; Grimsby, Jonna; Sougnez, Carrie; Muller, Florian; Hu, Jian; Sloan, Andrew E.; Cohen, Mark L.; Van Meir, Erwin G.; Scarpace, Lisa; Laird, Peter W.; Weinstein, John N.; Lander, Eric S.; Gabriel, Stacey; Getz, Gad; Meyerson, Matthew; Chin, Lynda; Barnholtz-Sloan, Jill S.

    2015-01-01

    Glioblastoma (GBM) is a prototypical heterogeneous brain tumor refractory to conventional therapy. A small residual population of cells escapes surgery and chemoradiation, resulting in a typically fatal tumor recurrence ∼7 mo after diagnosis. Understanding the molecular architecture of this residual population is critical for the development of successful therapies. We used whole-genome sequencing and whole-exome sequencing of multiple sectors from primary and paired recurrent GBM tumors to reconstruct the genomic profile of residual, therapy resistant tumor initiating cells. We found that genetic alteration of the p53 pathway is a primary molecular event predictive of a high number of subclonal mutations in glioblastoma. The genomic road leading to recurrence is highly idiosyncratic but can be broadly classified into linear recurrences that share extensive genetic similarity with the primary tumor and can be directly traced to one of its specific sectors, and divergent recurrences that share few genetic alterations with the primary tumor and originate from cells that branched off early during tumorigenesis. Our study provides mechanistic insights into how genetic alterations in primary tumors impact the ensuing evolution of tumor cells and the emergence of subclonal heterogeneity. PMID:25650244

  18. Exome sequencing identifies CTSK mutations in patients originally diagnosed as intermediate osteopetrosis☆

    PubMed Central

    Pangrazio, Alessandra; Puddu, Alessandro; Oppo, Manuela; Valentini, Maria; Zammataro, Luca; Vellodi, Ashok; Gener, Blanca; Llano-Rivas, Isabel; Raza, Jamal; Atta, Irum; Vezzoni, Paolo; Superti-Furga, Andrea; Villa, Anna; Sobacchi, Cristina

    2014-01-01

    Autosomal Recessive Osteopetrosis is a genetic disorder characterized by increased bone density due to lack of resorption by the osteoclasts. Genetic studies have widely unraveled the molecular basis of the most severe forms, while cases of intermediate severity are more difficult to characterize, probably because of a large heterogeneity. Here, we describe the use of exome sequencing in the molecular diagnosis of 2 siblings initially thought to be affected by “intermediate osteopetrosis”, which identified a homozygous mutation in the CTSK gene. Prompted by this finding, we tested by Sanger sequencing 25 additional patients addressed to us for recessive osteopetrosis and found CTSK mutations in 4 of them. In retrospect, their clinical and radiographic features were found to be compatible with, but not typical for, Pycnodysostosis. We sought to identify modifier genes that might have played a role in the clinical manifestation of the disease in these patients, but our results were not informative. In conclusion, we underline the difficulties of differential diagnosis in some patients whose clinical appearance does not fit the classical malignant or benign picture and recommend that CTSK gene be included in the molecular diagnosis of high bone density conditions. PMID:24269275

  19. A novel NHS mutation causes Nance-Horan Syndrome in a Chinese family.

    PubMed

    Tian, Qi; Li, Yunping; Kousar, Rizwana; Guo, Hui; Peng, Fenglan; Zheng, Yu; Yang, Xiaohua; Long, Zhigao; Tian, Runyi; Xia, Kun; Lin, Haiying; Pan, Qian

    2017-01-07

    Nance-Horan Syndrome (NHS) (OMIM: 302350) is a rare X-linked developmental disorder characterized by bilateral congenital cataracts, with occasional dental anomalies, characteristic dysmorphic features, brachymetacarpia and mental retardation. Carrier females exhibit similar manifestations that are less severe than in affected males. Here, we report a four-generation Chinese family with multiple affected individuals presenting Nance-Horan Syndrome. Whole-exome sequencing combined with RT-PCR and Sanger sequencing was used to search for a genetic cause underlying the disease phenotype. Whole-exome sequencing identified in all affected individuals of the family a novel donor splicing site mutation (NM_198270: c.1045 + 2T > A) in intron 4 of the gene NHS, which maps to chromosome Xp22.13. The identified mutation results in an RNA processing defect causing a 416-nucleotide addition to exon 4 of the mRNA transcript, likely producing a truncated NHS protein. The donor splicing site mutation NM_198270: c.1045 + 2T > A of the NHS gene is the causative mutation in this Nance-Horan Syndrome family. This research broadens the spectrum of NHS gene mutations, contributing to our understanding of the molecular genetics of NHS.

  20. Clinical Application of Genome and Exome Sequencing as a Diagnostic Tool for Pediatric Patients: a Scoping Review of the Literature.

    PubMed

    Smith, Hadley Stevens; Swint, J Michael; Lalani, Seema R; Yamal, Jose-Miguel; de Oliveira Otto, Marcia C; Castellanos, Stephan; Taylor, Amy; Lee, Brendan H; Russell, Heidi V

    2018-05-14

    Availability of clinical genomic sequencing (CGS) has generated questions about the value of genome and exome sequencing as a diagnostic tool. Analysis of reported CGS application can inform uptake and direct further research. This scoping literature review aims to synthesize evidence on the clinical and economic impact of CGS. PubMed, Embase, and Cochrane were searched for peer-reviewed articles published between 2009 and 2017 on diagnostic CGS for infant and pediatric patients. Articles were classified according to sample size and whether economic evaluation was a primary research objective. Data on patient characteristics, clinical setting, and outcomes were extracted and narratively synthesized. Of 171 included articles, 131 were case reports, 40 were aggregate analyses, and 4 had a primary economic evaluation aim. Diagnostic yield was the only consistently reported outcome. Median diagnostic yield in aggregate analyses was 33.2% but varied by broad clinical categories and test type. Reported CGS use has rapidly increased and spans diverse clinical settings and patient phenotypes. Economic evaluations support the cost-saving potential of diagnostic CGS. Multidisciplinary implementation research, including more robust outcome measurement and economic evaluation, is needed to demonstrate clinical utility and cost-effectiveness of CGS.

  1. Exome sequencing in an admixed isolated population indicates NFXL1 variants confer a risk for specific language impairment.

    PubMed

    Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H; Gilissen, Christian; Reader, Rose H; Jara, Lillian; Echeverry, María Magdalena; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O'Hare, Anne; Bolton, Patrick F; Hennessy, Elizabeth R; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A; Cazier, Jean-Baptiste; De Barbieri, Zulema; Fisher, Simon E; Newbury, Dianne F

    2015-03-01

    Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10-4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model.

  2. Identification of a Novel De Novo Variant in the PAX3 Gene in Waardenburg Syndrome by Diagnostic Exome Sequencing: The First Molecular Diagnosis in Korea

    PubMed Central

    Jang, Mi-Ae; Lee, Taeheon; Lee, Junnam

    2015-01-01

    Waardenburg syndrome (WS) is a clinically and genetically heterogeneous hereditary auditory pigmentary disorder characterized by congenital sensorineural hearing loss and iris discoloration. Many genes have been linked to WS, including PAX3, MITF, SNAI2, EDNRB, EDN3, and SOX10, and many additional genes have been associated with disorders with phenotypic overlap with WS. To screen all possible genes associated with WS and congenital deafness simultaneously, we performed diagnostic exome sequencing (DES) in a male patient with clinical features consistent with WS. Using DES, we identified a novel missense variant (c.220C>G; p.Arg74Gly) in exon 2 of the PAX3 gene in the patient. Further analysis by Sanger sequencing of the patient and his parents revealed a de novo occurrence of the variant. Our findings show that DES can be a useful tool for the identification of pathogenic gene variants in WS patients and for differentiation between WS and similar disorders. To the best of our knowledge, this is the first report of genetically confirmed WS in Korea. PMID:25932447

  3. Novel TMEM98 mutations in pedigrees with autosomal dominant nanophthalmos

    PubMed Central

    Khorram, David; Choi, Michael; Roos, Ben R.; Stone, Edwin M.; Kopel, Teresa; Allen, Richard; Alward, Wallace L.M.; Scheetz, Todd E.

    2015-01-01

    Purpose Autosomal dominant nanophthalmos is an inherited eye disorder characterized by a structurally normal but smaller eye. Patients with nanophthalmos have high hyperopia (far-sightedness), a greater incidence of angle-closure glaucoma, and increased risk of surgical complications. In this study, the clinical features and the genetic basis of nanophthalmos were investigated in two large autosomal dominant nanophthalmos pedigrees. Methods Fourteen members of a Caucasian pedigree from the United States and 15 members of a pedigree from the Mariana Islands enrolled in a genetic study of nanophthalmos and contributed DNA samples. Twenty of 29 family members underwent eye examinations that included measurement of axial eye length and/or refractive error. The genetic basis of nanophthalmos in the pedigrees was studied with linkage analysis, whole exome sequencing, and candidate gene (i.e., TMEM98) sequencing to identify the nanophthalmos-causing gene. Results Nine members of the pedigree from the United States and 11 members of the pedigree from the Mariana Islands were diagnosed with nanophthalmos that is transmitted as an autosomal dominant trait. The patients with nanophthalmos had abnormally short axial eye lengths, which ranged from 15.9 to 18.4 mm. Linkage analysis of the nanophthalmos pedigree from the United States identified nine large regions of the genome (greater than 10 Mbp) that were coinherited with disease in this family. Genes within these “linked regions” were examined for disease-causing mutations using exome sequencing, and a His196Pro mutation was detected in the TMEM98 gene, which was recently reported to be a nanophthalmos gene. Sanger sequencing subsequently showed that all other members of this pedigree with nanophthalmos also carry the His196Pro TMEM98 mutation. Testing the Mariana Islands pedigree for TMEM98 mutations identified a 34 bp heterozygous deletion that spans the 3′ end of exon 4 in all affected family members. Neither TMEM98 mutation was detected in public exome sequence databases. Conclusions A recent report identified a single TMEM98 missense mutation in a nanophthalmos pedigree. Our discovery of two additional TMEM98 mutations confirms the important role of the gene in the pathogenesis of autosomal dominant nanophthalmos. PMID:26392740

  4. Novel TMEM98 mutations in pedigrees with autosomal dominant nanophthalmos.

    PubMed

    Khorram, David; Choi, Michael; Roos, Ben R; Stone, Edwin M; Kopel, Teresa; Allen, Richard; Alward, Wallace L M; Scheetz, Todd E; Fingert, John H

    2015-01-01

    Autosomal dominant nanophthalmos is an inherited eye disorder characterized by a structurally normal but smaller eye. Patients with nanophthalmos have high hyperopia (far-sightedness), a greater incidence of angle-closure glaucoma, and increased risk of surgical complications. In this study, the clinical features and the genetic basis of nanophthalmos were investigated in two large autosomal dominant nanophthalmos pedigrees. Fourteen members of a Caucasian pedigree from the United States and 15 members of a pedigree from the Mariana Islands enrolled in a genetic study of nanophthalmos and contributed DNA samples. Twenty of 29 family members underwent eye examinations that included measurement of axial eye length and/or refractive error. The genetic basis of nanophthalmos in the pedigrees was studied with linkage analysis, whole exome sequencing, and candidate gene (i.e., TMEM98) sequencing to identify the nanophthalmos-causing gene. Nine members of the pedigree from the United States and 11 members of the pedigree from the Mariana Islands were diagnosed with nanophthalmos that is transmitted as an autosomal dominant trait. The patients with nanophthalmos had abnormally short axial eye lengths, which ranged from 15.9 to 18.4 mm. Linkage analysis of the nanophthalmos pedigree from the United States identified nine large regions of the genome (greater than 10 Mbp) that were coinherited with disease in this family. Genes within these "linked regions" were examined for disease-causing mutations using exome sequencing, and a His196Pro mutation was detected in the TMEM98 gene, which was recently reported to be a nanophthalmos gene. Sanger sequencing subsequently showed that all other members of this pedigree with nanophthalmos also carry the His196Pro TMEM98 mutation. Testing the Mariana Islands pedigree for TMEM98 mutations identified a 34 bp heterozygous deletion that spans the 3' end of exon 4 in all affected family members. Neither TMEM98 mutation was detected in public exome sequence databases. A recent report identified a single TMEM98 missense mutation in a nanophthalmos pedigree. Our discovery of two additional TMEM98 mutations confirms the important role of the gene in the pathogenesis of autosomal dominant nanophthalmos.

  5. Exome capture from the spruce and pine giga-genomes.

    PubMed

    Suren, H; Hodgins, K A; Yeaman, S; Nurkowski, K A; Smets, P; Rieseberg, L H; Aitken, S N; Holliday, J A

    2016-09-01

    Sequence capture is a flexible tool for generating reduced representation libraries, particularly in species with massive genomes. We used an exome capture approach to sequence the gene space of two of the dominant species in Canadian boreal and montane forests - interior spruce (Picea glauca x engelmanii) and lodgepole pine (Pinus contorta). Transcriptome data generated with RNA-seq were coupled with draft genome sequences to design baits corresponding to 26 824 genes from pine and 28 649 genes from spruce. A total of 579 samples for spruce and 631 samples for pine were included, as well as two pine congeners and six spruce congeners. More than 50% of targeted regions were sequenced at >10× depth in each species, while ~12% captured near-target regions within 500 bp of a bait position were sequenced to a depth >10×. Much of our read data arose from off-target regions, which was likely due to the fragmented and incomplete nature of the draft genome assemblies. Capture in general was successful for the related species, suggesting that baits designed for a single species are likely to successfully capture sequences from congeners. From these data, we called approximately 10 million SNPs and INDELs in each species from coding regions, introns, untranslated and flanking regions, as well as from the intergenic space. Our study demonstrates the utility of sequence capture for resequencing in complex conifer genomes, suggests guidelines for improving capture efficiency and provides a rich resource of genetic variants for studies of selection and local adaptation in these species. © 2016 John Wiley & Sons Ltd.

  6. Guidance to rational use of pharmaceuticals in gallbladder sarcomatoid carcinoma using patient-derived cancer cells and whole exome sequencing.

    PubMed

    Feng, Feiling; Cheng, Qingbao; Yang, Liang; Zhang, Dadong; Ji, Shunlong; Zhang, Qiangzu; Lin, Yihui; Li, Fugen; Xiong, Lei; Liu, Chen; Jiang, Xiaoqing

    2017-01-17

    Gallbladder sarcomatoid carcinoma is a rare cancer with no clinical standard treatment. With the rapid development of next generation sequencing, it has been able to provide reasonable treatment options for patients based on genetic variations. However, most cancer drugs are not approval for gallbladder sarcomatoid carcinoma indications. The correlation between drug response and a genetic variation needs to be further elucidated. Three patient-derived cells-JXQ-3D-001, JXQ-3D-002, and JXQ-3D-003, were derived from biopsy samples of one gallbladder sarcomatoid carcinoma patient with progression and have been characterized. In order to study the relationship between drug sensitivity and gene alteration, genetic mutations of three patient-derived cells were discovered by whole exome sequencing, and drug screening has been performed based on the gene alterations and related signaling pathways that are associated with drug targets. It has been found that there are differences in biological characteristics such as morphology, cell proliferation, cell migration and colony formation activity among these three patient-derived cells although they are derived from the same patient. Their sensitivities to the chemotherapy drugs-Fluorouracil, Doxorubicin, and Cisplatin are distinct. Moreover, none of common chemotherapy drugs could inhibit the proliferations of all three patient-derived cells. Comprehensive analysis of their whole exome sequencing demonstrated that tumor-associated genes TP53, AKT2, FGFR3, FGF10, SDHA, and PI3KCA were mutated or amplified. Part of these alterations are actionable. By screening a set of compounds that are associated with the genetic alteration, it has been found that GDC-0941 and PF-04691502 for PI3K-AKT-mTOR pathway inhibitors could dramatically decrease the proliferation of three patient-derived cells. Importantly, expression of phosphorylated AKT and phosphorylated S6 were markedly decreased after treatments with PI3K-AKT-mTOR pathway inhibitors GDC-0941 (0.5 μM) and PF-04691502 (0.1 μM) in all three patient-derived cells. These data suggested that inhibition of the PI3K-AKT-mTOR pathway that was activated by PIK3CA amplification in all three patient-derived cells could reduce the cell proliferation. A patient-derived cell model combined with whole exome sequencing is a powerful tool to elucidate relationship between drug sensitivities and genetic alternations. In these gallbladder sarcomatoid carcinoma patient-derived cells, it is found that PIK3CA amplification could be used as a biomarker to indicate PI3K-AKT-mTOR pathway activation. Block of the pathway may benefit the gallbladder sarcomatoid carcinoma patient with this alternation in hypothesis. The real efficacy needs to be confirmed in vivo or in a clinical trial.

  7. CIDR

    Science.gov Websites

    NGS Pretesting and QC Using Illumina Infinium Arrays CIDR IGES Posters - 2017 A Comparison of Methods fragmentation methods for input into library construction protocol Development of a Low Input FFPE workflow for Evaluation of Copy Number Variation (CNV) detection methods in whole exome sequencing (WES) data CIDR AGBT

  8. Efficient genome-wide detection and cataloging of EMS-induced mutations using exome capture and next-generation sequencing

    USDA-ARS?s Scientific Manuscript database

    Chemical mutagenesis efficiently generates phenotypic variation in otherwise homogeneous genetic backgrounds, enabling functional analysis of genes. Advances in mutation detection have brought the utility of induced mutant populations on par with those produced by insertional mutagenesis, but system...

  9. Genetics Home Reference: familial paroxysmal kinesigenic dyskinesia

    MedlinePlus

    ... gene mutations, which reduce the amount of PRRT2 protein, lead to abnormal neuronal signaling. Altered neuronal activity could underlie the ... YF, Zhang QJ, Li HF, Lin Y, Murong SX, Xu J, Wang N, Wu ZY. Exome sequencing identifies truncating mutations in PRRT2 that cause paroxysmal ...

  10. Genetics Home Reference: Ochoa syndrome

    MedlinePlus

    ... Other researchers believe that a defective heparanase 2 protein may lead to problems with the development of the urinary tract or with muscle ... Peng W, Xu J, Li J, Owens KM, Bloom D, Innis JW. Exome capture and massively parallel sequencing identifies a novel HPSE2 mutation in a Saudi ...

  11. Genetics Home Reference: PURA syndrome

    MedlinePlus

    ... lead to a reduced amount of functional Purα protein. Although it is not understood how a partial loss of Purα function leads to the ... KJ, Gawne-Cain M; DDD study, Magee AC, Turnpenny PD, Baralle D. Whole exome sequencing in family trios reveals de novo mutations in ...

  12. NGS-based approach to determine the presence of HPV and their sites of integration in human cancer genome.

    PubMed

    Chandrani, P; Kulkarni, V; Iyer, P; Upadhyay, P; Chaubal, R; Das, P; Mulherkar, R; Singh, R; Dutt, A

    2015-06-09

    Human papilloma virus (HPV) accounts for the most common cause of all virus-associated human cancers. Here, we describe the first graphic user interface (GUI)-based automated tool 'HPVDetector', for non-computational biologists, exclusively for detection and annotation of the HPV genome based on next-generation sequencing data sets. We developed a custom-made reference genome that comprises of human chromosomes along with annotated genome of 143 HPV types as pseudochromosomes. The tool runs on a dual mode as defined by the user: a 'quick mode' to identify presence of HPV types and an 'integration mode' to determine genomic location for the site of integration. The input data can be a paired-end whole-exome, whole-genome or whole-transcriptome data set. The HPVDetector is available in public domain for download: http://www.actrec.gov.in/pi-webpages/AmitDutt/HPVdetector/HPVDetector.html. On the basis of our evaluation of 116 whole-exome, 23 whole-transcriptome and 2 whole-genome data, we were able to identify presence of HPV in 20 exomes and 4 transcriptomes of cervical and head and neck cancer tumour samples. Using the inbuilt annotation module of HPVDetector, we found predominant integration of viral gene E7, a known oncogene, at known 17q21, 3q27, 7q35, Xq28 and novel sites of integration in the human genome. Furthermore, co-infection with high-risk HPVs such as 16 and 31 were found to be mutually exclusive compared with low-risk HPV71. HPVDetector is a simple yet precise and robust tool for detecting HPV from tumour samples using variety of next-generation sequencing platforms including whole genome, whole exome and transcriptome. Two different modes (quick detection and integration mode) along with a GUI widen the usability of HPVDetector for biologists and clinicians with minimal computational knowledge.

  13. Analysis of whole exome sequencing with cardiometabolic traits using family-based linkage and association in the IRAS Family Study

    PubMed Central

    Tabb, Keri L.; Hellwege, Jacklyn N.; Palmer, Nicholette D.; Dimitrov, Latchezar; Sajuthi, Satria; Taylor, Kent D.; NG, Maggie C.Y.; Hawkins, Gregory A.; Chen, Yii-Der Ida; Brown, W. Mark; McWilliams, David; Williams, Adrienne; Lorenzo, Carlos; Norris, Jill M.; Long, Jirong; Rotter, Jerome I.; Curran, Joanne E.; Blangero, John; Wagenknecht, Lynne E.; Langefeld, Carl D.; Bowden, Donald W.

    2017-01-01

    Summary Family-based methods are a potentially powerful tool to identify trait-defining genetic variants in extended families, particularly when used to complement conventional association analysis. We utilized two-point linkage analysis and single variant association analysis to evaluate whole exome sequencing (WES) data from 1,205 Hispanic Americans (78 families) from the Insulin Resistance Atherosclerosis Family Study. WES identified 211,612 variants above the minor allele frequency threshold of ≥0.005. These variants were tested for linkage and/or association with 50 cardiometabolic traits after quality control checks. Two-point linkage analysis yielded 10,580,600 LOD scores with 1,148 LOD scores ≥3, 183 LOD scores ≥4, and 29 LOD scores ≥5. The maximal novel LOD score was 5.50 for rs2289043:T>C, in UNC5C with subcutaneous adipose tissue volume. Association analysis identified 13 variants attaining genome-wide significance (p<5×10-08), with the strongest association between rs651821:C>T in APOA5, and triglyceride levels (p=3.67×10-10). Overall, there was a 5.2-fold increase in the number of informative variants detected by WES compared to exome chip analysis in this population, nearly 30% of which were novel variants relative to dbSNP build 138. Thus, integration of results from two-point linkage and single-variant association analysis from WES data enabled identification of novel signals potentially contributing to cardiometabolic traits. PMID:28067407

  14. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies

    PubMed Central

    Dong, Chengliang; Wei, Peng; Jian, Xueqiu; Gibbs, Richard; Boerwinkle, Eric; Wang, Kai; Liu, Xiaoming

    2015-01-01

    Accurate deleteriousness prediction for nonsynonymous variants is crucial for distinguishing pathogenic mutations from background polymorphisms in whole exome sequencing (WES) studies. Although many deleteriousness prediction methods have been developed, their prediction results are sometimes inconsistent with each other and their relative merits are still unclear in practical applications. To address these issues, we comprehensively evaluated the predictive performance of 18 current deleteriousness-scoring methods, including 11 function prediction scores (PolyPhen-2, SIFT, MutationTaster, Mutation Assessor, FATHMM, LRT, PANTHER, PhD-SNP, SNAP, SNPs&GO and MutPred), 3 conservation scores (GERP++, SiPhy and PhyloP) and 4 ensemble scores (CADD, PON-P, KGGSeq and CONDEL). We found that FATHMM and KGGSeq had the highest discriminative power among independent scores and ensemble scores, respectively. Moreover, to ensure unbiased performance evaluation of these prediction scores, we manually collected three distinct testing datasets, on which no current prediction scores were tuned. In addition, we developed two new ensemble scores that integrate nine independent scores and allele frequency. Our scores achieved the highest discriminative power compared with all the deleteriousness prediction scores tested and showed low false-positive prediction rate for benign yet rare nonsynonymous variants, which demonstrated the value of combining information from multiple orthologous approaches. Finally, to facilitate variant prioritization in WES studies, we have pre-computed our ensemble scores for 87 347 044 possible variants in the whole-exome and made them publicly available through the ANNOVAR software and the dbNSFP database. PMID:25552646

  15. Worldwide Distribution of Cytochrome P450 Alleles: A Meta-analysis of Population-scale Sequencing Projects.

    PubMed

    Zhou, Y; Ingelman-Sundberg, M; Lauschke, V M

    2017-10-01

    Genetic polymorphisms in cytochrome P450 (CYP) genes can result in altered metabolic activity toward a plethora of clinically important medications. Thus, single nucleotide variants and copy number variations in CYP genes are major determinants of drug pharmacokinetics and toxicity and constitute pharmacogenetic biomarkers for drug dosing, efficacy, and safety. Strikingly, the distribution of CYP alleles differs considerably between populations with important implications for personalized drug therapy and healthcare programs. To provide a global distribution map of CYP alleles with clinical importance, we integrated whole-genome and exome sequencing data from 56,945 unrelated individuals of five major human populations. By combining this dataset with population-specific linkage information, we derive the frequencies of 176 CYP haplotypes, providing an extensive resource for major genetic determinants of drug metabolism. Furthermore, we aggregated this dataset into spectra of predicted functional variability in the respective populations and discuss the implications for population-adjusted pharmacological treatment strategies. © 2017 The Authors Clinical Pharmacology & Therapeutics published by Wiley Periodicals, Inc. on behalf of American Society for Clinical Pharmacology and Therapeutics.

  16. Genetic Architecture of Vitamin B12 and Folate Levels Uncovered Applying Deeply Sequenced Large Datasets

    PubMed Central

    Thorleifsson, Gudmar; Ahluwalia, Tarunveer S.; Steinthorsdottir, Valgerdur; Bjarnason, Helgi; Gudbjartsson, Daniel F.; Magnusson, Olafur T.; Sparsø, Thomas; Albrechtsen, Anders; Kong, Augustine; Masson, Gisli; Tian, Geng; Cao, Hongzhi; Nie, Chao; Kristiansen, Karsten; Husemoen, Lise Lotte; Thuesen, Betina; Li, Yingrui; Nielsen, Rasmus; Linneberg, Allan; Olafsson, Isleifur; Eyjolfsson, Gudmundur I.; Jørgensen, Torben; Wang, Jun; Hansen, Torben; Thorsteinsdottir, Unnur; Stefánsson, Kari; Pedersen, Oluf

    2013-01-01

    Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B12 and folate measurements, respectively. We found six novel loci associating with serum B12 (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B12 and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations. PMID:23754956

  17. PhenomeCentral: A Portal for Phenotypic and Genotypic Matchmaking of Patients with Rare Genetic Diseases

    PubMed Central

    Buske, Orion J.; Girdea, Marta; Dumitriu, Sergiu; Gallinger, Bailey; Hartley, Taila; Trang, Heather; Misyura, Andriy; Friedman, Tal; Beaulieu, Chandree; Bone, William P.; Links, Amanda E.; Washington, Nicole L.; Haendel, Melissa A.; Robinson, Peter N.; Boerkoel, Cornelius F.; Adams, David; Gahl, William A.; Boycott, Kym M.; Brudno, Michael

    2017-01-01

    The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https://phenomecentral.org). Each record includes a phenotypic description and relevant genetic information (exome or candidate genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on promising matches. PhenomeCentral incorporates data for over 1,000 patients with rare genetic diseases, contributed by the FORGE and Care4Rare Canada projects, the US NIH Undiagnosed Diseases Program, the EU Neuromics and ANDDIrare projects, as well as numerous independent clinicians and scientists. Though the majority of these records have associated exome data, most lack a molecular diagnosis. PhenomeCentral has already been used to identify causative mutations for several patients, and its ability to find matching patients and diagnose these diseases will grow with each additional patient that is entered. PMID:26251998

  18. Diversity and population structure of northern switchgrass as revealed though exome capture sequencing

    USDA-ARS?s Scientific Manuscript database

    Switchgrass (Panicum virgatum L.) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuels feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype composed of tetraploid and octoploid accessions, and a so...

  19. Genetics Home Reference: DOORS syndrome

    MedlinePlus

    ... reduce or eliminate the function of the TBC1D24 protein, but the specific mechanism by which loss of TBC1D24 function leads to the ... RC, Gibbs RA, Lee BH, Sisodiya SM. The genetic basis of DOORS syndrome: an exome-sequencing study. Lancet Neurol. 2014 Jan;13(1):44- ...

  20. Genetics Home Reference: ALG12-congenital disorder of glycosylation

    MedlinePlus

    ... CDG are likely due to impaired glycosylation of proteins and lipids that are needed for normal function of many organs and tissues, ... R, Gibbs RA, Lee BH, Cohn D, Campeau PM. Diagnosis of ALG12-CDG by exome sequencing in a case of severe skeletal dysplasia. Mol ...

Top