sequence variants affecting: Topics by Science.gov

Sample records for sequence variants affecting

Study designs for identification of rare disease variants in complex diseases: the utility of family-based designs.

PubMed

Ionita-Laza, Iuliana; Ottman, Ruth

2011-11-01

The recent progress in sequencing technologies makes possible large-scale medical sequencing efforts to assess the importance of rare variants in complex diseases. The results of such efforts depend heavily on the use of efficient study designs and analytical methods. We introduce here a unified framework for association testing of rare variants in family-based designs or designs based on unselected affected individuals. This framework allows us to quantify the enrichment in rare disease variants in families containing multiple affected individuals and to investigate the optimal design of studies aiming to identify rare disease variants in complex traits. We show that for many complex diseases with small values for the overall sibling recurrence risk ratio, such as Alzheimer's disease and most cancers, sequencing affected individuals with a positive family history of the disease can be extremely advantageous for identifying rare disease variants. In contrast, for complex diseases with large values of the sibling recurrence risk ratio, sequencing unselected affected individuals may be preferable.
Rare variants in RTEL1 are associated with familial interstitial pneumonia.

PubMed

Cogan, Joy D; Kropski, Jonathan A; Zhao, Min; Mitchell, Daphne B; Rives, Lynette; Markin, Cheryl; Garnett, Errine T; Montgomery, Keri H; Mason, Wendi R; McKean, David F; Powers, Julia; Murphy, Elissa; Olson, Lana M; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R; Lancaster, Lisa H; Steele, Mark P; Brown, Kevin K; Schwarz, Marvin I; Fingerlin, Tasha E; Schwartz, David A; Lawson, William E; Loyd, James E; Zhao, Zhongming; Phillips, John A; Blackwell, Timothy S

2015-03-15

Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis.
Rare Variants in RTEL1 Are Associated with Familial Interstitial Pneumonia

PubMed Central

Cogan, Joy D.; Zhao, Min; Mitchell, Daphne B.; Rives, Lynette; Markin, Cheryl; Garnett, Errine T.; Montgomery, Keri H.; Mason, Wendi R.; McKean, David F.; Powers, Julia; Murphy, Elissa; Olson, Lana M.; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R.; Lancaster, Lisa H.; Steele, Mark P.; Brown, Kevin K.; Schwarz, Marvin I.; Fingerlin, Tasha E.; Schwartz, David A.; Lawson, William E.; Loyd, James E.; Zhao, Zhongming; Phillips, John A.; Blackwell, Timothy S.

2015-01-01

Rationale: Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. Objectives: To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Methods: Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. Measurements and Main Results: We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Conclusions: Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis. PMID:25607374
Expansion of phenotype and genotypic data in CRB2-related syndrome.

PubMed

Lamont, Ryan E; Tan, Wen-Hann; Innes, A Micheil; Parboosingh, Jillian S; Schneidman-Duhovny, Dina; Rajkovic, Aleksandar; Pappas, John; Altschwager, Pablo; DeWard, Stephanie; Fulton, Anne; Gray, Kathryn J; Krall, Max; Mehta, Lakshmi; Rodan, Lance H; Saller, Devereux N; Steele, Deanna; Stein, Deborah; Yatsenko, Svetlana A; Bernier, François P; Slavotinek, Anne M

2016-10-01

Sequence variants in CRB2 cause a syndrome with greatly elevated maternal serum alpha-fetoprotein and amniotic fluid alpha-fetoprotein levels, cerebral ventriculomegaly and renal findings similar to Finnish congenital nephrosis. All reported patients have been homozygotes or compound heterozygotes for sequence variants in the Crumbs, Drosophila, Homolog of, 2 (CRB2) genes. Variants affecting CRB2 function have also been identified in four families with steroid resistant nephrotic syndrome, but without any other known systemic findings. We ascertained five, previously unreported individuals with biallelic variants in CRB2 that were predicted to affect function. We compiled the clinical features of reported cases and reviewed available literature for cases with features suggestive of CRB2-related syndrome in order to better understand the phenotypic and genotypic manifestations. Phenotypic analyses showed that ventriculomegaly was a common clinical manifestation (9/11 confirmed cases), in contrast to the original reports, in which patients were ascertained due to renal disease. Two children had minor eye findings and one was diagnosed with a B-cell lymphoma. Further genetic analysis identified one family with two affected siblings who were both heterozygous for a variant in NPHS2 predicted to affect function and separate families with sequence variants in NPHS4 and BBS7 in addition to the CRB2 variants. Our report expands the clinical phenotype of CRB2-related syndrome and establishes ventriculomegaly and hydrocephalus as frequent manifestations. We found additional sequence variants in genes involved in kidney development and ciliopathies in patients with CRB2-related syndrome, suggesting that these variants may modify the phenotype.
Genomic diagnosis for children with intellectual disability and/or developmental delay.

PubMed

Bowling, Kevin M; Thompson, Michelle L; Amaral, Michelle D; Finnila, Candice R; Hiatt, Susan M; Engel, Krysta L; Cochran, J Nicholas; Brothers, Kyle B; East, Kelly M; Gray, David E; Kelley, Whitley V; Lamb, Neil E; Lose, Edward J; Rich, Carla A; Simmons, Shirley; Whittle, Jana S; Weaver, Benjamin T; Nesmith, Amy S; Myers, Richard M; Barsh, Gregory S; Bebin, E Martina; Cooper, Gregory M

2017-05-30

Developmental disabilities have diverse genetic causes that must be identified to facilitate precise diagnoses. We describe genomic data from 371 affected individuals, 309 of which were sequenced as proband-parent trios. Whole-exome sequences (WES) were generated for 365 individuals (127 affected) and whole-genome sequences (WGS) were generated for 612 individuals (244 affected). Pathogenic or likely pathogenic variants were found in 100 individuals (27%), with variants of uncertain significance in an additional 42 (11.3%). We found that a family history of neurological disease, especially the presence of an affected first-degree relative, reduces the pathogenic/likely pathogenic variant identification rate, reflecting both the disease relevance and ease of interpretation of de novo variants. We also found that improvements to genetic knowledge facilitated interpretation changes in many cases. Through systematic reanalyses, we have thus far reclassified 15 variants, with 11.3% of families who initially were found to harbor a VUS and 4.7% of families with a negative result eventually found to harbor a pathogenic or likely pathogenic variant. To further such progress, the data described here are being shared through ClinVar, GeneMatcher, and dbGaP. Our data strongly support the value of large-scale sequencing, especially WGS within proband-parent trios, as both an effective first-choice diagnostic tool and means to advance clinical and research progress related to pediatric neurological disease.
Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.

PubMed

Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J; Robertson, Peggy D; Krumm, Niklas; Nickerson, Deborah A; Hershberger, Ray E

2013-04-01

BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.
Lessons learned from whole exome sequencing in multiplex families affected by a complex genetic disorder, intracranial aneurysm.

PubMed

Farlow, Janice L; Lin, Hai; Sauerbeck, Laura; Lai, Dongbing; Koller, Daniel L; Pugh, Elizabeth; Hetrick, Kurt; Ling, Hua; Kleinloog, Rachel; van der Vlies, Pieter; Deelen, Patrick; Swertz, Morris A; Verweij, Bon H; Regli, Luca; Rinkel, Gabriel J E; Ruigrok, Ynte M; Doheny, Kimberly; Liu, Yunlong; Broderick, Joseph; Foroud, Tatiana

2015-01-01

Genetic risk factors for intracranial aneurysm (IA) are not yet fully understood. Genomewide association studies have been successful at identifying common variants; however, the role of rare variation in IA susceptibility has not been fully explored. In this study, we report the use of whole exome sequencing (WES) in seven densely-affected families (45 individuals) recruited as part of the Familial Intracranial Aneurysm study. WES variants were prioritized by functional prediction, frequency, predicted pathogenicity, and segregation within families. Using these criteria, 68 variants in 68 genes were prioritized across the seven families. Of the genes that were expressed in IA tissue, one gene (TMEM132B) was differentially expressed in aneurysmal samples (n=44) as compared to control samples (n=16) (false discovery rate adjusted p-value=0.023). We demonstrate that sequencing of densely affected families permits exploration of the role of rare variants in a relatively common disease such as IA, although there are important study design considerations for applying sequencing to complex disorders. In this study, we explore methods of WES variant prioritization, including the incorporation of unaffected individuals, multipoint linkage analysis, biological pathway information, and transcriptome profiling. Further studies are needed to validate and characterize the set of variants and genes identified in this study.
Whole genome sequencing of an African American family highlights toll like receptor 6 variants in Kawasaki disease susceptibility.

PubMed

Kim, Jihoon; Shimizu, Chisato; Kingsmore, Stephen F; Veeraraghavan, Narayanan; Levy, Eric; Ribeiro Dos Santos, Andre M; Yang, Hai; Flatley, Jay; Hoang, Long Truong; Hibberd, Martin L; Tremoulet, Adriana H; Harismendy, Olivier; Ohno-Machado, Lucila; Burns, Jane C

2017-01-01

Kawasaki disease (KD) is the most common acquired pediatric heart disease. We analyzed Whole Genome Sequences (WGS) from a 6-member African American family in which KD affected two of four children. We sought rare, potentially causative genotypes by sequentially applying the following WGS filters: sequence quality scores, inheritance model (recessive homozygous and compound heterozygous), predicted deleteriousness, allele frequency, genes in KD-associated pathways or with significant associations in published KD genome-wide association studies (GWAS), and with differential expression in KD blood transcriptomes. Biologically plausible genotypes were identified in twelve variants in six genes in the two affected children. The affected siblings were compound heterozygous for the rare variants p.Leu194Pro and p.Arg247Lys in Toll-like receptor 6 (TLR6), which affect TLR6 signaling. The affected children were also homozygous for three common, linked (r2 = 1) intronic single nucleotide variants (SNVs) in TLR6 (rs56245262, rs56083757 and rs7669329), that have previously shown association with KD in cohorts of European descent. Using transcriptome data from pre-treatment whole blood of KD subjects (n = 146), expression quantitative trait loci (eQTL) analyses were performed. Subjects homozygous for the intronic risk allele (A allele of TLR6 rs56245262) had differential expression of Interleukin-6 (IL-6) as a function of genotype (p = 0.0007) and a higher erythrocyte sedimentation rate at diagnosis. TLR6 plays an important role in pathogen-associated molecular pattern recognition, and sequence variations may affect binding affinities that in turn influence KD susceptibility. This integrative genomic approach illustrates how the analysis of WGS in multiplex families with a complex genetic disease allows examination of both the common disease-common variant and common disease-rare variant hypotheses.
Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder.

PubMed

Steinberg, Karyn Meltz; Ramachandran, Dhanya; Patel, Viren C; Shetty, Amol C; Cutler, David J; Zwick, Michael E

2012-09-28

Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3' UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects.
Identification of rare X-linked neuroligin variants by massively parallel sequencing in males with autism spectrum disorder

PubMed Central

2012-01-01

Background Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. Methods We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. Results We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3’ UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. Conclusions These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects. PMID:23020841
Efficient analysis of mouse genome sequences reveal many nonsense variants

PubMed Central

Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude

2016-01-01

Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605
Identification of rare paired box 3 variant in strabismus by whole exome sequencing

PubMed Central

Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang

2017-01-01

AIM To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. METHODS A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. RESULTS Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. CONCLUSION Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder. PMID:28861346
Identification of rare paired box 3 variant in strabismus by whole exome sequencing.

PubMed

Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang

2017-01-01

To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder.
αIIbβ3 variants defined by next-generation sequencing: Predicting variants likely to cause Glanzmann thrombasthenia

PubMed Central

Buitrago, Lorena; Rendon, Augusto; Liang, Yupu; Simeoni, Ilenia; Negri, Ana; Filizola, Marta; Ouwehand, Willem H.; Coller, Barry S.; Alessi, Marie-Christine; Ballmaier, Matthias; Bariana, Tadbir; Bellissimo, Daniel; Bertoli, Marta; Bray, Paul; Bury, Loredana; Carrell, Robin; Cattaneo, Marco; Collins, Peter; French, Deborah; Favier, Remi; Freson, Kathleen; Furie, Bruce; Germeshausen, Manuela; Ghevaert, Cedric; Gomez, Keith; Goodeve, Anne; Gresele, Paolo; Guerrero, Jose; Hampshire, Dan J.; Hadinnapola, Charaka; Heemskerk, Johan; Henskens, Yvonne; Hill, Marian; Hogg, Nancy; Johnsen, Jill; Kahr, Walter; Kerr, Ron; Kunishima, Shinji; Laffan, Michael; Natwani, Amit; Neerman-Arbez, Marguerite; Nurden, Paquita; Nurden, Alan; Ormiston, Mark; Othman, Maha; Ouwehand, Willem; Perry, David; Vilk, Shoshana Ravel; Reitsma, Pieter; Rondina, Matthew; Simeoni, Ilenia; Smethurst, Peter; Stephens, Jonathan; Stevenson, William; Szkotak, Artur; Turro, Ernest; Van Geet, Christel; Vries, Minka; Ward, June; Waye, John; Westbury, Sarah; Whiteheart, Sidney; Wilcox, David; Zhang, Bi

2015-01-01

Next-generation sequencing is transforming our understanding of human genetic variation but assessing the functional impact of novel variants presents challenges. We analyzed missense variants in the integrin αIIbβ3 receptor subunit genes ITGA2B and ITGB3 identified by whole-exome or -genome sequencing in the ThromboGenomics project, comprising ∼32,000 alleles from 16,108 individuals. We analyzed the results in comparison with 111 missense variants in these genes previously reported as being associated with Glanzmann thrombasthenia (GT), 20 associated with alloimmune thrombocytopenia, and 5 associated with aniso/macrothrombocytopenia. We identified 114 novel missense variants in ITGA2B (affecting ∼11% of the amino acids) and 68 novel missense variants in ITGB3 (affecting ∼9% of the amino acids). Of the variants, 96% had minor allele frequencies (MAF) < 0.1%, indicating their rarity. Based on sequence conservation, MAF, and location on a complete model of αIIbβ3, we selected three novel variants that affect amino acids previously associated with GT for expression in HEK293 cells. αIIb P176H and β3 C547G severely reduced αIIbβ3 expression, whereas αIIb P943A partially reduced αIIbβ3 expression and had no effect on fibrinogen binding. We used receiver operating characteristic curves of combined annotation-dependent depletion, Polyphen 2-HDIV, and sorting intolerant from tolerant to estimate the percentage of novel variants likely to be deleterious. At optimal cut-off values, which had 69–98% sensitivity in detecting GT mutations, between 27% and 71% of the novel αIIb or β3 missense variants were predicted to be deleterious. Our data have implications for understanding the evolutionary pressure on αIIbβ3 and highlight the challenges in predicting the clinical significance of novel missense variants. PMID:25827233
Comprehensive Evaluation of the Association of APOE Genetic Variation with Plasma Lipoprotein Traits in U.S. Whites and African Blacks

PubMed Central

Radwan, Zaheda H.; Wang, Xingbin; Waqar, Fahad; Pirim, Dilek; Niemsiri, Vipavee; Hokanson, John E.; Hamman, Richard F.; Bunker, Clareann H.; Barmada, M. Michael; Demirci, F. Yesim; Kamboh, M. Ilyas

2014-01-01

Although common APOE genetic variation has a major influence on plasma LDL-cholesterol, its role in affecting HDL-cholesterol and triglycerides is not well established. Recent genome-wide association studies suggest that APOE also affects plasma variation in HDL-cholesterol and triglycerides. It is thus important to resequence the APOE gene to identify both common and uncommon variants that affect plasma lipid profile. Here, we have sequenced the APOE gene in 190 subjects with extreme HDL-cholesterol levels selected from two well-defined epidemiological samples of U.S. non-Hispanic Whites (NHWs) and African Blacks followed by genotyping of identified variants in the entire datasets (623 NHWs, 788 African Blacks) and association analyses with major lipid traits. We identified a total of 40 sequence variants, of which 10 are novel. A total of 32 variants, including common tagSNPs (≥5% frequency) and all uncommon variants (<5% frequency) were successfully genotyped and considered for genotype-phenotype associations. Other than the established associations of APOE*2 and APOE*4 with LDL-cholesterol, we have identified additional independent associations with LDL-cholesterol. We have also identified multiple associations of uncommon and common APOE variants with HDL-cholesterol and triglycerides. Our comprehensive sequencing and genotype-phenotype analyses indicate that APOE genetic variation impacts HDL-cholesterol and triglycerides in addition to affecting LDL-cholesterol. PMID:25502880
BEST1 sequence variants in Italian patients with vitelliform macular dystrophy

PubMed Central

Sodi, Andrea; Passerini, Ilaria; Caputo, Roberto; Bacci, Giacomo Maria; Bodoj, Mirela; Torricelli, Francesca; Menchini, Ugo

2012-01-01

Purpose To analyze the spectrum of sequence variants in the BEST1 gene in a group of Italian patients affected by Best vitelliform macular dystrophy (VMD). Methods Thirty Italian patients with a diagnosis of VMD and 20 clinically healthy relatives were recruited. They belonged to 19 Italian families predominantly originating from central Italy. They received a standard ophthalmologic examination, OCT scan, and electrophysiological tests (ERG and EOG). Fluorescein and ICG angiographies and fundus autofluorescence imaging were performed in selected cases. DNA samples were analyzed for sequence variants of the BEST1 gene by direct sequencing techniques. Results Nine missense variants and one deletion were found in the affected patients; each patient carried one mutation. Five variants [c.73C>T (p.Arg25Trp), c.652C>T (p.Arg218Cys), c.652C>G (p.Arg218Gly), c.728C>T (p.Ala243Val), c.893T>C (p.Phe298Ser)] have already been described in literature while another five variants [c.217A>C (p.Ile73Leu), c.239T>G (p.Phe80Cys), c.883_885del (p.Ile295del), c.907G>A (p.Asp303Asn), c.911A>G (p.Asp304Gly)] had not previously been reported. Affected patients, sometimes even from the same family, occasionally showed variable phenotypes. One heterozygous variant was also found in five clinically healthy relatives with normal fundus, visual acuity and ERG but with abnormal EOG. Conclusions Ten variants in the BEST1 gene were detected in a group of individuals with clinically apparent VMD, and in some clinically normal individuals with an abnormal EOG. The high prevalence of novel variants and the frequent report of a specific variant (p.Arg25Trp) that has rarely been described in other ethnic groups suggests a distribution of BEST1 variants peculiar to Italian VMD patients. PMID:23213274
MYO7A and USH2A gene sequence variants in Italian patients with Usher syndrome.

PubMed

Sodi, Andrea; Mariottini, Alessandro; Passerini, Ilaria; Murro, Vittoria; Tachyla, Iryna; Bianchi, Benedetta; Menchini, Ugo; Torricelli, Francesca

2014-01-01

To analyze the spectrum of sequence variants in the MYO7A and USH2A genes in a group of Italian patients affected by Usher syndrome (USH). Thirty-six Italian patients with a diagnosis of USH were recruited. They received a standard ophthalmologic examination, visual field testing, optical coherence tomography (OCT) scan, and electrophysiological tests. Fluorescein angiography and fundus autofluorescence imaging were performed in selected cases. All the patients underwent an audiologic examination for the 0.25-8,000 Hz frequencies. Vestibular function was evaluated with specific tests. DNA samples were analyzed for sequence variants of the MYO7A gene (for USH1) and the USH2A gene (for USH2) with direct sequencing techniques. A few patients were analyzed for both genes. In the MYO7A gene, ten missense variants were found; three patients were compound heterozygous, and two were homozygous. Thirty-four USH2A gene variants were detected, including eight missense variants, nine nonsense variants, six splicing variants, and 11 duplications/deletions; 19 patients were compound heterozygous, and three were homozygous. Four MYO7A and 17 USH2A variants have already been described in the literature. Among the novel mutations there are four USH2A large deletions, detected with multiplex ligation dependent probe amplification (MLPA) technology. Two potentially pathogenic variants were found in 27 patients (75%). Affected patients showed variable clinical pictures without a clear genotype-phenotype correlation. Ten variants in the MYO7A gene and 34 variants in the USH2A gene were detected in Italian patients with USH at a high detection rate. A selective analysis of these genes may be valuable for molecular analysis, combining diagnostic efficiency with little time wastage and less resource consumption.
Whole exome sequencing for familial bicuspid aortic valve identifies putative variants.

PubMed

Martin, Lisa J; Pilipenko, Valentina; Kaufman, Kenneth M; Cripe, Linda; Kottyan, Leah C; Keddache, Mehdi; Dexheimer, Phillip; Weirauch, Matthew T; Benson, D Woodrow

2014-10-01

Bicuspid aortic valve (BAV) is the most common congenital cardiovascular malformation. Although highly heritable, few causal variants have been identified. The purpose of this study was to identify genetic variants underlying BAV by whole exome sequencing a multiplex BAV kindred. Whole exome sequencing was performed on 17 individuals from a single family (BAV=3; other cardiovascular malformation, 3). Postvariant calling error control metrics were established after examining the relationship between Mendelian inheritance error rate and coverage, quality score, and call rate. To determine the most effective approach to identifying susceptibility variants from among 54 674 variants passing error control metrics, we evaluated 3 variant selection strategies frequently used in whole exome sequencing studies plus extended family linkage. No putative rare, high-effect variants were identified in all affected but no unaffected individuals. Eight high-effect variants were identified by ≥2 of the commonly used selection strategies; however, these were either common in the general population (>10%) or present in the majority of the unaffected family members. However, using extended family linkage, 3 synonymous variants were identified; all 3 variants were identified by at least one other strategy. These results suggest that traditional whole exome sequencing approaches, which assume causal variants alter coding sense, may be insufficient for BAV and other complex traits. Identification of disease-associated variants is facilitated by the use of segregation within families. © 2014 American Heart Association, Inc.
Exome sequencing in an admixed isolated population indicates NFXL1 variants confer a risk for specific language impairment.

PubMed

Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H; Gilissen, Christian; Reader, Rose H; Jara, Lillian; Echeverry, María Magdalena; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O'Hare, Anne; Bolton, Patrick F; Hennessy, Elizabeth R; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A; Cazier, Jean-Baptiste; De Barbieri, Zulema; Fisher, Simon E; Newbury, Dianne F

2015-03-01

Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10-4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model.
Supplementation of Nucleosides During Selection can Reduce Sequence Variant Levels in CHO Cells Using GS/MSX Selection System.

PubMed

Tang, Danming; Lam, Cynthia; Louie, Salina; Hoi, Kam Hon; Shaw, David; Yim, Mandy; Snedecor, Brad; Misaghi, Shahram

2018-01-01

In the process of generating stable monoclonal antibody (mAb) producing cell lines, reagents such as methotrexate (MTX) or methionine sulfoximine (MSX) are often used. However, using such selection reagent(s) increases the possibility of having higher occurrence of sequence variants in the expressed antibody molecules due to the effects of MTX or MSX on de novo nucleotide synthesis. Since MSX inhibits glutamine synthase (GS) and results in both amino acid and nucleoside starvation, it is questioned whether supplementing nucleosides into the media could lower sequence variant levels without affecting titer. The results show that the supplementation of nucleosides to the media during MSX selection decreased genomic DNA mutagenesis rates in the selected cells, probably by reducing nucleotide mis-incorporation into the DNA. Furthermore, addition of nucleosides enhance clone recovery post selection and does not affect antibody expression. It is further observed that nucleoside supplements lowered DNA mutagenesis rates only at the initial stage of the clone selection and do not have any effect on DNA mutagenesis rates after stable cell lines are established. Therefore, the data suggests that addition of nucleosides during early stages of MSX selection can lower sequence variant levels without affecting titer or clone stability in antibody expression. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Exome Sequence Analysis of 14 Families With High Myopia.

PubMed

Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L

2017-04-01

To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.
Using whole-exome sequencing to identify variants inherited from mosaic parents

PubMed Central

Rios, Jonathan J; Delgado, Mauricio R

2015-01-01

Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828
Sensitivity of BRCA1/2 testing in high-risk breast/ovarian/male breast cancer families: little contribution of comprehensive RNA/NGS panel testing.

PubMed

Byers, Helen; Wallis, Yvonne; van Veen, Elke M; Lalloo, Fiona; Reay, Kim; Smith, Philip; Wallace, Andrew J; Bowers, Naomi; Newman, William G; Evans, D Gareth

2016-11-01

The sensitivity of testing BRCA1 and BRCA2 remains unresolved as the frequency of deep intronic splicing variants has not been defined in high-risk familial breast/ovarian cancer families. This variant category is reported at significant frequency in other tumour predisposition genes, including NF1 and MSH2. We carried out comprehensive whole gene RNA analysis on 45 high-risk breast/ovary and male breast cancer families with no identified pathogenic variant on exonic sequencing and copy number analysis of BRCA1/2. In addition, we undertook variant screening of a 10-gene high/moderate risk breast/ovarian cancer panel by next-generation sequencing. DNA testing identified the causative variant in 50/56 (89%) breast/ovarian/male breast cancer families with Manchester scores of ≥50 with two variants being confirmed to affect splicing on RNA analysis. RNA sequencing of BRCA1/BRCA2 on 45 individuals from high-risk families identified no deep intronic variants and did not suggest loss of RNA expression as a cause of lost sensitivity. Panel testing in 42 samples identified a known RAD51D variant, a high-risk ATM variant in another breast ovary family and a truncating CHEK2 mutation. Current exonic sequencing and copy number analysis variant detection methods of BRCA1/2 have high sensitivity in high-risk breast/ovarian cancer families. Sequence analysis of RNA does not identify any variants undetected by current analysis of BRCA1/2. However, RNA analysis clarified the pathogenicity of variants of unknown significance detected by current methods. The low diagnostic uplift achieved through sequence analysis of the other known breast/ovarian cancer susceptibility genes indicates that further high-risk genes remain to be identified.
Pedigree Analysis and Exclusion of Alpha-Tocopherol Transfer Protein (TTPA) as a Candidate Gene for Neuroaxonal Dystrophy in the American Quarter Horse

PubMed Central

Finno, C.J.; Famula, T.; Aleman, M.; Higgins, R.J.; Madigan, J.E.; Bannasch, D.L.

2015-01-01

Background Equine neuroaxonal dystrophy/equine degenerative myeloencephalopathy (NAD/EDM) is a neurodegenerative disorder affecting young horses of various breeds that resembles ataxia with vitamin E deficiency in humans, an inherited disorder caused by mutations in the alpha-tocopherol transfer protein gene (TTPA). To evaluate variants found upon sequencing TTPA in the horse, the mode of inheritance for NAD/EDM had to be established. Hypothesis NAD/EDM in the American Quarter Horse (QH) is caused by a mutation in TTPA. Animals 88 clinically phenotyped (35 affected [ataxia score ≥2], 53 unaffected) QHs with a diagnosis of NAD/EDM with 6 affected and 4 unaffected cases confirmed at postmortem examination. Procedures Pedigrees and genotypes across 54,000 single nucleotide polymorphism (SNP) markers were assessed to determine heritability and mode of inheritance of NAD/EDM. TTPA sequence of exon/intron boundaries was evaluated in 2 affected and 2 control horses. An association analysis was performed by 71 SNPs surrounding TTPA and 8 SNPs within TTPA that were discovered by sequencing. RT-PCR for TTPA was performed on mRNA from the liver of 4 affected and 4 control horses. Results Equine NAD/EDM appears to be inherited as a polygenic trait and, within this family of QHs, demonstrates high heritability. Sequencing of TTPA identified 12 variants. No significant association was found using the 79 available variants in and surrounding TTPA. RT-PCR yielded PCR products of equivalent sizes between affected cases and controls. Conclusions and Clinical Importance NAD/EDM demonstrates heritability in this family of QHs. Variants in TTPA are not responsible for NAD/EDM in this study population. PMID:23186252
A novel homozygous missense variant in NECTIN4 (PVRL4) causing ectodermal dysplasia cutaneous syndactyly syndrome.

PubMed

Ahmad, Farooq; Nasir, Abdul; Thiele, Holger; Umair, Muhammad; Borck, Guntram; Ahmad, Wasim

2018-02-12

Ectodermal dysplasia syndactyly syndrome 1 (EDSS1) is a rare form of ectodermal dysplasia including anomalies of hair, nails, and teeth along with bilateral cutaneous syndactyly of hands and feet. In the present report, we performed a clinical and genetic characterization of a consanguineous Pakistani family with four individuals affected by EDSS1. We performed exome sequencing using DNA of one affected individual. Exome data analysis identified a novel homozygous missense variant (c.242T>C; p.(Leu81Pro)) in NECTIN4 (PVRL4). Sanger sequencing validated this variant and confirmed its cosegregation with the disease phenotype in the family members. Thus, our report adds a novel variant to the NECTIN4 mutation spectrum and contributes to the NECTIN4-related clinical characterization. © 2018 John Wiley & Sons Ltd/University College London.
Exome Sequencing in an Admixed Isolated Population Indicates NFXL1 Variants Confer a Risk for Specific Language Impairment

PubMed Central

Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H.; Gilissen, Christian; Reader, Rose H.; Jara, Lillian; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O’Hare, Anne; Bolton, Patrick F.; Hennessy, Elizabeth R.; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A.; Cazier, Jean-Baptiste; De Barbieri, Zulema

2015-01-01

Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10–4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model. PMID:25781923
Next-generation DNA sequencing identifies novel gene variants and pathways involved in specific language impairment.

PubMed

Chen, Xiaowei Sylvia; Reader, Rose H; Hoischen, Alexander; Veltman, Joris A; Simpson, Nuala H; Francks, Clyde; Newbury, Dianne F; Fisher, Simon E

2017-04-25

A significant proportion of children have unexplained problems acquiring proficient linguistic skills despite adequate intelligence and opportunity. Developmental language disorders are highly heritable with substantial societal impact. Molecular studies have begun to identify candidate loci, but much of the underlying genetic architecture remains undetermined. We performed whole-exome sequencing of 43 unrelated probands affected by severe specific language impairment, followed by independent validations with Sanger sequencing, and analyses of segregation patterns in parents and siblings, to shed new light on aetiology. By first focusing on a pre-defined set of known candidates from the literature, we identified potentially pathogenic variants in genes already implicated in diverse language-related syndromes, including ERC1, GRIN2A, and SRPX2. Complementary analyses suggested novel putative candidates carrying validated variants which were predicted to have functional effects, such as OXR1, SCN9A and KMT2D. We also searched for potential "multiple-hit" cases; one proband carried a rare AUTS2 variant in combination with a rare inherited haplotype affecting STARD9, while another carried a novel nonsynonymous variant in SEMA6D together with a rare stop-gain in SYNPR. On broadening scope to all rare and novel variants throughout the exomes, we identified biological themes that were enriched for such variants, including microtubule transport and cytoskeletal regulation.
Next-generation DNA sequencing identifies novel gene variants and pathways involved in specific language impairment

PubMed Central

Chen, Xiaowei Sylvia; Reader, Rose H.; Hoischen, Alexander; Veltman, Joris A.; Simpson, Nuala H.; Francks, Clyde; Newbury, Dianne F.; Fisher, Simon E.

2017-01-01

A significant proportion of children have unexplained problems acquiring proficient linguistic skills despite adequate intelligence and opportunity. Developmental language disorders are highly heritable with substantial societal impact. Molecular studies have begun to identify candidate loci, but much of the underlying genetic architecture remains undetermined. We performed whole-exome sequencing of 43 unrelated probands affected by severe specific language impairment, followed by independent validations with Sanger sequencing, and analyses of segregation patterns in parents and siblings, to shed new light on aetiology. By first focusing on a pre-defined set of known candidates from the literature, we identified potentially pathogenic variants in genes already implicated in diverse language-related syndromes, including ERC1, GRIN2A, and SRPX2. Complementary analyses suggested novel putative candidates carrying validated variants which were predicted to have functional effects, such as OXR1, SCN9A and KMT2D. We also searched for potential “multiple-hit” cases; one proband carried a rare AUTS2 variant in combination with a rare inherited haplotype affecting STARD9, while another carried a novel nonsynonymous variant in SEMA6D together with a rare stop-gain in SYNPR. On broadening scope to all rare and novel variants throughout the exomes, we identified biological themes that were enriched for such variants, including microtubule transport and cytoskeletal regulation. PMID:28440294
Ataxia telangiectasia presenting as dopa-responsive cervical dystonia

PubMed Central

Mohire, Mahavir D.; Schneider, Susanne A.; Stamelou, Maria; Wood, Nicholas W.; Bhatia, Kailash P.

2013-01-01

Objective: To identify the cause of cervical dopa-responsive dystonia (DRD) in a Muslim Indian family inherited in an apparently autosomal recessive fashion, as previously described in this journal. Methods: Previous testing for mutations in the genes known to cause DRD (GCH1, TH, and SPR) had been negative. Whole exome sequencing was performed on all 3 affected individuals for whom DNA was available to identify potentially pathogenic shared variants. Genotyping data obtained for all 3 affected individuals using the OmniExpress single nucleotide polymorphism chip (Illumina, San Diego, CA) were used to perform linkage analysis, autozygosity mapping, and copy number variation analysis. Sanger sequencing was used to confirm all variants. Results: After filtering of the variants, exome sequencing revealed 2 genes harboring potentially pathogenic compound heterozygous variants (ATM and LRRC16A). Of these, the variants in ATM segregated perfectly with the cervical DRD. Both mutations detected in ATM have been shown to be pathogenic, and α-fetoprotein, a marker of ataxia telangiectasia, was increased in all affected individuals. Conclusion: Biallelic mutations in ATM can cause DRD, and mutations in this gene should be considered in the differential diagnosis of unexplained DRD, particularly if the dystonia is cervical and if there is a recessive family history. ATM has previously been reported to cause isolated cervical dystonia, but never, to our knowledge, DRD. Individuals with dystonia related to ataxia telangiectasia may benefit from a trial of levodopa. PMID:23946315
Inner retinal dystrophy in a patient with biallelic sequence variants in BRAT1.

PubMed

Oatts, Julius T; Duncan, Jacque L; Hoyt, Creig S; Slavotinek, Anne M; Moore, Anthony T

2017-12-01

Mutations in the BRCA1-associated protein required for the ataxia telangiectasia mutated (ATM) activation-1 (BRAT1) gene cause lethal neonatal rigidity and multifocal seizure syndrome characterized by rigidity and intractable seizures and a milder phenotype with intellectual disability, seizures, nonprogressive cerebellar ataxia or dyspraxia, and cerebellar atrophy. To date, nystagmus, cortical visual impairment, impairment of central vision, optic nerve hypoplasia, and optic atrophy have been described in this condition. This article describes the retinal findings in a patient with biallelic deleterious sequence variants in BRAT1. Case report of a child with biallelic sequence variants in the BRAT1 gene. This patient had developmental delay, microcephaly, nystagmus, and esotropia, and full-field electroretinography (ERG) revealed an inner retinal dystrophy. She was found on exome sequencing to have compound heterozygous sequence variants in the BRAT1 gene: one maternally inherited frameshift variant (c.294dupA, predicting p.Leu99Thrfs*92), which has previously been reported, and one paternally inherited novel missense variant (c.803G>A, p.Arg268His), which is likely to affect protein function. Biallelic sequence variants in BRAT1 have been reported to cause a variety of ocular and systemic manifestations, but to our knowledge, this is the first report of inner retinal dysfunction manifest as selective loss of full-field ERG scotopic and photopic b-wave amplitudes.
Two missense mutations in melanocortin 1 receptor (MC1R) are strongly associated with dark ventral coat color in reindeer (Rangifer tarandus).

PubMed

Våge, D I; Nieminen, M; Anderson, D G; Røed, K H

2014-10-01

The protein-coding region of melanocortin 1 receptor (MC1R) was sequenced to identify potential variation affecting coat color in reindeer (Rangifer tarandus). A T→C sequence variation at nucleotide position 218 (c.218T>C) causing an amino acid (aa) change from methionine to threonine at aa position 73 (p.Met73Thr) was identified. In addition, a T→G sequence variation was found at nucleotide position 839 (c.839T>G), causing phenylalanine to be exchanged by cysteine at aa position 280 (p.Phe280Cys). The two sequence variants (c.218C and c.839G) were found to be closely associated with a darker belly coat compared with animals not having any of these two variants. The aa acid change p.Met73Thr affects the same position as p.Met73Lys previously reported to give constitutive activation of MC1R in black sheep (Ovis aries), whereas p.Phe280Cys is identical to one of two variants previously reported to be associated with dark coat color in Arctic fox (Alopex lagopus), supporting that the two variants found in reindeer are functional. The complete absence of Thr73 and Cys280 among the 51 wild reindeer analyzed provides some evidence that these variants are more common in the domestic herds. © 2014 Stichting International Foundation for Animal Genetics.
Whole-genome sequencing reveals a coding non-pathogenic variant tagging a non-coding pathogenic hexanucleotide repeat expansion in C9orf72 as cause of amyotrophic lateral sclerosis.

PubMed

Herdewyn, Sarah; Zhao, Hui; Moisse, Matthieu; Race, Valérie; Matthijs, Gert; Reumers, Joke; Kusters, Benno; Schelhaas, Helenius J; van den Berg, Leonard H; Goris, An; Robberecht, Wim; Lambrechts, Diether; Van Damme, Philip

2012-06-01

Motor neuron degeneration in amyotrophic lateral sclerosis (ALS) has a familial cause in 10% of patients. Despite significant advances in the genetics of the disease, many families remain unexplained. We performed whole-genome sequencing in five family members from a pedigree with autosomal-dominant classical ALS. A family-based elimination approach was used to identify novel coding variants segregating with the disease. This list of variants was effectively shortened by genotyping these variants in 2 additional unaffected family members and 1500 unrelated population-specific controls. A novel rare coding variant in SPAG8 on chromosome 9p13.3 segregated with the disease and was not observed in controls. Mutations in SPAG8 were not encountered in 34 other unexplained ALS pedigrees, including 1 with linkage to chromosome 9p13.2-23.3. The shared haplotype containing the SPAG8 variant in this small pedigree was 22.7 Mb and overlapped with the core 9p21 linkage locus for ALS and frontotemporal dementia. Based on differences in coverage depth of known variable tandem repeat regions between affected and non-affected family members, the shared haplotype was found to contain an expanded hexanucleotide (GGGGCC)(n) repeat in C9orf72 in the affected members. Our results demonstrate that rare coding variants identified by whole-genome sequencing can tag a shared haplotype containing a non-coding pathogenic mutation and that changes in coverage depth can be used to reveal tandem repeat expansions. It also confirms (GGGGCC)n repeat expansions in C9orf72 as a cause of familial ALS.
Association Between Germline Mutation in VSIG10L and Familial Barrett Neoplasia.

PubMed

Fecteau, Ryan E; Kong, Jianping; Kresak, Adam; Brock, Wendy; Song, Yeunjoo; Fujioka, Hisashi; Elston, Robert; Willis, Joseph E; Lynch, John P; Markowitz, Sanford D; Guda, Kishore; Chak, Amitabh

2016-10-01

Esophageal adenocarcinoma and its precursor lesion Barrett esophagus have seen a dramatic increase in incidence over the past 4 decades yet marked genetic heterogeneity of this disease has precluded advances in understanding its pathogenesis and improving treatment. To identify novel disease susceptibility variants in a familial syndrome of esophageal adenocarcinoma and Barrett esophagus, termed familial Barrett esophagus, by using high-throughput sequencing in affected individuals from a large, multigenerational family. We performed whole exome sequencing (WES) from peripheral lymphocyte DNA on 4 distant relatives from our multiplex, multigenerational familial Barrett esophagus family to identify candidate disease susceptibility variants. Gene variants were filtered, verified, and segregation analysis performed to identify a single candidate variant. Gene expression analysis was done with both quantitative real-time polymerase chain reaction and in situ RNA hybridization. A 3-dimensional organotypic cell culture model of esophageal maturation was utilized to determine the phenotypic effects of our gene variant. We used electron microscopy on esophageal mucosa from an affected family member carrying the gene variant to assess ultrastructural changes. Identification of a novel, germline disease susceptibility variant in a previously uncharacterized gene. A multiplex, multigenerational family with 14 members affected (3 members with esophageal adenocarcinoma and 11 with Barrett esophagus) was identified, and whole-exome sequencing identified a germline mutation (S631G) at a highly conserved serine residue in the uncharacterized gene VSIG10L that segregated in affected members. Transfection of S631G variant into a 3-dimensional organotypic culture model of normal esophageal squamous cells dramatically inhibited epithelial maturation compared with the wild-type. VSIG10L exhibited high expression in normal squamous esophagus with marked loss of expression in Barrett-associated lesions. Electron microscopy of squamous esophageal mucosa harboring the S631G variant revealed dilated intercellular spaces and reduced desmosomes. This study presents VSIG10L as a candidate familial Barrett esophagus susceptibility gene, with a putative role in maintaining normal esophageal homeostasis. Further research assessing VSIG10L function may reveal pathways important for esophageal maturation and the pathogenesis of Barrett esophagus and esophageal adenocarcinoma.
Association Between Germline Mutation in VSIG10L and Familial Barrett Neoplasia

PubMed Central

Fecteau, Ryan E.; Kong, Jianping; Kresak, Adam; Brock, Wendy; Song, Yeunjoo; Fujioka, Hisashi; Elston, Robert; Willis, Joseph E.; Lynch, John P.; Markowitz, Sanford D.; Guda, Kishore; Chak, Amitabh

2016-01-01

IMPORTANCE Esophageal adenocarcinoma and its precursor lesion Barrett esophagus have seen a dramatic increase in incidence over the past 4 decades yet marked genetic heterogeneity of this disease has precluded advances in understanding its pathogenesis and improving treatment. OBJECTIVE To identify novel disease susceptibility variants in a familial syndrome of esophageal adenocarcinoma and Barrett esophagus, termed familial Barrett esophagus, by using high-throughput sequencing in affected individuals from a large, multigenerational family. DESIGN, SETTING, AND PARTICIPANTS We performed whole exome sequencing (WES) from peripheral lymphocyte DNA on 4 distant relatives from our multiplex, multigenerational familial Barrett esophagus family to identify candidate disease susceptibility variants. Gene variants were filtered, verified, and segregation analysis performed to identify a single candidate variant. Gene expression analysis was done with both quantitative real-time polymerase chain reaction and in situ RNA hybridization. A 3-dimensional organotypic cell culture model of esophageal maturation was utilized to determine the phenotypic effects of our gene variant. We used electron microscopy on esophageal mucosa from an affected family member carrying the gene variant to assess ultrastructural changes. MAIN OUTCOMES AND MEASURES Identification of a novel, germline disease susceptibility variant in a previously uncharacterized gene. RESULTS A multiplex, multigenerational family with 14 members affected (3 members with esophageal adenocarcinoma and 11 with Barrett esophagus) was identified, and whole-exome sequencing identified a germline mutation (S631G) at a highly conserved serine residue in the uncharacterized gene VSIG10L that segregated in affected members. Transfection of S631G variant into a 3-dimensional organotypic culture model of normal esophageal squamous cells dramatically inhibited epithelial maturation compared with the wild-type. VSIG10L exhibited high expression in normal squamous esophagus with marked loss of expression in Barrett-associated lesions. Electron microscopy of squamous esophageal mucosa harboring the S631G variant revealed dilated intercellular spaces and reduced desmosomes. CONCLUSIONS AND RELEVANCE This study presents VSIG10L as a candidate familial Barrett esophagus susceptibility gene, with a putative role in maintaining normal esophageal homeostasis. Further research assessing VSIG10L function may reveal pathways important for esophageal maturation and the pathogenesis of Barrett esophagus and esophageal adenocarcinoma. PMID:27467440
Myopathy With SQSTM1 and TIA1 Variants: Clinical and Pathological Features.

PubMed

Niu, Zhiyv; Pontifex, Carly Sabine; Berini, Sarah; Hamilton, Leslie E; Naddaf, Elie; Wieben, Eric; Aleff, Ross A; Martens, Kristina; Gruber, Angela; Engel, Andrew G; Pfeffer, Gerald; Milone, Margherita

2018-01-01

The aim of this study is to identify the molecular defect of three unrelated individuals with late-onset predominant distal myopathy; to describe the spectrum of phenotype resulting from the contributing role of two variants in genes located on two different chromosomes; and to highlight the underappreciated complex forms of genetic myopathies. Clinical and laboratory data of three unrelated probands with predominantly distal weakness manifesting in the sixth-seventh decade of life, and available affected and unaffected family members were reviewed. Next-generation sequencing panel, whole exome sequencing, and targeted analyses of family members were performed to elucidate the genetic etiology of the myopathy. Genetic analyses detected two contributing variants located on different chromosomes in three unrelated probands: a heterozygous pathogenic mutation in SQSTM1 (c.1175C>T, p.Pro392Leu) and a heterozygous variant in TIA1 (c.1070A>G, p.Asn357Ser). The affected fraternal twin of one proband also carries both variants, while the unaffected family members harbor one or none. Two unrelated probands (family 1, II.3, and family 3, II.1) have a distal myopathy with rimmed vacuoles that manifested with index extensor weakness; the other proband (family 2, I.1) has myofibrillar myopathy manifesting with hypercapnic respiratory insufficiency and distal weakness. The findings indicate that all the affected individuals have a myopathy associated with both variants in SQSTM1 and TIA1 , respectively, suggesting that the two variants determine the phenotype and likely functionally interact. We speculate that the TIA1 variant is a modifier of the SQSTM1 mutation. We identify the combination of SQSTM1 and TIA1 variants as a novel genetic defect associated with myofibrillar myopathy and suggest to consider sequencing both genes in the molecular investigation of myopathy with rimmed vacuoles and myofibrillar myopathy although additional studies are needed to investigate the digenic nature of the disease.
Anticipation in a family with primary familial brain calcification caused by an SLC20A2 variant.

PubMed

Konno, Takuya; Blackburn, Patrick R; Rozen, Todd D; van Gerpen, Jay A; Ross, Owen A; Atwal, Paldeep S; Wszolek, Zbigniew K

2018-04-11

To describe a family with primary familial brain calcification (PFBC) due to SLC20A2 variant showing possible genetic anticipation. We conducted historical, genealogical, clinical, and radiologic studies of a family with PFBC. Clinical evaluations including neurological examination and head computed tomography (CT) scans of a proband and her father were performed. They provided additional information regarding other family members. To identify a causative gene variant, we performed whole-exome sequencing for the proband followed by segregation analysis in other affected members using direct sequencing. In this family, nine affected members were identified over four generations. The proband suffered from chronic daily headache including thunderclap headache. We identified an SLC20A2 (c.509delT, p.(Leu170*)) variant in three affected members over three generations. Interestingly, the age of onset became younger as the disease passed through successive generations, suggestive of genetic anticipation. For clinical purpose, it is important to consider thunderclap headache and genetic anticipation in PFBC caused by SLC20A2 variants. Further investigation is required to validate our observation. Copyright © 2018 Polish Neurological Society. Published by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
High-throughput sequencing of mGluR signaling pathway genes reveals enrichment of rare variants in autism.

PubMed

Kelleher, Raymond J; Geigenmüller, Ute; Hovhannisyan, Hayk; Trautman, Edwin; Pinard, Robert; Rathmell, Barbara; Carpenter, Randall; Margulies, David

2012-01-01

Identification of common molecular pathways affected by genetic variation in autism is important for understanding disease pathogenesis and devising effective therapies. Here, we test the hypothesis that rare genetic variation in the metabotropic glutamate-receptor (mGluR) signaling pathway contributes to autism susceptibility. Single-nucleotide variants in genes encoding components of the mGluR signaling pathway were identified by high-throughput multiplex sequencing of pooled samples from 290 non-syndromic autism cases and 300 ethnically matched controls on two independent next-generation platforms. This analysis revealed significant enrichment of rare functional variants in the mGluR pathway in autism cases. Higher burdens of rare, potentially deleterious variants were identified in autism cases for three pathway genes previously implicated in syndromic autism spectrum disorder, TSC1, TSC2, and SHANK3, suggesting that genetic variation in these genes also contributes to risk for non-syndromic autism. In addition, our analysis identified HOMER1, which encodes a postsynaptic density-localized scaffolding protein that interacts with Shank3 to regulate mGluR activity, as a novel autism-risk gene. Rare, potentially deleterious HOMER1 variants identified uniquely in the autism population affected functionally important protein regions or regulatory sequences and co-segregated closely with autism among children of affected families. We also identified rare ASD-associated coding variants predicted to have damaging effects on components of the Ras/MAPK cascade. Collectively, these findings suggest that altered signaling downstream of mGluRs contributes to the pathogenesis of non-syndromic autism.
High-Throughput Sequencing of mGluR Signaling Pathway Genes Reveals Enrichment of Rare Variants in Autism

PubMed Central

Hovhannisyan, Hayk; Trautman, Edwin; Pinard, Robert; Rathmell, Barbara; Carpenter, Randall; Margulies, David

2012-01-01

Identification of common molecular pathways affected by genetic variation in autism is important for understanding disease pathogenesis and devising effective therapies. Here, we test the hypothesis that rare genetic variation in the metabotropic glutamate-receptor (mGluR) signaling pathway contributes to autism susceptibility. Single-nucleotide variants in genes encoding components of the mGluR signaling pathway were identified by high-throughput multiplex sequencing of pooled samples from 290 non-syndromic autism cases and 300 ethnically matched controls on two independent next-generation platforms. This analysis revealed significant enrichment of rare functional variants in the mGluR pathway in autism cases. Higher burdens of rare, potentially deleterious variants were identified in autism cases for three pathway genes previously implicated in syndromic autism spectrum disorder, TSC1, TSC2, and SHANK3, suggesting that genetic variation in these genes also contributes to risk for non-syndromic autism. In addition, our analysis identified HOMER1, which encodes a postsynaptic density-localized scaffolding protein that interacts with Shank3 to regulate mGluR activity, as a novel autism-risk gene. Rare, potentially deleterious HOMER1 variants identified uniquely in the autism population affected functionally important protein regions or regulatory sequences and co-segregated closely with autism among children of affected families. We also identified rare ASD-associated coding variants predicted to have damaging effects on components of the Ras/MAPK cascade. Collectively, these findings suggest that altered signaling downstream of mGluRs contributes to the pathogenesis of non-syndromic autism. PMID:22558107
Sequencing of sporadic Attention-Deficit Hyperactivity Disorder (ADHD) identifies novel and potentially pathogenic de novo variants and excludes overlap with genes associated with autism spectrum disorder.

PubMed

Kim, Daniel Seung; Burt, Amber A; Ranchalis, Jane E; Wilmot, Beth; Smith, Joshua D; Patterson, Karynne E; Coe, Bradley P; Li, Yatong K; Bamshad, Michael J; Nikolas, Molly; Eichler, Evan E; Swanson, James M; Nigg, Joel T; Nickerson, Deborah A; Jarvik, Gail P

2017-06-01

Attention-Deficit Hyperactivity Disorder (ADHD) has high heritability; however, studies of common variation account for <5% of ADHD variance. Using data from affected participants without a family history of ADHD, we sought to identify de novo variants that could account for sporadic ADHD. Considering a total of 128 families, two analyses were conducted in parallel: first, in 11 unaffected parent/affected proband trios (or quads with the addition of an unaffected sibling) we completed exome sequencing. Six de novo missense variants at highly conserved bases were identified and validated from four of the 11 families: the brain-expressed genes TBC1D9, DAGLA, QARS, CSMD2, TRPM2, and WDR83. Separately, in 117 unrelated probands with sporadic ADHD, we sequenced a panel of 26 genes implicated in intellectual disability (ID) and autism spectrum disorder (ASD) to evaluate whether variation in ASD/ID-associated genes were also present in participants with ADHD. Only one putative deleterious variant (Gln600STOP) in CHD1L was identified; this was found in a single proband. Notably, no other nonsense, splice, frameshift, or highly conserved missense variants in the 26 gene panel were identified and validated. These data suggest that de novo variant analysis in families with independently adjudicated sporadic ADHD diagnosis can identify novel genes implicated in ADHD pathogenesis. Moreover, that only one of the 128 cases (0.8%, 11 exome, and 117 MIP sequenced participants) had putative deleterious variants within our data in 26 genes related to ID and ASD suggests significant independence in the genetic pathogenesis of ADHD as compared to ASD and ID phenotypes. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Molecular characterization of canine parvovirus variants (CPV-2a, CPV-2b, and CPV-2c) based on the VP2 gene in affected domestic dogs in Ecuador.

PubMed

la Torre, David De; Mafla, Eulalia; Puga, Byron; Erazo, Linda; Astolfi-Ferreira, Claudete; Ferreira, Antonio Piantino

2018-04-01

The objective of this study was to determine the presence of the variants of canine parvovirus (CPV)-2 in the city of Quito, Ecuador, due to the high domestic and street-type canine population, and to identify possible mutations at a genetic level that could be causing structural changes in the virus with a consequent influence on the immune response of the hosts. Thirty-five stool samples from different puppies with characteristic signs of the disease and positives for CPV through immunochromatography kits were collected from different veterinarian clinics of the city. Polymerase chain reaction and DNA sequencing were used to determine the mutations in residue 426 of the VP2 gene, which determines the variants of CPV-2; in addition, four samples were chosen for complete sequencing of the VP2 gene to identify all possible mutations in the circulating strains in this region of the country. The results revealed the presence of the three variants of CPV-2 with a prevalence of 57.1% (20/35) for CPV-2a, 8.5% (3/35) for CPV-2b, and 34.3% (12/35) for CPV-2c. In addition, complete sequencing of the VP2 gene showed amino acid substitutions in residues 87, 101, 139, 219, 297, 300, 305, 322, 324, 375, 386, 426, 440, and 514 of the three Ecuadorian variants when compared with the original CPV-2 sequence. This study describes the detection of CPV variants in the city of Quito, Ecuador. Variants of CPV-2 (2a, 2b, and 2c) have been reported in South America, and there are cases in Ecuador where CVP-2 is affecting even vaccinated puppies.

Cystinuria Associated with Different SLC7A9 Gene Variants in the Cat

PubMed Central

Raj, Karthik; Osborne, Carl; Giger, Urs

2016-01-01

Cystinuria is a classical inborn error of metabolism characterized by a selective proximal renal tubular defect affecting cystine, ornithine, lysine, and arginine (COLA) reabsorption, which can lead to uroliths and urinary obstruction. In humans, dogs and mice, cystinuria is caused by variants in one of two genes, SLC3A1 and SLC7A9, which encode the rBAT and bo,+AT subunits of the bo,+ basic amino acid transporter system, respectively. In this study, exons and flanking regions of the SLC3A1 and SLC7A9 genes were sequenced from genomic DNA of cats (Felis catus) with COLAuria and cystine calculi. Relative to the Felis catus-6.2 reference genome sequence, DNA sequences from these affected cats revealed 3 unique homozygous SLC7A9 missense variants: one in exon 5 (p.Asp236Asn) from a non-purpose-bred medium-haired cat, one in exon 7 (p.Val294Glu) in a Maine Coon and a Sphinx cat, and one in exon 10 (p.Thr392Met) from a non-purpose-bred long-haired cat. A genotyping assay subsequently identified another cystinuric domestic medium-haired cat that was homozygous for the variant originally identified in the purebred cats. These missense variants result in deleterious amino acid substitutions of highly conserved residues in the bo,+AT protein. A limited population survey supported that the variants found were likely causative. The remaining 2 sequenced domestic short-haired cats had a heterozygous variant at a splice donor site in intron 10 and a homozygous single nucleotide variant at a branchpoint in intron 11 of SLC7A9, respectively. This study identifies the first SLC7A9 variants causing feline cystinuria and reveals that, as in humans and dogs, this disease is genetically heterogeneous in cats. PMID:27404572
Rare and Coding Region Genetic Variants Associated With Risk of Ischemic Stroke: The NHLBI Exome Sequence Project.

PubMed

Auer, Paul L; Nalls, Mike; Meschia, James F; Worrall, Bradford B; Longstreth, W T; Seshadri, Sudha; Kooperberg, Charles; Burger, Kathleen M; Carlson, Christopher S; Carty, Cara L; Chen, Wei-Min; Cupples, L Adrienne; DeStefano, Anita L; Fornage, Myriam; Hardy, John; Hsu, Li; Jackson, Rebecca D; Jarvik, Gail P; Kim, Daniel S; Lakshminarayan, Kamakshi; Lange, Leslie A; Manichaikul, Ani; Quinlan, Aaron R; Singleton, Andrew B; Thornton, Timothy A; Nickerson, Deborah A; Peters, Ulrike; Rich, Stephen S

2015-07-01

Stroke is the second leading cause of death and the third leading cause of years of life lost. Genetic factors contribute to stroke prevalence, and candidate gene and genome-wide association studies (GWAS) have identified variants associated with ischemic stroke risk. These variants often have small effects without obvious biological significance. Exome sequencing may discover predicted protein-altering variants with a potentially large effect on ischemic stroke risk. To investigate the contribution of rare and common genetic variants to ischemic stroke risk by targeting the protein-coding regions of the human genome. The National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP) analyzed approximately 6000 participants from numerous cohorts of European and African ancestry. For discovery, 365 cases of ischemic stroke (small-vessel and large-vessel subtypes) and 809 European ancestry controls were sequenced; for replication, 47 affected sibpairs concordant for stroke subtype and an African American case-control series were sequenced, with 1672 cases and 4509 European ancestry controls genotyped. The ESP's exome sequencing and genotyping started on January 1, 2010, and continued through June 30, 2012. Analyses were conducted on the full data set between July 12, 2012, and July 13, 2013. Discovery of new variants or genes contributing to ischemic stroke risk and subtype (primary analysis) and determination of support for protein-coding variants contributing to risk in previously published candidate genes (secondary analysis). We identified 2 novel genes associated with an increased risk of ischemic stroke: a protein-coding variant in PDE4DIP (rs1778155; odds ratio, 2.15; P = 2.63 × 10(-8)) with an intracellular signal transduction mechanism and in ACOT4 (rs35724886; odds ratio, 2.04; P = 1.24 × 10(-7)) with a fatty acid metabolism; confirmation of PDE4DIP was observed in affected sibpair families with large-vessel stroke subtype and in African Americans. Replication of protein-coding variants in candidate genes was observed for 2 previously reported GWAS associations: ZFHX3 (cardioembolic stroke) and ABCA1 (large-vessel stroke). Exome sequencing discovered 2 novel genes and mechanisms, PDE4DIP and ACOT4, associated with increased risk for ischemic stroke. In addition, ZFHX3 and ABCA1 were discovered to have protein-coding variants associated with ischemic stroke. These results suggest that genetic variation in novel pathways contributes to ischemic stroke risk and serves as a target for prediction, prevention, and therapy.
Novel genes and mutations in patients affected by recurrent pregnancy loss.

PubMed

Quintero-Ronderos, Paula; Mercier, Eric; Fukuda, Michiko; González, Ronald; Suárez, Carlos Fernando; Patarroyo, Manuel Alfonso; Vaiman, Daniel; Gris, Jean-Christophe; Laissue, Paul

2017-01-01

Recurrent pregnancy loss is a frequently occurring human infertility-related disease affecting ~1% of women. It has been estimated that the cause remains unexplained in >50% cases which strongly suggests that genetic factors may contribute towards the phenotype. Concerning its molecular aetiology numerous studies have had limited success in identifying the disease's genetic causes. This might have been due to the fact that hundreds of genes are involved in each physiological step necessary for guaranteeing reproductive success in mammals. In such scenario, next generation sequencing provides a potentially interesting tool for research into recurrent pregnancy loss causative mutations. The present study involved whole-exome sequencing and an innovative bioinformatics analysis, for the first time, in 49 unrelated women affected by recurrent pregnancy loss. We identified 27 coding variants (22 genes) potentially related to the phenotype (41% of patients). The affected genes, which were enriched by potentially deleterious sequence variants, belonged to distinct molecular cascades playing key roles in implantation/pregnancy biology. Using a quantum chemical approach method we established that mutations in MMP-10 and FGA proteins led to substantial energetic modifications suggesting an impact on their functions and/or stability. The next generation sequencing and bioinformatics approaches presented here represent an efficient way to find mutations, having potentially moderate/strong functional effects, associated with recurrent pregnancy loss aetiology. We consider that some of these variants (and genes) represent probable future biomarkers for recurrent pregnancy loss.
A sequence variant associating with educational attainment also affects childhood cognition.

PubMed

Gunnarsson, Bjarni; Jónsdóttir, Guðrún A; Björnsdóttir, Gyða; Konte, Bettina; Sulem, Patrick; Kristmundsdóttir, Snædís; Kehr, Birte; Gústafsson, Ómar; Helgason, Hannes; Iordache, Paul D; Ólafsson, Sigurgeir; Frigge, Michael L; Þorleifsson, Guðmar; Arnarsdóttir, Sunna; Stefánsdóttir, Berglind; Giegling, Ina; Djurovic, Srdjan; Sundet, Kjetil S; Espeseth, Thomas; Melle, Ingrid; Hartmann, Annette M; Thorsteinsdottir, Unnur; Kong, Augustine; Guðbjartsson, Daníel F; Ettinger, Ulrich; Andreassen, Ole A; Dan Rujescu; Halldórsson, Jónas G; Stefánsson, Hreinn; Halldórsson, Bjarni V; Stefánsson, Kári

2016-11-04

Only a few common variants in the sequence of the genome have been shown to impact cognitive traits. Here we demonstrate that polygenic scores of educational attainment predict specific aspects of childhood cognition, as measured with IQ. Recently, three sequence variants were shown to associate with educational attainment, a confluence phenotype of genetic and environmental factors contributing to academic success. We show that one of these variants associating with educational attainment, rs4851266-T, also associates with Verbal IQ in dyslexic children (P = 4.3 × 10 -4 , β = 0.16 s.d.). The effect of 0.16 s.d. corresponds to 1.4 IQ points for heterozygotes and 2.8 IQ points for homozygotes. We verified this association in independent samples consisting of adults (P = 8.3 × 10 -5 , β = 0.12 s.d., combined P = 2.2 x 10 -7 , β = 0.14 s.d.). Childhood cognition is unlikely to be affected by education attained later in life, and the variant explains a greater fraction of the variance in verbal IQ than in educational attainment (0.7% vs 0.12%,. P = 1.0 × 10 -5 ).
Novel pathogenic variant (c.3178G>A) in the SMC1A gene in a family with Cornelia de Lange syndrome identified by exome sequencing.

PubMed

Jang, Mi Ae; Lee, Chang Woo; Kim, Jin Kyung; Ki, Chang Seok

2015-11-01

Cornelia de Lange syndrome (CdLS) is a clinically and genetically heterogeneous congenital anomaly. Mutations in the NIPBL gene account for a half of the affected individuals. We describe a family with CdLS carrying a novel pathogenic variant of the SMC1A gene identified by exome sequencing. The proband was a 3-yr-old boy presenting with a developmental delay. He had distinctive facial features without major structural anomalies and tested negative for the NIPBL gene. His younger sister, mother, and maternal grandmother presented with mild mental retardation. By exome sequencing of the proband, a novel SMC1A variant, c.3178G>A, was identified, which was expected to cause an amino acid substitution (p.Glu1060Lys) in the highly conserved coiled-coil domain of the SMC1A protein. Sanger sequencing confirmed that the three female relatives with mental retardation also carry this variant. Our results reveal that SMC1A gene defects are associated with milder phenotypes of CdLS. Furthermore, we showed that exome sequencing could be a useful tool to identify pathogenic variants in patients with CdLS.
Whole exome sequencing in an Italian family with isolated maxillary canine agenesis and canine eruption anomalies.

PubMed

Barbato, Ersilia; Traversa, Alice; Guarnieri, Rosanna; Giovannetti, Agnese; Genovesi, Maria Luce; Magliozzi, Maria Rosa; Paolacci, Stefano; Ciolfi, Andrea; Pizzi, Simone; Di Giorgio, Roberto; Tartaglia, Marco; Pizzuti, Antonio; Caputo, Viviana

2018-07-01

The aim of this study was the clinical and molecular characterization of a family segregating a trait consisting of a phenotype specifically involving the maxillary canines, including agenesis, impaction and ectopic eruption, characterized by incomplete penetrance and variable expressivity. Clinical standardized assessment of 14 family members and a whole-exome sequencing (WES) of three affected subjects were performed. WES data analyses (sequence alignment, variant calling, annotation and prioritization) were carried out using an in-house implemented pipeline. Variant filtering retained coding and splice-site high quality private and rare variants. Variant prioritization was performed taking into account both the disruptive impact and the biological relevance of individual variants and genes. Sanger sequencing was performed to validate the variants of interest and to carry out segregation analysis. Prioritization of variants "by function" allowed the identification of multiple variants contributing to the trait, including two concomitant heterozygous variants in EDARADD (c.308C>T, p.Ser103Phe) and COL5A1 (c.1588G>A, p.Gly530Ser), specifically associated with a more severe phenotype (i.e. canine agenesis). Differently, heterozygous variants in genes encoding proteins with a role in the WNT pathway were shared by subjects showing a phenotype of impacted/ectopic erupted canines. This study characterized the genetic contribution underlying a complex trait consisting of isolated canine anomalies in a medium-sized family, highlighting the role of WNT and EDA cell signaling pathways in tooth development. Copyright © 2018 Elsevier Ltd. All rights reserved.
Whole-Exome Sequencing to Decipher the Genetic Heterogeneity of Hearing Loss in a Chinese Family with Deaf by Deaf Mating

PubMed Central

Qing, Jie; Yan, Denise; Zhou, Yuan; Liu, Qiong; Wu, Weijing; Xiao, Zian; Liu, Yuyuan; Liu, Jia; Du, Lilin; Xie, Dinghua; Liu, Xue Zhong

2014-01-01

Inherited deafness has been shown to have high genetic heterogeneity. For many decades, linkage analysis and candidate gene approaches have been the main tools to elucidate the genetics of hearing loss. However, this associated study design is costly, time-consuming, and unsuitable for small families. This is mainly due to the inadequate numbers of available affected individuals, locus heterogeneity, and assortative mating. Exome sequencing has now become technically feasible and a cost-effective method for detection of disease variants underlying Mendelian disorders due to the recent advances in next-generation sequencing (NGS) technologies. In the present study, we have combined both the Deafness Gene Mutation Detection Array and exome sequencing to identify deafness causative variants in a large Chinese composite family with deaf by deaf mating. The simultaneous screening of the 9 common deafness mutations using the allele-specific PCR based universal array, resulted in the identification of the 1555A>G in the mitochondrial DNA (mtDNA) 12S rRNA in affected individuals in one branch of the family. We then subjected the mutation-negative cases to exome sequencing and identified novel causative variants in the MYH14 and WFS1 genes. This report confirms the effective use of a NGS technique to detect pathogenic mutations in affected individuals who were not candidates for classical genetic studies. PMID:25289672
Short communication: Validation of 4 candidate causative trait variants in 2 cattle breeds using targeted sequence imputation.

PubMed

Pausch, Hubert; Wurmser, Christine; Reinhardt, Friedrich; Emmerling, Reiner; Fries, Ruedi

2015-06-01

Most association studies for pinpointing trait-associated variants are performed within breed. The availability of sequence data from key ancestors of several cattle breeds now enables immediate assessment of the frequency of trait-associated variants in populations different from the mapping population and their imputation into large validation populations. The objective of this study was to validate the effects of 4 putatively causative variants on milk production traits, male fertility, and stature in German Fleckvieh and Holstein-Friesian animals using targeted sequence imputation. We used whole-genome sequence data of 456 animals to impute 4 missense mutations in DGAT1, GHR, PRLR, and PROP1 into 10,363 Fleckvieh and 8,812 Holstein animals. The accuracy of the imputed genotypes exceeded 95% for all variants. Association testing with imputed variants revealed consistent antagonistic effects of the DGAT1 p.A232K and GHR p.F279Y variants on milk yield and protein and fat contents, respectively, in both breeds. The allele frequency of both polymorphisms has changed considerably in the past 20 yr, indicating that they were targets of recent selection for milk production traits. The PRLR p.S18N variant was associated with yield traits in Fleckvieh but not in Holstein, suggesting that it may be in linkage disequilibrium with a mutation affecting yield traits rather than being causal. The reported effects of the PROP1 p.H173R variant on milk production, male fertility, and stature could not be confirmed. Our results demonstrate that population-wide imputation of candidate causal variants from sequence data is feasible, enabling their rapid validation in large independent populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Clinical analysis of genome next-generation sequencing data using the Omicia platform

PubMed Central

Coonrod, Emily M; Margraf, Rebecca L; Russell, Archie; Voelkerding, Karl V; Reese, Martin G

2013-01-01

Aims Next-generation sequencing is being implemented in the clinical laboratory environment for the purposes of candidate causal variant discovery in patients affected with a variety of genetic disorders. The successful implementation of this technology for diagnosing genetic disorders requires a rapid, user-friendly method to annotate variants and generate short lists of clinically relevant variants of interest. This report describes Omicia’s Opal platform, a new software tool designed for variant discovery and interpretation in a clinical laboratory environment. The software allows clinical scientists to process, analyze, interpret and report on personal genome files. Materials & Methods To demonstrate the software, the authors describe the interactive use of the system for the rapid discovery of disease-causing variants using three cases. Results & Conclusion Here, the authors show the features of the Opal system and their use in uncovering variants of clinical significance. PMID:23895124
Variants in the PRPF8 Gene are Associated with Glaucoma.

PubMed

Micheal, Shazia; Hogewind, Barend F; Khan, Muhammad Imran; Siddiqui, Sorath Noorani; Zafar, Saemah Nuzhat; Akhtar, Farah; Qamar, Raheel; Hoyng, Carel B; den Hollander, Anneke I

2018-05-01

Glaucoma is the cause of irreversible blindness worldwide. Mutations in six genes have been associated with juvenile- and adult-onset familial primary open angle glaucoma (POAG) prior to this report but they explain only a small proportion of the genetic load. The aim of the study is to identify the novel genetic cause of the POAG in the families with adult-onset glaucoma. Whole exome sequencing (WES) was performed on DNA of two affected individuals, and predicted pathogenic variants were evaluated for segregation in four affected and three unaffected Dutch family members by Sanger sequencing. We identified a pathogenic variant (p.Val956Gly) in the PRPF8 gene, which segregates with the disease in Dutch family. Targeted Sanger sequencing of PRPF8 in a panel of 40 POAG families (18 Pakistani and 22 Dutch) revealed two additional nonsynonymous variants (p.Pro13Leu and p.Met25Thr), which segregate with the disease in two other Pakistani families. Both variants were then analyzed in a case-control cohort consisting of Pakistani 320 POAG cases and 250 matched controls. The p.Pro13Leu and p.Met25Thr variants were identified in 14 and 20 cases, respectively, while they were not detected in controls (p values 0.0004 and 0.0001, respectively). Previously, PRPF8 mutations have been associated with autosomal dominant retinitis pigmentosa (RP). The PRPF8 variants associated with POAG are located at the N-terminus, while all RP-associated mutations cluster at the C-terminus, dictating a clear genotype-phenotype correlation.
A candidate gene for autoimmune myasthenia gravis

PubMed Central

Landouré, Guida; Knight, Melanie A.; Stanescu, Horia; Taye, Addis A.; Shi, Yijun; Diallo, Oumarou; Johnson, Janel O.; Hernandez, Dena; Traynor, Bryan J.; Biesecker, Leslie G.; Elkahloun, Abdel; Rinaldi, Carlo; Vincent, Angela; Willcox, Nick; Kleta, Robert; Fischbeck, Kenneth H.

2012-01-01

Objective: We sought to identify a causative mutation in a previously reported kindred with parental consanguinity and 5 of 10 siblings with adult-onset autoimmune myasthenia gravis. Methods: We performed genome-wide homozygosity mapping, and sequenced all known genes in the one region of extended homozygosity. Quantitative and allele-specific reverse transcriptase PCR (RT-PCR) were performed on a candidate gene to determine the RNA expression level in affected siblings and controls and the relative abundance of the wild-type and mutant alleles in a heterozygote. Results: A region of shared homozygosity at chromosome 13q13.3–13q14.11 was found in 4 affected siblings and 1 unaffected sibling. A homozygous single nucleotide variant was found in the 3′-untranslated region of the ecto-NADH oxidase 1 gene (ENOX1). No other variants likely to be pathogenic were found in genes in this region or elsewhere. The ENOX1 sequence variant was not found in 764 controls. Quantitative RT-PCR showed that expression of ENOX1 decreased to about 20% of normal levels in lymphoblastoid cells from individuals homozygous for the variant and to about 50% in 2 unaffected heterozygotes. Allele-specific RT-PCR showed a 55%–60% reduction in the level of the variant transcript in heterozygous cells due to reduced mRNA stability. Conclusion: These results indicate that this sequence variant in ENOX1 may contribute to the familial autoimmune myasthenia in these patients. PMID:22744667
Variants in SKP1, PROB1, and IL17B genes at keratoconus 5q31.1–q35.3 susceptibility locus identified by whole-exome sequencing

PubMed Central

Karolak, Justyna A; Gambin, Tomasz; Pitarque, Jose A; Molinari, Andrea; Jhangiani, Shalini; Stankiewicz, Pawel; Lupski, James R; Gajecka, Marzena

2017-01-01

Keratoconus (KTCN) is a protrusion and thinning of the cornea, resulting in impairment of visual function. The extreme genetic heterogeneity makes it difficult to discover factors unambiguously influencing the KTCN phenotype. In this study, we used whole-exome sequencing (WES) and Sanger sequencing to reduce the number of candidate genes at the 5q31.1–q35.3 locus and to prioritize other potentially relevant variants in an Ecuadorian family with KTCN. We applied WES in two affected KTCN individuals from the Ecuadorian family that showed a suggestive linkage between the KTCN phenotype and the 5q31.1–q35.3 locus. Putative variants identified by WES were further evaluated in this family using Sanger sequencing. Exome capture discovered a total of 173 rare (minor allele frequency <0.001 in control population) nonsynonymous variants in both affected individuals. Among them, 16 SNVs were selected for further evaluation. Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1–q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied KTCN family. We demonstrate that a combination of various techniques significantly narrowed the studied genomic region and reduced the list of the putative exonic variants. Moreover, since this locus overlapped two other chromosomal regions previously recognized in distinct KTCN studies, our findings suggest that this 5q31.1–q35.3 locus might be linked with KTCN. PMID:27703147
Functional Assessment of Disease-Associated Regulatory Variants In Vivo Using a Versatile Dual Colour Transgenesis Strategy in Zebrafish

PubMed Central

Bhatia, Shipra; Gordon, Christopher T.; Foster, Robert G.; Melin, Lucie; Abadie, Véronique; Baujat, Geneviève; Vazquez, Marie-Paule; Amiel, Jeanne; Lyonnet, Stanislas; van Heyningen, Veronica; Kleinjan, Dirk A.

2015-01-01

Disruption of gene regulation by sequence variation in non-coding regions of the genome is now recognised as a significant cause of human disease and disease susceptibility. Sequence variants in cis-regulatory elements (CREs), the primary determinants of spatio-temporal gene regulation, can alter transcription factor binding sites. While technological advances have led to easy identification of disease-associated CRE variants, robust methods for discerning functional CRE variants from background variation are lacking. Here we describe an efficient dual-colour reporter transgenesis approach in zebrafish, simultaneously allowing detailed in vivo comparison of spatio-temporal differences in regulatory activity between putative CRE variants and assessment of altered transcription factor binding potential of the variant. We validate the method on known disease-associated elements regulating SHH, PAX6 and IRF6 and subsequently characterise novel, ultra-long-range SOX9 enhancers implicated in the craniofacial abnormality Pierre Robin Sequence. The method provides a highly cost-effective, fast and robust approach for simultaneously unravelling in a single assay whether, where and when in embryonic development a disease-associated CRE-variant is affecting its regulatory function. PMID:26030420
ACSS2 gene variant associated with cleft lip and palate in two independent Hispanic populations.

PubMed

Dodhia, Sonam; Celis, Katrina; Aylward, Alana; Cai, Yi; Fontana, Maria E; Trespalacios, Alberto; Hoffman, David C; Alfonso, Henry Ostos; Eisig, Sidney B; Su, Gloria H; Chung, Wendy K; Haddad, Joseph

2017-10-01

A candidate variant (p.Val496Ala) of the ACSS2 gene (T > C missense, rs59088485 variant at chr20: bp37 33509608) was previously found to consistently segregate with nonsyndromic cleft lip and/or palate (NSCLP) in three Honduran families. Objectives of this study were 1) to investigate the frequency of this ACSS2 variant in Honduran unrelated NSCLP patients and unrelated unaffected controls and 2) to investigate the frequency of this variant in Colombian unrelated affected NSCLP patients and unrelated unaffected controls. Case-control studies. Sanger sequencing of 99 unrelated Honduran NSCLP patients and 215 unrelated unaffected controls for the p.Val496Ala ACSS2 variant was used to determine the carrier frequency in NSCLP patients and controls. Sanger sequencing of 230 unrelated Colombian NSCLP patients and 146 unrelated unaffected controls for the p.Val496Ala ACSS2 variant was used to determine the carrier frequency in NSCLP patients and controls. In the Honduran population, the odds ratio of having NSCLP among carriers of the p.Val496Ala ACSS2 variant was 4.0 (P = .03), with a carrier frequency of seven of 99 (7.1%) in unrelated affected and four of 215 (1.9%) in unrelated unaffected individuals. In the Colombian population, the odds ratio of having NSCLP among carriers of the p.Val496Ala ACSS2 variant was 2.6 (P = .04), with a carrier frequency of 23 of 230 (10.0%) in unrelated affected and six of 146 (4.1%) in unrelated unaffected individuals. These findings support the role of ACSS2 in NSCLP in two independent Hispanic populations from Honduras and Colombia. NA Laryngoscope, 127:E336-E339, 2017. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Whole-exome sequencing identified a variant in EFTUD2 gene in establishing a genetic diagnosis.

PubMed

Rengasamy Venugopalan, S; Farrow, E G; Lypka, M

2017-06-01

Craniofacial anomalies are complex and have an overlapping phenotype. Mandibulofacial Dysostosis and Oculo-Auriculo-Vertebral Spectrum are conditions that share common craniofacial phenotype and present a challenge in arriving at a diagnosis. In this report, we present a case of female proband who was given a differential diagnosis of Treacher Collins syndrome or Hemifacial Microsomia without certainty. Prior genetic testing reported negative for 22q deletion and FGFR screenings. The objective of this study was to demonstrate the critical role of whole-exome sequencing in establishing a genetic diagnosis of the proband. The participants were 14½-year-old affected female proband/parent trio. Proband/parent trio were enrolled in the study. Surgical tissue sample from the proband and parental blood samples were collected and prepared for whole-exome sequencing. Illumina HiSeq 2500 instrument was used for sequencing (125 nucleotide reads/84X coverage). Analyses of variants were performed using custom-developed software, RUNES and VIKING. Variant analyses following whole-exome sequencing identified a heterozygous de novo pathogenic variant, c.259C>T (p.Gln87*), in EFTUD2 (NM_004247.3) gene in the proband. Previous studies have reported that the variants in EFTUD2 gene were associated with Mandibulofacial Dysostosis with Microcephaly. Patients with facial asymmetry, micrognathia, choanal atresia and microcephaly should be analyzed for variants in EFTUD2 gene. Next-generation sequencing techniques, such as whole-exome sequencing offer great promise to improve the understanding of etiologies of sporadic genetic diseases. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Enhancing genomic prediction with genome-wide association studies in multiparental maize populations

USDA-ARS?s Scientific Manuscript database

Genome-wide association mapping using dense marker sets has identified some nucleotide variants affecting complex traits which have been validated with fine-mapping and functional analysis. Many sequence variants associated with complex traits in maize have small effects and low repeatability, howev...
Whole-exome Sequence Analysis Implicates Rare Il17REL Variants in Familial and Sporadic Inflammatory Bowel Disease.

PubMed

Sasaki, Mark M; Skol, Andrew D; Hungate, Eric A; Bao, Riyue; Huang, Lei; Kahn, Stacy A; Allan, James M; Brant, Steven R; McGovern, Dermot P B; Peter, Inga; Silverberg, Mark S; Cho, Judy H; Kirschner, Barbara S; Onel, Kenan

2016-01-01

Rare variants (<1%) likely contribute significantly to risk for common diseases such as inflammatory bowel disease (IBD) in specific patient subsets, such as those with high familiality. They are, however, extraordinarily challenging to identify. To discover candidate rare variants associated with IBD, we performed whole-exome sequencing on 6 members of a pediatric-onset IBD family with multiple affected individuals. To determine whether the variants discovered in this family are also associated with nonfamilial IBD, we investigated their influence on disease in 2 large case-control (CC) series. We identified 2 rare variants, rs142430606 and rs200958270, both in the established IBD-susceptibility gene IL17REL, carried by all 4 affected family members and their obligate carrier parents. We then demonstrated that both variants are associated with sporadic ulcerative colitis (UC) in 2 independent data sets. For UC in CC 1: rs142430606 (odds ratio [OR] = 2.99, Padj = 0.028; minor allele frequency [MAF]cases = 0.0063, MAFcontrols = 0.0021); rs200958270 (OR = 2.61, Padj = 0.082; MAFcases = 0.0045, MAFcontrols = 0.0017). For UC in CC 2: rs142430606 (OR = 1.94, P = 0.0056; MAFcases = 0.0071, MAFcontrols = 0.0045); rs200958270 (OR = 2.08, P = 0.0028; MAFcases = 0.0071, MAFcontrols = 0.0042). We discover in a family and replicate in 2 CC data sets 2 rare susceptibility variants for IBD, both in IL17REL. Our results illustrate that whole-exome sequencing performed on disease-enriched families to guide association testing can be an efficient strategy for the discovery of rare disease-associated variants. We speculate that rare variants identified in families and confirmed in the general population may be important modifiers of disease risk for patients with a family history, and that genetic testing of these variants may be warranted in this patient subset.
Molecular characterization of canine parvovirus strains in Argentina: Detection of the pathogenic variant CPV2c in vaccinated dogs.

PubMed

Calderon, Marina Gallo; Mattion, Nora; Bucafusco, Danilo; Fogel, Fernando; Remorini, Patricia; La Torre, Jose

2009-08-01

PCR amplification with sequence-specific primers was used to detect canine parvovirus (CPV) DNA in 38 rectal swabs from Argentine domestic dogs with symptoms compatible with parvovirus disease. Twenty-seven out of 38 samples analyzed were CPV positive. The classical CPV2 strain was not detected in any of the samples, but nine samples were identified as CPV2a variant and 18 samples as CPV2b variant. Further sequence analysis revealed a mutation at amino acid 426 of the VP2 gene (Asp426Glu), characteristic of the CPV2c variant, in 14 out of 18 of the samples identified initially by PCR as CPV2b. The appearance of CPV2c variant in Argentina might be dated at least to the year 2003. Three different pathogenic CPV variants circulating currently in the Argentine domestic dog population were identified, with CPV2c being the only variant affecting vaccinated and unvaccinated dogs during the year 2008.
Characterization of Coconut cadang-cadang viroid variants from oil palm affected by orange spotting disease in Malaysia.

PubMed

Wu, Y H; Cheong, L C; Meon, S; Lau, W H; Kong, L L; Joseph, H; Vadamalai, G

2013-06-01

A 246-nt variant of Coconut cadang-cadang viroid (CCCVd) has been identified and described from oil palms with orange spotting symptoms in Malaysia. Compared with the 246-nt form of CCCVd from coconut, the oil palm variant substituted C(31)→U in the pathogenicity domain and G(70)→C in the central conserved domain. This is the first sequence reported for a 246-nt variant of CCCVd in oil palms expressing orange spotting symptoms.
Molecular characterization of canine parvovirus variants (CPV-2a, CPV-2b, and CPV-2c) based on the VP2 gene in affected domestic dogs in Ecuador

PubMed Central

la Torre, David De; Mafla, Eulalia; Puga, Byron; Erazo, Linda; Astolfi-Ferreira, Claudete; Ferreira, Antonio Piantino

2018-01-01

Aim The objective of this study was to determine the presence of the variants of canine parvovirus (CPV)-2 in the city of Quito, Ecuador, due to the high domestic and street-type canine population, and to identify possible mutations at a genetic level that could be causing structural changes in the virus with a consequent influence on the immune response of the hosts. Materials and Methods Thirty-five stool samples from different puppies with characteristic signs of the disease and positives for CPV through immunochromatography kits were collected from different veterinarian clinics of the city. Polymerase chain reaction and DNA sequencing were used to determine the mutations in residue 426 of the VP2 gene, which determines the variants of CPV-2; in addition, four samples were chosen for complete sequencing of the VP2 gene to identify all possible mutations in the circulating strains in this region of the country. Results The results revealed the presence of the three variants of CPV-2 with a prevalence of 57.1% (20/35) for CPV-2a, 8.5% (3/35) for CPV-2b, and 34.3% (12/35) for CPV-2c. In addition, complete sequencing of the VP2 gene showed amino acid substitutions in residues 87, 101, 139, 219, 297, 300, 305, 322, 324, 375, 386, 426, 440, and 514 of the three Ecuadorian variants when compared with the original CPV-2 sequence. Conclusion This study describes the detection of CPV variants in the city of Quito, Ecuador. Variants of CPV-2 (2a, 2b, and 2c) have been reported in South America, and there are cases in Ecuador where CVP-2 is affecting even vaccinated puppies. PMID:29805214

Pan-cancer analysis reveals technical artifacts in TCGA germline variant calls.

PubMed

Buckley, Alexandra R; Standish, Kristopher A; Bhutani, Kunal; Ideker, Trey; Lasken, Roger S; Carter, Hannah; Harismendy, Olivier; Schork, Nicholas J

2017-06-12

Cancer research to date has largely focused on somatically acquired genetic aberrations. In contrast, the degree to which germline, or inherited, variation contributes to tumorigenesis remains unclear, possibly due to a lack of accessible germline variant data. Here we called germline variants on 9618 cases from The Cancer Genome Atlas (TCGA) database representing 31 cancer types. We identified batch effects affecting loss of function (LOF) variant calls that can be traced back to differences in the way the sequence data were generated both within and across cancer types. Overall, LOF indel calls were more sensitive to technical artifacts than LOF Single Nucleotide Variant (SNV) calls. In particular, whole genome amplification of DNA prior to sequencing led to an artificially increased burden of LOF indel calls, which confounded association analyses relating germline variants to tumor type despite stringent indel filtering strategies. The samples affected by these technical artifacts include all acute myeloid leukemia and practically all ovarian cancer samples. We demonstrate how technical artifacts induced by whole genome amplification of DNA can lead to false positive germline-tumor type associations and suggest TCGA whole genome amplified samples be used with caution. This study draws attention to the need to be sensitive to problems associated with a lack of uniformity in data generation in TCGA data.
A structural variant in the 5’-flanking region of the TWIST2 gene affects melanocyte development in belted cattle

PubMed Central

Drögemüller, Cord; Jagannathan, Vidhya; Keller, Irene; Wüthrich, Daniel; Bruggmann, Rémy; Schütz, Ekkehard; Demmel, Steffi; Moser, Simon; Signer-Hasler, Heidi; Pieńkowska-Schelling, Aldona; Schelling, Claude; Sande, Marcos; Rongen, Ronald

2017-01-01

Belted cattle have a circular belt of unpigmented hair and skin around their midsection. The belt is inherited as a monogenic autosomal dominant trait. We mapped the causative variant to a 37 kb segment on bovine chromosome 3. Whole genome sequence data of 2 belted and 130 control cattle yielded only one private genetic variant in the critical interval in the two belted animals. The belt-associated variant was a copy number variant (CNV) involving the quadruplication of a 6 kb non-coding sequence located approximately 16 kb upstream of the TWIST2 gene. Increased copy numbers at this CNV were strongly associated with the belt phenotype in a cohort of 333 cases and 1322 controls. We hypothesized that the CNV causes aberrant expression of TWIST2 during neural crest development, which might negatively affect melanoblasts. Functional studies showed that ectopic expression of bovine TWIST2 in neural crest in transgenic zebrafish led to a decrease in melanocyte numbers. Our results thus implicate an unsuspected involvement of TWIST2 in regulating pigmentation and reveal a non-coding CNV underlying a captivating Mendelian character. PMID:28658273
Autosomal recessive Noonan syndrome associated with biallelic LZTR1 variants.

PubMed

Johnston, Jennifer J; van der Smagt, Jasper J; Rosenfeld, Jill A; Pagnamenta, Alistair T; Alswaid, Abdulrahman; Baker, Eva H; Blair, Edward; Borck, Guntram; Brinkmann, Julia; Craigen, William; Dung, Vu Chi; Emrick, Lisa; Everman, David B; van Gassen, Koen L; Gulsuner, Suleyman; Harr, Margaret H; Jain, Mahim; Kuechler, Alma; Leppig, Kathleen A; McDonald-McGinn, Donna M; Can, Ngoc Thi Bich; Peleg, Amir; Roeder, Elizabeth R; Rogers, R Curtis; Sagi-Dain, Lena; Sapp, Julie C; Schäffer, Alejandro A; Schanze, Denny; Stewart, Helen; Taylor, Jenny C; Verbeek, Nienke E; Walkiewicz, Magdalena A; Zackai, Elaine H; Zweier, Christiane; Zenker, Martin; Lee, Brendan; Biesecker, Leslie G

2018-02-22

PurposeTo characterize the molecular genetics of autosomal recessive Noonan syndrome.MethodsFamilies underwent phenotyping for features of Noonan syndrome in children and their parents. Two multiplex families underwent linkage analysis. Exome, genome, or multigene panel sequencing was used to identify variants. The molecular consequences of observed splice variants were evaluated by reverse-transcription polymerase chain reaction.ResultsTwelve families with a total of 23 affected children with features of Noonan syndrome were evaluated. The phenotypic range included mildly affected patients, but it was lethal in some, with cardiac disease and leukemia. All of the parents were unaffected. Linkage analysis using a recessive model supported a candidate region in chromosome 22q11, which includes LZTR1, previously shown to harbor mutations in patients with Noonan syndrome inherited in a dominant pattern. Sequencing analyses of 21 live-born patients and a stillbirth identified biallelic pathogenic variants in LZTR1, including putative loss-of-function, missense, and canonical and noncanonical splicing variants in the affected children, with heterozygous, clinically unaffected parents and heterozygous or normal genotypes in unaffected siblings.ConclusionThese clinical and genetic data confirm the existence of a form of Noonan syndrome that is inherited in an autosomal recessive pattern and identify biallelic mutations in LZTR1.Genet Med advance online publication, 22 February 2018; doi:10.1038/gim.2017.249.
Using sheep genomes from diverse U.S. breeds to identify missense variants in genes affecting fecundity

USDA-ARS?s Scientific Manuscript database

Background: Access to sheep genome sequences significantly improves the chances of identifying genes that may influence the health, welfare, and productivity of these animals. Methods: A public, searchable DNA sequence resource for U.S. sheep was created with whole genome sequence (WGS) of 96 rams. ...
Potentially pathogenic germline CHEK2 c.319+2T>A among multiple early-onset cancer families.

PubMed

Dominguez-Valentin, Mev; Nakken, Sigve; Tubeuf, Hélène; Vodak, Daniel; Ekstrøm, Per Olaf; Nissen, Anke M; Morak, Monika; Holinski-Feder, Elke; Martins, Alexandra; Møller, Pål; Hovig, Eivind

2018-01-01

To study the potential contribution of genes other than BRCA1/2, PTEN, and TP53 to the biological and clinical characteristics of multiple early-onset cancers in Norwegian families, including early-onset breast cancer, Cowden-like and Li-Fraumeni-like syndromes (BC, CSL and LFL, respectively). The Hereditary Cancer Biobank from the Norwegian Radium Hospital was used to identify early-onset BC, CSL or LFL for whom no pathogenic variants in BRCA1/2, PTEN, or TP53 had been found in routine diagnostic DNA sequencing. Forty-four cancer susceptibility genes were selected and analyzed by our in-house designed TruSeq amplicon-based assay for targeted sequencing. Protein- and RNA splicing-dedicated in silico analyses were performed for all variants of unknown significance (VUS). Variants predicted as the more likely to affect splicing were experimentally analyzed by minigene assay. We identified a CSL individual carrying a variant in CHEK2 (c.319+2T>A, IVS2), here considered as likely pathogenic. Out of the five VUS (BRCA2, CDH1, CHEK2, MAP3K1, NOTCH3) tested in the minigene splicing assay, only NOTCH3 c.14090C>T (p.Ser497Leu) showed a significant effect on RNA splicing, notably by inducing partial skipping of exon 9. Among 13 early-onset BC, CSL and LFL patients, gene panel sequencing identified a potentially pathogenic variant in CHEK2 that affects a canonical RNA splicing signal. Our study provides new information on genetic loci that may affect the risk of developing cancer in these patients and their families, demonstrating that genes presently not routinely tested in molecular diagnostic settings may be important for capturing cancer predisposition in these families.
A Splice Defect in the EDA Gene in Dogs with an X-Linked Hypohidrotic Ectodermal Dysplasia (XLHED) Phenotype.

PubMed

Waluk, Dominik P; Zur, Gila; Kaufmann, Ronnie; Welle, Monika M; Jagannathan, Vidhya; Drögemüller, Cord; Müller, Eliane J; Leeb, Tosso; Galichet, Arnaud

2016-09-08

X-linked hypohidrotic ectodermal dysplasia (XLHED) caused by variants in the EDA gene represents the most common ectodermal dysplasia in humans. We investigated three male mixed-breed dogs with an ectodermal dysplasia phenotype characterized by marked hypotrichosis and multifocal complete alopecia, almost complete absence of sweat and sebaceous glands, and altered dentition with missing and abnormally shaped teeth. Analysis of SNP chip genotypes and whole genome sequence data from the three affected dogs revealed that the affected dogs shared the same haplotype on a large segment of the X-chromosome, including the EDA gene. Unexpectedly, the whole genome sequence data did not reveal any nonsynonymous EDA variant in the affected dogs. We therefore performed an RNA-seq experiment on skin biopsies to search for changes in the transcriptome. This analysis revealed that the EDA transcript in the affected dogs lacked 103 nucleotides encoded by exon 2. We speculate that this exon skipping is caused by a genetic variant located in one of the large introns flanking this exon, which was missed by whole genome sequencing with the illumina short read technology. The altered EDA transcript splicing most likely causes the observed ectodermal dysplasia in the affected dogs. These dogs thus offer an excellent opportunity to gain insights into the complex splicing processes required for expression of the EDA gene, and other genes with large introns. Copyright © 2016 Waluk et al.
Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts.

PubMed

Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong

2016-01-08

Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.
Exome sequence analysis and follow up genotyping implicates rare ULK1 variants to be involved in susceptibility to schizophrenia

PubMed Central

Al Eissa, Mariam M.; Fiorentino, Alessia; Sharp, Sally I.; O'Brien, Niamh L.; Wolfe, Kate; Giaroli, Giovanni; Curtis, David; Bass, Nicholas J.

2017-01-01

Summary Schizophrenia (SCZ) is a severe, highly heritable psychiatric disorder. Elucidation of the genetic architecture of the disorder will facilitate greater understanding of the altered underlying neurobiological mechanisms. The aim of this study was to identify likely aetiological variants in subjects affected with SCZ. Exome sequence data from a SCZ cas–control sample from Sweden was analysed for likely aetiological variants using a weighted burden test. Suggestive evidence implicated the UNC‐51‐like kinase (ULK1) gene, and it was observed that four rare variants that were more common in the Swedish SCZ cases were also more common in UK10K SCZ cases, as compared to obesity cases. These three missense variants and one intronic variant were genotyped in the University College London cohort of 1304 SCZ cases and 1348 ethnically matched controls. All four variants were more common in the SCZ cases than controls and combining them produced a result significant at P = 0.02. The results presented here demonstrate the importance of following up exome sequencing studies using additional datasets. The roles of ULK1 in autophagy and mTOR signalling strengthen the case that these pathways may be important in the pathophysiology of SCZ. The findings reported here await independent replication. PMID:29148569
Chitayat-Hall and Schaaf-Yang syndromes:a common aetiology: expanding the phenotype of MAGEL2-related disorders.

PubMed

Jobling, Rebekah; Stavropoulos, Dimitri James; Marshall, Christian R; Cytrynbaum, Cheryl; Axford, Michelle M; Londero, Vanessa; Moalem, Sharon; Orr, Jennifer; Rossignol, Francis; Lopes, Fatima Daniela; Gauthier, Julie; Alos, Nathalie; Rupps, Rosemarie; McKinnon, Margaret; Adam, Shelin; Nowaczyk, Malgorzata J M; Walker, Susan; Scherer, Stephen W; Nassif, Christina; Hamdan, Fadi F; Deal, Cheri L; Soucy, Jean-François; Weksberg, Rosanna; Macleod, Patrick; Michaud, Jacques L; Chitayat, David

2018-05-01

Chitayat-Hall syndrome, initially described in 1990, is a rare condition characterised by distal arthrogryposis, intellectual disability, dysmorphic features and hypopituitarism, in particular growth hormone deficiency. The genetic aetiology has not been identified. We identified three unrelated families with a total of six affected patients with the clinical manifestations of Chitayat-Hall syndrome. Through whole exome or whole genome sequencing, pathogenic variants in the MAGEL2 gene were identified in all affected patients. All disease-causing sequence variants detected are predicted to result in a truncated protein, including one complex variant that comprised a deletion and inversion. Chitayat-Hall syndrome is caused by pathogenic variants in MAGEL2 and shares a common aetiology with the recently described Schaaf-Yang syndrome. The phenotype of MAGEL2 -related disorders is expanded to include growth hormone deficiency as an important and treatable complication. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Screening of the Filamin C Gene in a Large Cohort of Hypertrophic Cardiomyopathy Patients.

PubMed

Gómez, Juan; Lorca, Rebeca; Reguero, Julian R; Morís, César; Martín, María; Tranche, Salvador; Alonso, Belén; Iglesias, Sara; Alvarez, Victoria; Díaz-Molina, Beatriz; Avanzas, Pablo; Coto, Eliecer

2017-04-01

Recent exome sequencing studies identified filamin C ( FLNC ) as a candidate gene for hypertrophic cardiomyopathy (HCM). Our aim was to determine the rate of FLNC candidate variants in a large cohort of HCM patients who were also sequenced for the main sarcomere genes. A total of 448 HCM patients were next generation-sequenced (semiconductor chip technology) for the MYH7, MYBPC3 , TNNT2 , TNNI3 , ACTC1 , TNNC1 , MYL2 , MYL3 , TPM1 , and FLNC genes. We also sequenced 450 healthy controls from the same population. Based on the reported population frequencies, bioinformatic criteria, and familial segregation, we identified 20 FLNC candidate variants (13 new; 1 nonsense; and 19 missense) in 22 patients. Compared with the patients, only 1 of the control's missense variants was nonreported ( P =0.007; Fisher exact probability test). Based on the familial segregation and the reported functional studies, 6 of the candidate variants (in 7 patients) were finally classified as likely pathogenic, 10 as variants of uncertain significance, and 4 as likely benign. We provide a compelling evidence of the involvement of FLNC in the development of HCM. Most of the FLNC variants were associated with mild forms of HCM and a reduced penetrance, with few affected in the families to confirm the segregation. Our work, together with others who found FLNC variants among patients with dilated and restrictive cardiomyopathies, pointed to this gene as an important cause of structural cardiomyopathies. © 2017 American Heart Association, Inc.
Mutations Affecting Expression of the rosy Locus in Drosophila melanogaster

PubMed Central

Lee, Chong Sung; Curtis, Daniel; McCarron, Margaret; Love, Carol; Gray, Mark; Bender, Welcome; Chovnick, Arthur

1987-01-01

The rosy locus in Drosophila melanogaster codes for the enzyme xanthine dehydrogenase (XDH). Previous studies defined a "control element" near the 5' end of the gene, where variant sites affected the amount of rosy mRNA and protein produced. We have determined the DNA sequence of this region from both genomic and cDNA clones, and from the ry+10 underproducer strain. This variant strain had many sequence differences, so that the site of the regulatory change could not be fixed. A mutagenesis was also undertaken to isolate new regulatory mutations. We induced 376 new mutations with 1-ethyl-1-nitrosourea (ENU) and screened them to isolate those that reduced the amount of XDH protein produced, but did not change the properties of the enzyme. Genetic mapping was used to find mutations located near the 5' end of the gene. DNA from each of seven mutants was cloned and sequenced through the 5' region. Mutant base changes were identified in all seven; they appear to affect splicing and translation of the rosy mRNA. In a related study (T. P. Keith et al. 1987), the genomic and cDNA sequences are extended through the 3' end of the gene; the combined sequences define the processing pattern of the rosy transcript and predict the amino acid sequence of XDH. PMID:3036645
Information Topics of Greatest Interest for Return of Genome Sequencing Results among Women Diagnosed with Breast Cancer at a Young Age.

PubMed

Seo, Joann; Ivanovich, Jennifer; Goodman, Melody S; Biesecker, Barbara B; Kaphingst, Kimberly A

2017-06-01

We investigated what information women diagnosed with breast cancer at a young age would want to learn when genome sequencing results are returned. We conducted 60 semi-structured interviews with women diagnosed with breast cancer at age 40 or younger. We examined what specific information participants would want to learn across result types and for each type of result, as well as how much information they would want. Genome sequencing was not offered to participants as part of the study. Two coders independently coded interview transcripts; analysis was conducted using NVivo10. Across result types, participants wanted to learn about health implications, risk and prevalence in quantitative terms, causes of variants, and causes of diseases. Participants wanted to learn actionable information for variants affecting risk of preventable or treatable disease, medication response, and carrier status. The amount of desired information differed for variants affecting risk of unpreventable or untreatable disease, with uncertain significance, and not health-related. Women diagnosed with breast cancer at a young age recognize the value of genome sequencing results in identifying potential causes and effective treatments and expressed interest in using the information to help relatives and to further understand their other health risks. Our findings can inform the development of effective feedback strategies for genome sequencing that meet patients' information needs and preferences.
SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing.

PubMed

Sato, Yukuto; Kojima, Kaname; Nariai, Naoki; Yamaguchi-Kabata, Yumi; Kawai, Yosuke; Takahashi, Mamoru; Mimori, Takahiro; Nagasaki, Masao

2014-08-08

Next-generation sequencers (NGSs) have become one of the main tools for current biology. To obtain useful insights from the NGS data, it is essential to control low-quality portions of the data affected by technical errors such as air bubbles in sequencing fluidics. We develop a software SUGAR (subtile-based GUI-assisted refiner) which can handle ultra-high-throughput data with user-friendly graphical user interface (GUI) and interactive analysis capability. The SUGAR generates high-resolution quality heatmaps of the flowcell, enabling users to find possible signals of technical errors during the sequencing. The sequencing data generated from the error-affected regions of a flowcell can be selectively removed by automated analysis or GUI-assisted operations implemented in the SUGAR. The automated data-cleaning function based on sequence read quality (Phred) scores was applied to a public whole human genome sequencing data and we proved the overall mapping quality was improved. The detailed data evaluation and cleaning enabled by SUGAR would reduce technical problems in sequence read mapping, improving subsequent variant analysis that require high-quality sequence data and mapping results. Therefore, the software will be especially useful to control the quality of variant calls to the low population cells, e.g., cancers, in a sample with technical errors of sequencing procedures.
[Genetic analysis of two children patients affected with CHARGE syndrome].

PubMed

Li, Guoqiang; Li, Niu; Xu, Yufei; Li, Juan; Ding, Yu; Shen, Yiping; Wang, Xiumin; Wang, Jian

2018-04-10

To analyze two Chinese pediatric patients with multiple malformations and growth and development delay. Both patients were subjected to targeted gene sequencing, and the results were analyzed with Ingenuity Variant Analysis software. Suspected pathogenic variations were verified by Sanger sequencing. High-throughput sequencing showed that both patients have carried heterozygous variants of the CHD7 gene. Patient 1 carried a nonsense mutation in exon 36 (c.7957C>T, p.Arg2653*), while patient 2 carried a nonsense mutation of exon 2 (c.718C>T, p.Gln240*). Sanger sequencing confirmed the above mutations in both patients, while their parents were of wild-type for the corresponding sites, indicating that the two mutations have happened de novo. Two patients were diagnosed with CHARGE syndrome by high-throughput sequencing.
Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

PubMed Central

Takeda, Jun-ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Imanishi, Tadashi

2006-01-01

We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants. PMID:16914452
Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database.

PubMed

Thompson, Bryony A; Spurdle, Amanda B; Plazzer, John-Paul; Greenblatt, Marc S; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P; Farrington, Susan M; Frayling, Ian M; Frebourg, Thierry; Goldgar, David E; Heinen, Christopher D; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J; Sijmons, Rolf; Tavtigian, Sean V; Tops, Carli M; Weber, Thomas; Wijnen, Juul; Woods, Michael O; Macrae, Finlay; Genuardi, Maurizio

2014-02-01

The clinical classification of hereditary sequence variants identified in disease-related genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch syndrome-associated genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist in variant classification and was recognized through microattribution. The scheme was refined by multidisciplinary expert committee review of the clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants that were not obviously protein truncating from nomenclature. This large-scale endeavor will facilitate the consistent management of families suspected to have Lynch syndrome and demonstrates the value of multidisciplinary collaboration in the curation and classification of variants in public locus-specific databases.
Application of a five-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants lodged on the InSiGHT locus-specific database

PubMed Central

Plazzer, John-Paul; Greenblatt, Marc S.; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T.; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P.; Farrington, Susan M.; Frayling, Ian M.; Frebourg, Thierry; Goldgar, David E.; Heinen, Christopher D.; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J.; Sijmons, Rolf; Tavtigian, Sean V.; Tops, Carli M.; Weber, Thomas; Wijnen, Juul; Woods, Michael O.; Macrae, Finlay; Genuardi, Maurizio

2015-01-01

Clinical classification of sequence variants identified in hereditary disease genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch Syndrome genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist variant classification, and recognized by microattribution. The scheme was refined by multidisciplinary expert committee review of clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants not obviously protein-truncating from nomenclature. This large-scale endeavor will facilitate consistent management of suspected Lynch Syndrome families, and demonstrates the value of multidisciplinary collaboration for curation and classification of variants in public locus-specific databases. PMID:24362816
A Genome-Wide Linkage Study for Chronic Obstructive Pulmonary Disease in a Dutch Genetic Isolate Identifies Novel Rare Candidate Variants.

PubMed

Nedeljkovic, Ivana; Terzikhan, Natalie; Vonk, Judith M; van der Plaat, Diana A; Lahousse, Lies; van Diemen, Cleo C; Hobbs, Brian D; Qiao, Dandi; Cho, Michael H; Brusselle, Guy G; Postma, Dirkje S; Boezen, H M; van Duijn, Cornelia M; Amin, Najaf

2018-01-01

Chronic obstructive pulmonary disease (COPD) is a complex and heritable disease, associated with multiple genetic variants. Specific familial types of COPD may be explained by rare variants, which have not been widely studied. We aimed to discover rare genetic variants underlying COPD through a genome-wide linkage scan. Affected-only analysis was performed using the 6K Illumina Linkage IV Panel in 142 cases clustered in 27 families from a genetic isolate, the Erasmus Rucphen Family (ERF) study. Potential causal variants were identified by searching for shared rare variants in the exome-sequence data of the affected members of the families contributing most to the linkage peak. The identified rare variants were then tested for association with COPD in a large meta-analysis of several cohorts. Significant evidence for linkage was observed on chromosomes 15q14-15q25 [logarithm of the odds (LOD) score = 5.52], 11p15.4-11q14.1 (LOD = 3.71) and 5q14.3-5q33.2 (LOD = 3.49). In the chromosome 15 peak, that harbors the known COPD locus for nicotinic receptors, and in the chromosome 5 peak we could not identify shared variants. In the chromosome 11 locus, we identified four rare (minor allele frequency (MAF) <0.02), predicted pathogenic, missense variants. These were shared among the affected family members. The identified variants localize to genes including neuroblast differentiation-associated protein ( AHNAK ), previously associated with blood biomarkers in COPD, phospholipase C Beta 3 ( PLCB3 ), shown to increase airway hyper-responsiveness, solute carrier family 22-A11 ( SLC22A11 ), involved in amino acid metabolism and ion transport, and metallothionein-like protein 5 ( MTL5 ), involved in nicotinate and nicotinamide metabolism. Association of SLC22A11 and MTL5 variants were confirmed in the meta-analysis of 9,888 cases and 27,060 controls. In conclusion, we have identified novel rare variants in plausible genes related to COPD. Further studies utilizing large sample whole-genome sequencing should further confirm the associations at chromosome 11 and investigate the chromosome 15 and 5 linked regions.
Peptidomimetic Escape Mechanisms Arise via Genetic Diversity in the Ligand-Binding Site of the Hepatitis C Virus NS3/4A Serine Protease

PubMed Central

Welsch, Christoph; Shimakami, Tetsuro; Hartmann, Christoph; Yang, Yan; Domingues, Francisco S.; Lengauer, Thomas; Zeuzem, Stefan; Lemon, Stanley M.

2011-01-01

Background & Aims It is a challenge to develop direct-acting antiviral agents (DAAs) that target the NS3/4A protease of hepatitis C virus (HCV) because resistant variants develop. Ketoamide compounds, designed to mimic the natural protease substrate, have been developed as inhibitors. However, clinical trials have revealed rapid selection of resistant mutants, most of which are considered to be pre-existing variants. Methods We identified residues near the ketoamide-binding site in X-ray structures of the genotype 1a protease, co-crystallized with boceprevir or a telaprevir-like ligand, and then identified variants at these positions in 219 genotype 1 sequences from a public database. We used side-chain modeling to assess the potential effects of these variants on the interaction between ketoamide and the protease, and compared these results with the phenotypic effects on ketoamide resistance, RNA replication capacity, and infectious virus yields in a cell culture model of infection. Results Thirteen natural binding-site variants with potential for ketoamide resistance were identified at 10 residues in the protease, near the ketoamide binding site. Rotamer analysis of amino acid side-chain conformations indicated that 2 variants (R155K and D168G) could affect binding of telaprevir more than boceprevir. Measurements of antiviral susceptibility in cell culture studies were consistent with this observation. Four variants (Q41H, I132V, R155K, and D168G) caused low-to-moderate levels of ketoamide resistance; 3 of these were highly fit (Q41H, I132V, and R155K). Conclusions Using a comprehensive sequence and structure-based analysis, we showed how natural variation in the HCV protease NS3/4A sequences might affect susceptibility to first-generation DAAs. These findings increase our understanding of the molecular basis of ketoamide resistance among naturally existing viral variants. PMID:22155364
Chitayat syndrome: hyperphalangism, characteristic facies, hallux valgus and bronchomalacia results from a recurrent c.266A>G p.(Tyr89Cys) variant in the ERF gene.

PubMed

Balasubramanian, M; Lord, H; Levesque, S; Guturu, H; Thuriot, F; Sillon, G; Wenger, A M; Sureka, D L; Lester, T; Johnson, D S; Bowen, J; Calhoun, A R; Viskochil, D H; Bejerano, G; Bernstein, J A; Chitayat, D

2017-03-01

In 1993, Chitayat et al. , reported a newborn with hyperphalangism, facial anomalies, and bronchomalacia. We identified three additional families with similar findings. Features include bilateral accessory phalanx resulting in shortened index fingers; hallux valgus; distinctive face; respiratory compromise. To identify the genetic aetiology of Chitayat syndrome and identify a unifying cause for this specific form of hyperphalangism. Through ongoing collaboration, we had collected patients with strikingly-similar phenotype. Trio-based exome sequencing was first performed in Patient 2 through Deciphering Developmental Disorders study. Proband-only exome sequencing had previously been independently performed in Patient 4. Following identification of a candidate gene variant in Patient 2, the same variant was subsequently confirmed from exome data in Patient 4. Sanger sequencing was used to validate this variant in Patients 1, 3; confirm paternal inheritance in Patient 5. A recurrent, novel variant NM_006494.2:c.266A>G p.(Tyr89Cys) in ERF was identified in five affected individuals: de novo (patient 1, 2 and 3) and inherited from an affected father (patient 4 and 5). p.Tyr89Cys is an aromatic polar neutral to polar neutral amino acid substitution, at a highly conserved position and lies within the functionally important ETS-domain of the protein. The recurrent ERF c.266A>C p.(Tyr89Cys) variant causes Chitayat syndrome. ERF variants have previously been associated with complex craniosynostosis. In contrast, none of the patients with the c.266A>G p.(Tyr89Cys) variant have craniosynostosis. We report the molecular aetiology of Chitayat syndrome and discuss potential mechanisms for this distinctive phenotype associated with the p.Tyr89Cys substitution in ERF . Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

PubMed

Wu, Jiaxin; Li, Yanda; Jiang, Rui

2014-03-01

Exome sequencing has been widely used in detecting pathogenic nonsynonymous single nucleotide variants (SNVs) for human inherited diseases. However, traditional statistical genetics methods are ineffective in analyzing exome sequencing data, due to such facts as the large number of sequenced variants, the presence of non-negligible fraction of pathogenic rare variants or de novo mutations, and the limited size of affected and normal populations. Indeed, prevalent applications of exome sequencing have been appealing for an effective computational method for identifying causative nonsynonymous SNVs from a large number of sequenced variants. Here, we propose a bioinformatics approach called SPRING (Snv PRioritization via the INtegration of Genomic data) for identifying pathogenic nonsynonymous SNVs for a given query disease. Based on six functional effect scores calculated by existing methods (SIFT, PolyPhen2, LRT, MutationTaster, GERP and PhyloP) and five association scores derived from a variety of genomic data sources (gene ontology, protein-protein interactions, protein sequences, protein domain annotations and gene pathway annotations), SPRING calculates the statistical significance that an SNV is causative for a query disease and hence provides a means of prioritizing candidate SNVs. With a series of comprehensive validation experiments, we demonstrate that SPRING is valid for diseases whose genetic bases are either partly known or completely unknown and effective for diseases with a variety of inheritance styles. In applications of our method to real exome sequencing data sets, we show the capability of SPRING in detecting causative de novo mutations for autism, epileptic encephalopathies and intellectual disability. We further provide an online service, the standalone software and genome-wide predictions of causative SNVs for 5,080 diseases at http://bioinfo.au.tsinghua.edu.cn/spring.
Autosomal dominant retinitis pigmentosa with macular involvement associated with a disease haplotype that included a novel PRPH2 variant (p.Cys250Gly).

PubMed

Katagiri, Satoshi; Hayashi, Takaaki; Mizobuchi, Kei; Yoshitake, Kazutoshi; Iwata, Takeshi; Nakano, Tadashi

2018-06-01

It is known that PRPH2 variants appear to be rare causes of retinitis pigmentosa (RP) in the Japanese population. The purpose of this study was to describe clinical and genetic features in autosomal dominant RP (adRP) patients with a novel disease-causing variant in the PRHP2 gene. A total of 57 unrelated Japanese probands with adRP were investigated in this study. Comprehensive ophthalmic examinations include fundus photography, fundus autofluorescence imaging, spectral-domain optical coherence tomography, and electroretinography. Whole exome sequencing or Sanger sequencing for 25 targeted exons of multiple genes causing adRP was performed to identify disease-causing variants. Co-segregation and haplotype analyses were performed to determine a disease-causing gene variant and its haplotype. Genetic analysis identified a novel heterozygous PRPH2 variant (c.748T>G, p.Cys250Gly) as disease causing in four probands from four families. The variant co-segregated with the RP phenotype in the eight affected patients in all families. At least three of the four families shared the same haplotype for the variant allele. Clinically, seven of the eight affected patients exhibited typical RP presentation, as well as variable macular involvement including cystoid macular change, vitelliform-like appearance, choroidal neovascularization, and macular atrophy. The same disease haplotype that included a novel PRPH2 variant (p.Cys250Gly) was identified in three of the four Japanese families with adRP, suggesting a founder effect. Our clinical findings indicate that adRP caused by the p.Cys250Gly variant may accompany macular involvement with high frequency.
Incidence and Carrier Frequency of CFTR Gene Mutations in Pregnancies With Echogenic Bowel in Nova Scotia and Prince Edward Island.

PubMed

Miller, Michelle E; Allen, Victoria M; Brock, Jo-Ann K

2018-03-01

Fetal echogenic bowel (echogenic bowel) is associated with cystic fibrosis (CF), with a reported incidence ranging from 1% to 13%. Prenatal testing for CF in the setting of echogenic bowel can be done by screening parental or fetal samples for pathogenic CFTR variants. If only one pathogenic variant is identified, sequencing of the CFTR gene can be undertaken, to identify a second pathogenic variant not covered in the standard screening panel. Full gene sequencing, however, also introduces the potential to identify variants of uncertain significance (VUSs) that can create counselling challenges and cause parental anxiety. To provide accurate counselling for families in the study population, the incidence of CF associated with echogenic bowel and the carrier frequency of CFTR variants were investigated. All pregnancies for which CF testing was undertaken for the indication of echogenic bowel (from Nova Scotia and Prince Edward Island) were identified (January 2007-July 2017). The CFTR screening and sequencing results were reviewed, and fetal outcomes related to CF were assessed. A total of 463 pregnancies with echogenic bowel were tested. Four were confirmed to be affected with CF, giving an incidence of 0.9% in this cohort. The carrier frequency of CF among all parents in the cohort was 5.0% (1 in 20); however, when excluding parents of affected fetuses, the carrier frequency for the population was estimated at 4.1% (1 in 25). CFTR gene sequencing identified an additional VUS in two samples. The incidence of CF in pregnancies with echogenic bowel in Nova Scotia and Prince Edward Island is 0.9%, with an estimated population carrier frequency of 4.1%. These results provide the basis for improved counselling to assess the risk of CF in the pregnancy, after parental carrier screening, using Bayesian probability. Counselling regarding VUSs should be undertaken before gene sequencing. Copyright © 2017 Society of Obstetricians and Gynaecologists of Canada. Published by Elsevier Inc. All rights reserved.
Functional SNP associated with birth weight in independent populations identified with a permutation step added to GBLUP-GWAS

USDA-ARS?s Scientific Manuscript database

This study was conducted as an initial assessment of a newly available genotyping assay containing about 34,000 common SNP included on previous SNP chips, and 199,000 sequence variants predicted to affect gene function. Objectives were to identify functional variants associated with birth weight in...
Exome Sequencing Identified a Splice Site Mutation in FHL1 that Causes Uruguay Syndrome, an X-Linked Disorder With Skeletal Muscle Hypertrophy and Premature Cardiac Death.

PubMed

Xue, Yuan; Schoser, Benedikt; Rao, Aliz R; Quadrelli, Roberto; Vaglio, Alicia; Rupp, Verena; Beichler, Christine; Nelson, Stanley F; Schapacher-Tilp, Gudrun; Windpassinger, Christian; Wilcox, William R

2016-04-01

Previously, we reported a rare X-linked disorder, Uruguay syndrome in a single family. The main features are pugilistic facies, skeletal deformities, and muscular hypertrophy despite a lack of exercise and cardiac ventricular hypertrophy leading to premature death. An ≈19 Mb critical region on X chromosome was identified through identity-by-descent analysis of 3 affected males. Exome sequencing was conducted on one affected male to identify the disease-causing gene and variant. A splice site variant (c.502-2A>G) in the FHL1 gene was highly suspicious among other candidate genes and variants. FHL1A is the predominant isoform of FHL1 in cardiac and skeletal muscle. Sequencing cDNA showed the splice site variant led to skipping of exons 6 of the FHL1A isoform, equivalent to the FHL1C isoform. Targeted analysis showed that this splice site variant cosegregated with disease in the family. Western blot and immunohistochemical analysis of muscle from the proband showed a significant decrease in protein expression of FHL1A. Real-time polymerase chain reaction analysis of different isoforms of FHL1 demonstrated that the FHL1C is markedly increased. Mutations in the FHL1 gene have been reported in disorders with skeletal and cardiac myopathy but none has the skeletal or facial phenotype seen in patients with Uruguay syndrome. Our data suggest that a novel FHL1 splice site variant results in the absence of FHL1A and the abundance of FHL1C, which may contribute to the complex and severe phenotype. Mutation screening of the FHL1 gene should be considered for patients with uncharacterized myopathies and cardiomyopathies. © 2016 American Heart Association, Inc.
THAP1/DYT6 sequence variants in non-DYT1 early-onset primary dystonia in China and their effects on RNA expression.

PubMed

Cheng, Fu Bo; Ozelius, Laurie J; Wan, Xin Hua; Feng, Jia Chun; Ma, Ling Yan; Yang, Ying Mai; Wang, Lin

2012-02-01

Mutations in the THAP1 gene were recently identified as the cause of DYT6 primary dystonia. More than 40 mutations in this gene have been described in different populations. However, no previous report has identified sequence variations that affect the transcript process of the THAP1 gene. In addition, the mutation frequency in Chinese early-onset primary dystonia has not been well characterized. One hundred and two unrelated patients with non-DYT1 early-onset primary dystonia (age at onset <26 years), family members of participants with mutations, and 200 neurologically normal controls were screened for THAP1 gene mutations. The effects of the identified mutations on RNA expression were analyzed using semi-quantitative real-time PCR. Seven sequence variants (c.63_66del TTTC, c.161G>T, c.224A>T, c.267G>A, c.339T>C, c.449A>C, and c.539T>C) were identified in this group of patients (6.9%). In this cohort, 15 subjects (seven unrelated patients and eight family members) were detected to have THAP1 sequence variants. Among these 15 subjects, 11 were manifested (penetrance of DYT6 was 73.3%) and seven presented with craniocervical involvement (63.6%). However, one patient manifested paroxysmal headshake, and one presented with essential hand tremor. Semi-quantitative real-time PCR indicated that a novel silent mutation (c.267G>A) decreased the expression of THAP1 in human lymphocytes. Our findings indicated that THAP1 sequence variants are not common in non-DYT1 early-onset primary dystonia in China and that the clinical manifestation may vary. One silent mutation (c.267G>A) was shown to affect THAP1 expression.
Two Novel Variants Affecting CDKL5 Transcript Associated with Epileptic Encephalopathy.

PubMed

Neupauerová, Jana; Štěrbová, Katalin; Vlčková, Markéta; Sebroňová, Věra; Maříková, Tat'ána; Krůtová, Marcela; David, Staněk; Kršek, Pavel; Žaliová, Markéta; Seeman, Pavel; Laššuthová, Petra

2017-10-01

Variants in the human X-linked cyclin-dependent kinase-like 5 (CDKL5) gene have been reported as being etiologically associated with early infantile epileptic encephalopathy type 2 (EIEE2). We report on two patients, a boy and a girl, with EIEE2 that present with early onset epilepsy, hypotonia, severe intellectual disability, and poor eye contact. Massively parallel sequencing (MPS) of a custom-designed gene panel for epilepsy and epileptic encephalopathy containing 112 epilepsy-related genes was performed. Sanger sequencing was used to confirm the novel variants. For confirmation of the functional consequence of an intronic CDKL5 variant in patient 2, an RNA study was done. DNA sequencing revealed de novo variants in CDKL5, a c.2578C>T (p. Gln860*) present in a hemizygous state in a 3-year-old boy, and a potential splice site variant c.463+5G>A in heterozygous state in a 5-year-old girl. Multiple in silico splicing algorithms predicted a highly reduced splice site score for c.463+5G>A. A subsequent mRNA study confirmed an aberrant shorter transcript lacking exon 7. Our data confirmed that variants in the CDKL5 are associated with EIEE2. There is credible evidence that the novel identified variants are pathogenic and, therefore, are likely the cause of the disease in the presented patients. In one of the patients a stop codon variant is predicted to produce a truncated protein, and in the other patient an intronic variant results in aberrant splicing.
Targeted next generation sequencing identified a novel mutation in MYO7A causing Usher syndrome type 1 in an Iranian consanguineous pedigree.

PubMed

Kooshavar, Daniz; Razipour, Masoumeh; Movasat, Morteza; Keramatipour, Mohammad

2018-01-01

Usher syndrome (USH) is characterized by congenital hearing loss and retinitis pigmentosa (RP) with a later onset. It is an autosomal recessive trait with clinical and genetic heterogeneity which makes the molecular diagnosis much difficult. In this study, we introduce a pedigree with two affected members with USH type 1 and represent a cost and time effective approach for genetic diagnosis of USH as a genetically heterogeneous disorder. Target region capture in the genes of interest, followed by next generation sequencing (NGS) was used to determine the causative mutations in one of the probands. Then segregation analysis in the pedigree was conducted using PCR-Sanger sequencing. Targeted NGS detected a novel homozygous nonsense variant c.4513G > T (p.Glu1505Ter) in MYO7A. The variant is segregating in the pedigree with an autosomal recessive pattern. In this study, a novel stop gained variant c.4513G > T (p.Glu1505Ter) in MYO7A was found in an Iranian pedigree with two affected members with USH type 1. Bioinformatic as well as pedigree segregation analyses were in line with pathogenic nature of this variant. Targeted NGS panel was showed to be an efficient method for mutation detection in hereditary disorders with locus heterogeneity. Copyright © 2017 Elsevier B.V. All rights reserved.
Post-mortem testing; germline BRCA1/2 variant detection using archival FFPE non-tumor tissue. A new paradigm in genetic counseling.

PubMed

Petersen, Annabeth Høgh; Aagaard, Mads Malik; Nielsen, Henriette Roed; Steffensen, Karina Dahl; Waldstrøm, Marianne; Bojesen, Anders

2016-08-01

Accurate estimation of cancer risk in HBOC families often requires BRCA1/2 testing, but this may be impossible in deceased family members. Previous, testing archival formalin-fixed, paraffin-embedded (FFPE) tissue for germline BRCA1/2 variants was unsuccessful, except for the Jewish founder mutations. A high-throughput method to systematically test for variants in all coding regions of BRCA1/2 in archival FFPE samples of non-tumor tissue is described, using HaloPlex target enrichment and next-generation sequencing. In a validation study, correct identification of variants or wild-type was possible in 25 out of 30 (83%) FFPE samples (age range 1-14 years), with a known variant status in BRCA1/2. No false positive was found. Unsuccessful identification was due to highly degraded DNA or presence of large intragenic deletions. In clinical use, a total of 201 FFPE samples (aged 0-43 years) were processed. Thirty-six samples were rejected because of highly degraded DNA or failed library preparation. Fifteen samples were investigated to search for a known variant. In the remaining 150 samples (aged 0-38 years), three variants known to affect function and one variant likely to affect function in BRCA1, six variants known to affect function and one variant likely to affect function in BRCA2, as well as four variants of unknown significance (VUS) in BRCA1 and three VUS in BRCA2 were discovered. It is now possible to test for germline BRCA1/2 variants in deceased persons, using archival FFPE samples from non-tumor tissue. Accurate genetic counseling is achievable in families where variant testing would otherwise be impossible.
Post-mortem testing; germline BRCA1/2 variant detection using archival FFPE non-tumor tissue. A new paradigm in genetic counseling

PubMed Central

Petersen, Annabeth Høgh; Aagaard, Mads Malik; Nielsen, Henriette Roed; Steffensen, Karina Dahl; Waldstrøm, Marianne; Bojesen, Anders

2016-01-01

Accurate estimation of cancer risk in HBOC families often requires BRCA1/2 testing, but this may be impossible in deceased family members. Previous, testing archival formalin-fixed, paraffin-embedded (FFPE) tissue for germline BRCA1/2 variants was unsuccessful, except for the Jewish founder mutations. A high-throughput method to systematically test for variants in all coding regions of BRCA1/2 in archival FFPE samples of non-tumor tissue is described, using HaloPlex target enrichment and next-generation sequencing. In a validation study, correct identification of variants or wild-type was possible in 25 out of 30 (83%) FFPE samples (age range 1–14 years), with a known variant status in BRCA1/2. No false positive was found. Unsuccessful identification was due to highly degraded DNA or presence of large intragenic deletions. In clinical use, a total of 201 FFPE samples (aged 0–43 years) were processed. Thirty-six samples were rejected because of highly degraded DNA or failed library preparation. Fifteen samples were investigated to search for a known variant. In the remaining 150 samples (aged 0–38 years), three variants known to affect function and one variant likely to affect function in BRCA1, six variants known to affect function and one variant likely to affect function in BRCA2, as well as four variants of unknown significance (VUS) in BRCA1 and three VUS in BRCA2 were discovered. It is now possible to test for germline BRCA1/2 variants in deceased persons, using archival FFPE samples from non-tumor tissue. Accurate genetic counseling is achievable in families where variant testing would otherwise be impossible. PMID:26733283
Burden of rare variants in ALS genes influences survival in familial and sporadic ALS.

PubMed

Pang, Shirley Yin-Yu; Hsu, Jacob Shujui; Teo, Kay-Cheong; Li, Yan; Kung, Michelle H W; Cheah, Kathryn S E; Chan, Danny; Cheung, Kenneth M C; Li, Miaoxin; Sham, Pak-Chung; Ho, Shu-Leong

2017-10-01

Genetic variants are implicated in the development of amyotrophic lateral sclerosis (ALS), but it is unclear whether the burden of rare variants in ALS genes has an effect on survival. We performed whole genome sequencing on 8 familial ALS (FALS) patients with superoxide dismutase 1 (SOD1) mutation and whole exome sequencing on 46 sporadic ALS (SALS) patients living in Hong Kong and found that 67% had at least 1 rare variant in the exons of 40 ALS genes; 22% had 2 or more. Patients with 2 or more rare variants had lower probability of survival than patients with 0 or 1 variant (p = 0.001). After adjusting for other factors, each additional rare variant increased the risk of respiratory failure or death by 60% (p = 0.0098). The presence of the rare variant was associated with the risk of ALS (Odds ratio 1.91, 95% confidence interval 1.03-3.61, p = 0.03), and ALS patients had higher rare variant burden than controls (MB, p = 0.004). Our findings support an oligogenic basis with the burden of rare variants affecting the development and survival of ALS. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
STAG3 truncating variant as the cause of primary ovarian insufficiency

PubMed Central

Le Quesne Stabej, Polona; Williams, Hywel J; James, Chela; Tekman, Mehmet; Stanescu, Horia C; Kleta, Robert; Ocaka, Louise; Lescai, Francesco; Storr, Helen L; Bitner-Glindzicz, Maria; Bacchelli, Chiara; Conway, Gerard S

2016-01-01

Primary ovarian insufficiency (POI) is a distressing cause of infertility in young women. POI is heterogeneous with only a few causative genes having been discovered so far. Our objective was to determine the genetic cause of POI in a consanguineous Lebanese family with two affected sisters presenting with primary amenorrhoea and an absence of any pubertal development. Multipoint parametric linkage analysis was performed. Whole-exome sequencing was done on the proband. Linkage analysis identified a locus on chromosome 7 where exome sequencing successfully identified a homozygous two base pair duplication (c.1947_48dupCT), leading to a truncated protein p.(Y650Sfs*22) in the STAG3 gene, confirming it as the cause of POI in this family. Exome sequencing combined with linkage analyses offers a powerful tool to efficiently find novel genetic causes of rare, heterogeneous disorders, even in small single families. This is only the second report of a STAG3 variant; the first STAG3 variant was recently described in a phenotypically similar family with extreme POI. Identification of an additional family highlights the importance of STAG3 in POI pathogenesis and suggests it should be evaluated in families affected with POI. PMID:26059840
Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds.

PubMed

Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

2017-01-01

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds

PubMed Central

Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

2017-01-01

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836
Genotype–phenotype correlations in individuals with pathogenic RERE variants

PubMed Central

Jordan, Valerie K.; Fregeau, Brieana; Ge, Xiaoyan; Giordano, Jessica; Wapner, Ronald J.; Balci, Tugce B.; Carter, Melissa T.; Bernat, John A.; Moccia, Amanda N.; Srivastava, Anshika; Martin, Donna M.; Bielas, Stephanie L.; Pappas, John; Svoboda, Melissa D.; Rio, Marlène; Boddaert, Nathalie; Cantagrel, Vincent; Lewis, Andrea M.; Scaglia, Fernando; Kohler, Jennefer N.; Bernstein, Jonathan A.; Dries, Annika M.; Rosenfeld, Jill A.; DeFilippo, Colette; Thorson, Willa; Yang, Yaping; Sherr, Elliott H.; Bi, Weimin; Scott, Daryl A.

2018-01-01

Heterozygous variants in the arginine-glutamic acid dipeptide repeats gene (RERE) have been shown to cause neurodevelopmental disorder with or without anomalies of the brain, eye, or heart (NEDBEH). Here, we report nine individuals with NEDBEH who carry partial deletions or deleterious sequence variants in RERE. These variants were found to be de novo in all cases in which parental samples were available. An analysis of data from individuals with NEDBEH suggests that point mutations affecting the Atrophin-1 domain of RERE are associated with an increased risk of structural eye defects, congenital heart defects, renal anomalies, and sensorineural hearing loss when compared with loss-of-function variants that are likely to lead to haploinsufficiency. A high percentage of RERE pathogenic variants affect a histidine-rich region in the Atrophin-1 domain. We have also identified a recurrent two-amino-acid duplication in this region that is associated with the development of a CHARGE syndrome-like phenotype. We conclude that mutations affecting RERE result in a spectrum of clinical phenotypes. Genotype–phenotype correlations exist and can be used to guide medical decision making. Consideration should also be given to screening for RERE variants in individuals who fulfill diagnostic criteria for CHARGE syndrome but do not carry pathogenic variants in CHD7. PMID:29330883
Genotype-phenotype correlations in individuals with pathogenic RERE variants.

PubMed

Jordan, Valerie K; Fregeau, Brieana; Ge, Xiaoyan; Giordano, Jessica; Wapner, Ronald J; Balci, Tugce B; Carter, Melissa T; Bernat, John A; Moccia, Amanda N; Srivastava, Anshika; Martin, Donna M; Bielas, Stephanie L; Pappas, John; Svoboda, Melissa D; Rio, Marlène; Boddaert, Nathalie; Cantagrel, Vincent; Lewis, Andrea M; Scaglia, Fernando; Kohler, Jennefer N; Bernstein, Jonathan A; Dries, Annika M; Rosenfeld, Jill A; DeFilippo, Colette; Thorson, Willa; Yang, Yaping; Sherr, Elliott H; Bi, Weimin; Scott, Daryl A

2018-05-01

Heterozygous variants in the arginine-glutamic acid dipeptide repeats gene (RERE) have been shown to cause neurodevelopmental disorder with or without anomalies of the brain, eye, or heart (NEDBEH). Here, we report nine individuals with NEDBEH who carry partial deletions or deleterious sequence variants in RERE. These variants were found to be de novo in all cases in which parental samples were available. An analysis of data from individuals with NEDBEH suggests that point mutations affecting the Atrophin-1 domain of RERE are associated with an increased risk of structural eye defects, congenital heart defects, renal anomalies, and sensorineural hearing loss when compared with loss-of-function variants that are likely to lead to haploinsufficiency. A high percentage of RERE pathogenic variants affect a histidine-rich region in the Atrophin-1 domain. We have also identified a recurrent two-amino-acid duplication in this region that is associated with the development of a CHARGE syndrome-like phenotype. We conclude that mutations affecting RERE result in a spectrum of clinical phenotypes. Genotype-phenotype correlations exist and can be used to guide medical decision making. Consideration should also be given to screening for RERE variants in individuals who fulfill diagnostic criteria for CHARGE syndrome but do not carry pathogenic variants in CHD7. © 2018 Wiley Periodicals, Inc.
Next generation sequencing in women affected by nonsyndromic premature ovarian failure displays new potential causative genes and mutations.

PubMed

Fonseca, Dora Janeth; Patiño, Liliana Catherine; Suárez, Yohjana Carolina; de Jesús Rodríguez, Asid; Mateus, Heidi Eliana; Jiménez, Karen Marcela; Ortega-Recalde, Oscar; Díaz-Yamal, Ivonne; Laissue, Paul

2015-07-01

To identify new molecular actors involved in nonsyndromic premature ovarian failure (POF) etiology. This is a retrospective case-control cohort study. University research group and IVF medical center. Twelve women affected by nonsyndromic POF. The control group included 176 women whose menopause had occurred after age 50 and had no antecedents regarding gynecological disease. A further 345 women from the same ethnic origin (general population group) were also recruited to assess allele frequency for potentially deleterious sequence variants. Next generation sequencing (NGS), Sanger sequencing, and bioinformatics analysis. The complete coding regions of 70 candidate genes were massively sequenced, via NGS, in POF patients. Bioinformatics and genetics were used to confirm NGS results and to identify potential sequence variants related to the disease pathogenesis. We have identified mutations in two novel genes, ADAMTS19 and BMPR2, that are potentially related to POF origin. LHCGR mutations, which might have contributed to the phenotype, were also detected. We thus recommend NGS as a powerful tool for identifying new molecular actors in POF and for future diagnostic/prognostic purposes. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Germline pathogenic variants in PALB2 and other cancer-predisposing genes in families with hereditary diffuse gastric cancer without CDH1 mutation: a whole-exome sequencing study.

PubMed

Fewings, Eleanor; Larionov, Alexey; Redman, James; Goldgraben, Mae A; Scarth, James; Richardson, Susan; Brewer, Carole; Davidson, Rosemarie; Ellis, Ian; Evans, D Gareth; Halliday, Dorothy; Izatt, Louise; Marks, Peter; McConnell, Vivienne; Verbist, Louis; Mayes, Rebecca; Clark, Graeme R; Hadfield, James; Chin, Suet-Feung; Teixeira, Manuel R; Giger, Olivier T; Hardwick, Richard; di Pietro, Massimiliano; O'Donovan, Maria; Pharoah, Paul; Caldas, Carlos; Fitzgerald, Rebecca C; Tischkowitz, Marc

2018-04-26

Germline pathogenic variants in the E-cadherin gene (CDH1) are strongly associated with the development of hereditary diffuse gastric cancer. There is a paucity of data to guide risk assessment and management of families with hereditary diffuse gastric cancer that do not carry a CDH1 pathogenic variant, making it difficult to make informed decisions about surveillance and risk-reducing surgery. We aimed to identify new candidate genes associated with predisposition to hereditary diffuse gastric cancer in affected families without pathogenic CDH1 variants. We did whole-exome sequencing on DNA extracted from the blood of 39 individuals (28 individuals diagnosed with hereditary diffuse gastric cancer and 11 unaffected first-degree relatives) in 22 families without pathogenic CDH1 variants. Genes with loss-of-function variants were prioritised using gene-interaction analysis to identify clusters of genes that could be involved in predisposition to hereditary diffuse gastric cancer. Protein-affecting germline variants were identified in probands from six families with hereditary diffuse gastric cancer; variants were found in genes known to predispose to cancer and in lesser-studied DNA repair genes. A frameshift deletion in PALB2 was found in one member of a family with a history of gastric and breast cancer. Two different MSH2 variants were identified in two unrelated affected individuals, including one frameshift insertion and one previously described start-codon loss. One family had a unique combination of variants in the DNA repair genes ATR and NBN. Two variants in the DNA repair gene RECQL5 were identified in two unrelated families: one missense variant and a splice-acceptor variant. The results of this study suggest a role for the known cancer predisposition gene PALB2 in families with hereditary diffuse gastric cancer and no detected pathogenic CDH1 variants. We also identified new candidate genes associated with disease risk in these families. UK Medical Research Council (Sackler programme), European Research Council under the European Union's Seventh Framework Programme (2007-13), National Institute for Health Research Cambridge Biomedical Research Centre, Experimental Cancer Medicine Centres, and Cancer Research UK. Copyright © 2018 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY 4.0 license. Published by Elsevier Ltd.. All rights reserved.
GM2 Gangliosidosis in Shiba Inu Dogs with an In-Frame Deletion in HEXB.

PubMed

Kolicheski, A; Johnson, G S; Villani, N A; O'Brien, D P; Mhlanga-Mutangadura, T; Wenger, D A; Mikoloski, K; Eagleson, J S; Taylor, J F; Schnabel, R D; Katz, M L

2017-09-01

Consistent with a tentative diagnosis of neuronal ceroid lipofuscinosis (NCL), autofluorescent cytoplasmic storage bodies were found in neurons from the brains of 2 related Shiba Inu dogs with a young-adult onset, progressive neurodegenerative disease. Unexpectedly, no potentially causal NCL-related variants were identified in a whole-genome sequence generated with DNA from 1 of the affected dogs. Instead, the whole-genome sequence contained a homozygous 3 base pair (bp) deletion in a coding region of HEXB. The other affected dog also was homozygous for this 3-bp deletion. Mutations in the human HEXB ortholog cause Sandhoff disease, a type of GM2 gangliosidosis. Thin-layer chromatography confirmed that GM2 ganglioside had accumulated in an affected Shiba Inu brain. Enzymatic analysis confirmed that the GM2 gangliosidosis resulted from a deficiency in the HEXB encoded protein and not from a deficiency in products from HEXA or GM2A, which are known alternative causes of GM2 gangliosidosis. We conclude that the homozygous 3-bp deletion in HEXB is the likely cause of the Shiba Inu neurodegenerative disease and that whole-genome sequencing can lead to the early identification of potentially disease-causing DNA variants thereby refocusing subsequent diagnostic analyses toward confirming or refuting candidate variant causality. Copyright © 2017 The Authors. Journal of Veterinary Internal Medicine published by Wiley Periodicals, Inc. on behalf of the American College of Veterinary Internal Medicine.
Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

PubMed

Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

2018-01-10

Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies

PubMed Central

Yuan, Shuai; Johnston, H. Richard; Zhang, Guosheng; Li, Yun; Hu, Yi-Juan; Qin, Zhaohui S.

2015-01-01

With rapid decline of the sequencing cost, researchers today rush to embrace whole genome sequencing (WGS), or whole exome sequencing (WES) approach as the next powerful tool for relating genetic variants to human diseases and phenotypes. A fundamental step in analyzing WGS and WES data is mapping short sequencing reads back to the reference genome. This is an important issue because incorrectly mapped reads affect the downstream variant discovery, genotype calling and association analysis. Although many read mapping algorithms have been developed, the majority of them uses the universal reference genome and do not take sequence variants into consideration. Given that genetic variants are ubiquitous, it is highly desirable if they can be factored into the read mapping procedure. In this work, we developed a novel strategy that utilizes genotypes obtained a priori to customize the universal haploid reference genome into a personalized diploid reference genome. The new strategy is implemented in a program named RefEditor. When applying RefEditor to real data, we achieved encouraging improvements in read mapping, variant discovery and genotype calling. Compared to standard approaches, RefEditor can significantly increase genotype calling consistency (from 43% to 61% at 4X coverage; from 82% to 92% at 20X coverage) and reduce Mendelian inconsistency across various sequencing depths. Because many WGS and WES studies are conducted on cohorts that have been genotyped using array-based genotyping platforms previously or concurrently, we believe the proposed strategy will be of high value in practice, which can also be applied to the scenario where multiple NGS experiments are conducted on the same cohort. The RefEditor sources are available at https://github.com/superyuan/refeditor. PMID:26267278
A FRMD7 variant in a Japanese family causes congenital nystagmus.

PubMed

Kohmoto, Tomohiro; Okamoto, Nana; Satomura, Shigeko; Naruto, Takuya; Komori, Takahide; Hashimoto, Toshiaki; Imoto, Issei

2015-01-01

Idiopathic congenital nystagmus (ICN) is a genetically heterogeneous eye movement disorder that causes a large proportion of childhood visual impairment. Here we describe a missense variant (p.L292P) within a mutation-rich region of FRMD7 detected in three affected male siblings in a Japanese family with X-linked ICN. Combining sequence analysis and results from structural and functional predictions, we report p.L292P as a variant potentially disrupting FRMD7 function associated with X-linked ICN.
A FRMD7 variant in a Japanese family causes congenital nystagmus

PubMed Central

Kohmoto, Tomohiro; Okamoto, Nana; Satomura, Shigeko; Naruto, Takuya; Komori, Takahide; Hashimoto, Toshiaki; Imoto, Issei

2015-01-01

Idiopathic congenital nystagmus (ICN) is a genetically heterogeneous eye movement disorder that causes a large proportion of childhood visual impairment. Here we describe a missense variant (p.L292P) within a mutation-rich region of FRMD7 detected in three affected male siblings in a Japanese family with X-linked ICN. Combining sequence analysis and results from structural and functional predictions, we report p.L292P as a variant potentially disrupting FRMD7 function associated with X-linked ICN. PMID:27081518
Feline hypersomatotropism and acromegaly tumorigenesis: a potential role for the AIP gene.

PubMed

Scudder, C J; Niessen, S J; Catchpole, B; Fowkes, R C; Church, D B; Forcada, Y

2017-04-01

Acromegaly in humans is usually sporadic, however up to 20% of familial isolated pituitary adenomas are caused by germline sequence variants of the aryl-hydrocarbon-receptor interacting protein (AIP) gene. Feline acromegaly has similarities to human acromegalic families with AIP mutations. The aim of this study was to sequence the feline AIP gene, identify sequence variants and compare the AIP gene sequence between feline acromegalic and control cats, and in acromegalic siblings. The feline AIP gene was amplified through PCR using whole blood genomic DNA from 10 acromegalic and 10 control cats, and 3 sibling pairs affected by acromegaly. PCR products were sequenced and compared with the published predicted feline AIP gene. A single nonsynonymous SNP was identified in exon 1 (AIP:c.9T > G) of two acromegalic cats and none of the control cats, as well as both members of one sibling pair. The region of this SNP is considered essential for the interaction of the AIP protein with its receptor. This sequence variant has not previously been reported in humans. Two additional synonymous sequence variants were identified (AIP:c.481C > T and AIP:c.826C > T). This is the first molecular study to investigate a potential genetic cause of feline acromegaly and identified a nonsynonymous AIP single nucleotide polymorphism in 20% of the acromegalic cat population evaluated, as well as in one of the sibling pairs evaluated. Copyright © 2016 Elsevier Inc. All rights reserved.
Identification of novel mutations and sequence variants in the SOX2 and CHX10 genes in patients with anophthalmia/microphthalmia

PubMed Central

Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele

2008-01-01

Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794
Positional bias in variant calls against draft reference assemblies.

PubMed

Briskine, Roman V; Shimizu, Kentaro K

2017-03-28

Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to improve the quality of their variant data sets and reduce artefactual findings in downstream analyses.
Non-coding variants contribute to the clinical heterogeneity of TTR amyloidosis.

PubMed

Iorio, Andrea; De Lillo, Antonella; De Angelis, Flavio; Di Girolamo, Marco; Luigetti, Marco; Sabatelli, Mario; Pradotto, Luca; Mauro, Alessandro; Mazzeo, Anna; Stancanelli, Claudia; Perfetto, Federico; Frusconi, Sabrina; My, Filomena; Manfellotto, Dario; Fuciarelli, Maria; Polimanti, Renato

2017-09-01

Coding mutations in TTR gene cause a rare hereditary form of systemic amyloidosis, which has a complex genotype-phenotype correlation. We investigated the role of non-coding variants in regulating TTR gene expression and consequently amyloidosis symptoms. We evaluated the genotype-phenotype correlation considering the clinical information of 129 Italian patients with TTR amyloidosis. Then, we conducted a re-sequencing of TTR gene to investigate how non-coding variants affect TTR expression and, consequently, phenotypic presentation in carriers of amyloidogenic mutations. Polygenic scores for genetically determined TTR expression were constructed using data from our re-sequencing analysis and the GTEx (Genotype-Tissue Expression) project. We confirmed a strong phenotypic heterogeneity across coding mutations causing TTR amyloidosis. Considering the effects of non-coding variants on TTR expression, we identified three patient clusters with specific expression patterns associated with certain phenotypic presentations, including late onset, autonomic neurological involvement, and gastrointestinal symptoms. This study provides novel data regarding the role of non-coding variation and the gene expression profiles in patients affected by TTR amyloidosis, also putting forth an approach that could be used to investigate the mechanisms at the basis of the genotype-phenotype correlation of the disease.
svviz: a read viewer for validating structural variants.

PubMed

Spies, Noah; Zook, Justin M; Salit, Marc; Sidow, Arend

2015-12-15

Visualizing read alignments is the most effective way to validate candidate structural variants (SVs) with existing data. We present svviz, a sequencing read visualizer for SVs that sorts and displays only reads relevant to a candidate SV. svviz works by searching input bam(s) for potentially relevant reads, realigning them against the inferred sequence of the putative variant allele as well as the reference allele and identifying reads that match one allele better than the other. Separate views of the two alleles are then displayed in a scrollable web browser view, enabling a more intuitive visualization of each allele, compared with the single reference genome-based view common to most current read browsers. The browser view facilitates examining the evidence for or against a putative variant, estimating zygosity, visualizing affected genomic annotations and manual refinement of breakpoints. svviz supports data from most modern sequencing platforms. svviz is implemented in python and freely available from http://svviz.github.io/. Published by Oxford University Press 2015. This work is written by US Government employees and is in the public domain in the US.
MACARON: A python framework to identify and re-annotate multi-base affected codons in whole genome/exome sequence data.

PubMed

Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre

2018-05-03

Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
A dominant variant in the PDE1C gene is associated with nonsyndromic hearing loss.

PubMed

Wang, Li; Feng, Yong; Yan, Denise; Qin, Litao; Grati, M'hamed; Mittal, Rahul; Li, Tao; Sundhari, Abhiraami Kannan; Liu, Yalan; Chapagain, Prem; Blanton, Susan H; Liao, Shixiu; Liu, Xuezhong

2018-06-02

Identification of genes with variants causing non-syndromic hearing loss (NSHL) is challenging due to genetic heterogeneity. The difficulty is compounded by technical limitations that in the past prevented comprehensive gene identification. Recent advances in technology, using targeted capture and next-generation sequencing (NGS), is changing the face of gene identification and making it possible to rapidly and cost-effectively sequence the whole human exome. Here, we characterize a five-generation Chinese family with progressive, postlingual autosomal dominant nonsyndromic hearing loss (ADNSHL). By combining population-specific mutation arrays, targeted deafness genes panel, whole exome sequencing (WES), we identified PDE1C (Phosphodiesterase 1C) c.958G>T (p.A320S) as the disease-associated variant. Structural modeling insights into p.A320S strongly suggest that the sequence alteration will likely affect the substrate-binding pocket of PDE1C. By whole-mount immunofluorescence on postnatal day 3 mouse cochlea, we show its expression in outer (OHC) and inner (IHC) hair cells cytosol co-localizing with Lamp-1 in lysosomes. Furthermore, we provide evidence that the variant alters the PDE1C hydrolytic activity for both cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP). Collectively, our findings indicate that the c.958G>T variant in PDE1C may disrupt the cross talk between cGMP-signaling and cAMP pathways in Ca 2+ homeostasis.
A Nonsense Variant in the ACADVL Gene in German Hunting Terriers with Exercise Induced Metabolic Myopathy.

PubMed

Lepori, Vincent; Mühlhause, Franziska; Sewell, Adrian C; Jagannathan, Vidhya; Janzen, Nils; Rosati, Marco; Alves de Sousa, Filipe Miguel Maximiano; Tschopp, Aurélie; Schüpbach, Gertraud; Matiasek, Kaspar; Tipold, Andrea; Leeb, Tosso; Kornberg, Marion

2018-05-04

Several enzymes are involved in fatty acid oxidation, which is a key process in mitochondrial energy production. Inherited defects affecting any step of fatty acid oxidation can result in clinical disease. We present here an extended family of German Hunting Terriers with 10 dogs affected by clinical signs of exercise induced weakness, muscle pain, and suspected rhabdomyolysis. The combination of clinical signs, muscle histopathology and acylcarnitine analysis with an elevated tetradecenoylcarnitine (C14:1) peak suggested a possible diagnosis of acyl-CoA dehydrogenase very long chain deficiency (ACADVLD). Whole genome sequence analysis of one affected dog and 191 controls revealed a nonsense variant in the ACADVL gene encoding acyl-CoA dehydrogenase very long chain, c.1728C>A or p.(Tyr576*). The variant showed perfect association with the phenotype in the 10 affected and more than 500 control dogs of various breeds. Pathogenic variants in the ACADVL gene have been reported in humans with similar myopathic phenotypes. We therefore considered the detected variant to be the most likely candidate causative variant for the observed exercise induced myopathy. To our knowledge, this is the first description of this disease in dogs, which we propose to name exercise induced metabolic myopathy (EIMM), and the identification of the first canine pathogenic ACADVL variant. Our findings provide a large animal model for a known human disease and will enable genetic testing to avoid the unintentional breeding of affected offspring. Copyright © 2018 Lepori et al.
Exome sequencing is an efficient tool for variant late-infantile neuronal ceroid lipofuscinosis molecular diagnosis.

PubMed

Patiño, Liliana Catherine; Battu, Rajani; Ortega-Recalde, Oscar; Nallathambi, Jeyabalan; Anandula, Venkata Ramana; Renukaradhya, Umashankar; Laissue, Paul

2014-01-01

The neuronal ceroid-lipofuscinoses (NCL) is a group of neurodegenerative disorders characterized by epilepsy, visual failure, progressive mental and motor deterioration, myoclonus, dementia and reduced life expectancy. Classically, NCL-affected individuals have been classified into six categories, which have been mainly defined regarding the clinical onset of symptoms. However, some patients cannot be easily included in a specific group because of significant variation in the age of onset and disease progression. Molecular genetics has emerged in recent years as a useful tool for enhancing NCL subtype classification. Fourteen NCL genetic forms (CLN1 to CLN14) have been described to date. The variant late-infantile form of the disease has been linked to CLN5, CLN6, CLN7 (MFSD8) and CLN8 mutations. Despite advances in the diagnosis of neurodegenerative disorders mutations in these genes may cause similar phenotypes, which rends difficult accurate candidate gene selection for direct sequencing. Three siblings who were affected by variant late-infantile NCL are reported in the present study. We used whole-exome sequencing, direct sequencing and in silico approaches to identify the molecular basis of the disease. We identified the novel c.1219T>C (p.Trp407Arg) and c.1361T>C (p.Met454Thr) MFSD8 pathogenic mutations. Our results highlighted next generation sequencing as a novel and powerful methodological approach for the rapid determination of the molecular diagnosis of NCL. They also provide information regarding the phenotypic and molecular spectrum of CLN7 disease.
Exome sequencing identifies a novel FOXP3 mutation in a 2-generation family with inflammatory bowel disease.

PubMed

Okou, David T; Mondal, Kajari; Faubion, William A; Kobrynski, Lisa J; Denson, Lee A; Mulle, Jennifer G; Ramachandran, Dhanya; Xiong, Yuning; Svingen, Phyllis; Patel, Viren; Bose, Promita; Waters, Jon P; Prahalad, Sampath; Cutler, David J; Zwick, Michael E; Kugathasan, Subra

2014-05-01

Inflammatory bowel disease (IBD) is heritable, but a total of 163 variants commonly implicated in IBD pathogenesis account for only 25% of the heritability. Rare, highly penetrant genetic variants may also explain mendelian forms of IBD and some of the missing heritability. To test the hypothesis that rare loss-of-function mutations can be causative, we performed whole exome sequencing (WES) on 5 members of a 2-generation family of European ancestry presenting with an early-onset and atypical form of IBD. WES was performed for all of the 5 family members; the mother and 3 male offspring were affected, whereas the father was unaffected. Mapping, annotation, and filtering criteria were used to reduce candidate variants. For functional testing we performed forkhead box P3 (FOXP3) staining and a T-cell suppression assay. We identified a novel missense variant in exon 6 of the X-linked FOXP3 gene. The c.694A>C substitution in FOXP3 results in a cysteine-to-glycine change at the protein position 232 that is completely conserved among all vertebrates. This variant (heterozygous in the mother and hemizygous in all 3 affected sons) did not impair FOXP3 protein expression, but significantly reduced the ability of the host's T regulatory cells to suppress an inappropriate autoimmune response. The variant results in a milder immune dysregulation, polyendocrinopathy, enteropathy, and X-linked phenotype with early-onset IBD. Our study illustrates the successful application of WES for making a definitive molecular diagnosis in a case of multiply affected families, with atypical IBD-like phenotype. Our results also have important implications for disease biology and disease-directed therapeutic development.
The phenotypic spectrum of ARHGEF9 includes intellectual disability, focal epilepsy and febrile seizures.

PubMed

Klein, Karl Martin; Pendziwiat, Manuela; Eilam, Anda; Gilad, Ronit; Blatt, Ilan; Rosenow, Felix; Kanaan, Moien; Helbig, Ingo; Afawi, Zaid

2017-07-01

Mutations or structural genomic alterations of the X-chromosomal gene ARHGEF9 have been described in male and female patients with intellectual disability. Hyperekplexia and epilepsy were observed to a variable degree, but incompletely described. Here, we expand the phenotypic spectrum of ARHGEF9 by describing a large Ethiopian-Jewish family with epilepsy and intellectual disability. The four affected male siblings, their unaffected parents and two unaffected female siblings were recruited and phenotyped. Parametric linkage analysis was performed using SNP microarrays. Variants from exome sequencing in two affected individuals were confirmed by Sanger sequencing. All affected male siblings had febrile seizures from age 2-3 years and intellectual disability. Three developed afebrile seizures between age 7-17 years. Three showed focal seizure semiology. None had hyperekplexia. A novel ARHGEF9 variant (c.967G>A, p.G323R, NM_015185.2) was hemizygous in all affected male siblings and heterozygous in the mother. This family reveals that the phenotypic spectrum of ARHGEF9 is broader than commonly assumed and includes febrile seizures and focal epilepsy with intellectual disability in the absence of hyperekplexia or other clinically distinguishing features. Our findings suggest that pathogenic variants in ARHGEF9 may be more common than previously assumed in patients with intellectual disability and mild epilepsy.
Identification of pathogenic gene variants in small families with intellectually disabled siblings by exome sequencing.

PubMed

Schuurs-Hoeijmakers, Janneke H M; Vulto-van Silfhout, Anneke T; Vissers, Lisenka E L M; van de Vondervoort, Ilse I G M; van Bon, Bregje W M; de Ligt, Joep; Gilissen, Christian; Hehir-Kwa, Jayne Y; Neveling, Kornelia; del Rosario, Marisol; Hira, Gausiya; Reitano, Santina; Vitello, Aurelio; Failla, Pinella; Greco, Donatella; Fichera, Marco; Galesi, Ornella; Kleefstra, Tjitske; Greally, Marie T; Ockeloen, Charlotte W; Willemsen, Marjolein H; Bongers, Ernie M H F; Janssen, Irene M; Pfundt, Rolph; Veltman, Joris A; Romano, Corrado; Willemsen, Michèl A; van Bokhoven, Hans; Brunner, Han G; de Vries, Bert B A; de Brouwer, Arjan P M

2013-12-01

Intellectual disability (ID) is a common neurodevelopmental disorder affecting 1-3% of the general population. Mutations in more than 10% of all human genes are considered to be involved in this disorder, although the majority of these genes are still unknown. We investigated 19 small non-consanguineous families with two to five affected siblings in order to identify pathogenic gene variants in known, novel and potential ID candidate genes. Non-consanguineous families have been largely ignored in gene identification studies as small family size precludes prior mapping of the genetic defect. Using exome sequencing, we identified pathogenic mutations in three genes, DDHD2, SLC6A8, and SLC9A6, of which the latter two have previously been implicated in X-linked ID phenotypes. In addition, we identified potentially pathogenic mutations in BCORL1 on the X-chromosome and in MCM3AP, PTPRT, SYNE1, and ZNF528 on autosomes. We show that potentially pathogenic gene variants can be identified in small, non-consanguineous families with as few as two affected siblings, thus emphasising their value in the identification of syndromic and non-syndromic ID genes.
HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers

PubMed Central

Stadhouders, Ralph; Aktuna, Suleyman; Thongjuea, Supat; Aghajanirefah, Ali; Pourfarzad, Farzin; van IJcken, Wilfred; Lenhard, Boris; Rooks, Helen; Best, Steve; Menzel, Stephan; Grosveld, Frank; Thein, Swee Lay; Soler, Eric

2014-01-01

Genetic studies have identified common variants within the intergenic region (HBS1L-MYB) between GTP-binding elongation factor HBS1L and myeloblastosis oncogene MYB on chromosome 6q that are associated with elevated fetal hemoglobin (HbF) levels and alterations of other clinically important human erythroid traits. It is unclear how these noncoding sequence variants affect multiple erythrocyte characteristics. Here, we determined that several HBS1L-MYB intergenic variants affect regulatory elements that are occupied by key erythroid transcription factors within this region. These elements interact with MYB, a critical regulator of erythroid development and HbF levels. We found that several HBS1L-MYB intergenic variants reduce transcription factor binding, affecting long-range interactions with MYB and MYB expression levels. These data provide a functional explanation for the genetic association of HBS1L-MYB intergenic polymorphisms with human erythroid traits and HbF levels. Our results further designate MYB as a target for therapeutic induction of HbF to ameliorate sickle cell and β-thalassemia disease severity. PMID:24614105
Identification of a novel aviadenovirus, designated pigeon adenovirus 2 in domestic pigeons (Columba livia).

PubMed

Teske, L; Rubbenstroth, D; Meixner, M; Liere, K; Bartels, H; Rautenschlein, S

2017-01-02

The young pigeon disease syndrome (YPDS) affects mainly young pigeons of less than one year of age and leads to crop stasis, vomitus, diarrhea, anorexia and occasionally death. This disease is internationally a major health problem because of its seasonal appearance during competitions such as homing pigeon races or exhibitions of ornamental birds. While the etiology of YPDS is still unclear, adenoviruses are frequently discussed as potential causative agents. Electron microscopy of feces from a YPDS outbreak revealed massive shedding of adenovirus-like particles. Whole genome sequencing of this sample identified a novel adenovirus tentatively named pigeon adenovirus 2 (PiAdV-2). Phylogenetic and comparative genome analysis suggest PiAdV-2 to belong to a new species within the genus Aviadenovirus, for which we propose the name Pigeon aviadenovirus B. The PiAdV-2 genome shares 54.9% nucleotide sequence identity with pigeon adenovirus 1 (PiAdV-1). In a screening of further YPDS-affected flocks two variants of PiAdV-2 (variant A and B) were detected which shared 97.6% nucleotide identity of partial polymerase sequences, but only 79.7% nucleotide identity of partial hexon sequences. The distribution of both PiAdV-2 variants was further investigated in fecal samples collected between 2008 and 2015 from healthy or YPDS-affected racing pigeons of different lofts. Independent of their health status, approximately 20% of young and 13% of adult pigeon flocks harbored PiAdV-2 variants. Birds were free of PiAdV-1 or other aviadenoviruses as determined by PCRs targeting the aviadenovirus polymerase or the PiAdV-1 fiber gene, respectively. In conclusion, there is no indication of a correlation between YPDS outbreaks and the presence of PiAdV-2 or other aviadenoviruses, arguing against an causative role in this disease complex. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Molecular and geographic analyses of vampire bat-transmitted cattle rabies in central Brazil

PubMed Central

Kobayashi, Yuki; Sato, Go; Mochizuki, Nobuyuki; Hirano, Shinji; Itou, Takuya; Carvalho, Adolorata AB; Albas, Avelino; Santos, Hamilton P; Ito, Fumio H; Sakai, Takeo

2008-01-01

Background Vampire bats are important rabies virus vectors, causing critical problems in both the livestock industry and public health sector in Latin America. In order to assess the epidemiological characteristics of vampire bat-transmitted rabies, the authors conducted phylogenetic and geographical analyses using sequence data of a large number of cattle rabies isolates collected from a wide geographical area in Brazil. Methods Partial nucleoprotein genes of rabies viruses isolated from 666 cattle and 18 vampire bats between 1987 and 2006 were sequenced and used for phylogenetic analysis. The genetic variants were plotted on topographical maps of Brazil. Results In this study, 593 samples consisting of 24 genetic variants were analyzed. Regional localization of variants was observed, with the distribution of several variants found to be delimited by mountain ranges which served as geographic boundaries. The geographical distributions of vampire-bat and cattle isolates that were classified as the identical phylogenetic group were found to overlap with high certainty. Most of the samples analyzed in this study were isolated from adjacent areas linked by rivers. Conclusion This study revealed the existence of several dozen regional variants associated with vampire bats in Brazil, with the distribution patterns of these variants found to be affected by mountain ranges and rivers. These results suggest that epidemiological characteristics of vampire bat-related rabies appear to be associated with the topographical and geographical characteristics of areas where cattle are maintained, and the factors affecting vampire bat ecology. PMID:18983685
Application of Whole Exome Sequencing in Six Families with an Initial Diagnosis of Autosomal Dominant Retinitis Pigmentosa: Lessons Learned

PubMed Central

Fernandez-San Jose, Patricia; Liu, Yichuan; March, Michael; Pellegrino, Renata; Golhar, Ryan; Corton, Marta; Blanco-Kelly, Fiona; López-Molina, Maria Isabel; García-Sandoval, Blanca; Guo, Yiran; Tian, Lifeng; Liu, Xuanzhu; Guan, Liping; Zhang, Jianguo; Keating, Brendan; Xu, Xun

2015-01-01

This study aimed to identify the genetics underlying dominant forms of inherited retinal dystrophies using whole exome sequencing (WES) in six families extensively screened for known mutations or genes. Thirty-eight individuals were subjected to WES. Causative variants were searched among single nucleotide variants (SNVs) and insertion/deletion variants (indels) and whenever no potential candidate emerged, copy number variant (CNV) analysis was performed. Variants or regions harboring a candidate variant were prioritized and segregation of the variant with the disease was further assessed using Sanger sequencing in case of SNVs and indels, and quantitative PCR (qPCR) for CNVs. SNV and indel analysis led to the identification of a previously reported mutation in PRPH2. Two additional mutations linked to different forms of retinal dystrophies were identified in two families: a known frameshift deletion in RPGR, a gene responsible for X-linked retinitis pigmentosa and p.Ser163Arg in C1QTNF5 associated with Late-Onset Retinal Degeneration. A novel heterozygous deletion spanning the entire region of PRPF31 was also identified in the affected members of a fourth family, which was confirmed with qPCR. This study allowed the identification of the genetic cause of the retinal dystrophy and the establishment of a correct diagnosis in four families, including a large heterozygous deletion in PRPF31, typically considered one of the pitfalls of this method. Since all findings in this study are restricted to known genes, we propose that targeted sequencing using gene-panel is an optimal first approach for the genetic screening and that once known genetic causes are ruled out, WES might be used to uncover new genes involved in inherited retinal dystrophies. PMID:26197217
Rare variants in FBN1 and FBN2 are associated with severe adolescent idiopathic scoliosis

PubMed Central

Buchan, Jillian G.; Alvarado, David M.; Haller, Gabe E.; Cruchaga, Carlos; Harms, Matthew B.; Zhang, Tianxiao; Willing, Marcia C.; Grange, Dorothy K.; Braverman, Alan C.; Miller, Nancy H.; Morcuende, Jose A.; Tang, Nelson Leung-Sang; Lam, Tsz-Ping; Ng, Bobby Kin-Wah; Cheng, Jack Chun-Yiu; Dobbs, Matthew B.; Gurnett, Christina A.

2014-01-01

Adolescent idiopathic scoliosis (AIS) causes spinal deformity in 3% of children. Despite a strong genetic basis, few genes have been associated with AIS and the pathogenesis remains poorly understood. In a genome-wide rare variant burden analysis using exome sequence data, we identified fibrillin-1 (FBN1) as the most significantly associated gene with AIS. Based on these results, FBN1 and a related gene, fibrillin-2 (FBN2), were sequenced in a total of 852 AIS cases and 669 controls. In individuals of European ancestry, rare variants in FBN1 and FBN2 were enriched in severely affected AIS cases (7.6%) compared with in-house controls (2.4%) (OR = 3.5, P = 5.46 × 10−4) and Exome Sequencing Project controls (2.3%) (OR = 3.5, P = 1.48 × 10−6). Scoliosis severity in AIS cases was associated with FBN1 and FBN2 rare variants (P = 0.0012) and replicated in an independent Han Chinese cohort (P = 0.0376), suggesting that rare variants may be useful as predictors of curve progression. Clinical evaluations revealed that the majority of AIS cases with rare FBN1 variants do not meet diagnostic criteria for Marfan syndrome, though variants are associated with tall stature (P = 0.0035) and upregulation of the transforming growth factor beta pathway. Overall, these results expand our definition of fibrillin-related disorders to include AIS and open up new strategies for diagnosing and treating severe AIS. PMID:24833718

mirVAFC: A Web Server for Prioritizations of Pathogenic Sequence Variants from Exome Sequencing Data via Classifications.

PubMed

Li, Zhongshan; Liu, Zhenwei; Jiang, Yi; Chen, Denghui; Ran, Xia; Sun, Zhong Sheng; Wu, Jinyu

2017-01-01

Exome sequencing has been widely used to identify the genetic variants underlying human genetic disorders for clinical diagnoses, but the identification of pathogenic sequence variants among the huge amounts of benign ones is complicated and challenging. Here, we describe a new Web server named mirVAFC for pathogenic sequence variants prioritizations from clinical exome sequencing (CES) variant data of single individual or family. The mirVAFC is able to comprehensively annotate sequence variants, filter out most irrelevant variants using custom criteria, classify variants into different categories as for estimated pathogenicity, and lastly provide pathogenic variants prioritizations based on classifications and mutation effects. Case studies using different types of datasets for different diseases from publication and our in-house data have revealed that mirVAFC can efficiently identify the right pathogenic candidates as in original work in each case. Overall, the Web server mirVAFC is specifically developed for pathogenic sequence variant identifications from family-based CES variants using classification-based prioritizations. The mirVAFC Web server is freely accessible at https://www.wzgenomics.cn/mirVAFC/. © 2016 WILEY PERIODICALS, INC.
Whole-exome sequencing identifies novel candidate predisposition genes for familial polycythemia vera.

PubMed

Hirvonen, Elina A M; Pitkänen, Esa; Hemminki, Kari; Aaltonen, Lauri A; Kilpivaara, Outi

2017-04-20

Polycythemia vera (PV), characterized by massive production of erythrocytes, is one of the myeloproliferative neoplasms. Most patients carry a somatic gain-of-function mutation in JAK2, c.1849G > T (p.Val617Phe), leading to constitutive activation of JAK-STAT signaling pathway. Familial clustering is also observed occasionally, but high-penetrance predisposition genes to PV have remained unidentified. We studied the predisposition to PV by exome sequencing (three cases) in a Finnish PV family with four patients. The 12 shared variants (maximum allowed minor allele frequency <0.001 in Finnish population in ExAC database) predicted damaging in silico and absent in an additional control set of over 500 Finns were further validated by Sanger sequencing in a fourth affected family member. Three novel predisposition candidate variants were identified: c.1254C > G (p.Phe418Leu) in ZXDC, c.1931C > G (p.Pro644Arg) in ATN1, and c.701G > A (p.Arg234Gln) in LRRC3. We also observed a rare, predicted benign germline variant c.2912C > G (p.Ala971Gly) in BCORL1 in all four patients. Somatic mutations in BCORL1 have been reported in myeloid malignancies. We further screened the variants in eight PV patients in six other Finnish families, but no other carriers were found. Exome sequencing provides a powerful tool for the identification of novel variants, and understanding the familial predisposition of diseases. This is the first report on Finnish familial PV cases, and we identified three novel candidate variants that may predispose to the disease.
Novel mutations in LRP6 highlight the role of WNT signaling in tooth agenesis

PubMed Central

Ludwig, Kerstin U.; Sullivan, Robert; van Rooij, Iris A.L.M.; Thonissen, Michelle; Swinnen, Steven; Phan, Milien; Conte, Federica; Ishorst, Nina; Gilissen, Christian; RoaFuentes, Laury; van de Vorst, Maartje; Henkes, Arjen; Steehouwer, Marloes; van Beusekom, Ellen; Bloemen, Marjon; Vankeirsbilck, Bruno; Bergé, Stefaan; Hens, Greet; Schoenaers, Joseph; Poorten, Vincent Vander; Roosenboom, Jasmien; Verdonck, An; Devriendt, Koen; Roeleveldt, Nel; Jhangiani, Shalini N.; Vissers, Lisenka E.L.M.; Lupski, James R.; de Ligt, Joep; Von den Hoff, Johannes W.; Pfundt, Rolph; Brunner, Han G.; Zhou, Huiqing; Dixon, Jill; Mangold, Elisabeth; van Bokhoven, Hans; Dixon, Michael J.; Kleefstra, Tjitske

2016-01-01

Purpose Here we aimed to identify a novel genetic cause of tooth agenesis (TA) and/or orofacial clefting (OFC) by combining whole exome sequencing (WES) and targeted re-sequencing in a large cohort of TA and OFC patients. Methods WES was performed in two unrelated patients, one with severe TA and OFC and another with severe TA only. After identifying deleterious mutations in a gene encoding the low density lipoprotein receptor-related protein 6 (LRP6), all its exons were re-sequenced with molecular inversion probes, in 67 patients with TA, 1,072 patients with OFC and in 706 controls. Results We identified a frameshift (c.4594delG, p.Cys1532fs) and a canonical splice site mutation (c.3398-2A>C, p.?) in LRP6 respectively in the patient with TA and OFC, and in the patient with severe TA only. The targeted re-sequencing showed significant enrichment of unique LRP6 variants in TA patients, but not in nonsyndromic OFC. From the 5 variants in patients with TA, 2 affect the canonical splice site and 3 were missense variants; all variants segregated with the dominant phenotype and in 1 case the missense mutation occurred de novo. Conclusion Mutations in LRP6 cause tooth agenesis in man. PMID:26963285
Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease.

PubMed

Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo

2017-11-01

Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND.
Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease

PubMed Central

Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo

2017-01-01

Purpose: Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. Methods: To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. Results: We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. Conclusion: c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND. PMID:29133643
Rapid differentiation of citrus Hop stunt viroid variants by real-time RT-PCR and high resolution melting analysis.

PubMed

Loconsole, Giuliana; Onelge, Nuket; Yokomi, Raymond K; Kubaa, Raied Abou; Savino, Vito; Saponari, Maria

2013-01-01

The RNA genome of pathogenic and non-pathogenic variants of citrus Hop stunt viroid (HSVd) differ by five to six nucleotides located within the variable (V) domain referred to as the "cachexia expression motif". Sensitive hosts such as mandarin and its hybrids are seriously affected by cachexia disease. Current methods to differentiate HSVd variants rely on lengthy greenhouse biological indexing on Parson's Special mandarin and/or direct nucleotide sequence analysis of amplicons from RT-PCR of HSVd-infected plants. Two independent high throughput assays to segregate HSVd variants by real-time RT-PCR and High-Resolution Melting Temperature (HRM) analysis were developed: one based on EVAGreen dye; the other based on TaqMan probes. Primers for both assays targeted three differentiating nucleotides in the V domain which separated HSVd variants into three clusters by distinct melting temperatures with a confidence level higher than 98%. The accuracy of the HRM assays were validated by nucleotide sequencing of representative samples within each HRM cluster and by testing 45 HSVd-infected field trees from California, Italy, Spain, Syria and Turkey. To our knowledge, this is the first report of a rapid and sensitive approach to detect and differentiate HSVd variants associated with different biological behaviors. Although, HSVd is found in several crops including citrus, cachexia variants are restricted to some citrus-growing areas, particularly the Mediterranean Region. Rapid diagnosis for cachexia and non-cachexia variants is, thus, important for the management of HSVd in citrus and reduces the need for bioindexing and sequencing analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.
Whole exome sequencing using Ion Proton system enables reliable genetic diagnosis of inherited retinal dystrophies

PubMed Central

Riera, Marina; Navarro, Rafael; Ruiz-Nogales, Sheila; Méndez, Pilar; Burés-Jelstrup, Anniken; Corcóstegui, Borja; Pomares, Esther

2017-01-01

Inherited retinal dystrophies (IRD) comprise a wide group of clinically and genetically complex diseases that progressively affect the retina. Over recent years, the development of next-generation sequencing (NGS) methods has transformed our ability to diagnose heterogeneous diseases. In this work, we have evaluated the implementation of whole exome sequencing (WES) for the molecular diagnosis of IRD. Using Ion ProtonTM system, we simultaneously analyzed 212 genes that are responsible for more than 25 syndromic and non-syndromic IRD. This approach was used to evaluate 59 unrelated families, with the pathogenic variant(s) successfully identified in 71.18% of cases. Interestingly, the mutation detection rate varied substantially depending on the IRD subtype. Overall, we found 63 different mutations (21 novel) in 29 distinct genes, and performed in vivo functional studies to determine the deleterious impact of variants identified in MERTK, CDH23, and RPGRIP1. In addition, we provide evidences that support CDHR1 as a gene responsible for autosomal recessive retinitis pigmentosa with early macular affectation, and present data regarding the disease mechanism of this gene. Altogether, these results demonstrate that targeted WES of all IRD genes is a reliable, hypothesis-free approach, and a cost- and time-effective strategy for the routine genetic diagnosis of retinal dystrophies. PMID:28181551
Gitelman syndrome in a South African family presenting with hypokalaemia and unusual food cravings.

PubMed

van der Merwe, Pieter Du Toit; Rensburg, Megan A; Haylett, William L; Bardien, Soraya; Davids, M Razeen

2017-01-26

Gitelman syndrome (GS) is an autosomal recessive renal tubular disorder characterised by renal salt wasting with hypokalaemia, metabolic alkalosis, hypomagnesaemia and hypocalciuria. It is caused by mutations in SLC12A3 encoding the sodium-chloride cotransporter on the apical membrane of the distal convoluted tubule. We report a South African family with five affected individuals presenting with hypokalaemia and unusual food cravings. The affected individuals and two unaffected first degree relatives were enrolled into the study. Phenotypes were evaluated through history, physical examination and biochemical analysis of blood and urine. Mutation screening was performed by sequencing of SLC12A3, and determining the allele frequencies of the sequence variants found in this family in 117 ethnically matched controls. The index patient, her sister, father and two aunts had a history of severe salt cravings, fatigue and tetanic episodes, leading to consumption of large quantities of salt and vinegar. All affected individuals demonstrated hypokalaemia with renal potassium wasting. Genetic analysis revealed that the pseudo-dominant pattern of inheritance was due to compound heterozygosity with two novel mutations: a S546G substitution in exon 13, and insertion of AGCCCC at c.1930 in exon 16. These variants were present in the five affected individuals, but only one variant each in the unaffected family members. Neither variant was found in any of the controls. The diagnosis of GS was established in five members of a South African family through clinical assessment, biochemical analysis and mutation screening of the SLC12A3 gene, which identified two novel putative pathogenic mutations.
Epigenetic and genetic components of height regulation.

PubMed

Benonisdottir, Stefania; Oddsson, Asmundur; Helgason, Agnar; Kristjansson, Ragnar P; Sveinbjornsson, Gardar; Oskarsdottir, Arna; Thorleifsson, Gudmar; Davidsson, Olafur B; Arnadottir, Gudny A; Sulem, Gerald; Jensson, Brynjar O; Holm, Hilma; Alexandersson, Kristjan F; Tryggvadottir, Laufey; Walters, G Bragi; Gudjonsson, Sigurjon A; Ward, Lucas D; Sigurdsson, Jon K; Iordache, Paul D; Frigge, Michael L; Rafnar, Thorunn; Kong, Augustine; Masson, Gisli; Helgason, Hannes; Thorsteinsdottir, Unnur; Gudbjartsson, Daniel F; Sulem, Patrick; Stefansson, Kari

2016-11-16

Adult height is a highly heritable trait. Here we identified 31.6 million sequence variants by whole-genome sequencing of 8,453 Icelanders and tested them for association with adult height by imputing them into 88,835 Icelanders. Here we discovered 13 novel height associations by testing four different models including parent-of-origin (|β|=0.4-10.6 cm). The minor alleles of three parent-of-origin signals associate with less height only when inherited from the father and are located within imprinted regions (IGF2-H19 and DLK1-MEG3). We also examined the association of these sequence variants in a set of 12,645 Icelanders with birth length measurements. Two of the novel variants, (IGF2-H19 and TET1), show significant association with both adult height and birth length, indicating a role in early growth regulation. Among the parent-of-origin signals, we observed opposing parental effects raising questions about underlying mechanisms. These findings demonstrate that common variations affect human growth by parental imprinting.
Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants.

PubMed

Pasquali, Lorenzo; Gaulton, Kyle J; Rodríguez-Seguí, Santiago A; Mularoni, Loris; Miguel-Escalada, Irene; Akerman, İldem; Tena, Juan J; Morán, Ignasi; Gómez-Marín, Carlos; van de Bunt, Martijn; Ponsa-Cobas, Joan; Castro, Natalia; Nammo, Takao; Cebola, Inês; García-Hurtado, Javier; Maestro, Miguel Angel; Pattou, François; Piemonti, Lorenzo; Berney, Thierry; Gloyn, Anna L; Ravassard, Philippe; Skarmeta, José Luis Gómez; Müller, Ferenc; McCarthy, Mark I; Ferrer, Jorge

2014-02-01

Type 2 diabetes affects over 300 million people, causing severe complications and premature death, yet the underlying molecular mechanisms are largely unknown. Pancreatic islet dysfunction is central in type 2 diabetes pathogenesis, and understanding islet genome regulation could therefore provide valuable mechanistic insights. We have now mapped and examined the function of human islet cis-regulatory networks. We identify genomic sequences that are targeted by islet transcription factors to drive islet-specific gene activity and show that most such sequences reside in clusters of enhancers that form physical three-dimensional chromatin domains. We find that sequence variants associated with type 2 diabetes and fasting glycemia are enriched in these clustered islet enhancers and identify trait-associated variants that disrupt DNA binding and islet enhancer activity. Our studies illustrate how islet transcription factors interact functionally with the epigenome and provide systematic evidence that the dysregulation of islet enhancers is relevant to the mechanisms underlying type 2 diabetes.
Whole-exome sequencing identifies novel compound heterozygous mutations in USH2A in Spanish patients with autosomal recessive retinitis pigmentosa.

PubMed

Méndez-Vidal, Cristina; González-Del Pozo, María; Vela-Boza, Alicia; Santoyo-López, Javier; López-Domingo, Francisco J; Vázquez-Marouschek, Carmen; Dopazo, Joaquin; Borrego, Salud; Antiñolo, Guillermo

2013-01-01

Retinitis pigmentosa (RP) is an inherited retinal dystrophy characterized by extreme genetic and clinical heterogeneity. Thus, the diagnosis is not always easily performed due to phenotypic and genetic overlap. Current clinical practices have focused on the systematic evaluation of a set of known genes for each phenotype, but this approach may fail in patients with inaccurate diagnosis or infrequent genetic cause. In the present study, we investigated the genetic cause of autosomal recessive RP (arRP) in a Spanish family in which the causal mutation has not yet been identified with primer extension technology and resequencing. We designed a whole-exome sequencing (WES)-based approach using NimbleGen SeqCap EZ Exome V3 sample preparation kit and the SOLiD 5500×l next-generation sequencing platform. We sequenced the exomes of both unaffected parents and two affected siblings. Exome analysis resulted in the identification of 43,204 variants in the index patient. All variants passing filter criteria were validated with Sanger sequencing to confirm familial segregation and absence in the control population. In silico prediction tools were used to determine mutational impact on protein function and the structure of the identified variants. Novel Usher syndrome type 2A (USH2A) compound heterozygous mutations, c.4325T>C (p.F1442S) and c.15188T>G (p.L5063R), located in exons 20 and 70, respectively, were identified as probable causative mutations for RP in this family. Family segregation of the variants showed the presence of both mutations in all affected members and in two siblings who were apparently asymptomatic at the time of family ascertainment. Clinical reassessment confirmed the diagnosis of RP in these patients. Using WES, we identified two heterozygous novel mutations in USH2A as the most likely disease-causing variants in a Spanish family diagnosed with arRP in which the cause of the disease had not yet been identified with commonly used techniques. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases where conventional genetic approaches have previously failed in achieving a proper diagnosis.
Double Hits in Schizophrenia.

PubMed

Vorstman, Jacob A S; Olde Loohuis, Loes M; Kahn, René S; Ophoff, Roel A

2018-05-14

The co-occurrence of a Copy Number Variant (CNV) and a functional variant on the other allele may be a relevant genetic mechanism in schizophrenia. We hypothesized that the cumulative burden of such double hits - in particular those composed of a deletion and a coding single nucleotide variation (SNV) - is increased in patients with schizophrenia.We combined CNV data with coding variants data in 795 patients with schizophrenia and 474 controls. To limit false CNV-detection, only CNVs called only by two algorithms we included. CNV-affected genes were subsequently examined for coding SNVs, which we termed "CNV-SNVs". Correcting for total queried sequence, we assessed the CNV-SNV-burden and the combined predicted deleterious effect. We estimated p-values by permutation of the phenotype.We detected 105 CNV-SNVs; 67 in duplicated and 38 in deleted genic sequence. While the difference in CNV-SNVs rates was not significant, the combined deleteriousness inferred by CNV-SNVs in deleted sequence was almost fourfold higher in cases compared to controls (nominal p = 0.009). This effect may be driven by a higher number of CNV-SNVs and/or by a higher degree of predicted deleteriousness of CNV-SNVs. No such effect was observed for duplications.We provide early evidence that deletions co-occurring with a functional variant may be relevant, albeit of modest impact, for the genetic etiology of schizophrenia. Large-scale consortium studies are required to validate our findings. Sequence-based analyses would provide the best resolution for detection of CNVs as well as coding variants genome-wide.
Jannovar: a java library for exome annotation.

PubMed

Jäger, Marten; Wang, Kai; Bauer, Sebastian; Smedley, Damian; Krawitz, Peter; Robinson, Peter N

2014-05-01

Transcript-based annotation and pedigree analysis are two basic steps in the computational analysis of whole-exome sequencing experiments in genetic diagnostics and disease-gene discovery projects. Here, we present Jannovar, a stand-alone Java application as well as a Java library designed to be used in larger software frameworks for exome and genome analysis. Jannovar uses an interval tree to identify all transcripts affected by a given variant, and provides Human Genome Variation Society-compliant annotations both for variants affecting coding sequences and splice junctions as well as untranslated regions and noncoding RNA transcripts. Jannovar can also perform family-based pedigree analysis with Variant Call Format (VCF) files with data from members of a family segregating a Mendelian disorder. Using a desktop computer, Jannovar requires a few seconds to annotate a typical VCF file with exome data. Jannovar is freely available under the BSD2 license. Source code as well as the Java application and library file can be downloaded from http://compbio.charite.de (with tutorial) and https://github.com/charite/jannovar. © 2014 WILEY PERIODICALS, INC.
Exome Sequencing Is an Efficient Tool for Variant Late-Infantile Neuronal Ceroid Lipofuscinosis Molecular Diagnosis

PubMed Central

Ortega-Recalde, Oscar; Nallathambi, Jeyabalan; Anandula, Venkata Ramana; Renukaradhya, Umashankar; Laissue, Paul

2014-01-01

The neuronal ceroid-lipofuscinoses (NCL) is a group of neurodegenerative disorders characterized by epilepsy, visual failure, progressive mental and motor deterioration, myoclonus, dementia and reduced life expectancy. Classically, NCL-affected individuals have been classified into six categories, which have been mainly defined regarding the clinical onset of symptoms. However, some patients cannot be easily included in a specific group because of significant variation in the age of onset and disease progression. Molecular genetics has emerged in recent years as a useful tool for enhancing NCL subtype classification. Fourteen NCL genetic forms (CLN1 to CLN14) have been described to date. The variant late-infantile form of the disease has been linked to CLN5, CLN6, CLN7 (MFSD8) and CLN8 mutations. Despite advances in the diagnosis of neurodegenerative disorders mutations in these genes may cause similar phenotypes, which rends difficult accurate candidate gene selection for direct sequencing. Three siblings who were affected by variant late-infantile NCL are reported in the present study. We used whole-exome sequencing, direct sequencing and in silico approaches to identify the molecular basis of the disease. We identified the novel c.1219T>C (p.Trp407Arg) and c.1361T>C (p.Met454Thr) MFSD8 pathogenic mutations. Our results highlighted next generation sequencing as a novel and powerful methodological approach for the rapid determination of the molecular diagnosis of NCL. They also provide information regarding the phenotypic and molecular spectrum of CLN7 disease. PMID:25333361
Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

PubMed

Curtis, David

2018-01-01

Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

PubMed

van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

2017-10-01

Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.
Variant of TREM2 Associated with the Risk of Alzheimer’s Disease

PubMed Central

Jonsson, Thorlakur; Stefansson, Hreinn; Steinberg, Stacy; Jonsdottir, Ingileif; Jonsson, Palmi V.; Snaedal, Jon; Bjornsson, Sigurbjorn; Huttenlocher, Johanna; Levey, Allan I.; Lah, James J.; Rujescu, Dan; Hampel, Harald; Giegling, Ina; Andreassen, Ole A.; Engedal, Knut; Ulstein, Ingun; Djurovic, Srdjan; Ibrahim-Verbaas, Carla; Hofman, Albert; Ikram, M. Arfan; van Duijn, Cornelia M; Thorsteinsdottir, Unnur; Kong, Augustine; Stefansson, Kari

2013-01-01

BACKGROUND Sequence variants, including the ε4 allele of apolipoprotein E, have been associated with the risk of the common late-onset form of Alzheimer’s disease. Few rare variants affecting the risk of late-onset Alzheimer’s disease have been found. METHODS We obtained the genome sequences of 2261 Icelanders and identified sequence variants that were likely to affect protein function. We imputed these variants into the genomes of patients with Alzheimer’s disease and control participants and then tested for an association with Alzheimer’s disease. We performed replication tests using case–control series from the United States, Norway, the Netherlands, and Germany. We also tested for a genetic association with cognitive function in a population of unaffected elderly persons. RESULTS A rare missense mutation (rs75932628-T) in the gene encoding the triggering receptor expressed on myeloid cells 2 (TREM2), which was predicted to result in an R47H substitution, was found to confer a significant risk of Alzheimer’s disease in Iceland (odds ratio, 2.92; 95% confidence interval [CI], 2.09 to 4.09; P = 3.42×10−10). The mutation had a frequency of 0.46% in controls 85 years of age or older. We observed the association in additional sample sets (odds ratio, 2.90; 95% CI, 2.16 to 3.91; P = 2.1×10−12 in combined discovery and replication samples). We also found that carriers of rs75932628-T between the ages of 80 and 100 years without Alzheimer’s disease had poorer cognitive function than noncarriers (P = 0.003). CONCLUSIONS Our findings strongly implicate variant TREM2 in the pathogenesis of Alzheimer’s disease. Given the reported antiinflammatory role of TREM2 in the brain, the R47H substitution may lead to an increased predisposition to Alzheimer’s disease through impaired containment of inflammatory processes. (Funded by the National Institute on Aging and others.) PMID:23150908
Functional assessment of a novel COL4A5 splice region variant and immunostaining of plucked hair follicles as an alternative method of diagnosis in X-linked Alport syndrome.

PubMed

Malone, Andrew F; Funk, Steven D; Alhamad, Tarek; Miner, Jeffrey H

2017-06-01

Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Targeted next-generation sequencing results of an individual with Alport syndrome were analyzed and the results confirmed by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant's effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. Using this approach we demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance.
Biallelic variants in the ciliary gene TMEM67 cause RHYNS syndrome.

PubMed

Brancati, Francesco; Camerota, Letizia; Colao, Emma; Vega-Warner, Virginia; Zhao, Xiangzhong; Zhang, Ruixiao; Bottillo, Irene; Castori, Marco; Caglioti, Alfredo; Sangiuolo, Federica; Novelli, Giuseppe; Perrotti, Nicola; Otto, Edgar A

2018-06-11

A rare syndrome was first described in 1997 in a 17-year-old male patient presenting with Retinitis pigmentosa, HYpopituitarism, Nephronophthisis and Skeletal dysplasia (RHYNS). In the single reported familial case, two brothers were affected, arguing for X-linked or recessive mode of inheritance. Up to now, the underlying genetic basis of RHYNS syndrome remains unknown. Here we applied whole-exome sequencing in the originally described family with RHYNS to identify compound heterozygous variants in the ciliary gene TMEM67. Sanger sequencing confirmed a paternally inherited nonsense c.622A > T, p.(Arg208*) and a maternally inherited missense variant c.1289A > G, p.(Asp430Gly), which perturbs the correct splicing of exon 13. Overall, TMEM67 showed one of the widest clinical continuum observed in ciliopathies ranging from early lethality to adults with liver fibrosis. Our findings extend the spectrum of phenotypes/syndromes resulting from biallelic TMEM67 variants to now eight distinguishable clinical conditions including RHYNS syndrome.
Whole-exome sequencing of a rare case of familial childhood acute lymphoblastic leukemia reveals putative predisposing mutations in Fanconi anemia genes.

PubMed

Spinella, Jean-François; Healy, Jasmine; Saillour, Virginie; Richer, Chantal; Cassart, Pauline; Ouimet, Manon; Sinnett, Daniel

2015-07-23

Acute lymphoblastic leukemia (ALL) is the most common pediatric cancer. While the multi-step model of pediatric leukemogenesis suggests interplay between constitutional and somatic genomes, the role of inherited genetic variability remains largely undescribed. Nonsyndromic familial ALL, although extremely rare, provides the ideal setting to study inherited contributions to ALL. Toward this goal, we sequenced the exomes of a childhood ALL family consisting of mother, father and two non-twinned siblings diagnosed with concordant pre-B hyperdiploid ALL and previously shown to have inherited a rare form of PRDM9, a histone H3 methyltransferase involved in crossing-over at recombination hotspots and Holliday junctions. We postulated that inheritance of additional rare disadvantaging variants in predisposing cancer genes could affect genomic stability and lead to increased risk of hyperdiploid ALL within this family. Whole exomes were captured using Agilent's SureSelect kit and sequenced on the Life Technologies SOLiD System. We applied a data reduction strategy to identify candidate variants shared by both affected siblings. Under a recessive disease model, we focused on rare non-synonymous or frame-shift variants in leukemia predisposing pathways. Though the family was nonsyndromic, we identified a combination of rare variants in Fanconi anemia (FA) genes FANCP/SLX4 (compound heterozygote - rs137976282/rs79842542) and FANCA (rs61753269) and a rare homozygous variant in the Holliday junction resolvase GEN1 (rs16981869). These variants, predicted to affect protein function, were previously identified in familial breast cancer cases. Based on our in-house database of 369 childhood ALL exomes, the sibs were the only patients to carry this particularly rare combination and only a single hyperdiploid patient was heterozygote at both FANCP/SLX4 positions, while no FANCA variant allele carriers were identified. FANCA is the most commonly mutated gene in FA and is essential for resolving DNA interstrand cross-links during replication. FANCP/SLX4 and GEN1 are involved in the cleavage of Holliday junctions and their mutated forms, in combination with the rare allele of PRDM9, could alter Holliday junction resolution leading to nondisjunction of chromosomes and segregation defects. Taken together, these results suggest that concomitant inheritance of rare variants in FANCA, FANCP/SLX4 and GEN1 on the specific genetic background of this familial case, could lead to increased genomic instability, hematopoietic dysfunction, and higher risk of childhood leukemia.

Frequent and Rare HABP2 Variants Are Not Associated with Increased Susceptibility to Familial Nonmedullary Thyroid Carcinoma in the Spanish Population.

PubMed

de Randamie, Rajdee; Martos-Moreno, Gabriel Ángel; Lumbreras, César; Chueca, Maria; Donnay, Sergio; Luque, Manuel; Regojo, Rita María; Mendiola, Marta; Hardisson, David; Argente, Jesús; Moreno, José C

2018-06-12

A genomic HABP2 variant was proposed to be responsible for familial nonmedullary thyroid carcinoma (FNMTC). However, its involvement has been questioned in subsequent studies. We aimed to identify genetic HABP2 mutations in a series of FNMTC patients and investigate their involvement in the disease. HABP2 was sequenced from 6 index patients. Presence of the variants was investigated in all members of one family. Somatic BRAF and RAS "hotspot" mutations were investigated by the IdyllaTM BRAF Mutation Test and/or Sanger sequencing. Two HABP2 variants (p.E393Q and p.G534E) were identified in the index patient from one family with papillary thyroid carcinoma (PTC) (follicular variant). The prevalence of p.E393Q in Spanish control alleles was 0.5% and that of p.G534E was 5.1%. However, neither change cosegregated with the phenotype in 3 affected members and 5 healthy members of the kindred. Interestingly, all 3 members affected by PTC harbored the p.V600E somatic mutation in BRAF. The variant G534E is prevalent in the Spanish population (5.1%); however, p.E393Q is rare (< 1%) and none cosegregated with the FNMTC phenotype. The presence of the noninheritable V600E BRAF mutation in this family supports Knudson's "double-hit" hypothesis for cancer development and suggests the involvement of more than 1 gene in the clinical expression of FNMTC. © 2018 S. Karger AG, Basel.
Sequence of the toxic shock syndrome toxin gene (tstH) borne by strains of Staphylococcus aureus isolated from patients with Kawasaki syndrome.

PubMed Central

Deresiewicz, R L; Flaxenburg, J; Leng, K; Kasper, D L

1996-01-01

To explore whether a novel staphylococcal clone or structural variant of toxic shock syndrome toxin 1 is associated with Kawasaki syndrome, six toxigenic strains of Staphylococcus aureus from Kawasaki syndrome patients were studied. The strains were divisible into two groups based on phenotypic and genotypic characteristics and are therefore unequivocally not clonal. Portions of the tstH genes of each strain were sequenced. Three were sequenced in their entirety, while the remainder were sequenced from codon 66 to codon 137 of the mature protein only. Two of the former group differed slightly in the sequences of their signal peptides relative to the sequence published for the tstH signal peptide. Those differences did not affect toxin processing or secretion. The sequenced portions of the regions encoding mature toxic shock syndrome toxin 1 were identical in all six strains and corresponded exactly to the published sequence of tstH. No evidence was found for the existence of a structural variant of tstH uniquely associated with Kawasaki syndrome. PMID:8757881
Fast single-pass alignment and variant calling using sequencing data

USDA-ARS?s Scientific Manuscript database

Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...
Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis.

PubMed

Asgari, Samira; McLaren, Paul J; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R; Abarca, Katia; Gelderman, Kyra A; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J

2016-01-01

One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa ( P. aeruginosa ) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B . This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs.
Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis

PubMed Central

Asgari, Samira; McLaren, Paul J.; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R.; Abarca, Katia; Gelderman, Kyra A.; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J.; Posfay-Barbe, Klara

2016-01-01

One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa (P. aeruginosa) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B. This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs. PMID:27703454
X-Linked Glomerulopathy Due to COL4A5 Founder Variant.

PubMed

Barua, Moumita; John, Rohan; Stella, Lorenzo; Li, Weili; Roslin, Nicole M; Sharif, Bedra; Hack, Saidah; Lajoie-Starkell, Ginette; Schwaderer, Andrew L; Becknell, Brian; Wuttke, Matthias; Köttgen, Anna; Cattran, Daniel; Paterson, Andrew D; Pei, York

2018-03-01

Alport syndrome is a rare hereditary disorder caused by rare variants in 1 of 3 genes encoding for type IV collagen. Rare variants in COL4A5 on chromosome Xq22 cause X-linked Alport syndrome, which accounts for ∼80% of the cases. Alport syndrome has a variable clinical presentation, including progressive kidney failure, hearing loss, and ocular defects. Exome sequencing performed in 2 affected related males with an undefined X-linked glomerulopathy characterized by global and segmental glomerulosclerosis, mesangial hypercellularity, and vague basement membrane immune complex deposition revealed a COL4A5 sequence variant, a substitution of a thymine by a guanine at nucleotide 665 (c.T665G; rs281874761) of the coding DNA predicted to lead to a cysteine to phenylalanine substitution at amino acid 222, which was not seen in databases cataloguing natural human genetic variation, including dbSNP138, 1000 Genomes Project release version 01-11-2004, Exome Sequencing Project 21-06-2014, or ExAC 01-11-2014. Review of the literature identified 2 additional families with the same COL4A5 variant leading to similar atypical histopathologic features, suggesting a unique pathologic mechanism initiated by this specific rare variant. Homology modeling suggests that the substitution alters the structural and dynamic properties of the type IV collagen trimer. Genetic analysis comparing members of the 3 families indicated a distant relationship with a shared haplotype, implying a founder effect. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Exploration of RNA Sequence Space in the Absence of a Replicase.

PubMed

Tirumalai, Madhan R; Tran, Quyen; Paci, Maxim; Chavan, Dimple; Marathe, Anuradha; Fox, George E

2018-05-11

It is generally considered that if an RNA World ever existed that it would be driven by an RNA capable of RNA replication. Whether such a catalytic RNA could emerge in an RNA World or not, there would need to be prior routes to increasing complexity in order to produce it. It is hypothesized here that increasing sequence variety, if not complexity, can in fact readily emerge in response to a dynamic equilibrium between synthesis and degradation. A model system in which T4 RNA ligase catalyzes synthesis and Benzonase catalyzes degradation was constructed. An initial 20-mer served as a seed and was subjected to 180 min of simultaneous ligation and degradation. The seed RNA rapidly disappeared and was replaced by an increasing number and variety of both larger and smaller variants. Variants of 40-80 residues were consistently seen, typically representing 2-4% of the unique sequences. In a second experiment with four individual 9-mers, numerous variants were again produced. These included variants of the individual 9-mers as well as sequences that contained sequence segments from two or more 9-mers. In both cases, the RNA products lack large numbers of point mutations but instead incorporate additions and subtractions of fragments of the original RNAs. The system demonstrates that if such equilibrium were established in a prebiotic world it would result in significant exploration of RNA sequence space and likely increased complexity. It remains to be seen if the variety of products produced is affected by the presence of small peptide oligomers.
Ultradeep Sequencing for Detection of Quasispecies Variants in the Major Hydrophilic Region of Hepatitis B Virus in Indonesian Patients

PubMed Central

Yamani, Laura Navika; Utsumi, Takako; Juniastuti; Wandono, Hadi; Widjanarko, Doddy; Triantanoe, Ari; Wasityastuti, Widya; Liang, Yujiao; Okada, Rina; Tanahashi, Toshihito; Murakami, Yoshiki; Azuma, Takeshi; Soetjipto; Lusida, Maria Inge; Hayashi, Yoshitake

2015-01-01

Quasispecies of hepatitis B virus (HBV) with variations in the major hydrophilic region (MHR) of the HBV surface antigen (HBsAg) can evolve during infection, allowing HBV to evade neutralizing antibodies. These escape variants may contribute to chronic infections. In this study, we looked for MHR variants in HBV quasispecies using ultradeep sequencing and evaluated the relationship between these variants and clinical manifestations in infected patients. We enrolled 30 Indonesian patients with hepatitis B infection (11 with chronic hepatitis and 19 with advanced liver disease). The most common subgenotype/subtype of HBV was B3/adw (97%). The HBsAg titer was lower in patients with advanced liver disease than that in patients with chronic hepatitis. The MHR variants were grouped based on the percentage of the viral population affected: major, ≥20% of the total population; intermediate, 5% to <20%; and minor, 1% to <5%. The rates of MHR variation that were present in the major and intermediate viral population were significantly greater in patients with advanced liver disease than those in chronic patients. The most frequent MHR variants related to immune evasion in the major and intermediate populations were P120Q/T, T123A, P127T, Q129H/R, M133L/T, and G145R. The major population of MHR variants causing impaired of HBsAg secretion (e.g., G119R, Q129R, T140I, and G145R) was detected only in advanced liver disease patients. This is the first study to use ultradeep sequencing for the detection of MHR variants of HBV quasispecies in Indonesian patients. We found that a greater number of MHR variations was related to disease severity and reduced likelihood of HBsAg titer. PMID:26202119
Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements

PubMed Central

Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.

2008-01-01

Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679
Silent Tyrosinemia Type I Without Elevated Tyrosine or Succinylacetone Associated with Liver Cirrhosis and Hepatocellular Carcinoma.

PubMed

Blackburn, Patrick R; Hickey, Raymond D; Nace, Rebecca A; Giama, Nasra H; Kraft, Daniel L; Bordner, Andrew J; Chaiteerakij, Roongruedee; McCormick, Jennifer B; Radulovic, Maja; Graham, Rondell P; Torbenson, Michael S; Tortorelli, Silvia; Scott, C Ronald; Lindor, Noralane M; Milliner, Dawn S; Oglesbee, Devin; Al-Qabandi, Wafa'a; Grompe, Markus; Gavrilov, Dimitar K; El-Youssef, Mounif; Clark, Karl J; Atwal, Paldeep S; Roberts, Lewis R; Klee, Eric W; Ekker, Stephen C

2016-10-01

Tyrosinemia type I (TYRSN1, TYR I) is caused by fumarylacetoacetate hydrolase (FAH) deficiency and affects approximately one in 100,000 individuals worldwide. Pathogenic variants in FAH cause TYRSN1, which induces cirrhosis and can progress to hepatocellular carcinoma (HCC). TYRSN1 is characterized by the production of a pathognomonic metabolite, succinylacetone (SUAC) and is included in the Recommended Uniform Screening Panel for newborns. Treatment intervention is effective if initiated within the first month of life. Here, we describe a family with three affected children who developed HCC secondary to idiopathic hepatosplenomegaly and cirrhosis during infancy. Whole exome sequencing revealed a novel homozygous missense variant in FAH (Chr15(GRCh38):g.80162305A>G; NM_000137.2:c.424A > G; NP_000128.1:p.R142G). This novel variant involves the catalytic pocket of the enzyme, but does not result in increased SUAC or tyrosine, making the diagnosis of TYRSN1 problematic. Testing this novel variant using a rapid, in vivo somatic mouse model showed that this variant could not rescue FAH deficiency. In this case of atypical TYRSN1, we show how reliance on SUAC as a primary diagnostic test can be misleading in some patients with this disease. Augmentation of current screening for TYRSN1 with targeted sequencing of FAH is warranted in cases suggestive of the disorder. © 2016 The Authors. **Human Mutation published by Wiley Periodicals, Inc.
Whole-genome sequencing reveals a potential causal mutation for dwarfism in the Miniature Shetland pony.

PubMed

Metzger, Julia; Gast, Alana Christina; Schrimpf, Rahel; Rau, Janina; Eikelberg, Deborah; Beineke, Andreas; Hellige, Maren; Distl, Ottmar

2017-04-01

The Miniature Shetland pony represents a horse breed with an extremely small body size. Clinical examination of a dwarf Miniature Shetland pony revealed a lowered size at the withers, malformed skull and brachygnathia superior. Computed tomography (CT) showed a shortened maxilla and a cleft of the hard and soft palate which protruded into the nasal passage leading to breathing difficulties. Pathological examination confirmed these findings but did not reveal histopathological signs of premature ossification in limbs or cranial sutures. Whole-genome sequencing of this dwarf Miniature Shetland pony and comparative sequence analysis using 26 reference equids from NCBI Sequence Read Archive revealed three probably damaging missense variants which could be exclusively found in the affected foal. Validation of these three missense mutations in 159 control horses from different horse breeds and five donkeys revealed only the aggrecan (ACAN)-associated g.94370258G>C variant as homozygous wild-type in all control samples. The dwarf Miniature Shetland pony had the homozygous mutant genotype C/C of the ACAN:g.94370258G>C variant and the normal parents were heterozygous G/C. An unaffected full sib and 3/5 unaffected half-sibs were heterozygous G/C for the ACAN:g.94370258G>C variant. In summary, we could demonstrate a dwarf phenotype in a miniature pony breed perfectly associated with a missense mutation within the ACAN gene.
Next generation sequencing gives an insight into the characteristics of highly selected breeds versus non-breed horses in the course of domestication.

PubMed

Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar

2014-07-04

Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.
Identification and description of three families with familial Alzheimer disease that segregate variants in the SORL1 gene.

PubMed

Thonberg, Håkan; Chiang, Huei-Hsin; Lilius, Lena; Forsell, Charlotte; Lindström, Anna-Karin; Johansson, Charlotte; Björkström, Jenny; Thordardottir, Steinunn; Sleegers, Kristel; Van Broeckhoven, Christine; Rönnbäck, Annica; Graff, Caroline

2017-06-09

Alzheimer disease (AD) is a progressive neurodegenerative disorder and the most common form of dementia. The majority of AD cases are sporadic, while up to 5% are families with an early onset AD (EOAD). Mutations in one of the three genes: amyloid beta precursor protein (APP), presenilin 1 (PSEN1) or presenilin 2 (PSEN2) can be disease causing. However, most EOAD families do not carry mutations in any of these three genes, and candidate genes, such as the sortilin-related receptor 1 (SORL1), have been suggested to be potentially causative. To identify AD causative variants, we performed whole-exome sequencing on five individuals from a family with EOAD and a missense variant, p.Arg1303Cys (c.3907C > T) was identified in SORL1 which segregated with disease and was further characterized with immunohistochemistry on two post mortem autopsy cases from the same family. In a targeted re-sequencing effort on independent index patients from 35 EOAD-families, a second SORL1 variant, c.3050-2A > G, was found which segregated with the disease in 3 affected and was absent in one unaffected family member. The c.3050-2A > G variant is located two nucleotides upstream of exon 22 and was shown to cause exon 22 skipping, resulting in a deletion of amino acids Gly1017- Glu1074 of SORL1. Furthermore, a third SORL1 variant, c.5195G > C, recently identified in a Swedish case control cohort included in the European Early-Onset Dementia (EU EOD) consortium study, was detected in two affected siblings in a third family with familial EOAD. The finding of three SORL1-variants that segregate with disease in three separate families with EOAD supports the involvement of SORL1 in AD pathology. The cause of these rare monogenic forms of EOAD has proven difficult to find and the use of exome and genome sequencing may be a successful route to target them.
Whole-Genome Sequencing Suggests Schizophrenia Risk Mechanisms in Humans with 22q11.2 Deletion Syndrome.

PubMed

Merico, Daniele; Zarrei, Mehdi; Costain, Gregory; Ogura, Lucas; Alipanahi, Babak; Gazzellone, Matthew J; Butcher, Nancy J; Thiruvahindrapuram, Bhooma; Nalpathamkalam, Thomas; Chow, Eva W C; Andrade, Danielle M; Frey, Brendan J; Marshall, Christian R; Scherer, Stephen W; Bassett, Anne S

2015-09-16

Chromosome 22q11.2 microdeletions impart a high but incomplete risk for schizophrenia. Possible mechanisms include genome-wide effects of DGCR8 haploinsufficiency. In a proof-of-principle study to assess the power of this model, we used high-quality, whole-genome sequencing of nine individuals with 22q11.2 deletions and extreme phenotypes (schizophrenia, or no psychotic disorder at age >50 years). The schizophrenia group had a greater burden of rare, damaging variants impacting protein-coding neurofunctional genes, including genes involved in neuron projection (nominal P = 0.02, joint burden of three variant types). Variants in the intact 22q11.2 region were not major contributors. Restricting to genes affected by a DGCR8 mechanism tended to amplify between-group differences. Damaging variants in highly conserved long intergenic noncoding RNA genes also were enriched in the schizophrenia group (nominal P = 0.04). The findings support the 22q11.2 deletion model as a threshold-lowering first hit for schizophrenia risk. If applied to a larger and thus better-powered cohort, this appears to be a promising approach to identify genome-wide rare variants in coding and noncoding sequence that perturb gene networks relevant to idiopathic schizophrenia. Similarly designed studies exploiting genetic models may prove useful to help delineate the genetic architecture of other complex phenotypes. Copyright © 2015 Merico et al.
Whole-Genome Sequencing Suggests Schizophrenia Risk Mechanisms in Humans with 22q11.2 Deletion Syndrome

PubMed Central

Merico, Daniele; Zarrei, Mehdi; Costain, Gregory; Ogura, Lucas; Alipanahi, Babak; Gazzellone, Matthew J.; Butcher, Nancy J.; Thiruvahindrapuram, Bhooma; Nalpathamkalam, Thomas; Chow, Eva W. C.; Andrade, Danielle M.; Frey, Brendan J.; Marshall, Christian R.; Scherer, Stephen W.; Bassett, Anne S.

2015-01-01

Chromosome 22q11.2 microdeletions impart a high but incomplete risk for schizophrenia. Possible mechanisms include genome-wide effects of DGCR8 haploinsufficiency. In a proof-of-principle study to assess the power of this model, we used high-quality, whole-genome sequencing of nine individuals with 22q11.2 deletions and extreme phenotypes (schizophrenia, or no psychotic disorder at age >50 years). The schizophrenia group had a greater burden of rare, damaging variants impacting protein-coding neurofunctional genes, including genes involved in neuron projection (nominal P = 0.02, joint burden of three variant types). Variants in the intact 22q11.2 region were not major contributors. Restricting to genes affected by a DGCR8 mechanism tended to amplify between-group differences. Damaging variants in highly conserved long intergenic noncoding RNA genes also were enriched in the schizophrenia group (nominal P = 0.04). The findings support the 22q11.2 deletion model as a threshold-lowering first hit for schizophrenia risk. If applied to a larger and thus better-powered cohort, this appears to be a promising approach to identify genome-wide rare variants in coding and noncoding sequence that perturb gene networks relevant to idiopathic schizophrenia. Similarly designed studies exploiting genetic models may prove useful to help delineate the genetic architecture of other complex phenotypes. PMID:26384369
Genetic Analyses of the NF1 Gene in Turkish Neurofibromatosis Type I Patients and Definition of three Novel Variants

PubMed Central

Ulusal, SD; Gürkan, H; Atlı, E; Özal, SA; Çiftdemir, M; Tozkır, H; Karal, Y; Güçlü, H; Eker, D; Görker, I

2017-01-01

Abstract Neurofibromatosis Type I (NF1) is a multi systemic autosomal dominant neurocutaneous disorder predisposing patients to have benign and/or malignant lesions predominantly of the skin, nervous system and bone. Loss of function mutations or deletions of the NF1 gene is responsible for NF1 disease. Involvement of various pathogenic variants, the size of the gene and presence of pseudogenes makes it difficult to analyze. We aimed to report the results of 2 years of multiplex ligation-dependent probe amplification (MLPA) and next generation sequencing (NGS) for genetic diagnosis of NF1 applied at our genetic diagnosis center. The MLPA, semiconductor sequencing and Sanger sequencing were performed in genomic DNA samples from 24 unrelated patients and their affected family members referred to our center suspected of having NF1. In total, three novel and 12 known pathogenic variants and a whole gene deletion were determined. We suggest that next generation sequencing is a practical tool for genetic analysis of NF1. Deletion/duplication analysis with MLPA may also be helpful for patients clinically diagnosed to carry NF1 but do not have a detectable mutation in NGS. PMID:28924536
Genetic basis of arrhythmogenic cardiomyopathy.

PubMed

Karmouch, Jennifer; Protonotarios, Alexandros; Syrris, Petros

2018-05-01

To date 16 genes have been associated with arrhythmogenic cardiomyopathy (ACM). Mutations in these genes can lead to a broad spectrum of phenotypic expression ranging from disease affecting predominantly the right or left ventricle, to biventricular subtypes. Understanding the genetic causes of ACM is important in diagnosis and management of the disorder. This review summarizes recent advances in molecular genetics and discusses the application of next-generation sequencing technology in genetic testing in ACM. Use of next-generation sequencing methods has resulted in the identification of novel causative variants and genes for ACM. The involvement of filamin C in ACM demonstrates the genetic overlap between ACM and other types of cardiomyopathy. Putative pathogenic variants have been detected in cadherin 2 gene, a protein involved in cell adhesion. Large genomic rearrangements in desmosome genes have been systematically investigated in a cohort of ACM patients. Recent studies have identified novel causes of ACM providing new insights into the genetic spectrum of the disease and highlighting an overlapping phenotype between ACM and dilated cardiomyopathy. Next-generation sequencing is a useful tool for research and genetic diagnostic screening but interpretation of identified sequence variants requires caution and should be performed in specialized centres.
Breeding and Genetics Symposium: networks and pathways to guide genomic selection.

PubMed

Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A

2013-02-01

Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.
A de novo variant in the ASPRV1 gene in a dog with ichthyosis.

PubMed

Bauer, Anina; Waluk, Dominik P; Galichet, Arnaud; Timm, Katrin; Jagannathan, Vidhya; Sayar, Beyza S; Wiener, Dominique J; Dietschi, Elisabeth; Müller, Eliane J; Roosje, Petra; Welle, Monika M; Leeb, Tosso

2017-03-01

Ichthyoses are a heterogeneous group of inherited cornification disorders characterized by generalized dry skin, scaling and/or hyperkeratosis. Ichthyosis vulgaris is the most common form of ichthyosis in humans and caused by genetic variants in the FLG gene encoding filaggrin. Filaggrin is a key player in the formation of the stratum corneum, the uppermost layer of the epidermis and therefore crucial for barrier function. During terminal differentiation of keratinocytes, the precursor profilaggrin is cleaved by several proteases into filaggrin monomers and eventually processed into free amino acids contributing to the hydration of the cornified layer. We studied a German Shepherd dog with a novel form of ichthyosis. Comparing the genome sequence of the affected dog with 288 genomes from genetically diverse non-affected dogs we identified a private heterozygous variant in the ASPRV1 gene encoding "aspartic peptidase, retroviral-like 1", which is also known as skin aspartic protease (SASPase). The variant was absent in both parents and therefore due to a de novo mutation event. It was a missense variant, c.1052T>C, affecting a conserved residue close to an autoprocessing cleavage site, p.(Leu351Pro). ASPRV1 encodes a retroviral-like protease involved in profilaggrin-to-filaggrin processing. By immunofluorescence staining we showed that the filaggrin expression pattern was altered in the affected dog. Thus, our findings provide strong evidence that the identified de novo variant is causative for the ichthyosis in the affected dog and that ASPRV1 plays an essential role in skin barrier formation. ASPRV1 is thus a novel candidate gene for unexplained human forms of ichthyoses.
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.

PubMed

Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing

2015-08-05

To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.

Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.

PubMed

Carss, Keren J; Arno, Gavin; Erwood, Marie; Stephens, Jonathan; Sanchis-Juan, Alba; Hull, Sarah; Megy, Karyn; Grozeva, Detelina; Dewhurst, Eleanor; Malka, Samantha; Plagnol, Vincent; Penkett, Christopher; Stirrups, Kathleen; Rizzo, Roberta; Wright, Genevieve; Josifova, Dragana; Bitner-Glindzicz, Maria; Scott, Richard H; Clement, Emma; Allen, Louise; Armstrong, Ruth; Brady, Angela F; Carmichael, Jenny; Chitre, Manali; Henderson, Robert H H; Hurst, Jane; MacLaren, Robert E; Murphy, Elaine; Paterson, Joan; Rosser, Elisabeth; Thompson, Dorothy A; Wakeling, Emma; Ouwehand, Willem H; Michaelides, Michel; Moore, Anthony T; Webster, Andrew R; Raymond, F Lucy

2017-01-05

Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease. Copyright © 2017. Published by Elsevier Inc.
Identification of a Latin American-specific BabA adhesin variant through whole genome sequencing of Helicobacter pylori patient isolates from Nicaragua

DOE PAGES

Thorell, Kaisa; Hosseini, Shaghayegh; Palacios Gonzales, Reyna Victoria Palacios; ...

2016-02-29

In this study, Helicobacter pylori (H. pylori) is one of the most common bacterial infections in humans and this infection can lead to gastric ulcers and gastric cancer. H. pylori is one of the most genetically variable human pathogens and the ability of the bacterium to bind to the host epithelium as well as the presence of different virulence factors and genetic variants within these genes have been associated with disease severity. Nicaragua has particularly high gastric cancer incidence and we therefore studied Nicaraguan clinical H. pylori isolates for factors that could contribute to cancer risk. The complete genomes ofmore » fifty-two Nicaraguan H. pylorii isolates were sequenced and assembled de novo, and phylogenetic and virulence factor analyses were performed. The Nicaraguan isolates showed phylogenetic relationship with West African isolates in whole-genome sequence comparisons and with Western and urban South-and Central American isolates using MLSA (Multi-locus sequence analysis). A majority, 77 % of the isolates carried the cancer-associated virulence gene cagA and also the s1/i1/m1 vacuolating cytotoxin, vacA allele combination, which is linked to increased severity of disease. Specifically, we also found that Nicaraguan isolates have a blood group-binding adhesin (BabA) variant highly similar to previously reported BabA sequences from Latin America, including from isolates belonging to other phylogenetic groups. These BabA sequences were found to be under positive selection at several amino acid positions that differed from the global collection of isolates. In conclusion, the discovery of a Latin American BabA variant, independent of overall phylogenetic background, suggests hitherto unknown host or environmental factors within the Latin American population giving H. pylori isolates carrying this adhesin variant a selective advantage, which could affect pathogenesis and risk for sequelae through specific adherence properties.« less
Whole-exome sequencing, without prior linkage, identifies a mutation in LAMB3 as a cause of dominant hypoplastic amelogenesis imperfecta.

PubMed

Poulter, James A; El-Sayed, Walid; Shore, Roger C; Kirkham, Jennifer; Inglehearn, Chris F; Mighell, Alan J

2014-01-01

The conventional approach to identifying the defective gene in a family with an inherited disease is to find the disease locus through family studies. However, the rapid development and decreasing cost of next generation sequencing facilitates a more direct approach. Here, we report the identification of a frameshift mutation in LAMB3 as a cause of dominant hypoplastic amelogenesis imperfecta (AI). Whole-exome sequencing of three affected family members and subsequent filtering of shared variants, without prior genetic linkage, sufficed to identify the pathogenic variant. Simultaneous analysis of multiple family members confirms segregation, enhancing the power to filter the genetic variation found and leading to rapid identification of the pathogenic variant. LAMB3 encodes a subunit of Laminin-5, one of a family of basement membrane proteins with essential functions in cell growth, movement and adhesion. Homozygous LAMB3 mutations cause junctional epidermolysis bullosa (JEB) and enamel defects are seen in JEB cases. However, to our knowledge, this is the first report of dominant AI due to a LAMB3 mutation in the absence of JEB.
Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency.

PubMed

Breitfeld, Jana; Martens, Susanne; Klammt, Jürgen; Schlicke, Marina; Pfäffle, Roland; Krause, Kerstin; Weidle, Kerstin; Schleinitz, Dorit; Stumvoll, Michael; Führer, Dagmar; Kovacs, Peter; Tönjes, Anke

2013-12-01

The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD.
Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency

PubMed Central

2013-01-01

Background The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. Methods We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Results Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. Conclusions A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD. PMID:24289245
Exome analysis of a family with Wolff-Parkinson-White syndrome identifies a novel disease locus.

PubMed

Bowles, Neil E; Jou, Chuanchau J; Arrington, Cammon B; Kennedy, Brett J; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L; Etheridge, Susan P; Saarel, Elizabeth V; Bleyl, Steven B; Yost, H Joseph; Yandell, Mark; Leppert, Mark F; Tristani-Firouzi, Martin; Gruber, Peter J

2015-12-01

Wolff-Parkinson-White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5'-AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. © 2015 Wiley Periodicals, Inc.
Exome Analysis of a Family with Wolff–Parkinson–White Syndrome Identifies a Novel Disease Locus

PubMed Central

Bowles, Neil E.; Jou, Chuanchau J.; Arrington, Cammon B.; Kennedy, Brett J.; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L.; Etheridge, Susan P.; Saarel, Elizabeth V.; Bleyl, Steven B.; Yost, H. Joseph; Yandell, Mark; Leppert, Mark F.; Tristani-Firouzi, Martin; Gruber, Peter J.

2016-01-01

Wolff–Parkinson–White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5’ -AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. PMID:26284702
Isolation and molecular characterization of newly emerging avian reovirus variants and novel strains in Pennsylvania, USA, 2011-2014.

PubMed

Lu, Huaguang; Tang, Yi; Dunn, Patricia A; Wallner-Pendleton, Eva A; Lin, Lin; Knoll, Eric A

2015-10-15

Avian reovirus (ARV) infections of broiler and turkey flocks have caused significant clinical disease and economic losses in Pennsylvania (PA) since 2011. Most of the ARV-infected birds suffered from severe arthritis, tenosynovitis, pericarditis and depressed growth or runting-stunting syndrome (RSS). A high morbidity (up to 20% to 40%) was observed in ARV-affected flocks, and the flock mortality was occasionally as high as 10%. ARV infections in turkeys were diagnosed for the first time in PA in 2011. From 2011 to 2014, a total of 301 ARV isolations were made from affected PA poultry. The molecular characterization of the Sigma C gene of 114 field isolates, representing most ARV outbreaks, revealed that only 21.93% of the 114 sequenced ARV isolates were in the same genotyping cluster (cluster 1) as the ARV vaccine strains (S1133, 1733, and 2048), whereas 78.07% of the sequenced isolates were in genotyping clusters 2, 3, 4, 5, and 6 (which were distinct from the vaccine strains) and represented newly emerging ARV variants. In particular, genotyping cluster 6 was a new ARV genotype that was identified for the first time in 10 novel PA ARV variants of field isolates.
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

PubMed

Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

2017-09-01

While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
Functional assessment of a novel COL4A5 splice region variant and immunostaining of plucked hair follicles as an alternative method of diagnosis in X-linked Alport syndrome

PubMed Central

Malone, Andrew F.; Funk, Steven D.; Alhamad, Tarek; Miner, Jeffrey H.

2016-01-01

Introduction Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Methods We analyzed targeted next-generation sequencing results of an individual with Alport syndrome and confirmed results by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant’s effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. Results A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. We demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Conclusions Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance. PMID:28013382
Novel homozygous missense variant of GRIN1 in two sibs with intellectual disability and autistic features without epilepsy.

PubMed

Rossi, Massimiliano; Chatron, Nicolas; Labalme, Audrey; Ville, Dorothée; Carneiro, Maryline; Edery, Patrick; des Portes, Vincent; Lemke, Johannes R; Sanlaville, Damien; Lesca, Gaetan

2017-02-01

We report on two consanguineous sibs affected with severe intellectual disability and autistic features due to a homozygous missense variant of GRIN1. Massive parallel sequencing was performed using a gene panel including 450 genes related to intellectual disability and autism spectrum disorders. We found a homozygous missense variation of GRIN1 (c.679G>C; p.(Asp227His)) in the two affected sibs, which was inherited from both unaffected heterozygous parents. Heterozygous variants of GRIN1, encoding the GluN1 subunit of the NMDA receptor, have been reported in patients with neurodevelopmental disorders including epileptic encephalopathy, severe intellectual disability, and movement disorders. The p.(Asp227His) variant is located in the same aminoterminal protein domain as the recently published p.(Arg217Trp), which was found at the homozygous state in two patients with a similar phenotype of severe intellectual disability and autistic features but without epilepsy. In silico predictions were consistent with a deleterious effect. The present findings further expand the clinical spectrum of GRIN1 variants and support the existence of hypomorphic variants causing severe neurodevelopmental impairment with autosomal recessive inheritance.
A novel variant of aquaporin 3 is expressed in killifish (Fundulus heteroclitus) intestine

PubMed Central

Jung, Dawoon; Adamo, Meredith A.; Lehman, Rebecca M.; Barnaby, Roxanna; Jackson, Craig E.; Jackson, Brian P.; Shaw, Joseph R.; Stanton, Bruce A.

2015-01-01

Killifish (Fundulus heteroclitus) are euryhaline teleosts that are widely used in environmental and toxicological studies, and they are tolerant to arsenic, in part due to very low assimilation of arsenic from the environment. The mechanism of arsenic uptake by the intestine, a major route of arsenic uptake in humans is unknown. Thus, the goal of this study was to determine if aquaglyceroporins (AQP), which transport water and other small molecules including arsenite across cell membranes, are expressed in the killifish intestine, and whether AQP expression is affected by osmotic stress. Through RT-PCR and sequence analysis of PCR amplicons, we demonstrated that the intestine expresses kfAQP3a and kfAQP3b, two previously identified variants, and also identified a novel variant of killifish AQP3 (kfAQP3c) in the intestine. The variants likely represent alternate splice forms. A BLAST search of the F. heteroclitus reference genome revealed that the AQP3 gene resides on a single locus, while an alignment of the AQP3 sequence among 384 individuals from eight population ranging from Rhode Island to North Carolina revealed that its coding sequence was remarkably conserved with no fixed polymorphism residing in the region that distinguishes these variants. We further demonstrate that the novel variant transports arsenite into HEK293T cells. Whereas kfAQP3a, which does not transport arsenite, was expressed in both freshwater (FW) and saltwater (SW) acclimated fish, kfAQP3b, an arsenic transporter, was expressed only in FW acclimated fish, and kfAQP3c was expressed only in SW acclimated fish. Thus, we have identified a novel, putative splice variant of kfAQP3, kfAQP3c, which transports arsenic and is expressed only in SW acclimated fish. PMID:25766383
Rare variant association analysis in case-parents studies by allowing for missing parental genotypes.

PubMed

Li, Yumei; Xiang, Yang; Xu, Chao; Shen, Hui; Deng, Hongwen

2018-01-15

The development of next-generation sequencing technologies has facilitated the identification of rare variants. Family-based design is commonly used to effectively control for population admixture and substructure, which is more prominent for rare variants. Case-parents studies, as typical strategies in family-based design, are widely used in rare variant-disease association analysis. Current methods in case-parents studies are based on complete case-parents data; however, parental genotypes may be missing in case-parents trios, and removing these data may lead to a loss in statistical power. The present study focuses on testing for rare variant-disease association in case-parents study by allowing for missing parental genotypes. In this report, we extended the collapsing method for rare variant association analysis in case-parents studies to allow for missing parental genotypes, and investigated the performance of two methods by using the difference of genotypes between affected offspring and their corresponding "complements" in case-parent trios and TDT framework. Using simulations, we showed that, compared with the methods just only using complete case-parents data, the proposed strategy allowing for missing parental genotypes, or even adding unrelated affected individuals, can greatly improve the statistical power and meanwhile is not affected by population stratification. We conclude that adding case-parents data with missing parental genotypes to complete case-parents data set can greatly improve the power of our strategy for rare variant-disease association.
Impact of genotyping errors on statistical power of association tests in genomic analyses: A case study

PubMed Central

Hou, Lin; Sun, Ning; Mane, Shrikant; Sayward, Fred; Rajeevan, Nallakkandi; Cheung, Kei-Hoi; Cho, Kelly; Pyarajan, Saiju; Aslan, Mihaela; Miller, Perry; Harvey, Philip D.; Gaziano, J. Michael; Concato, John; Zhao, Hongyu

2017-01-01

A key step in genomic studies is to assess high throughput measurements across millions of markers for each participant’s DNA, either using microarrays or sequencing techniques. Accurate genotype calling is essential for downstream statistical analysis of genotype-phenotype associations, and next generation sequencing (NGS) has recently become a more common approach in genomic studies. How the accuracy of variant calling in NGS-based studies affects downstream association analysis has not, however, been studied using empirical data in which both microarrays and NGS were available. In this article, we investigate the impact of variant calling errors on the statistical power to identify associations between single nucleotides and disease, and on associations between multiple rare variants and disease. Both differential and nondifferential genotyping errors are considered. Our results show that the power of burden tests for rare variants is strongly influenced by the specificity in variant calling, but is rather robust with regard to sensitivity. By using the variant calling accuracies estimated from a substudy of a Cooperative Studies Program project conducted by the Department of Veterans Affairs, we show that the power of association tests is mostly retained with commonly adopted variant calling pipelines. An R package, GWAS.PC, is provided to accommodate power analysis that takes account of genotyping errors (http://zhaocenter.org/software/). PMID:28019059
Comprehensive Cancer-Predisposition Gene Testing in an Adult Multiple Primary Tumor Series Shows a Broad Range of Deleterious Variants and Atypical Tumor Phenotypes.

PubMed

Whitworth, James; Smith, Philip S; Martin, Jose-Ezequiel; West, Hannah; Luchetti, Andrea; Rodger, Faye; Clark, Graeme; Carss, Keren; Stephens, Jonathan; Stirrups, Kathleen; Penkett, Chris; Mapeta, Rutendo; Ashford, Sofie; Megy, Karyn; Shakeel, Hassan; Ahmed, Munaza; Adlard, Julian; Barwell, Julian; Brewer, Carole; Casey, Ruth T; Armstrong, Ruth; Cole, Trevor; Evans, Dafydd Gareth; Fostira, Florentia; Greenhalgh, Lynn; Hanson, Helen; Henderson, Alex; Hoffman, Jonathan; Izatt, Louise; Kumar, Ajith; Kwong, Ava; Lalloo, Fiona; Ong, Kai Ren; Paterson, Joan; Park, Soo-Mi; Chen-Shtoyerman, Rakefet; Searle, Claire; Side, Lucy; Skytte, Anne-Bine; Snape, Katie; Woodward, Emma R; Tischkowitz, Marc D; Maher, Eamonn R

2018-06-12

Multiple primary tumors (MPTs) affect a substantial proportion of cancer survivors and can result from various causes, including inherited predisposition. Currently, germline genetic testing of MPT-affected individuals for variants in cancer-predisposition genes (CPGs) is mostly targeted by tumor type. We ascertained pre-assessed MPT individuals (with at least two primary tumors by age 60 years or at least three by 70 years) from genetics centers and performed whole-genome sequencing (WGS) on 460 individuals from 440 families. Despite previous negative genetic assessment and molecular investigations, pathogenic variants in moderate- and high-risk CPGs were detected in 67/440 (15.2%) probands. WGS detected variants that would not be (or were not) detected by targeted resequencing strategies, including low-frequency structural variants (6/440 [1.4%] probands). In most individuals with a germline variant assessed as pathogenic or likely pathogenic (P/LP), at least one of their tumor types was characteristic of variants in the relevant CPG. However, in 29 probands (42.2% of those with a P/LP variant), the tumor phenotype appeared discordant. The frequency of individuals with truncating or splice-site CPG variants and at least one discordant tumor type was significantly higher than in a control population (χ 2 = 43.642; p ≤ 0.0001). 2/67 (3%) probands with P/LP variants had evidence of multiple inherited neoplasia allele syndrome (MINAS) with deleterious variants in two CPGs. Together with variant detection rates from a previous series of similarly ascertained MPT-affected individuals, the present results suggest that first-line comprehensive CPG analysis in an MPT cohort referred to clinical genetics services would detect a deleterious variant in about a third of individuals. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
A double mutation in exon 6 of the [beta]-hexosaminidase [alpha] subunit in a patient with the B1 variant of Tay-Sachs disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ainsworth, P.J.; Coulter-Mackie, M.B.

1992-10-01

The B1 variant form of Tay-Sachs disease is enzymologically unique in that the causative mutation(s) appear to affect the active site in the [alpha] subunit of [beta]-hexosaminidase A without altering its ability to associate with the [beta] subunit. Most previously reported B1 variant mutations were found in exon 5 within codon 178. The coding sequence of the [alpha] subunit gene of a patient with the B1 variant form was examined with a combination of reverse transcription of mRNA to cDNA, PCR, and dideoxy sequencing. A double mutation in exon 6 has been identified: a G[sub 574][yields]C transversion causing a val[submore » 192][yields]leu change and a G[sub 598][yields] A transition resulting in a val[sub 200][yields]met alteration. The amplified cDNAs were otherwise normal throughout their sequence. The 574 and 598 alterations have been confirmed by amplification directly from genomic DNA from the patient and her mother. Transient-expression studies of the two exon 6 mutations (singly or together) in COS-1 cells show that the G[sub 574][yields]C change is sufficient to cause the loss of enzyme activity. The biochemical phenotype of the 574 alteration in transfection studies is consistent with that expected for a B1 variant mutation. As such, this mutation differs from previously reported B1 variant mutations, all of which occur in exon 5. 31 refs., 2 figs., 2 tabs.« less
VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.

PubMed

Wala, Jeremiah; Zhang, Cheng-Zhong; Meyerson, Matthew; Beroukhim, Rameen

2016-07-01

We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. VariantBam and full documentation are available at github.com/jwalabroad/VariantBam rameen@broadinstitute.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Evolution of canine parvovirus in Argentina between years 2003 and 2010: CPV2c has become the predominant variant affecting the domestic dog population.

PubMed

Calderón, Marina Gallo; Romanutti, Carina; D' Antuono, Alejandra; Keller, Leticia; Mattion, Nora; La Torre, Jose

2011-04-01

The current frequency of Canine Parvovirus variants (CPV2a, CPV2b and CPV2c) in the Argentine dog population was investigated by PCR amplification of a 583 bp fragment in the VP2 gene. From a total of 79 rectal swab samples that have been submitted to our laboratory since 2008, 55 (69.6%) resulted positive and were further analyzed by direct DNA sequencing. Fifty positives samples (91%) were characterized as CPV2c variant, which appeared in Argentina in the year 2003 and has been the prevalent type since 2008, whereas CPV2a and CPV2b, still found in Argentine dogs, were represented in 3.6% and 5.4% of the population, respectively. Considering that CPV2c is spreading worldwide, and that this variant is also affecting vaccinated dogs, efforts should be made towards the development of new matched CPV vaccines. Copyright © 2011 Elsevier B.V. All rights reserved.
Whole-exome sequencing identifies novel compound heterozygous mutations in USH2A in Spanish patients with autosomal recessive retinitis pigmentosa

PubMed Central

Méndez-Vidal, Cristina; González-del Pozo, María; Vela-Boza, Alicia; Santoyo-López, Javier; López-Domingo, Francisco J.; Vázquez-Marouschek, Carmen; Dopazo, Joaquin; Borrego, Salud

2013-01-01

Purpose Retinitis pigmentosa (RP) is an inherited retinal dystrophy characterized by extreme genetic and clinical heterogeneity. Thus, the diagnosis is not always easily performed due to phenotypic and genetic overlap. Current clinical practices have focused on the systematic evaluation of a set of known genes for each phenotype, but this approach may fail in patients with inaccurate diagnosis or infrequent genetic cause. In the present study, we investigated the genetic cause of autosomal recessive RP (arRP) in a Spanish family in which the causal mutation has not yet been identified with primer extension technology and resequencing. Methods We designed a whole-exome sequencing (WES)-based approach using NimbleGen SeqCap EZ Exome V3 sample preparation kit and the SOLiD 5500×l next-generation sequencing platform. We sequenced the exomes of both unaffected parents and two affected siblings. Exome analysis resulted in the identification of 43,204 variants in the index patient. All variants passing filter criteria were validated with Sanger sequencing to confirm familial segregation and absence in the control population. In silico prediction tools were used to determine mutational impact on protein function and the structure of the identified variants. Results Novel Usher syndrome type 2A (USH2A) compound heterozygous mutations, c.4325T>C (p.F1442S) and c.15188T>G (p.L5063R), located in exons 20 and 70, respectively, were identified as probable causative mutations for RP in this family. Family segregation of the variants showed the presence of both mutations in all affected members and in two siblings who were apparently asymptomatic at the time of family ascertainment. Clinical reassessment confirmed the diagnosis of RP in these patients. Conclusions Using WES, we identified two heterozygous novel mutations in USH2A as the most likely disease-causing variants in a Spanish family diagnosed with arRP in which the cause of the disease had not yet been identified with commonly used techniques. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases where conventional genetic approaches have previously failed in achieving a proper diagnosis. PMID:24227914
Combined mismatch repair and POLE/POLD1 defects explain unresolved suspected Lynch syndrome cancers

PubMed Central

Jansen, Anne ML; van Wezel, Tom; van den Akker, Brendy EWM; Ventayol Garcia, Marina; Ruano, Dina; Tops, Carli MJ; Wagner, Anja; Letteboer, Tom GW; Gómez-García, Encarna B; Devilee, Peter; Wijnen, Juul T; Hes, Frederik J; Morreau, Hans

2016-01-01

Many suspected Lynch Syndrome (sLS) patients who lack mismatch repair (MMR) germline gene variants and MLH1 or MSH2 hypermethylation are currently explained by somatic MMR gene variants or, occasionally, by germline POLE variants. To further investigate unexplained sLS patients, we analyzed leukocyte and tumor DNA of 62 sLS patients using gene panel sequencing including the POLE, POLD1 and MMR genes. Forty tumors showed either one, two or more somatic MMR variants predicted to affect function. Nine sLS tumors showed a likely ultramutated phenotype and were found to carry germline (n=2) or somatic variants (n=7) in the POLE/POLD1 exonuclease domain (EDM). Six of these POLE/POLD1-EDM mutated tumors also carried somatic MMR variants. Our findings suggest that faulty proofreading may result in loss of MMR and thereby in microsatellite instability. PMID:26648449

Missense-depleted regions in population exomes implicate ras superfamily nucleotide-binding protein alteration in patients with brain malformation

PubMed Central

Ge, Xiaoyan; Gong, Henry; Dumas, Kevin; Litwin, Jessica; Phillips, Joanna J; Waisfisz, Quinten; Weiss, Marjan M; Hendriks, Yvonne; Stuurman, Kyra E; Nelson, Stanley F; Grody, Wayne W; Lee, Hane; Kwok, Pui-Yan; Shieh, Joseph T C

2016-01-01

Genomic sequence interpretation can miss clinically relevant missense variants for several reasons. Rare missense variants are numerous in the exome and difficult to prioritise. Affected genes may also not have existing disease association. To improve variant prioritisation, we leverage population exome data to identify intragenic missense-depleted regions (MDRs) genome-wide that may be important in disease. We then use missense depletion analyses to help prioritise undiagnosed disease exome variants. We demonstrate application of this strategy to identify a novel gene association for human brain malformation. We identified de novo missense variants that affect the GDP/GTP-binding site of ARF1 in three unrelated patients. Corresponding functional analysis suggests ARF1 GDP/GTP-activation is affected by the specific missense mutations associated with heterotopia. These findings expand the genetic pathway underpinning neurologic disease that classically includes FLNA. ARF1 along with ARFGEF2 add further evidence implicating ARF/GEFs in the brain. Using functional ontology, top MDR-containing genes were highly enriched for nucleotide-binding function, suggesting these may be candidates for human disease. Routine consideration of MDR in the interpretation of exome data for rare diseases may help identify strong genetic factors for many severe conditions, infertility/reduction in reproductive capability, and embryonic conditions contributing to preterm loss. PMID:28868155
Autosomal-recessive SASH1 variants associated with a new genodermatosis with pigmentation defects, palmoplantar keratoderma and skin carcinoma

PubMed Central

Courcet, Jean- Benoît; Elalaoui, Siham Chafai; Duplomb, Laurence; Tajir, Mariam; Rivière, Jean-Baptiste; Thevenon, Julien; Gigot, Nadège; Marle, Nathalie; Aral, Bernard; Duffourd, Yannis; Sarasin, Alain; Naim, Valeria; Courcet-Degrolard, Emilie; Aubriot-Lorton, Marie- Hélène; Martin, Laurent; Abrid, Jamal Eddin; Thauvin, Christel; Sefiani, Abdelaziz; Vabres, Pierre; Faivre, Laurence

2015-01-01

SASH1 (SAM and SH3 domain-containing protein 1) is a tumor suppressor gene involved in the tumorigenesis of a spectrum of solid cancers. Heterozygous SASH1 variants are known to cause autosomal-dominant dyschromatosis. Homozygosity mapping and whole-exome sequencing were performed in a consanguineous Moroccan family with two affected siblings presenting an unclassified phenotype associating an abnormal pigmentation pattern (hypo- and hyperpigmented macules of the trunk and face and areas of reticular hypo- and hyperpigmentation of the extremities), alopecia, palmoplantar keratoderma, ungueal dystrophy and recurrent spinocellular carcinoma. We identified a homozygous variant in SASH1 (c.1849G>A; p.Glu617Lys) in both affected individuals. Wound-healing assay showed that the patient's fibroblasts were better able than control fibroblasts to migrate. Following the identification of SASH1 heterozygous variants in dyschromatosis, we used reverse phenotyping to show that autosomal-recessive variants of this gene could be responsible for an overlapping but more complex phenotype that affected skin appendages. SASH1 should be added to the list of genes responsible for autosomal-dominant and -recessive genodermatosis, with no phenotype in heterozygous patients in the recessive form, and to the list of genes responsible for a predisposition to skin cancer. PMID:25315659
Autosomal-recessive SASH1 variants associated with a new genodermatosis with pigmentation defects, palmoplantar keratoderma and skin carcinoma.

PubMed

Courcet, Jean-Benoît; Elalaoui, Siham Chafai; Duplomb, Laurence; Tajir, Mariam; Rivière, Jean-Baptiste; Thevenon, Julien; Gigot, Nadège; Marle, Nathalie; Aral, Bernard; Duffourd, Yannis; Sarasin, Alain; Naim, Valeria; Courcet-Degrolard, Emilie; Aubriot-Lorton, Marie-Hélène; Martin, Laurent; Abrid, Jamal Eddin; Thauvin, Christel; Sefiani, Abdelaziz; Vabres, Pierre; Faivre, Laurence

2015-07-01

SASH1 (SAM and SH3 domain-containing protein 1) is a tumor suppressor gene involved in the tumorigenesis of a spectrum of solid cancers. Heterozygous SASH1 variants are known to cause autosomal-dominant dyschromatosis. Homozygosity mapping and whole-exome sequencing were performed in a consanguineous Moroccan family with two affected siblings presenting an unclassified phenotype associating an abnormal pigmentation pattern (hypo- and hyperpigmented macules of the trunk and face and areas of reticular hypo- and hyperpigmentation of the extremities), alopecia, palmoplantar keratoderma, ungueal dystrophy and recurrent spinocellular carcinoma. We identified a homozygous variant in SASH1 (c.1849G>A; p.Glu617Lys) in both affected individuals. Wound-healing assay showed that the patient's fibroblasts were better able than control fibroblasts to migrate. Following the identification of SASH1 heterozygous variants in dyschromatosis, we used reverse phenotyping to show that autosomal-recessive variants of this gene could be responsible for an overlapping but more complex phenotype that affected skin appendages. SASH1 should be added to the list of genes responsible for autosomal-dominant and -recessive genodermatosis, with no phenotype in heterozygous patients in the recessive form, and to the list of genes responsible for a predisposition to skin cancer.
Sex is a moderator of the association between NOS1AP sequence variants and QTc in two long QT syndrome founder populations: a pedigree-based measured genotype association analysis.

PubMed

Winbo, Annika; Stattin, Eva-Lena; Westin, Ida Maria; Norberg, Anna; Persson, Johan; Jensen, Steen M; Rydberg, Annika

2017-07-18

Sequence variants in the NOS1AP gene have repeatedly been reported to influence QTc, albeit with moderate effect sizes. In the long QT syndrome (LQTS), this may contribute to the substantial QTc variance seen among carriers of identical pathogenic sequence variants. Here we assess three non-coding NOS1AP sequence variants, chosen for their previously reported strong association with QTc in normal and LQTS populations, for association with QTc in two Swedish LQT1 founder populations. This study included 312 individuals (58% females) from two LQT1 founder populations, whereof 227 genotype positive segregating either Y111C (n = 148) or R518* (n = 79) pathogenic sequence variants in the KCNQ1 gene, and 85 genotype negatives. All were genotyped for NOS1AP sequence variants rs12143842, rs16847548 and rs4657139, and tested for association with QTc length (effect size presented as mean difference between derived and wildtype, in ms), using a pedigree-based measured genotype association analysis. Mean QTc was obtained by repeated manual measurement (preferably in lead II) by one observer using coded 50 mm/s standard 12-lead ECGs. A substantial variance in mean QTc was seen in genotype positives 476 ± 36 ms (Y111C 483 ± 34 ms; R518* 462 ± 34 ms) and genotype negatives 433 ± 24 ms. Female sex was significantly associated with QTc prolongation in all genotype groups (p < 0.001). In a multivariable analysis including the entire study population and adjusted for KCNQ1 genotype, sex and age, NOS1AP sequence variants rs12143842 and rs16847548 (but not rs4657139) were significantly associated with QT prolongation, +18 ms (p = 0.0007) and +17 ms (p = 0.006), respectively. Significant sex-interactions were detected for both sequent variants (interaction term r = 0.892, p < 0.001 and r = 0.944, p < 0.001, respectively). Notably, across the genotype groups, when stratified by sex neither rs12143842 nor rs16847548 were significantly associated with QTc in females (both p = 0.16) while in males, a prolongation of +19 ms and +8 ms (p = 0.002 and p = 0.02) was seen in multivariable analysis, explaining up to 23% of QTc variance in all males. Sex was identified as a moderator of the association between NOS1AP sequence variants and QTc in two LQT1 founder populations. This finding may contribute to QTc sex differences and affect the usefulness of NOS1AP as a marker for clinical risk stratification in LQTS.
Cloning and characterization of human immunodeficiency virus type 1 variants diminished in the ability to induce syncytium-independent cytolysis.

PubMed Central

Stevenson, M; Haggerty, S; Lamonica, C; Mann, A M; Meier, C; Wasiak, A

1990-01-01

The phenomenon of interference was exploited to isolate low-abundance noncytopathic human immunodeficiency virus type 1 (HIV-1) variants from a primary HIV-1 isolate from an asymptomatic HIV-1-seropositive hemophiliac. Successive rounds of virus infection of a cytolysis-susceptible CD4+ cell line and isolation of surviving cells resulted in selective amplification of an HIV-1 variant reduced in the ability to induce cytolysis. The presence of a PvuII polymorphism facilitated subsequent amplification and cloning of cytopathic and noncytopathic HIV-1 variants from the primary isolate. Cloned virus stocks from cytopathic and noncytopathic variants exhibited similar replication kinetics, infectivity, and syncytium induction in susceptible host cells. The noncytopathic HIV-1 variant was unable, however, to induce single-cell killing in susceptible host cells. Construction of viral hybrids in which regions of cytopathic and noncytopathic variants were exchanged indicated that determinants for the noncytopathic phenotype map to the envelope glycoprotein. Sequence analysis of the envelope coding regions indicated the absence of two highly conserved N-linked glycosylation sites in the noncytopathic HIV-1 variant, which accompanied differences in processing of precursor gp160 envelope glycoprotein. These results demonstrate that determinants for syncytium-independent single-cell killing are located within the envelope glycoprotein and suggest that single-cell killing is profoundly influenced by alterations in envelope sequence which affect posttranslational processing of HIV-1 envelope glycoprotein within the infected cell. Images PMID:1695254
NOTCH3 variants and risk of ischemic stroke.

PubMed

Ross, Owen A; Soto-Ortolaza, Alexandra I; Heckman, Michael G; Verbeeck, Christophe; Serie, Daniel J; Rayaprolu, Sruti; Rich, Stephen S; Nalls, Michael A; Singleton, Andrew; Guerreiro, Rita; Kinsella, Emma; Wszolek, Zbigniew K; Brott, Thomas G; Brown, Robert D; Worrall, Bradford B; Meschia, James F

2013-01-01

Mutations within the NOTCH3 gene cause cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL). CADASIL mutations appear to be restricted to the first twenty-four exons, resulting in the gain or loss of a cysteine amino acid. The role of other exonic NOTCH3 variation not involving cysteine residues and mutations in exons 25-33 in ischemic stroke remains unresolved. All 33 exons of NOTCH3 were sequenced in 269 Caucasian probands from the Siblings With Ischemic Stroke Study (SWISS), a 70-center North American affected sibling pair study and 95 healthy Caucasian control subjects. Variants identified by sequencing in the SWISS probands were then tested for association with ischemic stroke using US Caucasian controls collected at the Mayo Clinic (n=654), and further assessed in a Caucasian (n=802) and African American (n=298) patient-control series collected through the Ischemic Stroke Genetics Study (ISGS). Sequencing of the 269 SWISS probands identified one (0.4%) with small vessel type stroke carrying a known CADASIL mutation (p.R558C; Exon 11). Of the 19 common NOTCH3 variants identified, the only variant significantly associated with ischemic stroke after multiple testing adjustment was p.R1560P (rs78501403; Exon 25) in the combined SWISS and ISGS Caucasian series (Odds Ratio [OR] 0.50, P=0.0022) where presence of the minor allele was protective against ischemic stroke. Although only significant prior to adjustment for multiple testing, p.T101T (rs3815188; Exon 3) was associated with an increased risk of small-vessel stroke (OR: 1.56, P=0.008) and p.P380P (rs61749020; Exon 7) was associated with decreased risk of large-vessel stroke (OR: 0.35, P=0.047) in Caucasians. No significant associations were observed in the small African American series. Cysteine-affecting NOTCH3 mutations are rare in patients with typical ischemic stroke, however our observation that common NOTCH3 variants may be associated with risk of ischemic stroke warrants further study.
BlackOPs: increasing confidence in variant detection through mappability filtering.

PubMed

Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil

2013-10-01

Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.
Whole genome sequencing and integrative genomic analysis approach on two 22q11.2 deletion syndrome family trios for genotype to phenotype correlations

PubMed Central

Chung, Jonathan H.; Cai, Jinlu; Suskin, Barrie G.; Zhang, Zhengdong; Coleman, Karlene

2015-01-01

The 22q11.2 deletion syndrome (22q11DS) affects 1:4000 live births and presents with highly variable phenotype expressivity. In this study, we developed an analytical approach utilizing whole genome sequencing and integrative analysis to discover genetic modifiers. Our pipeline combined available tools in order to prioritize rare, predicted deleterious, coding and non-coding single nucleotide variants (SNVs) and insertion/deletions (INDELs) from whole genome sequencing (WGS). We sequenced two unrelated probands with 22q11DS, with contrasting clinical findings, and their unaffected parents. Proband P1 had cognitive impairment, psychotic episodes, anxiety, and tetralogy of Fallot (TOF); while proband P2 had juvenile rheumatoid arthritis but no other major clinical findings. In P1, we identified common variants in COMT and PRODH on 22q11.2 as well as rare potentially deleterious DNA variants in other behavioral/neurocognitive genes. We also identified a de novo SNV in ADNP2 (NM_014913.3:c.2243G>C), encoding a neuroprotective protein that may be involved in behavioral disorders. In P2, we identified a novel non-synonymous SNV in ZFPM2 (NM_012082.3:c.1576C>T), a known causative gene for TOF, which may act as a protective variant downstream of TBX1, haploinsufficiency of which is responsible for congenital heart disease in individuals with 22q11DS. PMID:25981510
A Missense Variant in PLEC Increases Risk of Atrial Fibrillation.

PubMed

Thorolfsdottir, Rosa B; Sveinbjornsson, Gardar; Sulem, Patrick; Helgadottir, Anna; Gretarsdottir, Solveig; Benonisdottir, Stefania; Magnusdottir, Audur; Davidsson, Olafur B; Rajamani, Sridharan; Roden, Dan M; Darbar, Dawood; Pedersen, Terje R; Sabatine, Marc S; Jonsdottir, Ingileif; Arnar, David O; Thorsteinsdottir, Unnur; Gudbjartsson, Daniel F; Holm, Hilma; Stefansson, Kari

2017-10-24

Genome-wide association studies (GWAS) have yielded variants at >30 loci that associate with atrial fibrillation (AF), including rare coding mutations in the sarcomere genes MYH6 and MYL4. The aim of this study was to search for novel AF associations and in doing so gain insights into the mechanisms whereby variants affect AF risk, using electrocardiogram (ECG) measurements. The authors performed a GWAS of 14,255 AF cases and 374,939 controls, using whole-genome sequence data from the Icelandic population, and tested novel signals in 2,002 non-Icelandic cases and 12,324 controls. They then tested the AF variants for effect on cardiac electrical function by using measurements in 289,297 ECGs from 62,974 individuals. The authors discovered 2 novel AF variants, the intergenic variant rs72700114, between the genes LINC01142 and METTL11B (risk allele frequency = 8.1%; odds ratio [OR]: 1.26; p = 3.1 × 10 -18 ), and the missense variant p.Gly4098Ser in PLEC (frequency = 1.2%; OR: 1.55; p = 8.0 × 10 -10 ), encoding plectin, a cytoskeletal cross-linking protein that contributes to integrity of cardiac tissue. The authors also confirmed 29 reported variants. p.Gly4098Ser in PLEC significantly affects various ECG measurements in the absence of AF. Other AF variants have diverse effects on the conduction system, ranging from none to extensive. The discovery of a missense variant in PLEC affecting AF combined with recent discoveries of variants in the sarcomere genes MYH6 and MYL4 points to an important role of myocardial structure in the pathogenesis of the disease. The diverse associations between AF variants and ECG measurements suggest fundamentally different categories of mechanisms contributing to the development of AF. Copyright © 2017 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.
Selecting sequence variants to improve genomic predictions for dairy cattle

USDA-ARS?s Scientific Manuscript database

Millions of genetic variants have been identified by population-scale sequencing projects, but subsets are needed for routine genomic predictions or to include on genotyping arrays. Methods of selecting sequence variants were compared using both simulated sequence genotypes and actual data from run ...
Genetic high throughput screening in Retinitis Pigmentosa based on high resolution melting (HRM) analysis.

PubMed

Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier

2013-11-01

Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n = 96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4% of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes.
Genetic highthroughput screening in retinitis pigmentosa based on high resolution melting (HRM) analysis.

PubMed

Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier

2013-10-24

Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n=96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4 % of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes. © 2013 Published by Elsevier Ltd.
Description of HIV-1 Group M Molecular Epidemiology and Drug Resistance Prevalence in Equatorial Guinea from Migrants in Spain

PubMed Central

Yebra, Gonzalo; de Mulder, Miguel; Holguín, África

2013-01-01

Background The HIV epidemic is increasing in Equatorial Guinea (GQ), West Central Africa, but few studies have reported its HIV molecular epidemiology. We aimed to describe the HIV-1 group M (HIV-1M) variants and drug-resistance mutations in GQ using sequences sampled in this country and in Spain, a frequent destination of Equatoguinean migrants. Methods We collected 195 HIV-1M pol sequences from Equatoguinean subjects attending Spanish clinics during 1997-2011, and 83 additional sequences sampled in GQ in 1997 and 2008 from GenBank. All (n = 278) were re-classified using phylogeny and tested for drug-resistance mutations. To evaluate the origin of CRF02_AG in GQ, we analyzed 2,562 CRF02_AG sequences and applied Bayesian MCMC inference (BEAST program). Results Most Equatoguinean patients recruited in Spain were women (61.1%) or heterosexuals (87.7%). In the 278 sequences, the variants found were CRF02_AG (47.8%), A (13.7%), B (7.2%), C (5.8%), G (5.4%) and others (20.1%). We found 6 CRF02_AG clusters emerged from 1983.9 to 2002.5 with origin in GQ (5.5 sequences/cluster). Transmitted drug-resistance (TDR) rate among naïve patients attended in Spain (n = 144) was 4.7%: 3.4% for PI (all with M46IL), 1.8% for NRTI (all with M184V) and 0.9% for NNRTI (Y188L). Among pre-treated patients, 9/31 (29%) presented any resistance, mainly affecting NNRTI (27.8%). Conclusions We report a low (<5%) TDR rate among naïve, with PI as the most affected class. Pre-treated patients also showed a low drug-resistance prevalence (29%) maybe related to the insufficient treatment coverage in GQ. CRF02_AG was the prevalent HIV-1M variant and entered GQ through independent introductions at least since the early 1980s. PMID:23717585
Novel mutations in CRB1 gene identified in a chinese pedigree with retinitis pigmentosa by targeted capture and next generation sequencing

PubMed Central

Lo, David; Weng, Jingning; Liu, xiaohong; Yang, Juhua; He, Fen; Wang, Yun; Liu, Xuyang

2016-01-01

PURPOSE To detect the disease-causing gene in a Chinese pedigree with autosomal-recessive retinitis pigmentosa (ARRP). METHODS All subjects in this family underwent a complete ophthalmic examination. Targeted-capture next generation sequencing (NGS) was performed on the proband to detect variants. All variants were verified in the remaining family members by PCR amplification and Sanger sequencing. RESULTS All the affected subjects in this pedigree were diagnosed with retinitis pigmentosa (RP). The compound heterozygous c.138delA (p.Asp47IlefsX24) and c.1841G>T (p.Gly614Val) mutations in the Crumbs homolog 1 (CRB1) gene were identified in all the affected patients but not in the unaffected individuals in this family. These mutations were inherited from their parents, respectively. CONCLUSION The novel compound heterozygous mutations in CRB1 were identified in a Chinese pedigree with ARRP using targeted-capture next generation sequencing. After evaluating the significant heredity and impaired protein function, the compound heterozygous c.138delA (p.Asp47IlefsX24) and c.1841G>T (p.Gly614Val) mutations are the causal genes of early onset ARRP in this pedigree. To the best of our knowledge, there is no previous report regarding the compound mutations. PMID:27806333
Genetic analysis of a Chinese family with members affected with Usher syndrome type II and Waardenburg syndrome type IV.

PubMed

Wang, Xueling; Lin, Xiao-Jiang; Tang, Xiangrong; Chai, Yong-Chuan; Yu, De-Hong; Chen, Dong-Ye; Wu, Hao

2017-11-01

The purpose of this study was to identify the genetic causes of a family presenting with multiple symptoms overlapping Usher syndrome type II (USH2) and Waardenburg syndrome type IV (WS4). Targeted next-generation sequencing including the exon and flanking intron sequences of 79 deafness genes was performed on the proband. Co-segregation of the disease phenotype and the detected variants were confirmed in all family members by PCR amplification and Sanger sequencing. The affected members of this family had two different recessive disorders, USH2 and WS4. By targeted next-generation sequencing, we identified that USH2 was caused by a novel missense mutation, p.V4907D in GPR98; whereas WS4 due to p.V185M in EDNRB. This is the first report of homozygous p.V185M mutation in EDNRB in patient with WS4. This study reported a Chinese family with multiple independent and overlapping phenotypes. In condition, molecular level analysis was efficient to identify the causative variant p.V4907D in GPR98 and p.V185M in EDNRB, also was helpful to confirm the clinical diagnosis of USH2 and WS4. Copyright © 2017 Elsevier B.V. All rights reserved.
FaStore - a space-saving solution for raw sequencing data.

PubMed

Roguski, Lukasz; Ochoa, Idoia; Hernaez, Mikel; Deorowicz, Sebastian

2018-03-29

The affordability of DNA sequencing has led to the generation of unprecedented volumes of raw sequencing data. These data must be stored, processed, and transmitted, which poses significant challenges. To facilitate this effort, we introduce FaStore, a specialized compressor for FASTQ files. FaStore does not use any reference sequences for compression, and permits the user to choose from several lossy modes to improve the overall compression ratio, depending on the specific needs. FaStore in the lossless mode achieves a significant improvement in compression ratio with respect to previously proposed algorithms. We perform an analysis on the effect that the different lossy modes have on variant calling, the most widely used application for clinical decision making, especially important in the era of precision medicine. We show that lossy compression can offer significant compression gains, while preserving the essential genomic information and without affecting the variant calling performance. FaStore can be downloaded from https://github.com/refresh-bio/FaStore. sebastian.deorowicz@polsl.pl. Supplementary data are available at Bioinformatics online.
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

PubMed

Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

2012-02-17

The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.
Global characterization of copy number variants in epilepsy patients from whole genome sequencing

PubMed Central

Meloche, Caroline; Andrade, Danielle M.; Lafreniere, Ron G.; Gravel, Micheline; Spiegelman, Dan; Dionne-Laporte, Alexandre; Boelman, Cyrus; Hamdan, Fadi F.; Michaud, Jacques L.; Rouleau, Guy; Minassian, Berge A.; Bourque, Guillaume; Cossette, Patrick

2018-01-01

Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases. PMID:29649218
Empowered genome community: leveraging a bioinformatics platform as a citizen-scientist collaboration tool.

PubMed

Wendelsdorf, Katherine; Shah, Sohela

2015-09-01

There is on-going effort in the biomedical research community to leverage Next Generation Sequencing (NGS) technology to identify genetic variants that affect our health. The main challenge facing researchers is getting enough samples from individuals either sick or healthy - to be able to reliably identify the few variants that are causal for a phenotype among all other variants typically seen among individuals. At the same time, more and more individuals are having their genome sequenced either out of curiosity or to identify the cause of an illness. These individuals may benefit from of a way to view and understand their data. QIAGEN's Ingenuity Variant Analysis is an online application that allows users with and without extensive bioinformatics training to incorporate information from published experiments, genetic databases, and a variety of statistical models to identify variants, from a long list of candidates, that are most likely causal for a phenotype as well as annotate variants with what is already known about them in the literature and databases. Ingenuity Variant Analysis is also an information sharing platform where users may exchange samples and analyses. The Empowered Genome Community (EGC) is a new program in which QIAGEN is making this on-line tool freely available to any individual who wishes to analyze their own genetic sequence. EGC members are then able to make their data available to other Ingenuity Variant Analysis users to be used in research. Here we present and describe the Empowered Genome Community in detail. We also present a preliminary, proof-of-concept study that utilizes the 200 genomes currently available through the EGC. The goal of this program is to allow individuals to access and understand their own data as well as facilitate citizen-scientist collaborations that can drive research forward and spur quality scientific dialogue in the general public.
Histone H3 Variants in Trichomonas vaginalis

PubMed Central

Zubáčová, Zuzana; Hostomská, Jitka

2012-01-01

The parabasalid protist Trichomonas vaginalis is a widespread parasite that affects humans, frequently causing vaginitis in infected women. Trichomonad mitosis is marked by the persistence of the nuclear membrane and the presence of an asymmetric extranuclear spindle with no obvious direct connection to the chromosomes. No centromeric markers have been described in T. vaginalis, which has prevented a detailed analysis of mitotic events in this organism. In other eukaryotes, nucleosomes of centromeric chromatin contain the histone H3 variant CenH3. The principal aim of this work was to identify a CenH3 homolog in T. vaginalis. We performed a screen of the T. vaginalis genome to retrieve sequences of canonical and variant H3 histones. Three variant histone H3 proteins were identified, and the subcellular localization of their epitope-tagged variants was determined. The localization of the variant TVAG_185390 could not be distinguished from that of the canonical H3 histone. The sequence of the variant TVAG_087830 closely resembled that of histone H3. The tagged protein colocalized with sites of active transcription, indicating that the variant TVAG_087830 represented H3.3 in T. vaginalis. The third H3 variant (TVAG_224460) was localized to 6 or 12 distinct spots at the periphery of the nucleus, corresponding to the number of chromosomes in G1 phase and G2 phase, respectively. We propose that this variant represents the centromeric marker CenH3 and thus can be employed as a tool to study mitosis in T. vaginalis. Furthermore, we suggest that the peripheral distribution of CenH3 within the nucleus results from the association of centromeres with the nuclear envelope throughout the cell cycle. PMID:22408228

Deep Sequencing of Three Loci Implicated in Large-Scale Genome-Wide Association Study Smoking Meta-Analyses.

PubMed

Clark, Shaunna L; McClay, Joseph L; Adkins, Daniel E; Aberg, Karolina A; Kumar, Gaurav; Nerella, Sri; Xie, Linying; Collins, Ann L; Crowley, James J; Quakenbush, Corey R; Hillard, Christopher E; Gao, Guimin; Shabalin, Andrey A; Peterson, Roseann E; Copeland, William E; Silberg, Judy L; Maes, Hermine; Sullivan, Patrick F; Costello, Elizabeth J; van den Oord, Edwin J

2016-05-01

Genome-wide association study meta-analyses have robustly implicated three loci that affect susceptibility for smoking: CHRNA5\\CHRNA3\\CHRNB4, CHRNB3\\CHRNA6 and EGLN2\\CYP2A6. Functional follow-up studies of these loci are needed to provide insight into biological mechanisms. However, these efforts have been hampered by a lack of knowledge about the specific causal variant(s) involved. In this study, we prioritized variants in terms of the likelihood they account for the reported associations. We employed targeted capture of the CHRNA5\\CHRNA3\\CHRNB4, CHRNB3\\CHRNA6, and EGLN2\\CYP2A6 loci and flanking regions followed by next-generation deep sequencing (mean coverage 78×) to capture genomic variation in 363 individuals. We performed single locus tests to determine if any single variant accounts for the association, and examined if sets of (rare) variants that overlapped with biologically meaningful annotations account for the associations. In total, we investigated 963 variants, of which 71.1% were rare (minor allele frequency < 0.01), 6.02% were insertion/deletions, and 51.7% were catalogued in dbSNP141. The single variant results showed that no variant fully accounts for the association in any region. In the variant set results, CHRNB4 accounts for most of the signal with significant sets consisting of directly damaging variants. CHRNA6 explains most of the signal in the CHRNB3\\CHRNA6 locus with significant sets indicating a regulatory role for CHRNA6. Significant sets in CYP2A6 involved directly damaging variants while the significant variant sets suggested a regulatory role for EGLN2. We found that multiple variants implicating multiple processes explain the signal. Some variants can be prioritized for functional follow-up. © The Author 2015. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Deep Sequencing of Three Loci Implicated in Large-Scale Genome-Wide Association Study Smoking Meta-Analyses

PubMed Central

McClay, Joseph L.; Adkins, Daniel E.; Aberg, Karolina A.; Kumar, Gaurav; Nerella, Sri; Xie, Linying; Collins, Ann L.; Crowley, James J.; Quakenbush, Corey R.; Hillard, Christopher E.; Gao, Guimin; Shabalin, Andrey A.; Peterson, Roseann E.; Copeland, William E.; Silberg, Judy L.; Maes, Hermine; Sullivan, Patrick F.; Costello, Elizabeth J.; van den Oord, Edwin J.

2016-01-01

Abstract Introduction: Genome-wide association study meta-analyses have robustly implicated three loci that affect susceptibility for smoking: CHRNA5\\CHRNA3\\CHRNB4 , CHRNB3\\CHRNA6 and EGLN2\\CYP2A6 . Functional follow-up studies of these loci are needed to provide insight into biological mechanisms. However, these efforts have been hampered by a lack of knowledge about the specific causal variant(s) involved. In this study, we prioritized variants in terms of the likelihood they account for the reported associations. Methods: We employed targeted capture of the CHRNA5\\CHRNA3\\CHRNB4 , CHRNB3\\CHRNA6 , and EGLN2\\CYP2A6 loci and flanking regions followed by next-generation deep sequencing (mean coverage 78×) to capture genomic variation in 363 individuals. We performed single locus tests to determine if any single variant accounts for the association, and examined if sets of (rare) variants that overlapped with biologically meaningful annotations account for the associations. Results: In total, we investigated 963 variants, of which 71.1% were rare (minor allele frequency < 0.01), 6.02% were insertion/deletions, and 51.7% were catalogued in dbSNP141. The single variant results showed that no variant fully accounts for the association in any region. In the variant set results, CHRNB4 accounts for most of the signal with significant sets consisting of directly damaging variants. CHRNA6 explains most of the signal in the CHRNB3\\CHRNA6 locus with significant sets indicating a regulatory role for CHRNA6 . Significant sets in CYP2A6 involved directly damaging variants while the significant variant sets suggested a regulatory role for EGLN2 . Conclusions: We found that multiple variants implicating multiple processes explain the signal. Some variants can be prioritized for functional follow-up. PMID:26283763
New mutations in non-syndromic primary ovarian insufficiency patients identified via whole-exome sequencing.

PubMed

Patiño, Liliana Catherine; Beau, Isabelle; Carlosama, Carolina; Buitrago, July Constanza; González, Ronald; Suárez, Carlos Fernando; Patarroyo, Manuel Alfonso; Delemer, Brigitte; Young, Jacques; Binart, Nadine; Laissue, Paul

2017-07-01

Is it possible to identify new mutations potentially associated with non-syndromic primary ovarian insufficiency (POI) via whole-exome sequencing (WES)? WES is an efficient tool to study genetic causes of POI as we have identified new mutations, some of which lead to protein destablization potentially contributing to the disease etiology. POI is a frequently occurring complex pathology leading to infertility. Mutations in only few candidate genes, mainly identified by Sanger sequencing, have been definitively related to the pathogenesis of the disease. This is a retrospective cohort study performed on 69 women affected by POI. WES and an innovative bioinformatics analysis were used on non-synonymous sequence variants in a subset of 420 selected POI candidate genes. Mutations in BMPR1B and GREM1 were modeled by using fragment molecular orbital analysis. Fifty-five coding variants in 49 genes potentially related to POI were identified in 33 out of 69 patients (48%). These genes participate in key biological processes in the ovary, such as meiosis, follicular development, granulosa cell differentiation/proliferation and ovulation. The presence of at least two mutations in distinct genes in 42% of the patients argued in favor of a polygenic nature of POI. It is possible that regulatory regions, not analyzed in the present study, carry further variants related to POI. WES and the in silico analyses presented here represent an efficient approach for mapping variants associated with POI etiology. Sequence variants presented here represents potential future genetic biomarkers. This study was supported by the Universidad del Rosario and Colciencias (Grants CS/CIGGUR-ABN062-2016 and 672-2014). Colciencias supported Liliana Catherine Patiño´s work (Fellowship: 617, 2013). The authors declare no conflict of interest. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Rapid molecular diagnostics of severe primary immunodeficiency determined by using targeted next-generation sequencing.

PubMed

Yu, Hui; Zhang, Victor Wei; Stray-Pedersen, Asbjørg; Hanson, Imelda Celine; Forbes, Lisa R; de la Morena, M Teresa; Chinn, Ivan K; Gorman, Elizabeth; Mendelsohn, Nancy J; Pozos, Tamara; Wiszniewski, Wojciech; Nicholas, Sarah K; Yates, Anne B; Moore, Lindsey E; Berge, Knut Erik; Sorte, Hanne; Bayer, Diana K; ALZahrani, Daifulah; Geha, Raif S; Feng, Yanming; Wang, Guoli; Orange, Jordan S; Lupski, James R; Wang, Jing; Wong, Lee-Jun

2016-10-01

Primary immunodeficiency diseases (PIDDs) are inherited disorders of the immune system. The most severe form, severe combined immunodeficiency (SCID), presents with profound deficiencies of T cells, B cells, or both at birth. If not treated promptly, affected patients usually do not live beyond infancy because of infections. Genetic heterogeneity of SCID frequently delays the diagnosis; a specific diagnosis is crucial for life-saving treatment and optimal management. We developed a next-generation sequencing (NGS)-based multigene-targeted panel for SCID and other severe PIDDs requiring rapid therapeutic actions in a clinical laboratory setting. The target gene capture/NGS assay provides an average read depth of approximately 1000×. The deep coverage facilitates simultaneous detection of single nucleotide variants and exonic copy number variants in one comprehensive assessment. Exons with insufficient coverage (<20× read depth) or high sequence homology (pseudogenes) are complemented by amplicon-based sequencing with specific primers to ensure 100% coverage of all targeted regions. Analysis of 20 patient samples with low T-cell receptor excision circle numbers on newborn screening or a positive family history or clinical suspicion of SCID or other severe PIDD identified deleterious mutations in 14 of them. Identified pathogenic variants included both single nucleotide variants and exonic copy number variants, such as hemizygous nonsense, frameshift, and missense changes in IL2RG; compound heterozygous changes in ATM, RAG1, and CIITA; homozygous changes in DCLRE1C and IL7R; and a heterozygous nonsense mutation in CHD7. High-throughput deep sequencing analysis with complete clinical validation greatly increases the diagnostic yield of severe primary immunodeficiency. Establishing a molecular diagnosis enables early immune reconstitution through prompt therapeutic intervention and guides management for improved long-term quality of life. Copyright © 2016 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions.

PubMed

Seo, Heewon; Park, Yoomi; Min, Byung Joo; Seo, Myung Eui; Kim, Ju Han

2017-01-01

The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.
CCDC141 Mutations in Idiopathic Hypogonadotropic Hypogonadism.

PubMed

Turan, Ihsan; Hutchins, B Ian; Hacihamdioglu, Bulent; Kotan, L Damla; Gurbuz, Fatih; Ulubay, Ayca; Mengen, Eda; Yuksel, Bilgin; Wray, Susan; Topaloglu, A Kemal

2017-06-01

Gonadotropin-releasing hormone neurons originate outside the central nervous system in the olfactory placode and migrate into the central nervous system, becoming integral components of the hypothalamic-pituitary-gonadal axis. Failure of this migration can lead to idiopathic hypogonadotropic hypogonadism (IHH)/Kallmann syndrome (KS). We have previously shown that CCDC141 knockdown leads to impaired migration of GnRH neurons but not of olfactory receptor neurons. The aim of this study was to further describe the phenotype and prevalence of CCDC141 mutations in IHH/KS. Using autozygosity mapping, candidate gene screening, whole-exome sequencing, and Sanger sequencing, those individuals carrying deleterious CDCD141 variants and their phenotypes were determined in a cohort of 120 IHH/KS families. No interventions were made. Our studies revealed nine affected individuals from four independent families in which IHH/KS is associated with inactivating CCDC141 variants, revealing a prevalence of 3.3%. Affected individuals (with the exception of those from family 1 who concomitantly have FEZF1 mutations) have normal olfactory function and anatomically normal olfactory bulbs. Four affected individuals show evidence of clinical reversibility. In three of the families, there was at least one more potentially deleterious variant in other known puberty genes with evidence of allelic heterogeneity within respective pedigrees. These studies confirm that inactivating CCDC141 variants cause normosmic IHH but not KS. This is consistent with our previous in vitro experiments showing exclusively impaired embryonic migration of GnRH neurons upon CCDC141 knockdown. These studies expand the clinical and genetic spectrum of IHH and also attest to the complexity of phenotype and genotype in IHH. Copyright © 2017 by the Endocrine Society
Expression Variants of the Lipogenic AGPAT6 Gene Affect Diverse Milk Composition Phenotypes in Bos taurus

PubMed Central

Littlejohn, Mathew D.; Tiplady, Kathryn; Lopdell, Thomas; Law, Tania A.; Scott, Andrew; Harland, Chad; Sherlock, Ric; Henty, Kristen; Obolonkin, Vlad; Lehnert, Klaus; MacGibbon, Alistair; Spelman, Richard J.; Davis, Stephen R.; Snell, Russell G.

2014-01-01

Milk is composed of a complex mixture of lipids, proteins, carbohydrates and various vitamins and minerals as a source of nutrition for young mammals. The composition of milk varies between individuals, with lipid composition in particular being highly heritable. Recent reports have highlighted a region of bovine chromosome 27 harbouring variants affecting milk fat percentage and fatty acid content. We aimed to further investigate this locus in two independent cattle populations, consisting of a Holstein-Friesian x Jersey crossbreed pedigree of 711 F2 cows, and a collection of 32,530 mixed ancestry Bos taurus cows. Bayesian genome-wide association mapping using markers imputed from the Illumina BovineHD chip revealed a large quantitative trait locus (QTL) for milk fat percentage on chromosome 27, present in both populations. We also investigated a range of other milk composition phenotypes, and report additional associations at this locus for fat yield, protein percentage and yield, lactose percentage and yield, milk volume, and the proportions of numerous milk fatty acids. We then used mammary RNA sequence data from 212 lactating cows to assess the transcript abundance of genes located in the milk fat percentage QTL interval. This analysis revealed a strong eQTL for AGPAT6, demonstrating that high milk fat percentage genotype is also additively associated with increased expression of the AGPAT6 gene. Finally, we used whole genome sequence data from six F1 sires to target a panel of novel AGPAT6 locus variants for genotyping in the F2 crossbreed population. Association analysis of 58 of these variants revealed highly significant association for polymorphisms mapping to the 5′UTR exons and intron 1 of AGPAT6. Taken together, these data suggest that variants affecting the expression of AGPAT6 are causally involved in differential milk fat synthesis, with pleiotropic consequences for a diverse range of other milk components. PMID:24465687
De novo missense mutations in the NAA10 gene cause severe non-syndromic developmental delay in males and females

PubMed Central

Popp, Bernt; Støve, Svein I; Endele, Sabine; Myklebust, Line M; Hoyer, Juliane; Sticht, Heinrich; Azzarello-Burri, Silvia; Rauch, Anita; Arnesen, Thomas; Reis, André

2015-01-01

Recent studies revealed the power of whole-exome sequencing to identify mutations in sporadic cases with non-syndromic intellectual disability. We now identified de novo missense variants in NAA10 in two unrelated individuals, a boy and a girl, with severe global developmental delay but without any major dysmorphism by trio whole-exome sequencing. Both de novo variants were predicted to be deleterious, and we excluded other variants in this gene. This X-linked gene encodes N-alpha-acetyltransferase 10, the catalytic subunit of the NatA complex involved in multiple cellular processes. A single hypomorphic missense variant p.(Ser37Pro) was previously associated with Ogden syndrome in eight affected males from two different families. This rare disorder is characterized by a highly recognizable phenotype, global developmental delay and results in death during infancy. In an attempt to explain the discrepant phenotype, we used in vitro N-terminal acetylation assays which suggested that the severity of the phenotype correlates with the remaining catalytic activity. The variant in the Ogden syndrome patients exhibited a lower activity than the one seen in the boy with intellectual disability, while the variant in the girl was the most severe exhibiting only residual activity in the acetylation assays used. We propose that N-terminal acetyltransferase deficiency is clinically heterogeneous with the overall catalytic activity determining the phenotypic severity. PMID:25099252
Compound heterozygous alterations in intraflagellar transport protein CLUAP1 in a child with a novel Joubert and oral-facial-digital overlap syndrome.

PubMed

Johnston, Jennifer J; Lee, Chanjae; Wentzensen, Ingrid M; Parisi, Melissa A; Crenshaw, Molly M; Sapp, Julie C; Gross, Jeffrey M; Wallingford, John B; Biesecker, Leslie G

2017-07-01

Disruption of normal ciliary function results in a range of diseases collectively referred to as ciliopathies. Here we report a child with a phenotype that overlapped with Joubert, oral-facial-digital, and Pallister-Hall syndromes including brain, limb, and craniofacial anomalies. We performed exome-sequence analysis on a proband and both parents, filtered for putative causative variants, and Sanger-verified variants of interest. Identified variants in CLUAP1 were functionally analyzed in a Xenopus system to determine their effect on ciliary function. Two variants in CLUAP1 were identified through exome-sequence analysis, Chr16:g.3558407T>G, c.338T>G, p.(Met113Arg) and Chr16:g.3570011C>T, c.688C>T, p.(Arg230Ter). These variants were rare in the Exome Aggregation Consortium (ExAC) data set of 65,000 individuals (one and two occurrences, respectively). Transfection of mutant CLUAP1 constructs into Xenopus embryos showed reduced protein levels p.(Arg230Ter) and reduced intraflagellar transport p.(Met113Arg). The genetic data show that these variants are present in an affected child, are rare in the population, and result in reduced, but not absent, intraflagellar transport. We conclude that biallelic mutations in CLUAP1 resulted in this novel ciliopathy syndrome in the proband. © 2017 Johnston et al.; Published by Cold Spring Harbor Laboratory Press.
Compound heterozygous alterations in intraflagellar transport protein CLUAP1 in a child with a novel Joubert and oral–facial–digital overlap syndrome

PubMed Central

Johnston, Jennifer J.; Lee, Chanjae; Wentzensen, Ingrid M.; Parisi, Melissa A.; Crenshaw, Molly M.; Sapp, Julie C.; Gross, Jeffrey M.; Wallingford, John B.; Biesecker, Leslie G.

2017-01-01

Disruption of normal ciliary function results in a range of diseases collectively referred to as ciliopathies. Here we report a child with a phenotype that overlapped with Joubert, oral–facial–digital, and Pallister–Hall syndromes including brain, limb, and craniofacial anomalies. We performed exome-sequence analysis on a proband and both parents, filtered for putative causative variants, and Sanger-verified variants of interest. Identified variants in CLUAP1 were functionally analyzed in a Xenopus system to determine their effect on ciliary function. Two variants in CLUAP1 were identified through exome-sequence analysis, Chr16:g.3558407T>G, c.338T>G, p.(Met113Arg) and Chr16:g.3570011C>T, c.688C>T, p.(Arg230Ter). These variants were rare in the Exome Aggregation Consortium (ExAC) data set of 65,000 individuals (one and two occurrences, respectively). Transfection of mutant CLUAP1 constructs into Xenopus embryos showed reduced protein levels p.(Arg230Ter) and reduced intraflagellar transport p.(Met113Arg). The genetic data show that these variants are present in an affected child, are rare in the population, and result in reduced, but not absent, intraflagellar transport. We conclude that biallelic mutations in CLUAP1 resulted in this novel ciliopathy syndrome in the proband. PMID:28679688
Variants in EXOSC9 Disrupt the RNA Exosome and Result in Cerebellar Atrophy with Spinal Motor Neuronopathy.

PubMed

Burns, David T; Donkervoort, Sandra; Müller, Juliane S; Knierim, Ellen; Bharucha-Goebel, Diana; Faqeih, Eissa Ali; Bell, Stephanie K; AlFaifi, Abdullah Y; Monies, Dorota; Millan, Francisca; Retterer, Kyle; Dyack, Sarah; MacKay, Sara; Morales-Gonzalez, Susanne; Giunta, Michele; Munro, Benjamin; Hudson, Gavin; Scavina, Mena; Baker, Laura; Massini, Tara C; Lek, Monkol; Hu, Ying; Ezzo, Daniel; AlKuraya, Fowzan S; Kang, Peter B; Griffin, Helen; Foley, A Reghan; Schuelke, Markus; Horvath, Rita; Bönnemann, Carsten G

2018-05-03

The exosome is a conserved multi-protein complex that is essential for correct RNA processing. Recessive variants in exosome components EXOSC3, EXOSC8, and RBM7 cause various constellations of pontocerebellar hypoplasia (PCH), spinal muscular atrophy (SMA), and central nervous system demyelination. Here, we report on four unrelated affected individuals with recessive variants in EXOSC9 and the effect of the variants on the function of the RNA exosome in vitro in affected individuals' fibroblasts and skeletal muscle and in vivo in zebrafish. The clinical presentation was severe, early-onset, progressive SMA-like motor neuronopathy, cerebellar atrophy, and in one affected individual, congenital fractures of the long bones. Three affected individuals of different ethnicity carried the homozygous c.41T>C (p.Leu14Pro) variant, whereas one affected individual was compound heterozygous for c.41T>C (p.Leu14Pro) and c.481C>T (p.Arg161 ∗ ). We detected reduced EXOSC9 in fibroblasts and skeletal muscle and observed a reduction of the whole multi-subunit exosome complex on blue-native polyacrylamide gel electrophoresis. RNA sequencing of fibroblasts and skeletal muscle detected significant >2-fold changes in genes involved in neuronal development and cerebellar and motor neuron degeneration, demonstrating the widespread effect of the variants. Morpholino oligonucleotide knockdown and CRISPR/Cas9-mediated mutagenesis of exosc9 in zebrafish recapitulated aspects of the human phenotype, as they have in other zebrafish models of exosomal disease. Specifically, portions of the cerebellum and hindbrain were absent, and motor neurons failed to develop and migrate properly. In summary, we show that variants in EXOSC9 result in a neurological syndrome combining cerebellar atrophy and spinal motoneuronopathy, thus expanding the list of human exosomopathies. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
A novel pathogenic variant in an Iranian Ataxia telangiectasia family revealed by next-generation sequencing followed by in silico analysis.

PubMed

Tabatabaiefar, Mohammad Amin; Alipour, Paria; Pourahmadiyan, Azam; Fattahi, Najmeh; Shariati, Laleh; Golchin, Neda; Mohammadi-Asl, Javad

2017-08-15

Ataxia telangiectasia (A-T) is a neurodegenerative autosomal recessive disorder with the main characteristics of progressive cerebellar degeneration, sensitivity to ionizing radiation, immunodeficiency, telangiectasia, premature aging, recurrent sinopulmonary infections, and increased risk of malignancy, especially of lymphoid origin. Ataxia Telangiectasia Mutated gene, ATM, as a causative gene for the A-T disorder, encodes the ATM protein, which plays an important role in the activation of cell-cycle checkpoints and initiation of DNA repair in response to DNA damage. Targeted next-generation sequencing (NGS) was performed on an Iranian 5-year-old boy presented with truncal and limb ataxia, telangiectasia of the eye, Hodgkin lymphoma, hyper pigmentation, total alopecia, hepatomegaly, and dysarthria. Sanger sequencing was used to confirm the candidate pathogenic variants. Computational docking was done using the HEX software to examine how this change affects the interactions of ATM with the upstream and downstream proteins. Three different variants were identified comprising two homozygous SNPs and one novel homozygous frameshift variant (c.80468047delTA, p.Thr2682ThrfsX5), which creates a stop codon in exon 57 leaving the protein truncated at its C-terminal portion. Therefore, the activation and phosphorylation of target proteins are lost. Moreover, the HEX software confirmed that the mutated protein lost its interaction with upstream and downstream proteins. The variant was classified as pathogenic based on the American College of Medical Genetics and Genomics guideline. This study expands the spectrum of ATM pathogenic variants in Iran and demonstrates the utility of targeted NGS in genetic diagnostics. Copyright © 2017. Published by Elsevier B.V.
OCA2 splice site variant in German Spitz dogs with oculocutaneous albinism.

PubMed

Caduff, Madleina; Bauer, Anina; Jagannathan, Vidhya; Leeb, Tosso

2017-01-01

We investigated a German Spitz family where the mating of a black male to a white female had yielded three puppies with an unexpected light brown coat color, lightly pigmented lips and noses, and blue eyes. Combined linkage and homozygosity analysis based on a fully penetrant monogenic autosomal recessive mode of inheritance identified a critical interval of 15 Mb on chromosome 3. We obtained whole genome sequence data from one affected dog, three wolves, and 188 control dogs. Filtering for private variants revealed a single variant with predicted high impact in the critical interval in LOC100855460 (XM_005618224.1:c.377+2T>G LT844587.1:c.-45+2T>G). The variant perfectly co-segregated with the phenotype in the family. We genotyped 181 control dogs with normal pigmentation from diverse breeds including 22 unrelated German Spitz dogs, which were all homozygous wildtype. Comparative sequence analyses revealed that LOC100855460 actually represents the 5'-end of the canine OCA2 gene. The CanFam 3.1 reference genome assembly is incorrect and separates the first two exons from the remaining exons of the OCA2 gene. We amplified a canine OCA2 cDNA fragment by RT-PCR and determined the correct full-length mRNA sequence (LT844587.1). Variants in the OCA2 gene cause oculocutaneous albinism type 2 (OCA2) in humans, pink-eyed dilution in mice, and similar phenotypes in corn snakes, medaka and Mexican cave tetra fish. We therefore conclude that the observed oculocutaneous albinism in German Spitz is most likely caused by the identified variant in the 5'-splice site of the first intron of the canine OCA2 gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Thorell, Kaisa; Hosseini, Shaghayegh; Palacios Gonzales, Reyna Victoria Palacios

In this study, Helicobacter pylori (H. pylori) is one of the most common bacterial infections in humans and this infection can lead to gastric ulcers and gastric cancer. H. pylori is one of the most genetically variable human pathogens and the ability of the bacterium to bind to the host epithelium as well as the presence of different virulence factors and genetic variants within these genes have been associated with disease severity. Nicaragua has particularly high gastric cancer incidence and we therefore studied Nicaraguan clinical H. pylori isolates for factors that could contribute to cancer risk. The complete genomes ofmore » fifty-two Nicaraguan H. pylorii isolates were sequenced and assembled de novo, and phylogenetic and virulence factor analyses were performed. The Nicaraguan isolates showed phylogenetic relationship with West African isolates in whole-genome sequence comparisons and with Western and urban South-and Central American isolates using MLSA (Multi-locus sequence analysis). A majority, 77 % of the isolates carried the cancer-associated virulence gene cagA and also the s1/i1/m1 vacuolating cytotoxin, vacA allele combination, which is linked to increased severity of disease. Specifically, we also found that Nicaraguan isolates have a blood group-binding adhesin (BabA) variant highly similar to previously reported BabA sequences from Latin America, including from isolates belonging to other phylogenetic groups. These BabA sequences were found to be under positive selection at several amino acid positions that differed from the global collection of isolates. In conclusion, the discovery of a Latin American BabA variant, independent of overall phylogenetic background, suggests hitherto unknown host or environmental factors within the Latin American population giving H. pylori isolates carrying this adhesin variant a selective advantage, which could affect pathogenesis and risk for sequelae through specific adherence properties.« less
A targeted genotyping approach enhances identification of variants in taste receptor and appetite/reward genes of potential functional importance for obesity-related porcine traits.

PubMed

Cirera, S; Clop, A; Jacobsen, M J; Guerin, M; Lesnik, P; Jørgensen, C B; Fredholm, M; Karlskov-Mortensen, P

2018-04-01

Taste receptors (TASRs) and appetite and reward (AR) mechanisms influence eating behaviour, which in turn affects food intake and risk of obesity. In a previous study, we used next generation sequencing to identify potentially functional mutations in TASR and AR genes and found indications for genetic associations between identified variants and growth and fat deposition in a subgroup of animals (n = 38) from the UNIK resource pig population. This population was created for studying obesity and obesity-related diseases. In the present study we validated results from our previous study by investigating genetic associations between 24 selected single nucleotide variants in TASR and AR gene variants and 35 phenotypes describing obesity and metabolism in the entire UNIK population (n = 564). Fifteen variants showed significant association with specific obesity-related phenotypes after Bonferroni correction. Six of the 15 genes, namely SIM1, FOS, TAS2R4, TAS2R9, MCHR2 and LEPR, showed good correlation between known biological function and associated phenotype. We verified a genetic association between potentially functional variants in TASR/AR genes and growth/obesity and conclude that the combination of identification of potentially functional variants by next generation sequencing followed by targeted genotyping and association studies is a powerful and cost-effective approach for increasing the power of genetic association studies. © 2018 Stichting International Foundation for Animal Genetics.
An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.

PubMed

Jun, Goo; Wing, Mary Kate; Abecasis, Gonçalo R; Kang, Hyun Min

2015-06-01

The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies. © 2015 Jun et al.; Published by Cold Spring Harbor Laboratory Press.
Principles and Recommendations for Standardizing the Use of the Next-Generation Sequencing Variant File in Clinical Settings.

PubMed

Lubin, Ira M; Aziz, Nazneen; Babb, Lawrence J; Ballinger, Dennis; Bisht, Himani; Church, Deanna M; Cordes, Shaun; Eilbeck, Karen; Hyland, Fiona; Kalman, Lisa; Landrum, Melissa; Lockhart, Edward R; Maglott, Donna; Marth, Gabor; Pfeifer, John D; Rehm, Heidi L; Roy, Somak; Tezak, Zivana; Truty, Rebecca; Ullman-Cullere, Mollie; Voelkerding, Karl V; Worthey, Elizabeth A; Zaranek, Alexander W; Zook, Justin M

2017-05-01

A national workgroup convened by the Centers for Disease Control and Prevention identified principles and made recommendations for standardizing the description of sequence data contained within the variant file generated during the course of clinical next-generation sequence analysis for diagnosing human heritable conditions. The specifications for variant files were initially developed to be flexible with regard to content representation to support a variety of research applications. This flexibility permits variation with regard to how sequence findings are described and this depends, in part, on the conventions used. For clinical laboratory testing, this poses a problem because these differences can compromise the capability to compare sequence findings among laboratories to confirm results and to query databases to identify clinically relevant variants. To provide for a more consistent representation of sequence findings described within variant files, the workgroup made several recommendations that considered alignment to a common reference sequence, variant caller settings, use of genomic coordinates, and gene and variant naming conventions. These recommendations were considered with regard to the existing variant file specifications presently used in the clinical setting. Adoption of these recommendations is anticipated to reduce the potential for ambiguity in describing sequence findings and facilitate the sharing of genomic data among clinical laboratories and other entities. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
regSNPs: a strategy for prioritizing regulatory single nucleotide substitutions

PubMed Central

Teng, Mingxiang; Ichikawa, Shoji; Padgett, Leah R.; Wang, Yadong; Mort, Matthew; Cooper, David N.; Koller, Daniel L.; Foroud, Tatiana; Edenberg, Howard J.; Econs, Michael J.; Liu, Yunlong

2012-01-01

Motivation: One of the fundamental questions in genetics study is to identify functional DNA variants that are responsible to a disease or phenotype of interest. Results from large-scale genetics studies, such as genome-wide association studies (GWAS), and the availability of high-throughput sequencing technologies provide opportunities in identifying causal variants. Despite the technical advances, informatics methodologies need to be developed to prioritize thousands of variants for potential causative effects. Results: We present regSNPs, an informatics strategy that integrates several established bioinformatics tools, for prioritizing regulatory SNPs, i.e. the SNPs in the promoter regions that potentially affect phenotype through changing transcription of downstream genes. Comparing to existing tools, regSNPs has two distinct features. It considers degenerative features of binding motifs by calculating the differences on the binding affinity caused by the candidate variants and integrates potential phenotypic effects of various transcription factors. When tested by using the disease-causing variants documented in the Human Gene Mutation Database, regSNPs showed mixed performance on various diseases. regSNPs predicted three SNPs that can potentially affect bone density in a region detected in an earlier linkage study. Potential effects of one of the variants were validated using luciferase reporter assay. Contact: yunliu@iupui.edu Supplementary information: Supplementary data are available at Bioinformatics online PMID:22611130
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

PubMed

Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

2017-12-19

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

PubMed Central

Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M.; Agarwala, Vineeta; Gaulton, Kyle J.; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J.; Rivas, Manuel A.; Perry, John R. B.; Sim, Xueling; Blackwell, Thomas W.; Robertson, Neil R.; Rayner, N William; Cingolani, Pablo; Locke, Adam E.; Tajes, Juan Fernandez; Highland, Heather M.; Dupuis, Josee; Chines, Peter S.; Lindgren, Cecilia M.; Hartl, Christopher; Jackson, Anne U.; Chen, Han; Huyghe, Jeroen R.; van de Bunt, Martijn; Pearson, Richard D.; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M.; Gamazon, Eric R.; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A.; Below, Jennifer E.; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L.; Pasko, Dorota; Parker, Stephen C. J.; Varga, Tibor V.; Green, Todd; Beer, Nicola L.; Day-Williams, Aaron G.; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J.; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P.; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F.; Han, Bok-Ghee; Jenkinson, Christopher P.; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C. Y.; Palmer, Nicholette D.; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E.; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D.; Neale, Benjamin M.; Purcell, Shaun; Butterworth, Adam S.; Howson, Joanna M. M.; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K. L.; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H. T.; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E.; Rybin, Dennis; Farook, Vidya S.; Fowler, Sharon P.; Freedman, Barry I.; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J.; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K.; Puppala, Sobha; Scott, William R.; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A.; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C.; Mangino, Massimo; Bonnycastle, Lori L.; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L.; Herder, Christian; Groves, Christopher J.; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A.; Doney, Alex S. F.; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J.; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E.; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H.; Stirrups, Kathleen; Wood, Andrew R.; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O.; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P.; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B.; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N. A.; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M.; Syvänen, Ann-Christine; Bergman, Richard N.; Bharadwaj, Dwaipayan; Bottinger, Erwin P.; Cho, Yoon Shin; Chandak, Giriraj R.; Chan, Juliana CN; Chia, Kee Seng; Daly, Mark J.; Ebrahim, Shah B.; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A.; Lehman, Donna M.; Jia, Weiping; Ma, Ronald C. W.; Pollin, Toni I.; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J. F.; Small, Kerrin S.; Ried, Janina S.; DeFronzo, Ralph A.; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J.; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W.; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R.; Gloyn, Anna L.; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D.; Hattersley, Andrew T.; Bowden, Donald W.; Collins, Francis S.; Atzmon, Gil; Chambers, John C.; Spector, Timothy D.; Laakso, Markku; Strom, Tim M.; Bell, Graeme I.; Blangero, John; Duggirala, Ravindranath; Tai, E. Shyong; McVean, Gilean; Hanis, Craig L.; Wilson, James G.; Seielstad, Mark; Frayling, Timothy M.; Meigs, James B.; Cox, Nancy J.; Sladek, Rob; Lander, Eric S.; Gabriel, Stacey; Mohlke, Karen L.; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J.; Morris, Andrew P.; Kang, Hyun Min; Altshuler, David; Burtt, Noël P.; Florez, Jose C.; Boehnke, Michael; McCarthy, Mark I.

2017-01-01

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D. PMID:29257133

A novel PTCH1 mutation underlies non-syndromic cleft lip and/or palate in a Han Chinese family.

PubMed

Zhao, Huaxiang; Zhong, Wenjie; Leng, Chuntao; Zhang, Jieni; Zhang, Mengqi; Huang, Wenbin; Zhang, Yunfan; Li, Weiran; Jia, Peizeng; Lin, Jiuxiang; Maimaitili, Gulibaha; Chen, Feng

2018-06-16

Cleft lip and/or palate (CL/P) is the most common craniofacial congenital disease, and it has a complex aetiology. This study aimed to identify the causative gene mutation of a Han Chinese family with CL/P. Whole exome sequencing was conducted on the proband and her mother, who exhibited the same phenotype. A Mendelian dominant inheritance model, allele frequency, mutation regions, functional prediction and literature review were used to screen and filter the variants. The candidate was validated by Sanger sequencing. Conservation analysis and homology modelling were conducted. A heterozygous missense mutation c.1175C>T in the PTCH1 gene predicting p.Ala392Val was identified. This variant has not been reported and was predicted to be deleterious. Sanger sequencing verified the variant and the dominant inheritance model in the family. The missense alteration affects an amino acid that is evolutionarily conserved in the first extracellular loop of the PTCH1 protein. The local structure of the mutant protein was significantly altered according to homology modelling. Our findings suggest that c.1175C>T in PTCH1 (NM_000264) may be the causative mutation of this pedigree. Our results add to the evidence that PTCH1 variants play a role in the pathogenesis of orofacial clefts. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Evaluation of Bioinformatic Programmes for the Analysis of Variants within Splice Site Consensus Regions

PubMed Central

Tang, Rongying; Prosser, Debra O.; Love, Donald R.

2016-01-01

The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609
L1-Associated Genomic Regions are Deleted in Somatic Cells of the Healthy Human Brain

PubMed Central

Erwin, Jennifer A.; Paquola, Apuã C.M.; Singer, Tatjana; Gallina, Iryna; Novotny, Mark; Quayle, Carolina; Bedrosian, Tracy; Ivanio, Francisco; Butcher, Cheyenne R.; Herdy, Joseph R.; Sarkar, Anindita; Lasken, Roger S.; Muotri, Alysson R.; Gage, Fred H.

2016-01-01

The healthy human brain is a mosaic of varied genomes. L1 retrotransposition is known to create mosaicism by inserting L1 sequences into new locations of somatic cell genomes. Using a machine learning-based, single-cell sequencing approach, we discovered that Somatic L1-Associated Variants (SLAVs) are actually composed of two classes: L1 retrotransposition insertions and retrotransposition-independent L1-associated variants. We demonstrate that a subset of SLAVs are, in fact, somatic deletions generated by L1 endonuclease cutting activity. Retrotransposition- independent rearrangements within inherited L1s resulted in the deletion of proximal genomic regions. These rearrangements were resolved by microhomology-mediated repair, which suggests that L1-associated genomic regions are hotspots for somatic copy number variants in the brain and therefore a heritable genetic contributor to somatic mosaicism. We demonstrate that SLAVs are present in crucial neural genes, such as DLG2/PSD93, and affect between 44–63% of cells of the cells in the healthy brain. PMID:27618310
A novel synonymous variant in the AVP gene associated with adFNDI causes partial RNA missplicing.

PubMed

Kvistgaard, Helene; Christensen, Jane H; Johansson, Jan-Ove; Gregersen, Niels; Rittig, Charlotte; Rittig, Soeren; Corydon, Thomas Juhl

2018-06-27

Objective: Autosomal dominant familial neurohypophyseal diabetes insipidus (adFNDI) is characterized by severe polyuria and polydipsia and is caused by variations in the gene encoding the AVP prohormone. The study aimed to ascertain a correct diagnosis, to identify the underlying genetic cause of adFNDI in a Swedish kindred, and to test the hypothesis that the identified synonymous exonic variant in the AVP gene (c.324G>A), causes missplicing, and endoplasmic reticulum (ER) retention of the prohormone. Three affected family members were admitted for fluid deprivation test and dDAVP challenge test. Direct sequencing of the AVP gene was performed in affected subjects, and genotyping of the identified variant was performed in family members. The variant was examined by expression of AVP minigenes containing the entire coding regions as well as intron 2 of AVP. Clinical tests revealed significant phenotypical variation with both complete and partial adFNDI phenotype. DNA analysis revealed a synonymous c.324G>A substitution in one allele of the AVP gene in affected family members only. Cellular studies revealed both normally spliced and misspliced pre-mRNA in cells transfected with the AVP c.324G>A minigene. Confocal laser scanning microscopy showed collective localization of the variant prohormone to ER and vesicular structures at the tip of cellular processes. We have identified a synonymous variant affecting the second nucleotide of exon 3 in the AVP gene (c.324G>A) in a kindred in which adFNDI segregates. Notably, we showed that this variant causes partial missplicing of pre-mRNA resulting in accumulation of variant prohormone in ER. Our study suggests that even a small amount of aberrant mRNA might be sufficient to disturb cellular function resulting in adFNDI.
. ©2018S. Karger AG, Basel.
Possible role of rare variants in Trace amine associated receptor 1 in schizophrenia.

PubMed

John, Jibin; Kukshal, Prachi; Bhatia, Triptish; Chowdari, K V; Nimgaonkar, V L; Deshpande, S N; Thelma, B K

2017-11-01

Schizophrenia (SZ) is a chronic mental illness with behavioral abnormalities. Recent common variant based genome wide association studies and rare variant detection using next generation sequencing approaches have identified numerous variants that confer risk for SZ, but etiology remains unclear propelling continuing investigations. Using whole exome sequencing, we identified a rare heterozygous variant (c.545G>T; p.Cys182Phe) in Trace amine associated receptor 1 gene (TAAR1 6q23.2) in three affected members in a small SZ family. The variant predicted to be damaging by 15 prediction tools, causes breakage of a conserved disulfide bond in this G-protein-coupled receptor. On screening this intronless gene for additional variant(s) in ~800 sporadic SZ patients, we identified six rare protein altering variants (MAF<0.001) namely p.Ser47Cys, p.Phe51Leu, p.Tyr294Ter, p.Leu295Ser in four unrelated north Indian cases (n=475); p.Ala109Thr and p.Val250Ala in two independent Caucasian/African-American patients (n=310). Five of these variants were also predicted to be damaging. Besides, a rare synonymous variant was observed in SZ patients. These rare variants were absent in north Indian healthy controls (n=410) but significantly enriched in patients (p=0.036). Conversely, three common coding SNPs (rs8192621, rs8192620 and rs8192619) and a promoter SNP (rs60266355) tested for association with SZ in the north Indian cohort were not significant (P>0.05). TAAR1 is a modulator of monoaminergic pathways and interacts with AKT signaling pathways. Substantial animal model based pharmacological and functional data implying its relevance in SZ are also available. However, this is the first report suggestive of the likely contribution of rare variants in this gene to SZ. Copyright © 2017 Elsevier B.V. All rights reserved.
Isolation and molecular characterization of newly emerging avian reovirus variants and novel strains in Pennsylvania, USA, 2011–2014

PubMed Central

Lu, Huaguang; Tang, Yi; Dunn, Patricia A.; Wallner-Pendleton, Eva A.; Lin, Lin; Knoll, Eric A.

2015-01-01

Avian reovirus (ARV) infections of broiler and turkey flocks have caused significant clinical disease and economic losses in Pennsylvania (PA) since 2011. Most of the ARV-infected birds suffered from severe arthritis, tenosynovitis, pericarditis and depressed growth or runting-stunting syndrome (RSS). A high morbidity (up to 20% to 40%) was observed in ARV-affected flocks, and the flock mortality was occasionally as high as 10%. ARV infections in turkeys were diagnosed for the first time in PA in 2011. From 2011 to 2014, a total of 301 ARV isolations were made from affected PA poultry. The molecular characterization of the Sigma C gene of 114 field isolates, representing most ARV outbreaks, revealed that only 21.93% of the 114 sequenced ARV isolates were in the same genotyping cluster (cluster 1) as the ARV vaccine strains (S1133, 1733, and 2048), whereas 78.07% of the sequenced isolates were in genotyping clusters 2, 3, 4, 5, and 6 (which were distinct from the vaccine strains) and represented newly emerging ARV variants. In particular, genotyping cluster 6 was a new ARV genotype that was identified for the first time in 10 novel PA ARV variants of field isolates. PMID:26469681
Nonsense variant in COL7A1 causes recessive dystrophic epidermolysis bullosa in Central Asian Shepherd dogs.

PubMed

Niskanen, Julia; Dillard, Kati; Arumilli, Meharji; Salmela, Elina; Anttila, Marjukka; Lohi, Hannes; Hytönen, Marjo K

2017-01-01

A rare hereditary mechanobullous disorder called epidermolysis bullosa (EB) causes blistering in the skin and the mucosal membranes. To date, nineteen EB-related genes have been discovered in human and other species. We describe here a novel EB variant in dogs. Two newborn littermates of Central Asian Shepherd dogs with severe signs of skin blistering were brought to a veterinary clinic and euthanized due to poor prognosis. In post-mortem examination, the puppies were shown to have findings in the skin and the mucosal membranes characteristic of EB. A whole-genome sequencing of one of the affected puppies was performed to identify the genetic cause. The resequencing data were filtered under a recessive model against variants from 31 other dog genomes, revealing a homozygous case-specific nonsense variant in one of the known EB-causing genes, COL7A1 (c.4579C>T, p.R1527*). The variant results in a premature stop codon and likely absence of the functional protein in the basement membrane of the skin in the affected dogs. This was confirmed by immunohistochemistry using a COL7A1 antibody. Additional screening of the variant indicated full penetrance and breed specificity at ~28% carrier frequency. In summary, this study reveals a novel COL7A1 variant causing recessive dystrophic EB and provides a genetic test for the eradication of the disease from the breed.
A statistical method for the detection of variants from next-generation resequencing of DNA pools.

PubMed

Bansal, Vikas

2010-06-15

Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.
Characterization of the two intra-individual sequence variants in the 18S rRNA gene in the plant parasitic nematode, Rotylenchulus reniformis.

PubMed

Nyaku, Seloame T; Sripathi, Venkateswara R; Kantety, Ramesh V; Gu, Yong Q; Lawrence, Kathy; Sharma, Govind C

2013-01-01

The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene.
Characterization of the Two Intra-Individual Sequence Variants in the 18S rRNA Gene in the Plant Parasitic Nematode, Rotylenchulus reniformis

PubMed Central

Nyaku, Seloame T.; Sripathi, Venkateswara R.; Kantety, Ramesh V.; Gu, Yong Q.; Lawrence, Kathy; Sharma, Govind C.

2013-01-01

The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene. PMID:23593343
A map of human microRNA variation uncovers unexpectedly high levels of variability

PubMed Central

2012-01-01

Background MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in a healthy population seems an obvious requirement for future research on miRNA-related pathologies. Methods Data on 14 populations from the 1000 Genomes Project were analyzed, along with new data extracted from 60 exomes of healthy individuals from a population from southern Spain, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. Results Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. Conclusions Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. PMID:22906193
Guidelines for investigating causality of sequence variants in human disease

PubMed Central

MacArthur, D. G.; Manolio, T. A.; Dimmock, D. P.; Rehm, H. L.; Shendure, J.; Abecasis, G. R.; Adams, D. R.; Altman, R. B.; Antonarakis, S. E.; Ashley, E. A.; Barrett, J. C.; Biesecker, L. G.; Conrad, D. F.; Cooper, G. M.; Cox, N. J.; Daly, M. J.; Gerstein, M. B.; Goldstein, D. B.; Hirschhorn, J. N.; Leal, S. M.; Pennacchio, L. A.; Stamatoyannopoulos, J. A.; Sunyaev, S. R.; Valle, D.; Voight, B. F.; Winckler, W.; Gunter, C.

2014-01-01

The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence variants from the many potentially functional variants present in any human genome are urgently needed. Without rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of genomic research findings into the clinical diagnostic setting and hinder biological understanding of disease. Here we discuss the key challenges of assessing sequence variants in human disease, integrating both gene-level and variant-level support for causality. We propose guidelines for summarizing confidence in variant pathogenicity and highlight several areas that require further resource development. PMID:24759409
Guidelines for investigating causality of sequence variants in human disease.

PubMed

MacArthur, D G; Manolio, T A; Dimmock, D P; Rehm, H L; Shendure, J; Abecasis, G R; Adams, D R; Altman, R B; Antonarakis, S E; Ashley, E A; Barrett, J C; Biesecker, L G; Conrad, D F; Cooper, G M; Cox, N J; Daly, M J; Gerstein, M B; Goldstein, D B; Hirschhorn, J N; Leal, S M; Pennacchio, L A; Stamatoyannopoulos, J A; Sunyaev, S R; Valle, D; Voight, B F; Winckler, W; Gunter, C

2014-04-24

The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence variants from the many potentially functional variants present in any human genome are urgently needed. Without rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of genomic research findings into the clinical diagnostic setting and hinder biological understanding of disease. Here we discuss the key challenges of assessing sequence variants in human disease, integrating both gene-level and variant-level support for causality. We propose guidelines for summarizing confidence in variant pathogenicity and highlight several areas that require further resource development.
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing.

PubMed

Morse, Alison M; Calabro, Kaitlyn R; Fear, Justin M; Bloom, David C; McIntyre, Lauren M

2017-08-16

High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended.
Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects.

PubMed

Raimondi, Daniele; Gazzo, Andrea M; Rooman, Marianne; Lenaerts, Tom; Vranken, Wim F

2016-06-15

There are now many predictors capable of identifying the likely phenotypic effects of single nucleotide variants (SNVs) or short in-frame Insertions or Deletions (INDELs) on the increasing amount of genome sequence data. Most of these predictors focus on SNVs and use a combination of features related to sequence conservation, biophysical, and/or structural properties to link the observed variant to either neutral or disease phenotype. Despite notable successes, the mapping between genetic variants and their phenotypic effects is riddled with levels of complexity that are not yet fully understood and that are often not taken into account in the predictions, despite their promise of significantly improving the prediction of deleterious mutants. We present DEOGEN, a novel variant effect predictor that can handle both missense SNVs and in-frame INDELs. By integrating information from different biological scales and mimicking the complex mixture of effects that lead from the variant to the phenotype, we obtain significant improvements in the variant-effect prediction results. Next to the typical variant-oriented features based on the evolutionary conservation of the mutated positions, we added a collection of protein-oriented features that are based on functional aspects of the gene affected. We cross-validated DEOGEN on 36 825 polymorphisms, 20 821 deleterious SNVs, and 1038 INDELs from SwissProt. The multilevel contextualization of each (variant, protein) pair in DEOGEN provides a 10% improvement of MCC with respect to current state-of-the-art tools. The software and the data presented here is publicly available at http://ibsquare.be/deogen : wvranken@vub.ac.be Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Rare variant associations with waist-to-hip ratio in European-American and African-American women from the NHLBI-Exome Sequencing Project

PubMed Central

Kan, Mengyuan; Auer, Paul L; Wang, Gao T; Bucasas, Kristine L; Hooker, Stanley; Rodriguez, Alejandra; Li, Biao; Ellis, Jaclyn; Adrienne Cupples, L; Ida Chen, Yii-Der; Dupuis, Josée; Fox, Caroline S; Gross, Myron D; Smith, Joshua D; Heard-Costa, Nancy; Meigs, James B; Pankow, James S; Rotter, Jerome I; Siscovick, David; Wilson, James G; Shendure, Jay; Jackson, Rebecca; Peters, Ulrike; Zhong, Hua; Lin, Danyu; Hsu, Li; Franceschini, Nora; Carlson, Chris; Abecasis, Goncalo; Gabriel, Stacey; Bamshad, Michael J; Altshuler, David; Nickerson, Deborah A; North, Kari E; Lange, Leslie A; Reiner, Alexander P; Leal, Suzanne M

2016-01-01

Waist-to-hip ratio (WHR), a relative comparison of waist and hip circumferences, is an easily accessible measurement of body fat distribution, in particular central abdominal fat. A high WHR indicates more intra-abdominal fat deposition and is an established risk factor for cardiovascular disease and type 2 diabetes. Recent genome-wide association studies have identified numerous common genetic loci influencing WHR, but the contributions of rare variants have not been previously reported. We investigated rare variant associations with WHR in 1510 European-American and 1186 African-American women from the National Heart, Lung, and Blood Institute-Exome Sequencing Project. Association analysis was performed on the gene level using several rare variant association methods. The strongest association was observed for rare variants in IKBKB (P=4.0 × 10−8) in European-Americans, where rare variants in this gene are predicted to decrease WHRs. The activation of the IKBKB gene is involved in inflammatory processes and insulin resistance, which may affect normal food intake and body weight and shape. Meanwhile, aggregation of rare variants in COBLL1, previously found to harbor common variants associated with WHR and fasting insulin, were nominally associated (P=2.23 × 10−4) with higher WHR in European-Americans. However, these significant results are not shared between African-Americans and European-Americans that may be due to differences in the allelic architecture of the two populations and the small sample sizes. Our study indicates that the combined effect of rare variants contribute to the inter-individual variation in fat distribution through the regulation of insulin response. PMID:26757982
Rare variant associations with waist-to-hip ratio in European-American and African-American women from the NHLBI-Exome Sequencing Project.

PubMed

Kan, Mengyuan; Auer, Paul L; Wang, Gao T; Bucasas, Kristine L; Hooker, Stanley; Rodriguez, Alejandra; Li, Biao; Ellis, Jaclyn; Adrienne Cupples, L; Ida Chen, Yii-Der; Dupuis, Josée; Fox, Caroline S; Gross, Myron D; Smith, Joshua D; Heard-Costa, Nancy; Meigs, James B; Pankow, James S; Rotter, Jerome I; Siscovick, David; Wilson, James G; Shendure, Jay; Jackson, Rebecca; Peters, Ulrike; Zhong, Hua; Lin, Danyu; Hsu, Li; Franceschini, Nora; Carlson, Chris; Abecasis, Goncalo; Gabriel, Stacey; Bamshad, Michael J; Altshuler, David; Nickerson, Deborah A; North, Kari E; Lange, Leslie A; Reiner, Alexander P; Leal, Suzanne M

2016-08-01

Waist-to-hip ratio (WHR), a relative comparison of waist and hip circumferences, is an easily accessible measurement of body fat distribution, in particular central abdominal fat. A high WHR indicates more intra-abdominal fat deposition and is an established risk factor for cardiovascular disease and type 2 diabetes. Recent genome-wide association studies have identified numerous common genetic loci influencing WHR, but the contributions of rare variants have not been previously reported. We investigated rare variant associations with WHR in 1510 European-American and 1186 African-American women from the National Heart, Lung, and Blood Institute-Exome Sequencing Project. Association analysis was performed on the gene level using several rare variant association methods. The strongest association was observed for rare variants in IKBKB (P=4.0 × 10(-8)) in European-Americans, where rare variants in this gene are predicted to decrease WHRs. The activation of the IKBKB gene is involved in inflammatory processes and insulin resistance, which may affect normal food intake and body weight and shape. Meanwhile, aggregation of rare variants in COBLL1, previously found to harbor common variants associated with WHR and fasting insulin, were nominally associated (P=2.23 × 10(-4)) with higher WHR in European-Americans. However, these significant results are not shared between African-Americans and European-Americans that may be due to differences in the allelic architecture of the two populations and the small sample sizes. Our study indicates that the combined effect of rare variants contribute to the inter-individual variation in fat distribution through the regulation of insulin response.
Variant calling in low-coverage whole genome sequencing of a Native American population sample.

PubMed

Bizon, Chris; Spiegel, Michael; Chasse, Scott A; Gizer, Ian R; Li, Yun; Malc, Ewa P; Mieczkowski, Piotr A; Sailsbery, Josh K; Wang, Xiaoshu; Ehlers, Cindy L; Wilhelmsen, Kirk C

2014-01-30

The reduction in the cost of sequencing a human genome has led to the use of genotype sampling strategies in order to impute and infer the presence of sequence variants that can then be tested for associations with traits of interest. Low-coverage Whole Genome Sequencing (WGS) is a sampling strategy that overcomes some of the deficiencies seen in fixed content SNP array studies. Linkage-disequilibrium (LD) aware variant callers, such as the program Thunder, may provide a calling rate and accuracy that makes a low-coverage sequencing strategy viable. We examined the performance of an LD-aware variant calling strategy in a population of 708 low-coverage whole genome sequences from a community sample of Native Americans. We assessed variant calling through a comparison of the sequencing results to genotypes measured in 641 of the same subjects using a fixed content first generation exome array. The comparison was made using the variant calling routines GATK Unified Genotyper program and the LD-aware variant caller Thunder. Thunder was found to improve concordance in a coverage dependent fashion, while correctly calling nearly all of the common variants as well as a high percentage of the rare variants present in the sample. Low-coverage WGS is a strategy that appears to collect genetic information intermediate in scope between fixed content genotyping arrays and deep-coverage WGS. Our data suggests that low-coverage WGS is a viable strategy with a greater chance of discovering novel variants and associations than fixed content arrays for large sample association analyses.
A machine learning model to determine the accuracy of variant calls in capture-based next generation sequencing.

PubMed

van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y

2018-04-17

Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.
New COL6A6 variant detected by whole-exome sequencing is linked to break points in intron 4 and 3′-UTR, deleting exon 5 of RHO, and causing adRP

PubMed Central

de Sousa Dias, Miguel; Hernan, Imma; Delás, Barbara; Pascual, Beatriz; Borràs, Emma; Gamundi, Maria José; Mañé, Begoña; Fernández-San José, Patricia; Ayuso, Carmen

2015-01-01

Purpose This study aimed to test a newly devised cost-effective multiplex PCR assay for the molecular diagnosis of autosomal dominant retinitis pigmentosa (adRP), as well as the use of whole-exome sequencing (WES) to detect disease-causing mutations in adRP. Methods Genomic DNA was extracted from peripheral blood lymphocytes of index patients with adRP and their affected and unaffected family members. We used a newly devised multiplex PCR assay capable of amplifying the genetic loci of RHO, PRPH2, RP1, PRPF3, PRPF8, PRPF31, IMPDH1, NRL, CRX, KLHL7, and NR2E3 to molecularly diagnose 18 index patients with adRP. We also performed WES in affected and unaffected members of four families with adRP in whom a disease-causing mutation was previously not found. Results We identified five previously reported mutations (p.Arg677X in the RP1 gene, p.Asp133Val and p.Arg195Leu in the PRPH2 gene, and p.Pro171Leu and p.Pro215Leu in the RHO gene) and one novel mutation (p.Val345Gly in the RHO gene) representing 33% detection of causative mutations in our adRP cohort. Comparative WES analysis showed a new variant (p.Gly103Arg in the COL6A6 gene) that segregated with the disease in one family with adRP. As this variant was linked with the RHO locus, we sequenced the complete RHO gene, which revealed a deletion in intron 4 that encompassed all of exon 5 and 28 bp of the 3′-untranslated region (UTR). Conclusions The novel multiplex PCR assay with next-generation sequencing (NGS) proved effective for detecting most of the adRP-causing mutations. A WES approach led to identification of a deletion in RHO through detection of a new linked variant in COL6A6. No pathogenic variants were identified in the remaining three families. Moreover, NGS and WES were inefficient for detecting the complete deletion of exon 5 in the RHO gene in one family with adRP. Carriers of this deletion showed variable clinical status, and two of these carriers had not previously been diagnosed with RP. PMID:26321861

Exome sequencing reveals novel genetic loci influencing obesity-related traits in Hispanic children

USDA-ARS?s Scientific Manuscript database

To perform whole exome sequencing in 928 Hispanic children and identify variants and genes associated with childhood obesity.Single-nucleotide variants (SNVs) were identified from Illumina whole exome sequencing data using integrated read mapping, variant calling, and an annotation pipeline (Mercury...
Novel sequence variants in the TMIE gene in families with autosomal recessive nonsyndromic hearing impairment

PubMed Central

Santos, Regie Lyn P.; El-Shanti, Hatem; Sikandar, Shaheen; Lee, Kwanghyuk; Bhatti, Attya; Yan, Kai; Chahrour, Maria H.; McArthur, Nathan; Pham, Thanh L.; Mahasneh, Amjad Abdullah; Ahmad, Wasim

2010-01-01

To date, 37 genes have been identified for nonsyndromic hearing impairment (NSHI). Identifying the functional sequence variants within these genes and knowing their population-specific frequencies is of public health value, in particular for genetic screening for NSHI. To determine putatively functional sequence variants in the transmembrane inner ear (TMIE) gene in Pakistani and Jordanian families with autosomal recessive (AR) NSHI, four Jordanian and 168 Pakistani families with ARNSHI that is not due to GJB2 (CX26) were submitted to a genome scan. Two-point and multipoint parametric linkage analyses were performed, and families with logarithmic odds (LOD) scores of 1.0 or greater within the TMIE region underwent further DNA sequencing. The evolutionary conservation and location in predicted protein domains of amino acid residues where sequence variants occurred were studied to elucidate the possible effects of these sequence variants on function. Of seven families that were screened for TMIE, putatively functional sequence variants were found to segregate with hearing impairment in four families but were not seen in not less than 110 ethnically matched control chromosomes. The previously reported c.241C>T (p.R81C) variant was observed in two Pakistani families. Two novel variants, c.92A>G (p.E31G) and the splice site mutation c.212–2A>C, were identified in one Pakistani and one Jordanian family, respectively. The c.92A>G (p.E31G) variant occurred at a residue that is conserved in the mouse and is predicted to be extracellular. Conservation and potential functionality of previously published mutations were also examined. The prevalence of functional TMIE variants in Pakistani families is 1.7% [95% confidence interval (CI) 0.3–4.8]. Further studies on the spectrum, prevalence rates, and functional effect of sequence variants in the TMIE gene in other populations should demonstrate the true importance of this gene as a cause of hearing impairment. PMID:16389551
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Co-occurring Down syndrome and SUCLA2-related mitochondrial depletion syndrome.

PubMed

Couser, Natario L; Marchuk, Daniel S; Smith, Laurie D; Arreola, Alexandra; Kaiser-Rogers, Kathleen A; Muenzer, Joseph; Pandya, Arti; Gucsavas-Calikoglu, Muge; Powell, Cynthia M

2017-10-01

Mitochondrial DNA depletion syndrome 5 (MIM 612073) is a rare autosomal recessive disorder caused by homozygous or compound heterozygous pathogenic variants in the beta subunit of the succinate-CoA ligase gene located within the 13q14 band. We describe two siblings of Hispanic descent with SUCLA2-related mitochondrial depletion syndrome (encephalomyopathic form with methylmalonic aciduria); the older sibling is additionally affected with trisomy 21. SUCLA2 sequencing identified homozygous p.Arg284Cys pathogenic variants in both patients. This mutation has previously been identified in four individuals of Italian and Caucasian descent. The older sibling with concomitant disease has a more severe phenotype than what is typically described in patients with either SUCLA2-related mitochondrial depletion syndrome or Down syndrome alone. The younger sibling, who has a normal female chromosome complement, is significantly less affected compared to her brother. While the clinical and molecular findings have been reported in about 50 patients affected with a deficiency of succinate-CoA ligase caused by pathogenic variants in SUCLA2, this report describes the first known individual affected with both a mitochondrial depletion syndrome and trisomy 21. © 2017 Wiley Periodicals, Inc.
Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer.

PubMed

Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji

2010-07-01

We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.
New insights into the genetics of glioblastoma multiforme by familial exome sequencing

PubMed Central

Backes, Christina; Harz, Christian; Fischer, Ulrike; Schmitt, Jana; Ludwig, Nicole; Petersen, Britt-Sabina; Mueller, Sabine C.; Kim, Yoo-Jin; Wolf, Nadine M.; Katus, Hugo A.; Meder, Benjamin; Furtwängler, Rhoikos; Franke, Andre; Bohle, Rainer; Henn, Wolfram; Graf, Norbert; Keller, Andreas; Meese, Eckart

2015-01-01

Glioblastoma multiforme (GBM) is the most aggressive and malignant subtype of human brain tumors. While a family clustering of GBM has long been acknowledged, relevant hereditary factors still remained elusive. Exome sequencing of families offers the option to discover respective genetic factors. We sequenced blood samples of one of the rare affected families: while both parents were healthy, both children were diagnosed with GBM. We report 85 homozygous non-synonymous single nucleotide variations (SNVs) in both siblings that were heterozygous in the parents. Beyond known key players for GBM such as ERBB2, PMS2, or CHI3L1, we identified over 50 genes that have not been associated to GBM so far. We also discovered three accumulative effects potentially adding to the tumorigenesis in the siblings: a clustering of multiple variants in single genes (e.g. PTPRB, CROCC), the aggregation of affected genes on specific molecular pathways (e.g. Focal adhesion or ECM receptor interaction) and genomic proximity (e.g. chr22.q12.2, chr1.p36.33). We found a striking accumulation of SNVs in specific genes for the daughter, who developed not only a GBM at the age of 12 years but was subsequently diagnosed with a pilocytic astrocytoma, a common acute lymphatic leukemia and a diffuse pontine glioma. The reported variants underline the relevance of genetic predisposition and cancer development in this family and demonstrate that GBM has a complex and heterogeneous genetic background. Sequencing of other affected families will help to further narrow down the driving genetic causes for this disease. PMID:25537509
Analysis of Sequence Variation and Risk Association of Human Papillomavirus 52 Variants Circulating in Korea

PubMed Central

Choi, Youn Jin; Ki, Eun Young; Zhang, Chuqing; Ho, Wendy C. S.; Lee, Sung-Jong; Jeong, Min Jin

2016-01-01

Introduction Human papillomavirus (HPV) 52 is a carcinogenic, high-risk genotype frequently detected in cervical cancer cases from East Asia, including Korea. Materials and Methods Sequences of HPV52 detected in 91 cervical samples collected from women attending Seoul St. Mary’s Hospital were analyzed. HPV52 genomic sequences were obtained by polymerase chain reaction (PCR)-based sequencing and analyzed using Seq-Scape software, and phylogenetic trees were constructed using MEGA6 software. Results Of the 91 cervical samples, 40 were normal, 22 were low-grade lesions, 21 were high-grade lesions and 7 were squamous cell carcinomas. Four HPV52 variant lineages (A, B, C and D) were identified. Lineage B was the most frequently detected lineage, followed by lineage C. By analyzing the two most frequently detected lineages (B and C), we found that distinct variations existed in each lineage. We also found that a lineage B-specific mutation K93R (A379G) was associated with an increased risk of cervical neoplasia. Conclusions To our knowledge, we are the first to reveal the predominance of the HPV52 lineages, B and C, in Korea. We also found these lineages harbored distinct genetic alterations that may affect oncogenicity. Our findings increase our understanding on the heterogeneity of HPV52 variants, and may be useful for the development of new diagnostic assays and therapeutic vaccines. PMID:27977741
Evaluation of 10 genes encoding cardiac proteins in Doberman Pinschers with dilated cardiomyopathy.

PubMed

O'Sullivan, M Lynne; O'Grady, Michael R; Pyle, W Glen; Dawson, John F

2011-07-01

To identify a causative mutation for dilated cardiomyopathy (DCM) in Doberman Pinschers by sequencing the coding regions of 10 cardiac genes known to be associated with familial DCM in humans. 5 Doberman Pinschers with DCM and congestive heart failure and 5 control mixed-breed dogs that were euthanized or died. RNA was extracted from frozen ventricular myocardial samples from each dog, and first-strand cDNA was synthesized via reverse transcription, followed by PCR amplification with gene-specific primers. Ten cardiac genes were analyzed: cardiac actin, α-actinin, α-tropomyosin, β-myosin heavy chain, metavinculin, muscle LIM protein, myosinbinding protein C, tafazzin, titin-cap (telethonin), and troponin T. Sequences for DCM-affected and control dogs and the published canine genome were compared. None of the coding sequences yielded a common causative mutation among all Doberman Pinscher samples. However, 3 variants were identified in the α-actinin gene in the DCM-affected Doberman Pinschers. One of these variants, identified in 2 of the 5 Doberman Pinschers, resulted in an amino acid change in the rod-forming triple coiled-coil domain. Mutations in the coding regions of several genes associated with DCM in humans did not appear to consistently account for DCM in Doberman Pinschers. However, an α-actinin variant was detected in some Doberman Pinschers that may contribute to the development of DCM given its potential effect on the structure of this protein. Investigation of additional candidate gene coding and noncoding regions and further evaluation of the role of α-actinin in development of DCM in Doberman Pinschers are warranted.
Rare Coding Variants in ANGPTL6 Are Associated with Familial Forms of Intracranial Aneurysm.

PubMed

Bourcier, Romain; Le Scouarnec, Solena; Bonnaud, Stéphanie; Karakachoff, Matilde; Bourcereau, Emmanuelle; Heurtebise-Chrétien, Sandrine; Menguy, Céline; Dina, Christian; Simonet, Floriane; Moles, Alexis; Lenoble, Cédric; Lindenbaum, Pierre; Chatel, Stéphanie; Isidor, Bertrand; Génin, Emmanuelle; Deleuze, Jean-François; Schott, Jean-Jacques; Le Marec, Hervé; Loirand, Gervaise; Desal, Hubert; Redon, Richard

2018-01-04

Intracranial aneurysms (IAs) are acquired cerebrovascular abnormalities characterized by localized dilation and wall thinning in intracranial arteries, possibly leading to subarachnoid hemorrhage and severe outcome in case of rupture. Here, we identified one rare nonsense variant (c.1378A>T) in the last exon of ANGPTL6 (Angiopoietin-Like 6)-which encodes a circulating pro-angiogenic factor mainly secreted from the liver-shared by the four tested affected members of a large pedigree with multiple IA-affected case subjects. We showed a 50% reduction of ANGPTL6 serum concentration in individuals heterozygous for the c.1378A>T allele (p.Lys460Ter) compared to relatives homozygous for the normal allele, probably due to the non-secretion of the truncated protein produced by the c.1378A>T transcripts. Sequencing ANGPTL6 in a series of 94 additional index case subjects with familial IA identified three other rare coding variants in five case subjects. Overall, we detected a significant enrichment (p = 0.023) in rare coding variants within this gene among the 95 index case subjects with familial IA, compared to a reference population of 404 individuals with French ancestry. Among the 6 recruited families, 12 out of 13 (92%) individuals carrying IA also carry such variants in ANGPTL6, versus 15 out of 41 (37%) unaffected ones. We observed a higher rate of individuals with a history of high blood pressure among affected versus healthy individuals carrying ANGPTL6 variants, suggesting that ANGPTL6 could trigger cerebrovascular lesions when combined with other risk factors such as hypertension. Altogether, our results indicate that rare coding variants in ANGPTL6 are causally related to familial forms of IA. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?

PubMed

Tso, Kai-Yuen; Lee, Sau Dan; Lo, Kwok-Wai; Yip, Kevin Y

2014-12-23

Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data.
VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research

PubMed Central

Lai, Zhongwu; Markovets, Aleksandra; Ahdesmaki, Miika; Chapman, Brad; Hofmann, Oliver; McEwen, Robert; Johnson, Justin; Dougherty, Brian; Barrett, J. Carl; Dry, Jonathan R.

2016-01-01

Abstract Accurate variant calling in next generation sequencing (NGS) is critical to understand cancer genomes better. Here we present VarDict, a novel and versatile variant caller for both DNA- and RNA-sequencing data. VarDict simultaneously calls SNV, MNV, InDels, complex and structural variants, expanding the detected genetic driver landscape of tumors. It performs local realignments on the fly for more accurate allele frequency estimation. VarDict performance scales linearly to sequencing depth, enabling ultra-deep sequencing used to explore tumor evolution or detect tumor DNA circulating in blood. In addition, VarDict performs amplicon aware variant calling for polymerase chain reaction (PCR)-based targeted sequencing often used in diagnostic settings, and is able to detect PCR artifacts. Finally, VarDict also detects differences in somatic and loss of heterozygosity variants between paired samples. VarDict reprocessing of The Cancer Genome Atlas (TCGA) Lung Adenocarcinoma dataset called known driver mutations in KRAS, EGFR, BRAF, PIK3CA and MET in 16% more patients than previously published variant calls. We believe VarDict will greatly facilitate application of NGS in clinical cancer research. PMID:27060149
Rare TREM2 variants associated with Alzheimer's disease display reduced cell surface expression.

PubMed

Sirkis, Daniel W; Bonham, Luke W; Aparicio, Renan E; Geier, Ethan G; Ramos, Eliana Marisa; Wang, Qing; Karydas, Anna; Miller, Zachary A; Miller, Bruce L; Coppola, Giovanni; Yokoyama, Jennifer S

2016-09-02

Rare variation in TREM2 has been associated with greater risk for Alzheimer's disease (AD). TREM2 encodes a cell surface receptor expressed on microglia and related cells, and the R47H variant associated with AD appears to affect the ability of TREM2 to bind extracellular ligands. In addition, other rare TREM2 mutations causing early-onset neurodegeneration are thought to impair cell surface expression. Using a sequence kernel association (SKAT) analysis in two independent AD cohorts, we found significant enrichment of rare TREM2 variants not previously characterized at the protein level. Heterologous expression of the identified variants showed that novel variants S31F and R47C displayed significantly reduced cell surface expression. In addition, we identified rare variant R136Q in a patient with language-predominant AD that also showed impaired surface expression. The results suggest rare TREM2 variants enriched in AD may be associated with altered TREM2 function and that AD risk may be conferred, in part, from altered TREM2 surface expression.
Texture evolution during isothermal, isostrain, and isobaric loading of polycrystalline shape memory NiTi

NASA Astrophysics Data System (ADS)

Nicholson, D. E.; Padula, S. A.; Benafan, O.; Vaidyanathan, R.

2017-06-01

In situ neutron diffraction was used to provide insights into martensite variant microstructures during isothermal, isobaric, and isostrain loading in shape memory NiTi. The results show that variant microstructures were equivalent for the corresponding strain, and more importantly, the reversibility and equivalency were immediately evident in variant microstructures that were first formed isobarically but then reoriented to near random self-accommodated microstructures following isothermal deformation. Variant microstructures formed isothermally were not significantly affected by a subsequent thermal cycle under constant strain. In all loading cases considered, the resulting variant microstructure correlated with strain and did not correlate with stress. Based on the ability to select a variant microstructure for a given strain despite thermomechanical loading history, the results demonstrated here can be obtained by following any sequence of thermomechanical loading paths over multiple cycles. Thus, for training shape memory alloys (repeating thermomechanical cycling to obtain the desired variant microstructure), optimal paths can be selected so as to minimize the number of training cycles required, thereby increasing the overall stability and fatigue life of these alloys in actuator or medical applications.
FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets

PubMed Central

2013-01-01

Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools. Conclusions FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes. PMID:23441864

Clinical Validation and Implementation of a Targeted Next-Generation Sequencing Assay to Detect Somatic Variants in Non-Small Cell Lung, Melanoma, and Gastrointestinal Malignancies

PubMed Central

Fisher, Kevin E.; Zhang, Linsheng; Wang, Jason; Smith, Geoffrey H.; Newman, Scott; Schneider, Thomas M.; Pillai, Rathi N.; Kudchadkar, Ragini R.; Owonikoko, Taofeek K.; Ramalingam, Suresh S.; Lawson, David H.; Delman, Keith A.; El-Rayes, Bassel F.; Wilson, Malania M.; Sullivan, H. Clifford; Morrison, Annie S.; Balci, Serdar; Adsay, N. Volkan; Gal, Anthony A.; Sica, Gabriel L.; Saxe, Debra F.; Mann, Karen P.; Hill, Charles E.; Khuri, Fadlo R.; Rossi, Michael R.

2017-01-01

We tested and clinically validated a targeted next-generation sequencing (NGS) mutation panel using 80 formalin-fixed, paraffin-embedded (FFPE) tumor samples. Forty non-small cell lung carcinoma (NSCLC), 30 melanoma, and 30 gastrointestinal (12 colonic, 10 gastric, and 8 pancreatic adenocarcinoma) FFPE samples were selected from laboratory archives. After appropriate specimen and nucleic acid quality control, 80 NGS libraries were prepared using the Illumina TruSight tumor (TST) kit and sequenced on the Illumina MiSeq. Sequence alignment, variant calling, and sequencing quality control were performed using vendor software and laboratory-developed analysis workflows. TST generated ≥500× coverage for 98.4% of the 13,952 targeted bases. Reproducible and accurate variant calling was achieved at ≥5% variant allele frequency with 8 to 12 multiplexed samples per MiSeq flow cell. TST detected 112 variants overall, and confirmed all known single-nucleotide variants (n = 27), deletions (n = 5), insertions (n = 3), and multinucleotide variants (n = 3). TST detected at least one variant in 85.0% (68/80), and two or more variants in 36.2% (29/80), of samples. TP53 was the most frequently mutated gene in NSCLC (13 variants; 13/32 samples), gastrointestinal malignancies (15 variants; 13/25 samples), and overall (30 variants; 28/80 samples). BRAF mutations were most common in melanoma (nine variants; 9/23 samples). Clinically relevant NGS data can be obtained from routine clinical FFPE solid tumor specimens using TST, benchtop instruments, and vendor-supplied bioinformatics pipelines. PMID:26801070
Rare coding variants in the phospholipase D3 gene confer risk for Alzheimer's disease

NASA Astrophysics Data System (ADS)

2014-01-01

Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD). These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case-control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer's disease in seven independent case-control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer's disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer's disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer's disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.
Rare coding variants in the phospholipase D3 gene confer risk for Alzheimer's disease.

PubMed

Cruchaga, Carlos; Karch, Celeste M; Jin, Sheng Chih; Benitez, Bruno A; Cai, Yefei; Guerreiro, Rita; Harari, Oscar; Norton, Joanne; Budde, John; Bertelsen, Sarah; Jeng, Amanda T; Cooper, Breanna; Skorupa, Tara; Carrell, David; Levitch, Denise; Hsu, Simon; Choi, Jiyoon; Ryten, Mina; Sassi, Celeste; Bras, Jose; Gibbs, Raphael J; Hernandez, Dena G; Lupton, Michelle K; Powell, John; Forabosco, Paola; Ridge, Perry G; Corcoran, Christopher D; Tschanz, JoAnn T; Norton, Maria C; Munger, Ronald G; Schmutz, Cameron; Leary, Maegan; Demirci, F Yesim; Bamne, Mikhil N; Wang, Xingbin; Lopez, Oscar L; Ganguli, Mary; Medway, Christopher; Turton, James; Lord, Jenny; Braae, Anne; Barber, Imelda; Brown, Kristelle; Pastor, Pau; Lorenzo-Betancor, Oswaldo; Brkanac, Zoran; Scott, Erick; Topol, Eric; Morgan, Kevin; Rogaeva, Ekaterina; Singleton, Andy; Hardy, John; Kamboh, M Ilyas; George-Hyslop, Peter St; Cairns, Nigel; Morris, John C; Kauwe, John S K; Goate, Alison M

2014-01-23

Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD). These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case-control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer's disease in seven independent case-control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer's disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer's disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer's disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.
Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm

PubMed Central

Clausen, Rudy; Ma, Buyong; Nussinov, Ruth; Shehu, Amarda

2015-01-01

An important goal in molecular biology is to understand functional changes upon single-point mutations in proteins. Doing so through a detailed characterization of structure spaces and underlying energy landscapes is desirable but continues to challenge methods based on Molecular Dynamics. In this paper we propose a novel algorithm, SIfTER, which is based instead on stochastic optimization to circumvent the computational challenge of exploring the breadth of a protein’s structure space. SIfTER is a data-driven evolutionary algorithm, leveraging experimentally-available structures of wildtype and variant sequences of a protein to define a reduced search space from where to efficiently draw samples corresponding to novel structures not directly observed in the wet laboratory. The main advantage of SIfTER is its ability to rapidly generate conformational ensembles, thus allowing mapping and juxtaposing landscapes of variant sequences and relating observed differences to functional changes. We apply SIfTER to variant sequences of the H-Ras catalytic domain, due to the prominent role of the Ras protein in signaling pathways that control cell proliferation, its well-studied conformational switching, and abundance of documented mutations in several human tumors. Many Ras mutations are oncogenic, but detailed energy landscapes have not been reported until now. Analysis of SIfTER-computed energy landscapes for the wildtype and two oncogenic variants, G12V and Q61L, suggests that these mutations cause constitutive activation through two different mechanisms. G12V directly affects binding specificity while leaving the energy landscape largely unchanged, whereas Q61L has pronounced, starker effects on the landscape. An implementation of SIfTER is made available at http://www.cs.gmu.edu/~ashehu/?q=OurTools. We believe SIfTER is useful to the community to answer the question of how sequence mutations affect the function of a protein, when there is an abundance of experimental structures that can be exploited to reconstruct an energy landscape that would be computationally impractical to do via Molecular Dynamics. PMID:26325505
Whole-genome sequence-based analysis of thyroid function.

PubMed

Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J; Traglia, Michela; Brown, Suzanne J; Mullin, Benjamin H; Shihab, Hashem A; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R; Beilby, John P; Charoen, Pimphen; Danecek, Petr; Dudbridge, Frank; Forgetta, Vincenzo; Greenwood, Celia; Grundberg, Elin; Johnson, Andrew D; Hui, Jennie; Lim, Ee M; McCarthy, Shane; Muddyman, Dawn; Panicker, Vijay; Perry, John R B; Bell, Jordana T; Yuan, Wei; Relton, Caroline; Gaunt, Tom; Schlessinger, David; Abecasis, Goncalo; Cucca, Francesco; Surdulescu, Gabriela L; Woltersdorf, Wolfram; Zeggini, Eleftheria; Zheng, Hou-Feng; Toniolo, Daniela; Dayan, Colin M; Naitza, Silvia; Walsh, John P; Spector, Tim; Davey Smith, George; Durbin, Richard; Richards, J Brent; Sanna, Serena; Soranzo, Nicole; Timpson, Nicholas J; Wilson, Scott G

2015-03-06

Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10(-9)) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10(-14)). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10(-9)) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10(-11)). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.
Mitochondrial targeting sequence variants of the CHCHD2 gene are a risk for Lewy body disorders

PubMed Central

Ogaki, Kotaro; Koga, Shunsuke; Heckman, Michael G.; Fiesel, Fabienne C.; Ando, Maya; Labbé, Catherine; Lorenzo-Betancor, Oswaldo; Moussaud-Lamodière, Elisabeth L.; Soto-Ortolaza, Alexandra I.; Walton, Ronald L.; Strongosky, Audrey J.; Uitti, Ryan J.; McCarthy, Allan; Lynch, Timothy; Siuda, Joanna; Opala, Grzegorz; Rudzinska, Monika; Krygowska-Wajs, Anna; Barcikowska, Maria; Czyzewski, Krzysztof; Puschmann, Andreas; Nishioka, Kenya; Funayama, Manabu; Hattori, Nobutaka; Parisi, Joseph E.; Petersen, Ronald C.; Graff-Radford, Neill R.; Boeve, Bradley F.; Springer, Wolfdieter; Wszolek, Zbigniew K.; Dickson, Dennis W.

2015-01-01

Objective: To assess the role of CHCHD2 variants in patients with Parkinson disease (PD) and Lewy body disease (LBD) in Caucasian populations. Methods: All exons of the CHCHD2 gene were sequenced in a US Caucasian patient-control series (878 PD, 610 LBD, and 717 controls). Subsequently, exons 1 and 2 were sequenced in an Irish series (355 PD and 365 controls) and a Polish series (394 PD and 350 controls). Immunohistochemistry and immunofluorescence studies were performed on pathologic LBD cases with rare CHCHD2 variants. Results: We identified 9 rare exonic variants of unknown significance. These variants were more frequent in the combined group of PD and LBD patients compared to controls (0.6% vs 0.1%, p = 0.013). In addition, the presence of any rare variant was more common in patients with LBD (2.5% vs 1.0%, p = 0.050) compared to controls. Eight of these 9 variants were located within the gene's mitochondrial targeting sequence. Conclusions: Although the role of variants of the CHCHD2 gene in PD and LBD remains to be further elucidated, the rare variants in the mitochondrial targeting sequence may be a risk factor for Lewy body disorders, which may link CHCHD2 to other genetic forms of parkinsonism with mitochondrial dysfunction. PMID:26561290
Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

PubMed

Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

2016-01-01

Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.
ATP-binding cassette subfamily A, member 4 intronic variants c.4773+3A>G and c.5461-10T>C cause Stargardt disease due to defective splicing.

PubMed

Jonsson, Frida; Westin, Ida Maria; Österman, Lennart; Sandgren, Ola; Burstedt, Marie; Holmberg, Monica; Golovleva, Irina

2018-02-20

Inherited retinal dystrophies (IRDs) represent a group of progressive conditions affecting the retina. There is a great genetic heterogeneity causing IRDs, and to date, more than 260 genes are associated with IRDs. Stargardt disease, type 1 (STGD1) or macular degeneration with flecks, STGD1 represents a disease with early onset, central visual impairment, frequent appearance of yellowish flecks and mutations in the ATP-binding cassette subfamily A, member 4 (ABCA4) gene. A large number of intronic sequence variants in ABCA4 have been considered pathogenic although their functional effect was seldom demonstrated. In this study, we aimed to reveal how intronic variants present in patients with Stargardt from the same Swedish family affect splicing. The splicing of the ABCA4 gene was studied in human embryonic kidney cells, HEK293T, and in human retinal pigment epithelium cells, ARPE-19, using a minigene system containing variants c.4773+3A>G and c.5461-10T>C. We showed that both ABCA4 variants, c.4773+3A>G and c.5461-10T>C, cause aberrant splicing of the ABCA4 minigene resulting in exon skipping. We also demonstrated that splicing of ABCA4 has different outcomes depending on transfected cell type. Two intronic variants c.4773+3A>G and c.5461-10T>C, both predicted to affect splicing, are indeed disease-causing mutations due to skipping of exons 33, 34, 39 and 40 of ABCA4 gene. The experimental proof that ABCA4 mutations in STGD patients affect protein function is crucial for their inclusion to future clinical trials; therefore, functional testing of all ABCA4 intronic variants associated with Stargardt disease by minigene technology is desirable. © 2018 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Rare ADAR and RNASEH2B variants and a type I interferon signature in glioma and prostate carcinoma risk and tumorigenesis.

PubMed

Beyer, Ulrike; Brand, Frank; Martens, Helge; Weder, Julia; Christians, Arne; Elyan, Natalie; Hentschel, Bettina; Westphal, Manfred; Schackert, Gabriele; Pietsch, Torsten; Hong, Bujung; Krauss, Joachim K; Samii, Amir; Raab, Peter; Das, Anibh; Dumitru, Claudia A; Sandalcioglu, I Erol; Hakenberg, Oliver W; Erbersdobler, Andreas; Lehmann, Ulrich; Reifenberger, Guido; Weller, Michael; Reijns, Martin A M; Preller, Matthias; Wiese, Bettina; Hartmann, Christian; Weber, Ruthild G

2017-12-01

In search of novel germline alterations predisposing to tumors, in particular to gliomas, we studied a family with two brothers affected by anaplastic gliomas, and their father and paternal great-uncle diagnosed with prostate carcinoma. In this family, whole-exome sequencing yielded rare, simultaneously heterozygous variants in the Aicardi-Goutières syndrome (AGS) genes ADAR and RNASEH2B co-segregating with the tumor phenotype. AGS is a genetically induced inflammatory disease particularly of the brain, which has not been associated with a consistently increased cancer risk to date. By targeted sequencing, we identified novel ADAR and RNASEH2B variants, and a 3- to 17-fold frequency increase of the AGS mutations ADAR,c.577C>G;p.(P193A) and RNASEH2B,c.529G>A;p.(A177T) in the germline of familial glioma patients as well as in test and validation cohorts of glioblastomas and prostate carcinomas versus ethnicity-matched controls, whereby rare RNASEH2B variants were significantly more frequent in familial glioma patients. Tumors with ADAR or RNASEH2B variants recapitulated features of AGS, such as calcification and increased type I interferon expression. Patients carrying ADAR or RNASEH2B variants showed upregulation of interferon-stimulated gene (ISG) transcripts in peripheral blood as seen in AGS. An increased ISG expression was also induced by ADAR and RNASEH2B variants in tumor cells and was blocked by the JAK inhibitor Ruxolitinib. Our data implicate rare variants in the AGS genes ADAR and RNASEH2B and a type I interferon signature in glioma and prostate carcinoma risk and tumorigenesis, consistent with a genetic basis underlying inflammation-driven malignant transformation in glioma and prostate carcinoma development.
A SIGMAR1 splice-site mutation causes distal hereditary motor neuropathy.

PubMed

Li, Xiaobo; Hu, Zhengmao; Liu, Lei; Xie, Yongzhi; Zhan, Yajing; Zi, Xiaohong; Wang, Junling; Wu, Lixiang; Xia, Kun; Tang, Beisha; Zhang, Ruxu

2015-06-16

To identify the underlying genetic cause in a consanguineous Chinese family segregating distal hereditary motor neuropathy (dHMN) in an autosomal recessive pattern. We used whole-exome sequencing and homozygosity mapping to detect the genetic variant in 2 affected individuals of the consanguineous Chinese family with dHMN. RNA analysis of peripheral blood leukocytes and immunofluorescence and immunoblotting of stable cell lines were performed to support the pathogenicity of the identified mutation. We identified 3 shared novel homozygous variants in 3 shared homozygous regions of the affected individuals. Sequencing of these 3 variants in family members revealed the c.151+1G>T mutation in SIGMAR1 gene, which located in homozygous region spanning approximately 5.3 Mb at chromosome 9p13.1-p13.3, segregated with the dHMN phenotype. The mutation causes an alternative splicing event and generates a transcript variant with an in-frame deletion of 60 base pairs in exon 1 (c.92_151del), and results in an internally shortened protein σ1R(31_50del). The proteasomal inhibitor treatment increased the intracellular amount of σ1R(31_50del) and led to the formation of nuclear aggregates. Stable expressing σ1R(31_50del) induced endoplasmic reticulum stress and enhanced apoptosis. The homozygous c.151+1G>T mutation in SIGMAR1 caused a novel form of autosomal recessive dHMN in a Chinese consanguineous family. Endoplasmic reticulum stress may have a role in the pathogenesis of dHMN. © 2015 American Academy of Neurology.
Novel mutation in the CHST6 gene causes macular corneal dystrophy in a black South African family.

PubMed

Carstens, Nadia; Williams, Susan; Goolam, Saadiah; Carmichael, Trevor; Cheung, Ming Sin; Büchmann-Møller, Stine; Sultan, Marc; Staedtler, Frank; Zou, Chao; Swart, Peter; Rice, Dennis S; Lacoste, Arnaud; Paes, Kim; Ramsay, Michèle

2016-07-20

Macular corneal dystrophy (MCD) is a rare autosomal recessive disorder that is characterized by progressive corneal opacity that starts in early childhood and ultimately progresses to blindness in early adulthood. The aim of this study was to identify the cause of MCD in a black South African family with two affected sisters. A multigenerational South African Sotho-speaking family with type I MCD was studied using whole exome sequencing. Variant filtering to identify the MCD-causal mutation included the disease inheritance pattern, variant minor allele frequency and potential functional impact. Ophthalmologic evaluation of the cases revealed a typical MCD phenotype and none of the other family members were affected. An average of 127 713 variants per individual was identified following exome sequencing and approximately 1.2 % were not present in any of the investigated public databases. Variant filtering identified a homozygous E71Q mutation in CHST6, a known MCD-causing gene encoding corneal N-acetyl glucosamine-6-O-sulfotransferase. This E71Q mutation results in a non-conservative amino acid change in a highly conserved functional domain of the human CHST6 that is essential for enzyme activity. We identified a novel E71Q mutation in CHST6 as the MCD-causal mutation in a black South African family with type I MCD. This is the first description of MCD in a black Sub-Saharan African family and therefore contributes valuable insights into the genetic aetiology of this disease, while improving genetic counselling for this and potentially other MCD families.
High depth, whole-genome sequencing of cholera isolates from Haiti and the Dominican Republic.

PubMed

Sealfon, Rachel; Gire, Stephen; Ellis, Crystal; Calderwood, Stephen; Qadri, Firdausi; Hensley, Lisa; Kellis, Manolis; Ryan, Edward T; LaRocque, Regina C; Harris, Jason B; Sabeti, Pardis C

2012-09-11

Whole-genome sequencing is an important tool for understanding microbial evolution and identifying the emergence of functionally important variants over the course of epidemics. In October 2010, a severe cholera epidemic began in Haiti, with additional cases identified in the neighboring Dominican Republic. We used whole-genome approaches to sequence four Vibrio cholerae isolates from Haiti and the Dominican Republic and three additional V. cholerae isolates to a high depth of coverage (>2000x); four of the seven isolates were previously sequenced. Using these sequence data, we examined the effect of depth of coverage and sequencing platform on genome assembly and identification of sequence variants. We found that 50x coverage is sufficient to construct a whole-genome assembly and to accurately call most variants from 100 base pair paired-end sequencing reads. Phylogenetic analysis between the newly sequenced and thirty-three previously sequenced V. cholerae isolates indicates that the Haitian and Dominican Republic isolates are closest to strains from South Asia. The Haitian and Dominican Republic isolates form a tight cluster, with only four variants unique to individual isolates. These variants are located in the CTX region, the SXT region, and the core genome. Of the 126 mutations identified that separate the Haiti-Dominican Republic cluster from the V. cholerae reference strain (N16961), 73 are non-synonymous changes, and a number of these changes cluster in specific genes and pathways. Sequence variant analyses of V. cholerae isolates, including multiple isolates from the Haitian outbreak, identify coverage-specific and technology-specific effects on variant detection, and provide insight into genomic change and functional evolution during an epidemic.
A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data.

PubMed

Demidov, German; Simakova, Tamara; Vnuchkova, Julia; Bragin, Anton

2016-10-22

Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm's sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples.
Identification of constitutional MLH1 epimutations and promoter variants in colorectal cancer patients from the Colon Cancer Family Registry

PubMed Central

Ward, Robyn L.; Dobbins, Timothy; Lindor, Noralane M.; Rapkins, Robert W.; Hitchins, Megan P.

2013-01-01

Purpose: Constitutional MLH1 epimutations manifest as promoter methylation and silencing of the affected allele in normal tissues, predisposing to Lynch syndrome–associated cancers. This study investigated their frequency and inheritance. Methods: A total of 416 individuals with a colorectal cancer showing loss of MLH1 expression and without deleterious germline mutations in MLH1 were ascertained from the Colon Cancer Family Registry (C-CFR). Constitutive DNA samples were screened for MLH1 methylation in all 416 subjects and for promoter sequence changes in 357 individuals. Results: Constitutional MLH1 epimutations were identified in 16 subjects. Of these, seven (1.7%) had mono- or hemi-allelic methylation and eight had low-level methylation (2%). In one subject the epimutation was linked to the c.-27C>A promoter variant. Testing of 37 relatives from nine probands revealed paternal transmission of low-level methylation segregating with a c.+27G>A variant in one case. Five additional probands had a promoter variant without an MLH1 epimutation, with three showing diminished promoter activity in functional assays. Conclusion: Although rare, sequence changes in the regulatory region of MLH1 and aberrant methylation may alone or together predispose to the development of cancer. Screening for these changes is warranted in individuals who have a negative germline sequence screen of MLH1 and loss of MLH1 expression in their tumor. PMID:22878509
Detection of clinically relevant copy-number variants by exome sequencing in a large cohort of genetic disorders

PubMed Central

Pfundt, Rolph; del Rosario, Marisol; Vissers, Lisenka E.L.M.; Kwint, Michael P.; Janssen, Irene M.; de Leeuw, Nicole; Yntema, Helger G.; Nelen, Marcel R.; Lugtenberg, Dorien; Kamsteeg, Erik-Jan; Wieskamp, Nienke; Stegmann, Alexander P.A.; Stevens, Servi J.C.; Rodenburg, Richard J.T.; Simons, Annet; Mensenkamp, Arjen R.; Rinne, Tuula; Gilissen, Christian; Scheffer, Hans; Veltman, Joris A.; Hehir-Kwa, Jayne Y.

2017-01-01

Purpose: Copy-number variation is a common source of genomic variation and an important genetic cause of disease. Microarray-based analysis of copy-number variants (CNVs) has become a first-tier diagnostic test for patients with neurodevelopmental disorders, with a diagnostic yield of 10–20%. However, for most other genetic disorders, the role of CNVs is less clear and most diagnostic genetic studies are generally limited to the study of single-nucleotide variants (SNVs) and other small variants. With the introduction of exome and genome sequencing, it is now possible to detect both SNVs and CNVs using an exome- or genome-wide approach with a single test. Methods: We performed exome-based read-depth CNV screening on data from 2,603 patients affected by a range of genetic disorders for which exome sequencing was performed in a diagnostic setting. Results: In total, 123 clinically relevant CNVs ranging in size from 727 bp to 15.3 Mb were detected, which resulted in 51 conclusive diagnoses and an overall increase in diagnostic yield of ~2% (ranging from 0 to –5.8% per disorder). Conclusions: This study shows that CNVs play an important role in a broad range of genetic disorders and that detection via exome-based CNV profiling results in an increase in the diagnostic yield without additional testing, bringing us closer to single-test genomics. Genet Med advance online publication 27 October 2016 PMID:28574513
[A boy with Meier-Gorlin syndrome carrying a novel ORC6 mutation and uniparental disomy of chromosome 16].

PubMed

Li, Juan; Ding, Yu; Chang, Guoying; Cheng, Qing; Li, Xin; Wang, Jian; Wang, Xiumin; Shen, Yiping

2017-02-10

To identify the genetic cause for a 11-year-old Chinese boy with Meier-Gorlin syndrome (MGS). Chromosomal microarray analysis (CMA) was used to detect potential variations, while whole exome sequencing (WES) was used to identify sequence variants. Sanger sequencing was used to confirm the suspected variants. The boy has featured short stature, microtia, small patella, slender body build, craniofacial anomalies, and small testes with normal gonadotropin. A complete uniparental disomy of chromosome 16 was revealed by CMA. WES has identified a novel homozygous mutation c.67A>G (p.Lys23Glu) in ORC6 gene mapped to chromosome 16. As predicted by Alamut functional software, the mutation may affect the function of structural domain of the ORC6 protein. The patient is probably the first diagnosed MGS case in China, who carried a novel homozygous mutation of the ORC6 gene and uniparental disomy of chromosome 16. The effect of this novel mutation on the growth and development needs to be further investigated.
Molecular characterization and detection of variants of Taenia multiceps in sheep in Turkey.

PubMed

Sonmez, Betul; Koroglu, Ergun; Simsek, Sami

2017-02-01

Taenia multiceps is a cestode (family Taeniidae) that in its adult stage lives in the small intestine of dogs and other canids. The metacestode, known as Coenurus cerebralis, is usually found in the central nervous system including brain and spinal card in sheep and other ruminants. The presence of cysts typically leads to neurological symptoms that in the majority of cases result in the death of the animal. Coenurosis could cause high losses in sheep farms because the disease commonly affects young animals. A total of 20 C. cerebralis isolates collected from naturally infected sheep in Mardin province of Turkey were characterized through the polymerase chain reaction and sequencing of a fragment of cytochrome c oxidase subunit 1 (CO1) gene. The results showed that the CO1 gene sequences were highly conserved in C. cerebralis isolates. Phylogenetic analysis based on partial CO1 gene sequences revealed that C. cerebralis isolates were composed of three different variants.
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

PubMed

Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A; Larsen, Martin Jakob

2016-01-01

Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data

PubMed Central

Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A.; Larsen, Martin Jakob

2016-01-01

Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths. PMID:27002637
Molecular epidemiological studies on foot-and-mouth disease type O Taiwan viruses from the 1997 epidemic.

PubMed

Tsai, C P; Pan, C H; Liu, M Y; Lin, Y L; Chen, C M; Huang, T S; Cheng, I C; Jong, M H; Yang, P C

2000-06-01

Sequence diversity was assessed of the complete VP1 gene directly amplified from 49 clinical specimens during an explosive foot-and-mouth disease (FMD) outbreak in Taiwan. Type O Taiwan FMD viruses are genetically highly homogenous, as seen by the minute divergence of 0.2-0.9% revealed in 20 variants. The O/HCP-0314/TW/97 and O/TCP-022/TW/97 viral variants dominated FMD outbreaks and were prevalent in most affected pig-raising areas. Comparison of deduced amino acid sequences around the main neutralizable antigenic sites on the VP1 polypeptide showed no significant antigenic variation. However, the O/CHP-158/TW/97 variant had an alternative critical residue at position 43 in antigenic site 3, which may be due to selective pressure in the field. Two vaccine production strains (O1/Manisa/Turkey/69 and O1/Campos/Brazil/71) probably provide partial heterologous protection of swine against O Taiwan viruses. The type O Taiwan variants clustered in sublineage A1 of four main lineages in the phylogenetic tree. The O/Hong Kong/9/94 and O/1685/Moscow/Russia/95 viruses in sublineage A2 are closely related to the O Taiwan variants. The causative agent for the 1997 epidemic presumably originated from a single common source of type O FMD viruses prevalent in neighboring areas.

Rare Variants in PLD3 Do Not Affect Risk for Early‐Onset Alzheimer Disease in a European Consortium Cohort

PubMed Central

Cacace, Rita; Van den Bossche, Tobi; Engelborghs, Sebastiaan; Geerts, Nathalie; Laureys, Annelies; Dillen, Lubina; Graff, Caroline; Thonberg, Håkan; Chiang, Huei‐Hsin; Pastor, Pau; Ortega‐Cubero, Sara; Pastor, Maria A.; Diehl‐Schmid, Janine; Alexopoulos, Panagiotis; Benussi, Luisa; Ghidoni, Roberta; Binetti, Giuliano; Nacmias, Benedetta; Sorbi, Sandro; Sanchez‐Valle, Raquel; Lladó, Albert; Gelpi, Ellen; Almeida, Maria Rosário; Santana, Isabel; Tsolaki, Magda; Koutroumani, Maria; Clarimon, Jordi; Lleó, Alberto; Fortea, Juan; de Mendonça, Alexandre; Martins, Madalena; Borroni, Barbara; Padovani, Alessandro; Matej, Radoslav; Rohan, Zdenek; Vandenbulcke, Mathieu; Vandenberghe, Rik; De Deyn, Peter P.; Cras, Patrick; van der Zee, Julie; Sleegers, Kristel

2015-01-01

ABSTRACT Rare variants in the phospholipase D3 gene (PLD3) were associated with increased risk for late‐onset Alzheimer disease (LOAD). We identified a missense mutation in PLD3 in whole‐genome sequence data of a patient with autopsy confirmed Alzheimer disease (AD) and onset age of 50 years. Subsequently, we sequenced PLD3 in a Belgian early‐onset Alzheimer disease (EOAD) patient (N = 261) and control (N = 319) cohort, as well as in European EOAD patients (N = 946) and control individuals (N = 1,209) ascertained in different European countries. Overall, we identified 22 rare variants with a minor allele frequency <1%, 20 missense and two splicing mutations. Burden analysis did not provide significant evidence for an enrichment of rare PLD3 variants in EOAD patients in any of the patient/control cohorts. Also, meta‐analysis of the PLD3 data, including a published dataset of a German EOAD cohort, was not significant (P = 0.43; OR = 1.53, 95% CI 0.60–3.31). Consequently, our data do not support a role for PLD3 rare variants in the genetic etiology of EOAD in European EOAD patients. Our data corroborate the negative replication data obtained in LOAD studies and therefore a genetic role of PLD3 in AD remains to be demonstrated. PMID:26411346
Whole exome sequencing of rare variants in EIF4G1 and VPS35 in Parkinson disease

PubMed Central

Nuytemans, Karen; Bademci, Guney; Inchausti, Vanessa; Dressen, Amy; Kinnamon, Daniel D.; Mehta, Arpit; Wang, Liyong; Züchner, Stephan; Beecham, Gary W.; Martin, Eden R.; Scott, William K.

2013-01-01

Objective: Recently, vacuolar protein sorting 35 (VPS35) and eukaryotic translation initiation factor 4 gamma 1 (EIF4G1) have been identified as 2 causal Parkinson disease (PD) genes. We used whole exome sequencing for rapid, parallel analysis of variations in these 2 genes. Methods: We performed whole exome sequencing in 213 patients with PD and 272 control individuals. Those rare variants (RVs) with <5% frequency in the exome variant server database and our own control data were considered for analysis. We performed joint gene-based tests for association using RVASSOC and SKAT (Sequence Kernel Association Test) as well as single-variant test statistics. Results: We identified 3 novel VPS35 variations that changed the coded amino acid (nonsynonymous) in 3 cases. Two variations were in multiplex families and neither segregated with PD. In EIF4G1, we identified 11 (9 nonsynonymous and 2 small indels) RVs including the reported pathogenic mutation p.R1205H, which segregated in all affected members of a large family, but also in 1 unaffected 86-year-old family member. Two additional RVs were found in isolated patients only. Whereas initial association studies suggested an association (p = 0.04) with all RVs in EIF4G1, subsequent testing in a second dataset for the driving variant (p.F1461) suggested no association between RVs in the gene and PD. Conclusions: We confirm that the specific EIF4G1 variation p.R1205H seems to be a strong PD risk factor, but is nonpenetrant in at least one 86-year-old. A few other select RVs in both genes could not be ruled out as causal. However, there was no evidence for an overall contribution of genetic variability in VPS35 or EIF4G1 to PD development in our dataset. PMID:23408866
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

PubMed Central

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Receptor homodimerization plays a critical role in a novel dominant negative P2RY12 variant identified in a family with severe bleeding.

PubMed

Mundell, S J; Rabbolini, D; Gabrielli, S; Chen, Q; Aungraheeta, R; Hutchinson, J L; Kilo, T; Mackay, J; Ward, C M; Stevenson, W; Morel-Kopp, M-C

2018-01-01

Essentials Three dominant variants for the autosomal recessive bleeding disorder type-8 have been described. To date, there has been no phenotype/genotype correlation explaining their dominant transmission. Proline plays an important role in P2Y12R ligand binding and signaling defects. P2Y12R homodimer formation is critical for the receptor function and signaling. Background Although inherited platelet disorders are still underdiagnosed worldwide, advances in molecular techniques are improving disease diagnosis and patient management. Objective To identify and characterize the mechanism underlying the bleeding phenotype in a Caucasian family with an autosomal dominant P2RY12 variant. Methods Full blood counts, platelet aggregometry, flow cytometry and western blotting were performed before next-generation sequencing (NGS). Detailed molecular analysis of the identified variant of the P2Y12 receptor (P2Y12R) was subsequently performed in mammalian cells overexpressing receptor constructs. Results All three referred individuals had markedly impaired ADP-induced platelet aggregation with primary wave only, despite normal total and surface P2Y12R expression. By NGS, a single P2RY12:c.G794C substitution (p.R265P) was identified in all affected individuals, and this was confirmed by Sanger sequencing. Mammalian cell experiments with the R265P-P2Y12R variant showed normal receptor surface expression versus wild-type (WT) P2Y12R. Agonist-stimulated R265P-P2Y12R function (both signaling and surface receptor loss) was reduced versus WT P2Y12R. Critically, R265P-P2Y12R acted in a dominant negative manner, with agonist-stimulated WT P2Y12R activity being reduced by variant coexpression, suggesting dramatic loss of WT homodimers. Importantly, platelet P2RY12 cDNA cloning and sequencing in two affected individuals also revealed three-fold mutant mRNA overexpression, decreasing even further the likelihood of WT homodimer formation. R265 located within extracellular loop 3 (EL3) is one of four residues that are important for receptor functional integrity, maintaining the binding pocket conformation and allowing rotation following ligand binding. Conclusion This novel dominant negative variant confirms the important role of R265 in EL3 in the functional integrity of P2Y12R, and suggests that pathologic heterodimer formation may underlie this family bleeding phenotype. © 2017 International Society on Thrombosis and Haemostasis.
Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.

PubMed

Veerkamp, Roel F; Bouwman, Aniek C; Schrooten, Chris; Calus, Mario P L

2016-12-01

Whole-genome sequence data is expected to capture genetic variation more completely than common genotyping panels. Our objective was to compare the proportion of variance explained and the accuracy of genomic prediction by using imputed sequence data or preselected SNPs from a genome-wide association study (GWAS) with imputed whole-genome sequence data. Phenotypes were available for 5503 Holstein-Friesian bulls. Genotypes were imputed up to whole-genome sequence (13,789,029 segregating DNA variants) by using run 4 of the 1000 bull genomes project. The program GCTA was used to perform GWAS for protein yield (PY), somatic cell score (SCS) and interval from first to last insemination (IFL). From the GWAS, subsets of variants were selected and genomic relationship matrices (GRM) were used to estimate the variance explained in 2087 validation animals and to evaluate the genomic prediction ability. Finally, two GRM were fitted together in several models to evaluate the effect of selected variants that were in competition with all the other variants. The GRM based on full sequence data explained only marginally more genetic variation than that based on common SNP panels: for PY, SCS and IFL, genomic heritability improved from 0.81 to 0.83, 0.83 to 0.87 and 0.69 to 0.72, respectively. Sequence data also helped to identify more variants linked to quantitative trait loci and resulted in clearer GWAS peaks across the genome. The proportion of total variance explained by the selected variants combined in a GRM was considerably smaller than that explained by all variants (less than 0.31 for all traits). When selected variants were used, accuracy of genomic predictions decreased and bias increased. Although 35 to 42 variants were detected that together explained 13 to 19% of the total variance (18 to 23% of the genetic variance) when fitted alone, there was no advantage in using dense sequence information for genomic prediction in the Holstein data used in our study. Detection and selection of variants within a single breed are difficult due to long-range linkage disequilibrium. Stringent selection of variants resulted in more biased genomic predictions, although this might be due to the training population being the same dataset from which the selected variants were identified.
An automatic and efficient pipeline for disease gene identification through utilizing family-based sequencing data.

PubMed

Song, Dandan; Li, Ning; Liao, Lejian

2015-01-01

Due to the generation of enormous amounts of data at both lower costs as well as in shorter times, whole-exome sequencing technologies provide dramatic opportunities for identifying disease genes implicated in Mendelian disorders. Since upwards of thousands genomic variants can be sequenced in each exome, it is challenging to filter pathogenic variants in protein coding regions and reduce the number of missing true variants. Therefore, an automatic and efficient pipeline for finding disease variants in Mendelian disorders is designed by exploiting a combination of variants filtering steps to analyze the family-based exome sequencing approach. Recent studies on the Freeman-Sheldon disease are revisited and show that the proposed method outperforms other existing candidate gene identification methods.
Direct protein interaction underlies gene-for-gene specificity and coevolution of the flax resistance genes and flax rust avirulence genes

PubMed Central

Dodds, Peter N.; Lawrence, Gregory J.; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I. A.; Ayliffe, Michael A.; Kobe, Bostjan; Ellis, Jeffrey G.

2006-01-01

Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R–Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R–Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R–Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant–pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes. PMID:16731621
Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.

PubMed

Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin

2015-06-01

Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Improving the Performance of the Prony Method Using a Wavelet Domain Filter for MRI Denoising

PubMed Central

Lentini, Marianela; Paluszny, Marco

2014-01-01

The Prony methods are used for exponential fitting. We use a variant of the Prony method for abnormal brain tissue detection in sequences of T 2 weighted magnetic resonance images. Here, MR images are considered to be affected only by Rician noise, and a new wavelet domain bilateral filtering process is implemented to reduce the noise in the images. This filter is a modification of Kazubek's algorithm and we use synthetic images to show the ability of the new procedure to suppress noise and compare its performance with respect to the original filter, using quantitative and qualitative criteria. The tissue classification process is illustrated using a real sequence of T 2 MR images, and the filter is applied to each image before using the variant of the Prony method. PMID:24834108
Improving the performance of the prony method using a wavelet domain filter for MRI denoising.

PubMed

Jaramillo, Rodney; Lentini, Marianela; Paluszny, Marco

2014-01-01

The Prony methods are used for exponential fitting. We use a variant of the Prony method for abnormal brain tissue detection in sequences of T 2 weighted magnetic resonance images. Here, MR images are considered to be affected only by Rician noise, and a new wavelet domain bilateral filtering process is implemented to reduce the noise in the images. This filter is a modification of Kazubek's algorithm and we use synthetic images to show the ability of the new procedure to suppress noise and compare its performance with respect to the original filter, using quantitative and qualitative criteria. The tissue classification process is illustrated using a real sequence of T 2 MR images, and the filter is applied to each image before using the variant of the Prony method.
Local energetic frustration affects the dependence of green fluorescent protein folding on the chaperonin GroEL.

PubMed

Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon

2017-12-15

The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
A mutation in sigma-1 receptor causes juvenile amyotrophic lateral sclerosis.

PubMed

Al-Saif, Amr; Al-Mohanna, Futwan; Bohlega, Saeed

2011-12-01

Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disorder characterized by loss of motor neurons in the brain and spinal cord, leading to muscle weakness and eventually death from respiratory failure. ALS is familial in about 10% of cases, with SOD1 mutations accounting for 20% of familial cases. Here we describe a consanguineous family segregating juvenile ALS in an autosomal recessive pattern and describe the genetic variant responsible for the disorder. We performed homozygosity mapping and direct sequencing to detect the genetic variant and tested the effect of this variant on a motor neuron-like cell line model (NSC34) expressing the wild-type or mutant gene. We identified a shared homozygosity region in affected individuals that spans ~120 kbp on chromosome 9p13.3 containing 9 RefSeq genes. Sequencing the SIGMAR1 gene revealed a mutation affecting a highly conserved amino acid located in the transmembrane domain of the encoded protein, sigma-1 receptor. The mutated protein showed an aberrant subcellular distribution in NSC34 cells. Furthermore, cells expressing the mutant protein were less resistant to apoptosis induced by endoplasmic reticulum stress. Sigma-1 receptors are known to have neuroprotective properties, and recently Sigmar1 knockout mice have been described to have motor deficiency. Our findings emphasize the role of sigma-1 receptors in motor neuron function and disease. Copyright © 2011 American Neurological Association.
HIV-1 adaptation to antigen processing results in population-level immune evasion and affects subtype diversification.

PubMed

Tenzer, Stefan; Crawford, Hayley; Pymm, Phillip; Gifford, Robert; Sreenu, Vattipally B; Weimershaus, Mirjana; de Oliveira, Tulio; Burgevin, Anne; Gerstoft, Jan; Akkad, Nadja; Lunn, Daniel; Fugger, Lars; Bell, John; Schild, Hansjörg; van Endert, Peter; Iversen, Astrid K N

2014-04-24

The recent HIV-1 vaccine failures highlight the need to better understand virus-host interactions. One key question is why CD8(+) T cell responses to two HIV-Gag regions are uniquely associated with delayed disease progression only in patients expressing a few rare HLA class I variants when these regions encode epitopes presented by ~30 more common HLA variants. By combining epitope processing and computational analyses of the two HIV subtypes responsible for ~60% of worldwide infections, we identified a hitherto unrecognized adaptation to the antigen-processing machinery through substitutions at subtype-specific motifs. Multiple HLA variants presenting epitopes situated next to a given subtype-specific motif drive selection at this subtype-specific position, and epitope abundances correlate inversely with the HLA frequency distribution in affected populations. This adaptation reflects the sum of intrapatient adaptations, is predictable, facilitates viral subtype diversification, and increases global HIV diversity. Because low epitope abundance is associated with infrequent and weak T cell responses, this most likely results in both population-level immune evasion and inadequate responses in most people vaccinated with natural HIV-1 sequence constructs. Our results suggest that artificial sequence modifications at subtype-specific positions in vitro could refocus and reverse the poor immunogenicity of HIV proteins. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
X-exome sequencing identifies a HDAC8 variant in a large pedigree with X-linked intellectual disability, truncal obesity, gynaecomastia, hypogonadism and unusual face.

PubMed

Harakalova, Magdalena; van den Boogaard, Marie-Jose; Sinke, Richard; van Lieshout, Stef; van Tuil, Marc C; Duran, Karen; Renkens, Ivo; Terhal, Paulien A; de Kovel, Carolien; Nijman, Ies J; van Haelst, Mieke; Knoers, Nine V A M; van Haaften, Gijs; Kloosterman, Wigard; Hennekam, Raoul C M; Cuppen, Edwin; Ploos van Amstel, Hans Kristian

2012-08-01

We present a large Dutch family with seven males affected by a novel syndrome of X-linked intellectual disability, hypogonadism, gynaecomastia, truncal obesity, short stature and recognisable craniofacial manifestations resembling but not identical to Wilson-Turner syndrome. Seven female relatives show a much milder expression of the phenotype. We performed X chromosome exome (X-exome) sequencing in five individuals from this family and identified a novel intronic variant in the histone deacetylase 8 gene (HDAC8), c.164+5G>A, which disturbs the normal splicing of exon 2 resulting in exon skipping, and introduces a premature stop at the beginning of the histone deacetylase catalytic domain. The identified variant completely segregates in this family and was absent in 96 Dutch controls and available databases. Affected female carriers showed a notably skewed X-inactivation pattern in lymphocytes in which the mutated X-chromosome was completely inactivated. HDAC8 is a member of the protein family of histone deacetylases that play a major role in epigenetic gene silencing during development. HDAC8 specifically controls the patterning of the skull with the mouse HDAC8 knock-out showing craniofacial deformities of the skull. The present family provides the first evidence for involvement of HDAC8 in a syndromic form of intellectual disability.
Validation and optimization of the Ion Torrent S5 XL sequencer and Oncomine workflow for BRCA1 and BRCA2 genetic testing.

PubMed

Shin, Saeam; Kim, Yoonjung; Chul Oh, Seoung; Yu, Nae; Lee, Seung-Tae; Rak Choi, Jong; Lee, Kyung-A

2017-05-23

In this study, we validated the analytical performance of BRCA1/2 sequencing using Ion Torrent's new bench-top sequencer with amplicon panel with optimized bioinformatics pipelines. Using 43 samples that were previously validated by Illumina's MiSeq platform and/or by Sanger sequencing/multiplex ligation-dependent probe amplification, we amplified the target with the Oncomine™ BRCA Research Assay and sequenced on Ion Torrent S5 XL (Thermo Fisher Scientific, Waltham, MA, USA). We compared two bioinformatics pipelines for optimal processing of S5 XL sequence data: the Torrent Suite with a plug-in Torrent Variant Caller (Thermo Fisher Scientific), and commercial NextGENe software (Softgenetics, State College, PA, USA). All expected 681 single nucleotide variants, 15 small indels, and three copy number variants were correctly called, except one common variant adjacent to a rare variant on the primer-binding site. The sensitivity, specificity, false positive rate, and accuracy for detection of single nucleotide variant and small indels of S5 XL sequencing were 99.85%, 100%, 0%, and 99.99% for the Torrent Variant Caller and 99.85%, 99.99%, 0.14%, and 99.99% for NextGENe, respectively. The reproducibility of variant calling was 100%, and the precision of variant frequency also showed good performance with coefficients of variation between 0.32 and 5.29%. We obtained highly accurate data through uniform and sufficient coverage depth over all target regions and through optimization of the bioinformatics pipeline. We confirmed that our platform is accurate and practical for diagnostic BRCA1/2 testing in a clinical laboratory.
Novel down-regulatory mechanism of the surface expression of the vasopressin V2 receptor by an alternative splice receptor variant.

PubMed

Sarmiento, José M; Añazco, Carolina C; Campos, Danae M; Prado, Gregory N; Navarro, Javier; González, Carlos B

2004-11-05

In rat kidney, two alternatively spliced transcripts are generated from the V2 vasopressin receptor gene. The large transcript (1.2 kb) encodes the canonical V2 receptor, whereas the small transcript encodes a splice variant displaying a distinct sequence corresponding to the putative seventh transmembrane domain and the intracellular C terminus of the V2 receptor. This work showed that the small spliced transcript is translated in the rat kidney collecting tubules. However, the protein encoded by the small transcript (here called the V2b splice variant) is retained inside the cell, in contrast to the preferential surface distribution of the V2 receptor (here called the V2a receptor). Cells expressing the V2b splice variant do not exhibit binding to 3H-labeled vasopressin. Interestingly, we found that expression of the splice variant V2b down-regulates the surface expression of the V2a receptor, most likely via the formation of V2a.V2b heterodimers as demonstrated by co-immunoprecipitation and fluorescence resonance energy transfer experiments between the V2a receptor and the V2b splice variant. The V2b splice variant would then be acting as a dominant negative. The effect of the V2b splice variant is specific, as it does not affect the surface expression of the G protein-coupled interleukin-8 receptor (CXCR1). Furthermore, the sequence encompassing residues 242-339, corresponding to the C-terminal domain of the V2b splice variant, also down-regulates the surface expression of the V2a receptor. We suggest that some forms of nephrogenic diabetes insipidus are due to overexpression of the splice variant V2b, which could retain the wild-type V2a receptor inside the cell via the formation of V2a.V2b heterodimers.
De Novo and Inherited Loss-of-Function Variants in TLK2: Clinical and Genotype-Phenotype Evaluation of a Distinct Neurodevelopmental Disorder.

PubMed

Reijnders, Margot R F; Miller, Kerry A; Alvi, Mohsan; Goos, Jacqueline A C; Lees, Melissa M; de Burca, Anna; Henderson, Alex; Kraus, Alison; Mikat, Barbara; de Vries, Bert B A; Isidor, Bertrand; Kerr, Bronwyn; Marcelis, Carlo; Schluth-Bolard, Caroline; Deshpande, Charu; Ruivenkamp, Claudia A L; Wieczorek, Dagmar; Baralle, Diana; Blair, Edward M; Engels, Hartmut; Lüdecke, Hermann-Josef; Eason, Jacqueline; Santen, Gijs W E; Clayton-Smith, Jill; Chandler, Kate; Tatton-Brown, Katrina; Payne, Katelyn; Helbig, Katherine; Radtke, Kelly; Nugent, Kimberly M; Cremer, Kirsten; Strom, Tim M; Bird, Lynne M; Sinnema, Margje; Bitner-Glindzicz, Maria; van Dooren, Marieke F; Alders, Marielle; Koopmans, Marije; Brick, Lauren; Kozenko, Mariya; Harline, Megan L; Klaassens, Merel; Steinraths, Michelle; Cooper, Nicola S; Edery, Patrick; Yap, Patrick; Terhal, Paulien A; van der Spek, Peter J; Lakeman, Phillis; Taylor, Rachel L; Littlejohn, Rebecca O; Pfundt, Rolph; Mercimek-Andrews, Saadet; Stegmann, Alexander P A; Kant, Sarina G; McLean, Scott; Joss, Shelagh; Swagemakers, Sigrid M A; Douzgou, Sofia; Wall, Steven A; Küry, Sébastien; Calpena, Eduardo; Koelling, Nils; McGowan, Simon J; Twigg, Stephen R F; Mathijssen, Irene M J; Nellaker, Christoffer; Brunner, Han G; Wilkie, Andrew O M

2018-06-07

Next-generation sequencing is a powerful tool for the discovery of genes related to neurodevelopmental disorders (NDDs). Here, we report the identification of a distinct syndrome due to de novo or inherited heterozygous mutations in Tousled-like kinase 2 (TLK2) in 38 unrelated individuals and two affected mothers, using whole-exome and whole-genome sequencing technologies, matchmaker databases, and international collaborations. Affected individuals had a consistent phenotype, characterized by mild-borderline neurodevelopmental delay (86%), behavioral disorders (68%), severe gastro-intestinal problems (63%), and facial dysmorphism including blepharophimosis (82%), telecanthus (74%), prominent nasal bridge (68%), broad nasal tip (66%), thin vermilion of the upper lip (62%), and upslanting palpebral fissures (55%). Analysis of cell lines from three affected individuals showed that mutations act through a loss-of-function mechanism in at least two case subjects. Genotype-phenotype analysis and comparison of computationally modeled faces showed that phenotypes of these and other individuals with loss-of-function variants significantly overlapped with phenotypes of individuals with other variant types (missense and C-terminal truncating). This suggests that haploinsufficiency of TLK2 is the most likely underlying disease mechanism, leading to a consistent neurodevelopmental phenotype. This work illustrates the power of international data sharing, by the identification of 40 individuals from 26 different centers in 7 different countries, allowing the identification, clinical delineation, and genotype-phenotype evaluation of a distinct NDD caused by mutations in TLK2. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Exome sequencing identifies variants in two genes encoding the LIM-proteins NRAP and FHL1 in an Italian patient with BAG3 myofibrillar myopathy.

PubMed

D'Avila, Francesca; Meregalli, Mirella; Lupoli, Sara; Barcella, Matteo; Orro, Alessandro; De Santis, Francesca; Sitzia, Clementina; Farini, Andrea; D'Ursi, Pasqualina; Erratico, Silvia; Cristofani, Riccardo; Milanesi, Luciano; Braga, Daniele; Cusi, Daniele; Poletti, Angelo; Barlassina, Cristina; Torrente, Yvan

2016-06-01

Myofibrillar myopathies (MFMs) are genetically heterogeneous dystrophies characterized by the disintegration of Z-disks and myofibrils and are associated with mutations in genes encoding Z-disk or Z-disk-related proteins. The c.626 C > T (p.P209L) mutation in the BAG3 gene has been described as causative of a subtype of MFM. We report a sporadic case of a 26-year-old Italian woman, affected by MFM with axonal neuropathy, cardiomyopathy, rigid spine, who carries the c.626 C > T mutation in the BAG3 gene. The patient and her non-consanguineous healthy parents and brother were studied with whole exome sequencing (WES) to further investigate the genetic basis of this complex phenotype. In the patient, we found that the BAG3 mutation is associated with variants in the NRAP and FHL1 genes that encode muscle-specific, LIM domain containing proteins. Quantitative real time PCR, immunohistochemistry and Western blot analysis of the patient's muscular biopsy showed the absence of NRAP expression and FHL1 accumulation in aggregates in the affected skeletal muscle tissue. Molecular dynamic analysis of the mutated FHL1 domain showed a modification in its surface charge, which could affect its capability to bind its target proteins. To our knowledge this is the first study reporting, in a BAG3 MFM, the simultaneous presence of genetic variants in the BAG3 and FHL1 genes (previously described as independently associated with MFMs) and linking the NRAP gene to MFM for the first time.
A novel germline PIGA mutation in Ferro-Cerebro-Cutaneous syndrome: a neurodegenerative X-linked epileptic encephalopathy with systemic iron-overload.

PubMed

Swoboda, Kathryn J; Margraf, Rebecca L; Carey, John C; Zhou, Holly; Newcomb, Tara M; Coonrod, Emily; Durtschi, Jacob; Mallempati, Kalyan; Kumanovics, Attila; Katz, Ben E; Voelkerding, Karl V; Opitz, John M

2014-01-01

Three related males presented with a newly recognized x-linked syndrome associated with neurodegeneration, cutaneous abnormalities, and systemic iron overload. Linkage studies demonstrated that they shared a haplotype on Xp21.3-Xp22.2 and exome sequencing was used to identify candidate variants. Of the segregating variants, only a PIGA mutation segregated with disease in the family. The c.328_330delCCT PIGA variant predicts, p.Leu110del (or c.1030_1032delCTT, p.Leu344del depending on the reference sequence). The unaffected great-grandfather shared his X allele with the proband but he did not have the PIGA mutation, indicating that the mutation arose de novo in his daughter. A single family with a germline PIGA mutation has been reported; affected males had a phenotype characterized by multiple congenital anomalies and severe neurologic impairment resulting in infantile lethality. In contrast, affected boys in the family described here were born without anomalies and were neurologically normal prior to onset of seizures after 6 months of age, with two surviving to the second decade. PIGA encodes an enzyme in the GPI anchor biosynthesis pathway. An affected individual in the family studied here was deficient in GPI anchor proteins on granulocytes but not erythrocytes. In conclusion, the PIGA mutation in this family likely causes a reduction in GPI anchor protein cell surface expression in various cell types, resulting in the observed pleiotropic phenotype involving central nervous system, skin, and iron metabolism. © 2013 Wiley Periodicals, Inc.

Genetic Architecture of Vitamin B12 and Folate Levels Uncovered Applying Deeply Sequenced Large Datasets

PubMed Central

Thorleifsson, Gudmar; Ahluwalia, Tarunveer S.; Steinthorsdottir, Valgerdur; Bjarnason, Helgi; Gudbjartsson, Daniel F.; Magnusson, Olafur T.; Sparsø, Thomas; Albrechtsen, Anders; Kong, Augustine; Masson, Gisli; Tian, Geng; Cao, Hongzhi; Nie, Chao; Kristiansen, Karsten; Husemoen, Lise Lotte; Thuesen, Betina; Li, Yingrui; Nielsen, Rasmus; Linneberg, Allan; Olafsson, Isleifur; Eyjolfsson, Gudmundur I.; Jørgensen, Torben; Wang, Jun; Hansen, Torben; Thorsteinsdottir, Unnur; Stefánsson, Kari; Pedersen, Oluf

2013-01-01

Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B12 and folate measurements, respectively. We found six novel loci associating with serum B12 (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B12 and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations. PMID:23754956
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.

PubMed

Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J

2015-01-09

To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
Network perturbation by recurrent regulatory variants in cancer

PubMed Central

Cho, Ara; Lee, Insuk; Choi, Jung Kyoon

2017-01-01

Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes. PMID:28333928
L1-associated genomic regions are deleted in somatic cells of the healthy human brain.

PubMed

Erwin, Jennifer A; Paquola, Apuã C M; Singer, Tatjana; Gallina, Iryna; Novotny, Mark; Quayle, Carolina; Bedrosian, Tracy A; Alves, Francisco I A; Butcher, Cheyenne R; Herdy, Joseph R; Sarkar, Anindita; Lasken, Roger S; Muotri, Alysson R; Gage, Fred H

2016-12-01

The healthy human brain is a mosaic of varied genomes. Long interspersed element-1 (LINE-1 or L1) retrotransposition is known to create mosaicism by inserting L1 sequences into new locations of somatic cell genomes. Using a machine learning-based, single-cell sequencing approach, we discovered that somatic L1-associated variants (SLAVs) are composed of two classes: L1 retrotransposition insertions and retrotransposition-independent L1-associated variants. We demonstrate that a subset of SLAVs comprises somatic deletions generated by L1 endonuclease cutting activity. Retrotransposition-independent rearrangements in inherited L1s resulted in the deletion of proximal genomic regions. These rearrangements were resolved by microhomology-mediated repair, which suggests that L1-associated genomic regions are hotspots for somatic copy number variants in the brain and therefore a heritable genetic contributor to somatic mosaicism. We demonstrate that SLAVs are present in crucial neural genes, such as DLG2 (also called PSD93), and affect 44-63% of cells of the cells in the healthy brain.
Whole genome sequences of two octogenarians with sustained cognitive abilities

PubMed Central

Nickles, Dorothee; Madireddy, Lohith; Patel, Nihar; Isobe, Noriko; Miller, Bruce L.; Baranzini, Sergio E.; Kramer, Joel H.; Oksenberg, Jorge R.

2014-01-01

Although numerous genetic variants affecting aging and mortality have been identified, e.g. APOE ε4, the genetic component influencing cognitive aging has not been fully defined yet. A better knowledge of the genetics of aging will prove helpful in understanding the underlying biological processes. Here, we describe the whole genome sequences of two female octogenarians. We provide the repertoire of genomic variants that the two octogenarians have in common. We also describe the overlap with the previously reported genomes of two supercentenarians - individuals aged ≥ 110 years. We assessed the genetic disease propensities of the octogenarians and non-aged control genomes and could not find support for the hypothesis that long-lived healthy individuals might exhibit greater genetic fitness than the general population. Furthermore, there is no evidence for an accumulation of previously described variants promoting longevity in the two octogenarians. These findings suggest that genetic fitness, as currently defined, is not the sole factor enabling an increased lifespan. We identified a number of healthy-cognitive-aging candidate genetic loci awaiting confirmation in larger studies. PMID:25618617
Whole genome sequences of 2 octogenarians with sustained cognitive abilities.

PubMed

Nickles, Dorothee; Madireddy, Lohith; Patel, Nihar; Isobe, Noriko; Miller, Bruce L; Baranzini, Sergio E; Kramer, Joel H; Oksenberg, Jorge R

2015-03-01

Although numerous genetic variants affecting aging and mortality have been identified, for example, apolipoprotein E ε4, the genetic component influencing cognitive aging has not been fully defined yet. A better knowledge of the genetics of aging will prove helpful in understanding the underlying biological processes. Here, we describe the whole genome sequences of 2 female octogenarians. We provide the repertoire of genomic variants that the 2 octogenarians have in common. We also describe the overlap with the previously reported genomes of 2 supercentenarians—individuals aged ≥110 years. We assessed the genetic disease propensities of the octogenarians and non-aged control genomes and could not find support for the hypothesis that long-lived healthy individuals might exhibit greater genetic fitness than the general population. Furthermore, there is no evidence for an accumulation of previously described variants promoting longevity in the 2 octogenarians. These findings suggest that genetic fitness, as currently defined, is not the sole factor enabling an increased life span. We identified a number of healthy-cognitive-aging candidate genetic loci awaiting confirmation in larger studies. Copyright © 2015 Elsevier Inc. All rights reserved.
Factors influencing success of clinical genome sequencing across a broad spectrum of disorders

PubMed Central

Lise, Stefano; Broxholme, John; Cazier, Jean-Baptiste; Rimmer, Andy; Kanapin, Alexander; Lunter, Gerton; Fiddy, Simon; Allan, Chris; Aricescu, A. Radu; Attar, Moustafa; Babbs, Christian; Becq, Jennifer; Beeson, David; Bento, Celeste; Bignell, Patricia; Blair, Edward; Buckle, Veronica J; Bull, Katherine; Cais, Ondrej; Cario, Holger; Chapel, Helen; Copley, Richard R; Cornall, Richard; Craft, Jude; Dahan, Karin; Davenport, Emma E; Dendrou, Calliope; Devuyst, Olivier; Fenwick, Aimée L; Flint, Jonathan; Fugger, Lars; Gilbert, Rodney D; Goriely, Anne; Green, Angie; Greger, Ingo H.; Grocock, Russell; Gruszczyk, Anja V; Hastings, Robert; Hatton, Edouard; Higgs, Doug; Hill, Adrian; Holmes, Chris; Howard, Malcolm; Hughes, Linda; Humburg, Peter; Johnson, David; Karpe, Fredrik; Kingsbury, Zoya; Kini, Usha; Knight, Julian C; Krohn, Jonathan; Lamble, Sarah; Langman, Craig; Lonie, Lorne; Luck, Joshua; McCarthy, Davis; McGowan, Simon J; McMullin, Mary Frances; Miller, Kerry A; Murray, Lisa; Németh, Andrea H; Nesbit, M Andrew; Nutt, David; Ormondroyd, Elizabeth; Oturai, Annette Bang; Pagnamenta, Alistair; Patel, Smita Y; Percy, Melanie; Petousi, Nayia; Piazza, Paolo; Piret, Sian E; Polanco-Echeverry, Guadalupe; Popitsch, Niko; Powrie, Fiona; Pugh, Chris; Quek, Lynn; Robbins, Peter A; Robson, Kathryn; Russo, Alexandra; Sahgal, Natasha; van Schouwenburg, Pauline A; Schuh, Anna; Silverman, Earl; Simmons, Alison; Sørensen, Per Soelberg; Sweeney, Elizabeth; Taylor, John; Thakker, Rajesh V; Tomlinson, Ian; Trebes, Amy; Twigg, Stephen RF; Uhlig, Holm H; Vyas, Paresh; Vyse, Tim; Wall, Steven A; Watkins, Hugh; Whyte, Michael P; Witty, Lorna; Wright, Ben; Yau, Chris; Buck, David; Humphray, Sean; Ratcliffe, Peter J; Bell, John I; Wilkie, Andrew OM; Bentley, David; Donnelly, Peter; McVean, Gilean

2015-01-01

To assess factors influencing the success of whole genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases across a broad spectrum of disorders in whom prior screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritisation. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease causing variants in 21% of cases, rising to 34% (23/68) for Mendelian disorders and 57% (8/14) in trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, though only four were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis, but also highlight many outstanding challenges. PMID:25985138
Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.

PubMed

Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N; Baldwin, Clinton T; Andersen, Stacy; Schork, Nicholas J; Steinberg, Martin H; Perls, Thomas T

2011-01-01

Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals' DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging.
Whole Genome Sequences of a Male and Female Supercentenarian, Ages Greater than 114 Years

PubMed Central

Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N.; Baldwin, Clinton T.; Andersen, Stacy; Schork, Nicholas J.; Steinberg, Martin H.; Perls, Thomas T.

2012-01-01

Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals’ DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging. PMID:22303384
Pulmonary Nontuberculous Mycobacterial Infection. A Multisystem, Multigenic Disease.

PubMed

Szymanski, Eva P; Leung, Janice M; Fowler, Cedar J; Haney, Carissa; Hsu, Amy P; Chen, Fei; Duggal, Priya; Oler, Andrew J; McCormack, Ryan; Podack, Eckhard; Drummond, Rebecca A; Lionakis, Michail S; Browne, Sarah K; Prevots, D Rebecca; Knowles, Michael; Cutting, Gary; Liu, Xinyue; Devine, Scott E; Fraser, Claire M; Tettelin, Hervé; Olivier, Kenneth N; Holland, Steven M

2015-09-01

The clinical features of patients infected with pulmonary nontuberculous mycobacteria (PNTM) are well described, but the genetic components of infection susceptibility are not. To examine genetic variants in patients with PNTM, their unaffected family members, and a control group. Whole-exome sequencing was done on 69 white patients with PNTM and 18 of their white unaffected family members. We performed a candidate gene analysis using immune, cystic fibrosis transmembrance conductance regulator (CFTR), cilia, and connective tissue gene sets. The numbers of patients, family members, and control subjects with variants in each category were compared, as was the average number of variants per person. A significantly higher number of patients with PNTM than the other subjects had low-frequency, protein-affecting variants in immune, CFTR, cilia, and connective tissue categories (35, 26, 90, and 90%, respectively). Patients with PNTM also had significantly more cilia and connective tissue variants per person than did control subjects (2.47 and 2.55 compared with 1.38 and 1.40, respectively; P = 1.4 × 10(-6) and P = 2.7 × 10(-8), respectively). Patients with PNTM had an average of 5.26 variants across all categories (1.98 in control subjects; P = 2.8 × 10(-17)), and they were more likely than control subjects to have variants in multiple categories. We observed similar results for family members without PNTM infection, with the exception of the immune category. Patients with PNTM have more low-frequency, protein-affecting variants in immune, CFTR, cilia, and connective tissue genes than their unaffected family members and control subjects. We propose that PNTM infection is a multigenic disease in which combinations of variants across gene categories, plus environmental exposures, increase susceptibility to the infection.
Short Stature, Accelerated Bone Maturation, and Early Growth Cessation Due to Heterozygous Aggrecan Mutations

PubMed Central

Nilsson, Ola; Guo, Michael H.; Dunbar, Nancy; Popovic, Jadranka; Flynn, Daniel; Jacobsen, Christina; Lui, Julian C.; Hirschhorn, Joel N.; Baron, Jeffrey

2014-01-01

Context: Many children with idiopathic short stature have a delayed bone age. Idiopathic short stature with advanced bone age is far less common. Objective: The aim was to identify underlying genetic causes of short stature with advanced bone age. Setting and Design: We used whole-exome sequencing to study three families with autosomal-dominant short stature, advanced bone age, and premature growth cessation. Results: Affected individuals presented with short stature [adult heights −2.3 to −4.2 standard deviation scores (SDS)] with histories of early growth cessation or childhood short stature (height SDS −1.9 to −3.5 SDS), advancement of bone age, and normal endocrine evaluations. Whole-exome sequencing identified novel heterozygous variants in ACAN, which encodes aggrecan, a proteoglycan in the extracellular matrix of growth plate and other cartilaginous tissues. The variants were present in all affected, but in no unaffected, family members. In Family 1, a novel frameshift mutation in exon 3 (c.272delA) was identified, which is predicted to cause early truncation of the aggrecan protein. In Family 2, a base-pair substitution was found in a highly conserved location within a splice donor site (c.2026+1G>A), which is also likely to alter the amino acid sequence of a large portion of the protein. In Family 3, a missense variant (c.7064T>C) in exon 14 affects a highly conserved residue (L2355P) and is strongly predicted to perturb protein function. Conclusions: Our study demonstrates that heterozygous mutations in ACAN can cause a mild skeletal dysplasia, which presents clinically as short stature with advanced bone age. The accelerating effect on skeletal maturation has not previously been noted in the few prior reports of human ACAN mutations. Our findings thus expand the spectrum of ACAN defects and provide a new molecular genetic etiology for the unusual child who presents with short stature and accelerated skeletal maturation. PMID:24762113
BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing.

PubMed

Yan, Song; Li, Yun

2014-02-15

Despite its great capability to detect rare variant associations, next-generation sequencing is still prohibitively expensive when applied to large samples. In case-control studies, it is thus appealing to sequence only a subset of cases to discover variants and genotype the identified variants in controls and the remaining cases under the reasonable assumption that causal variants are usually enriched among cases. However, this approach leads to inflated type-I error if analyzed naively for rare variant association. Several methods have been proposed in recent literature to control type-I error at the cost of either excluding some sequenced cases or correcting the genotypes of discovered rare variants. All of these approaches thus suffer from certain extent of information loss and thus are underpowered. We propose a novel method (BETASEQ), which corrects inflation of type-I error by supplementing pseudo-variants while keeps the original sequence and genotype data intact. Extensive simulations and real data analysis demonstrate that, in most practical situations, BETASEQ leads to higher testing powers than existing approaches with guaranteed (controlled or conservative) type-I error. BETASEQ and associated R files, including documentation, examples, are available at http://www.unc.edu/~yunmli/betaseq
Telomere extension by telomerase and ALT generates variant repeats by mechanistically distinct processes

PubMed Central

Lee, Michael; Hills, Mark; Conomos, Dimitri; Stutz, Michael D.; Dagg, Rebecca A.; Lau, Loretta M.S.; Reddel, Roger R.; Pickett, Hilda A.

2014-01-01

Telomeres are terminal repetitive DNA sequences on chromosomes, and are considered to comprise almost exclusively hexameric TTAGGG repeats. We have evaluated telomere sequence content in human cells using whole-genome sequencing followed by telomere read extraction in a panel of mortal cell strains and immortal cell lines. We identified a wide range of telomere variant repeats in human cells, and found evidence that variant repeats are generated by mechanistically distinct processes during telomerase- and ALT-mediated telomere lengthening. Telomerase-mediated telomere extension resulted in biased repeat synthesis of variant repeats that differed from the canonical sequence at positions 1 and 3, but not at positions 2, 4, 5 or 6. This indicates that telomerase is most likely an error-prone reverse transcriptase that misincorporates nucleotides at specific positions on the telomerase RNA template. In contrast, cell lines that use the ALT pathway contained a large range of variant repeats that varied greatly between lines. This is consistent with variant repeats spreading from proximal telomeric regions throughout telomeres in a stochastic manner by recombination-mediated templating of DNA synthesis. The presence of unexpectedly large numbers of variant repeats in cells utilizing either telomere maintenance mechanism suggests a conserved role for variant sequences at human telomeres. PMID:24225324
Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller.

PubMed

Xu, Chang; Nezami Ranjbar, Mohammad R; Wu, Zhong; DiCarlo, John; Wang, Yexun

2017-01-03

Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-containing DNA fragments efficiently in the mix of abundant wild-type DNA; 2) sequences the DNA library extensively to deep coverage; and 3) distinguishes low level true variants from amplification and sequencing errors with high accuracy. Targeted enrichment using PCR primers provides researchers with a convenient way to achieve deep sequencing for a small, yet most relevant region using benchtop sequencers. Molecular barcoding (or indexing) provides a unique solution for reducing sequencing artifacts analytically. Although different molecular barcoding schemes have been reported in recent literature, most variant calling has been done on limited targets, using simple custom scripts. The analytical performance of barcode-aware variant calling can be significantly improved by incorporating advanced statistical models. We present here a highly efficient, simple and scalable enrichment protocol that integrates molecular barcodes in multiplex PCR amplification. In addition, we developed smCounter, an open source, generic, barcode-aware variant caller based on a Bayesian probabilistic model. smCounter was optimized and benchmarked on two independent read sets with SNVs and indels at 5 and 1% allele fractions. Variants were called with very good sensitivity and specificity within coding regions. We demonstrated that we can accurately detect somatic mutations with allele fractions as low as 1% in coding regions using our enrichment protocol and variant caller.
In vitro digestion of purified β-casein variants A(1), A(2), B, and I: effects on antioxidant and angiotensin-converting enzyme inhibitory capacity.

PubMed

Petrat-Melin, B; Andersen, P; Rasmussen, J T; Poulsen, N A; Larsen, L B; Young, J F

2015-01-01

Genetic polymorphisms of bovine milk proteins affect the protein profile of the milk and, hence, certain technological properties, such as casein (CN) number and cheese yield. However, reports show that such polymorphisms may also affect the health-related properties of milk. Therefore, to gain insight into their digestion pattern and bioactive potential, β-CN was purified from bovine milk originating from cows homozygous for the variants A(1), A(2), B, and I by a combination of cold storage, ultracentrifugation, and acid precipitation. The purity of the isolated β-CN was determined by HPLC, variants were verified by mass spectrometry, and molar extinction coefficients at λ=280nm were determined. β-Casein from each of the variants was subjected to in vitro digestion using pepsin and pancreatic enzymes. Antioxidant and angiotensin-converting enzyme (ACE) inhibitory capacities of the hydrolysates were assessed at 3 stages of digestion and related to that of the undigested samples. Neither molar extinction coefficients nor overall digestibility varied significantly between these 4 variants; however, clear differences in digestion pattern were indicated by gel electrophoresis. In particular, after 60min of pepsin followed by 5min of pancreatic enzyme digestion, one ≈4kDa peptide with the N-terminal sequence (106)H-K-E-M-P-F-P-K- was absent from β-CN variant B. This is likely a result of the (122)Ser to (122)Arg substitution in variant B introducing a novel trypsin cleavage site, leading to the changed digestion pattern. All investigated β-CN variants exhibited a significant increase in antioxidant capacity upon digestion, as measured by the Trolox-equivalent antioxidant capacity assay. After 60min of pepsin + 120min of pancreatic enzyme digestion, the accumulated increase in antioxidant capacity was ≈1.7-fold for the 4 β-CN variants. The ACE inhibitory capacity was also significantly increased by digestion, with the B variant reaching the highest inhibitory capacity at the end of digestion (60min of pepsin + 120min of pancreatic enzymes), possibly because of the observed alternative digestion pattern. These results demonstrate that genetic polymorphisms affect the digestion pattern and bioactivity of milk proteins. Moreover, their capacity for radical scavenging and ACE inhibition is affected by digestion. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Next-generation sequencing using a pre-designed gene panel for the molecular diagnosis of congenital disorders in pediatric patients.

PubMed

Lim, Eileen C P; Brett, Maggie; Lai, Angeline H M; Lee, Siew-Peng; Tan, Ee-Shien; Jamuar, Saumya S; Ng, Ivy S L; Tan, Ene-Choo

2015-12-14

Next-generation sequencing (NGS) has revolutionized genetic research and offers enormous potential for clinical application. Sequencing the exome has the advantage of casting the net wide for all known coding regions while targeted gene panel sequencing provides enhanced sequencing depths and can be designed to avoid incidental findings in adult-onset conditions. A HaloPlex panel consisting of 180 genes within commonly altered chromosomal regions is available for use on both the Ion Personal Genome Machine (PGM) and MiSeq platforms to screen for causative mutations in these genes. We used this Haloplex ICCG panel for targeted sequencing of 15 patients with clinical presentations indicative of an abnormality in one of the 180 genes. Sequencing runs were done using the Ion 318 Chips on the Ion Torrent PGM. Variants were filtered for known polymorphisms and analysis was done to identify possible disease-causing variants before validation by Sanger sequencing. When possible, segregation of variants with phenotype in family members was performed to ascertain the pathogenicity of the variant. More than 97% of the target bases were covered at >20×. There was an average of 9.6 novel variants per patient. Pathogenic mutations were identified in five genes for six patients, with two novel variants. There were another five likely pathogenic variants, some of which were unreported novel variants. In a cohort of 15 patients, we were able to identify a likely genetic etiology in six patients (40%). Another five patients had candidate variants for which further evaluation and segregation analysis are ongoing. Our results indicate that the HaloPlex ICCG panel is useful as a rapid, high-throughput and cost-effective screening tool for 170 of the 180 genes. There is low coverage for some regions in several genes which might have to be supplemented by Sanger sequencing. However, comparing the cost, ease of analysis, and shorter turnaround time, it is a good alternative to exome sequencing for patients whose features are suggestive of a genetic etiology involving one of the genes in the panel.
Human papillomavirus type 16 variants in cervical intraepithelial neoplasia and invasive carcinoma in San Luis Potosí City, Mexico

PubMed Central

López-Revilla, Rubén; Pineda, Marco A; Ortiz-Valdez, Julio; Sánchez-Garza, Mireya; Riego, Lina

2009-01-01

Background In San Luis Potosí City cervical infection by human papillomavirus type 16 (HPV16) associated to dysplastic lesions is more prevalent in younger women. In this work HPV16 subtypes and variants associated to low-grade intraepithelial lesions (LSIL), high-grade intraepithelial lesions (HSIL) and invasive cervical cancer (ICC) of 38 women residing in San Luis Potosí City were identified by comparing their E6 open reading frame sequences. Results Three European (E) variants (E-P, n = 27; E-T350G, n = 7; E-C188G, n = 2) and one AA-a variant (n = 2) were identified among the 38 HPV16 sequences analyzed. E-P variant sequences contained 23 single nucleotide changes, two of which (A334G, A404T) had not been described before and allowed the phylogenetic separation from the other variants. E-P A334G sequences were the most prevalent (22 cases, 57.9%), followed by the E-P Ref prototype (8 cases, 21.1%) and E-P A404T (1 case, 2.6%) sequences. The HSIL + ICC fraction was 0.21 for the E-P A334G variants and 0.00 for the E-P Ref variants. Conclusion We conclude that in the women included in this study the HPV16 E subtype is 19 times more frequent than the AA subtype; that the circulating E variants are E-P (71.1%) > E-T350G (18.4%) > E-C188G (5.3%); that 71.0% of the E-P sequences carry the A334G single nucleotide change and appear to correspond to a HPV16 variant characteristic of San Luis Potosi City more oncogenic than the E-P Ref prototype. PMID:19216802
Imputation of Exome Sequence Variants into Population- Based Samples and Blood-Cell-Trait-Associated Loci in African Americans: NHLBI GO Exome Sequencing Project

PubMed Central

Auer, Paul L.; Johnsen, Jill M.; Johnson, Andrew D.; Logsdon, Benjamin A.; Lange, Leslie A.; Nalls, Michael A.; Zhang, Guosheng; Franceschini, Nora; Fox, Keolu; Lange, Ethan M.; Rich, Stephen S.; O’Donnell, Christopher J.; Jackson, Rebecca D.; Wallace, Robert B.; Chen, Zhao; Graubert, Timothy A.; Wilson, James G.; Tang, Hua; Lettre, Guillaume; Reiner, Alex P.; Ganesh, Santhi K.; Li, Yun

2012-01-01

Researchers have successfully applied exome sequencing to discover causal variants in selected individuals with familial, highly penetrant disorders. We demonstrate the utility of exome sequencing followed by imputation for discovering low-frequency variants associated with complex quantitative traits. We performed exome sequencing in a reference panel of 761 African Americans and then imputed newly discovered variants into a larger sample of more than 13,000 African Americans for association testing with the blood cell traits hemoglobin, hematocrit, white blood count, and platelet count. First, we illustrate the feasibility of our approach by demonstrating genome-wide-significant associations for variants that are not covered by conventional genotyping arrays; for example, one such association is that between higher platelet count and an MPL c.117G>T (p.Lys39Asn) variant encoding a p.Lys39Asn amino acid substitution of the thrombpoietin receptor gene (p = 1.5 × 10−11). Second, we identified an association between missense variants of LCT and higher white blood count (p = 4 × 10−13). Third, we identified low-frequency coding variants that might account for allelic heterogeneity at several known blood cell-associated loci: MPL c.754T>C (p.Tyr252His) was associated with higher platelet count; CD36 c.975T>G (p.Tyr325∗) was associated with lower platelet count; and several missense variants at the α-globin gene locus were associated with lower hemoglobin. By identifying low-frequency missense variants associated with blood cell traits not previously reported by genome-wide association studies, we establish that exome sequencing followed by imputation is a powerful approach to dissecting complex, genetically heterogeneous traits in large population-based studies. PMID:23103231
Identification of a de novo variant in CHUK in a patient with an EEC/AEC syndrome-like phenotype and hypogammaglobulinemia.

PubMed

Khandelwal, Kriti D; Ockeloen, Charlotte W; Venselaar, Hanka; Boulanger, Cécile; Brichard, Bénédicte; Sokal, Etienne; Pfundt, Rolph; Rinne, Tuula; van Beusekom, Ellen; Bloemen, Marjon; Vriend, Gerrit; Revencu, Nicole; Carels, Carine E L; van Bokhoven, Hans; Zhou, Huiqing

2017-05-17

The cardinal features of Ectrodactyly, Ectodermal dysplasia, Cleft lip/palate (EEC), and Ankyloblepharon-Ectodermal defects-Cleft lip/palate (AEC) syndromes are ectodermal dysplasia (ED), orofacial clefting, and limb anomalies. EEC and AEC are caused by heterozygous mutations in the transcription factor p63 encoded by TP63. Here, we report a patient with an EEC/AEC syndrome-like phenotype, including ankyloblepharon, ED, cleft palate, ectrodactyly, syndactyly, additional hypogammaglobulinemia, and growth delay. Neither pathogenic mutations in TP63 nor CNVs at the TP63 locus were identified. Exome sequencing revealed de novo heterozygous variants in CHUK (conserved helix-loop-helix ubiquitous kinase), PTGER4, and IFIT2. While the variant in PTGER4 might contribute to the immunodeficiency and growth delay, the variant in CHUK appeared to be most relevant for the EEC/AEC-like phenotype. CHUK is a direct target gene of p63 and encodes a component of the IKK complex that plays a key role in NF-κB pathway activation. The identified CHUK variant (g.101980394T>C; c.425A>G; p.His142Arg) is located in the kinase domain which is responsible for the phosphorylation activity of the protein. The variant may affect CHUK function and thus contribute to the disease phenotype in three ways: (1) the variant exhibits a dominant negative effect and results in an inactive IKK complex that affects the canonical NF-κB pathway; (2) it affects the feedback loop of the canonical and non-canonical NF-κB pathways that are CHUK kinase activity-dependent; and (3) it disrupts NF-κB independent epidermal development that is often p63-dependent. Therefore, we propose that the heterozygous CHUK variant is highly likely to be causative to the EEC/AEC-like and additional hypogammaglobulinemia phenotypes in the patient presented here. © 2017 Wiley Periodicals, Inc.
Molecular diagnosis of putative Stargardt disease probands by exome sequencing

PubMed Central

2012-01-01

Background The commonest genetic form of juvenile or early adult onset macular degeneration is Stargardt Disease (STGD) caused by recessive mutations in the gene ABCA4. However, high phenotypic and allelic heterogeneity and a small but non-trivial amount of locus heterogeneity currently impede conclusive molecular diagnosis in a significant proportion of cases. Methods We performed whole exome sequencing (WES) of nine putative Stargardt Disease probands and searched for potentially disease-causing genetic variants in previously identified retinal or macular dystrophy genes. Follow-up dideoxy sequencing was performed for confirmation and to screen for mutations in an additional set of affected individuals lacking a definitive molecular diagnosis. Results Whole exome sequencing revealed seven likely disease-causing variants across four genes, providing a confident genetic diagnosis in six previously uncharacterized participants. We identified four previously missed mutations in ABCA4 across three individuals. Likely disease-causing mutations in RDS/PRPH2, ELOVL, and CRB1 were also identified. Conclusions Our findings highlight the enormous potential of whole exome sequencing in Stargardt Disease molecular diagnosis and research. WES adequately assayed all coding sequences and canonical splice sites of ABCA4 in this study. Additionally, WES enables the identification of disease-related alleles in other genes. This work highlights the importance of collecting parental genetic material for WES testing as the current knowledge of human genome variation limits the determination of causality between identified variants and disease. While larger sample sizes are required to establish the precision and accuracy of this type of testing, this study supports WES for inherited early onset macular degeneration disorders as an alternative to standard mutation screening techniques. PMID:22863181

Deep sequencing of hepatitis C virus hypervariable region 1 reveals no correlation between genetic heterogeneity and antiviral treatment outcome

PubMed Central

2014-01-01

Background Hypervariable region 1 (HVR1) contained within envelope protein 2 (E2) gene is the most variable part of HCV genome and its translation product is a major target for the host immune response. Variability within HVR1 may facilitate evasion of the immune response and could affect treatment outcome. The aim of the study was to analyze the impact of HVR1 heterogeneity employing sensitive ultra-deep sequencing, on the outcome of PEG-IFN-α (pegylated interferon α) and ribavirin treatment. Methods HVR1 sequences were amplified from pretreatment serum samples of 25 patients infected with genotype 1b HCV (12 responders and 13 non-responders) and were subjected to pyrosequencing (GS Junior, 454/Roche). Reads were corrected for sequencing error using ShoRAH software, while population reconstruction was done using three different minimal variant frequency cut-offs of 1%, 2% and 5%. Statistical analysis was done using Mann–Whitney and Fisher’s exact tests. Results Complexity, Shannon entropy, nucleotide diversity per site, genetic distance and the number of genetic substitutions were not significantly different between responders and non-responders, when analyzing viral populations at any of the three frequencies (≥1%, ≥2% and ≥5%). When clonal sample was used to determine pyrosequencing error, 4% of reads were found to be incorrect and the most abundant variant was present at a frequency of 1.48%. Use of ShoRAH reduced the sequencing error to 1%, with the most abundant erroneous variant present at frequency of 0.5%. Conclusions While deep sequencing revealed complex genetic heterogeneity of HVR1 in chronic hepatitis C patients, there was no correlation between treatment outcome and any of the analyzed quasispecies parameters. PMID:25016390
Sequence variants in oxytocin pathway genes and preterm birth: a candidate gene association study

PubMed Central

2013-01-01

Background Preterm birth (PTB) is a complex disorder associated with significant neonatal mortality and morbidity and long-term adverse health consequences. Multiple lines of evidence suggest that genetic factors play an important role in its etiology. This study was designed to identify genetic variation associated with PTB in oxytocin pathway genes whose role in parturition is well known. Methods To identify common genetic variants predisposing to PTB, we genotyped 16 single nucleotide polymorphisms (SNPs) in the oxytocin (OXT), oxytocin receptor (OXTR), and leucyl/cystinyl aminopeptidase (LNPEP) genes in 651 case infants from the U.S. and one or both of their parents. In addition, we examined the role of rare genetic variation in susceptibility to PTB by conducting direct sequence analysis of OXTR in 1394 cases and 1112 controls from the U.S., Argentina, Denmark, and Finland. This study was further extended to maternal triads (maternal grandparents-mother of a case infant, N=309). We also performed in vitro analysis of selected rare OXTR missense variants to evaluate their functional importance. Results Maternal genetic effect analysis of the SNP genotype data revealed four SNPs in LNPEP that show significant association with prematurity. In our case–control sequence analysis, we detected fourteen coding variants in exon 3 of OXTR, all but four of which were found in cases only. Of the fourteen variants, three were previously unreported novel rare variants. When the sequence data from the maternal triads were analyzed using the transmission disequilibrium test, two common missense SNPs (rs4686302 and rs237902) in OXTR showed suggestive association for three gestational age subgroups. In vitro functional assays showed a significant difference in ligand binding between wild-type and two mutant receptors. Conclusions Our study suggests an association between maternal common polymorphisms in LNPEP and susceptibility to PTB. Maternal OXTR missense SNPs rs4686302 and rs237902 may have gestational age-dependent effects on prematurity. Most of the OXTR rare variants identified do not appear to significantly contribute to the risk of PTB, but those shown to affect receptor function in our in vitro study warrant further investigation. Future studies with larger sample sizes are needed to confirm the findings of this study. PMID:23889750
Investigation of the role of TCF4 rare sequence variants in schizophrenia.

PubMed

Basmanav, F Buket; Forstner, Andreas J; Fier, Heide; Herms, Stefan; Meier, Sandra; Degenhardt, Franziska; Hoffmann, Per; Barth, Sandra; Fricker, Nadine; Strohmaier, Jana; Witt, Stephanie H; Ludwig, Michael; Schmael, Christine; Moebus, Susanne; Maier, Wolfgang; Mössner, Rainald; Rujescu, Dan; Rietschel, Marcella; Lange, Christoph; Nöthen, Markus M; Cichon, Sven

2015-07-01

Transcription factor 4 (TCF4) is one of the most robust of all reported schizophrenia risk loci and is supported by several genetic and functional lines of evidence. While numerous studies have implicated common genetic variation at TCF4 in schizophrenia risk, the role of rare, small-sized variants at this locus-such as single nucleotide variants and short indels which are below the resolution of chip-based arrays requires further exploration. The aim of the present study was to investigate the association between rare TCF4 sequence variants and schizophrenia. Exon-targeted resequencing was performed in 190 German schizophrenia patients. Six rare variants at the coding exons and flanking sequences of the TCF4 gene were identified, including two missense variants and one splice site variant. These six variants were then pooled with nine additional rare variants identified in 379 European participants of the 1000 Genomes Project, and all 15 variants were genotyped in an independent German sample (n = 1,808 patients; n = 2,261 controls). These data were then analyzed using six statistical methods developed for the association analysis of rare variants. No significant association (P < 0.05) was found. However, the results from our association and power analyses suggest that further research into the possible involvement of rare TCF4 sequence variants in schizophrenia risk is warranted by the assessment of larger cohorts with higher statistical power to identify rare variant associations. © 2015 Wiley Periodicals, Inc.
Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples

PubMed Central

Wang, Jingwen; Skoog, Tiina; Einarsdottir, Elisabet; Kaartokallio, Tea; Laivuori, Hannele; Grauers, Anna; Gerdhem, Paul; Hytönen, Marjo; Lohi, Hannes; Kere, Juha; Jiao, Hong

2016-01-01

High-throughput sequencing using pooled DNA samples can facilitate genome-wide studies on rare and low-frequency variants in a large population. Some major questions concerning the pooling sequencing strategy are whether rare and low-frequency variants can be detected reliably, and whether estimated minor allele frequencies (MAFs) can represent the actual values obtained from individually genotyped samples. In this study, we evaluated MAF estimates using three variant detection tools with two sets of pooled whole exome sequencing (WES) and one set of pooled whole genome sequencing (WGS) data. Both GATK and Freebayes displayed high sensitivity, specificity and accuracy when detecting rare or low-frequency variants. For the WGS study, 56% of the low-frequency variants in Illumina array have identical MAFs and 26% have one allele difference between sequencing and individual genotyping data. The MAF estimates from WGS correlated well (r = 0.94) with those from Illumina arrays. The MAFs from the pooled WES data also showed high concordance (r = 0.88) with those from the individual genotyping data. In conclusion, the MAFs estimated from pooled DNA sequencing data reflect the MAFs in individually genotyped samples well. The pooling strategy can thus be a rapid and cost-effective approach for the initial screening in large-scale association studies. PMID:27633116
Expanding the clinical and genetic spectra of NKX6-2-related disorder.

PubMed

Baldi, C; Bertoli-Avella, A M; Al-Sannaa, N; Alfadhel, M; Al-Thihli, K; Alameer, S; Elmonairy, A A; Al Shamsi, A M; Abdelrahman, H A; Al-Gazali, L; Shawli, A; Al-Hakami, F; Yavuz, H; Kandaswamy, K K; Rolfs, A; Brandau, O; Bauer, P

2018-05-01

Hypomyelinating leukodystrophies (HLDs) affect the white matter of the central nervous system and manifest as neurological disorders. They are genetically heterogeneous. Very recently, biallelic variants in NKX6-2 have been suggested to cause a novel form of autosomal recessive HLD. Using whole-exome or whole-genome sequencing, we identified the previously reported c.196delC and c.487C>G variants in NKX6-2 in 3 and 2 unrelated index cases, respectively; the novel c.608G>A variant was identified in a sixth patient. All variants were homozygous in affected family members only. Our patients share a primary diagnosis of psychomotor delay, and they show spastic quadriparesis, nystagmus and hypotonia. Seizures and dysmorphic features (observed in 2 families each) represent an addition to the phenotype, while developmental regression (observed in 3 families) appears to be a notable and previously underestimated clinical feature. Our findings extend the clinical and mutational spectra associated with this novel form of HLD. Comparative analysis of our 10 patients and the 15 reported previously did, however, not reveal clear evidence for a genotype-phenotype correlation. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Characterization of self-generated variants in Pseudoalteromonas lipolytica biofilm with increased antifouling activities.

PubMed

Zeng, Zhenshun; Guo, Xing-Pan; Li, Baiyuan; Wang, Pengxia; Cai, Xingsheng; Tian, Xinpeng; Zhang, Si; Yang, Jin-Long; Wang, Xiaoxue

2015-12-01

Pseudoalteromonas is widespread in various marine environments, and most strains can affect invertebrate larval settlement and metamorphosis by forming biofilms. However, the impact and the molecular basis of population diversification occurring in Pseudoalteromonas biofilms are poorly understood. Here, we show that morphological diversification is prevalent in Pseudoalteromonas species during biofilm formation. Two types of genetic variants, wrinkled (frequency of 12±5%) and translucent (frequency of 5±3%), were found in Pseudoalteromonas lipolytica biofilms. The inducing activities of biofilms formed by the two variants on larval settlement and metamorphosis of the mussel Mytilus coruscus were significantly decreased, suggesting strong antifouling activities. Using whole-genome re-sequencing combined with genetic manipulation, two genes were identified to be responsible for the morphology alternations. A nonsense mutation in AT00_08765 led to a wrinkled morphology due to the overproduction of cellulose, whereas a point mutation in AT00_17125 led to a translucent morphology via a reduction in capsular polysaccharide production. Taken together, the results suggest that the microbial behavior on larval settlement and metamorphosis in marine environment could be affected by the self-generated variants generated during the formation of marine biofilms, thereby rendering potential application in biocontrol of marine biofouling.
Genetics and epigenetics of obesity.

PubMed

Herrera, Blanca M; Keildson, Sarah; Lindgren, Cecilia M

2011-05-01

Obesity results from interactions between environmental and genetic factors. Despite a relatively high heritability of common, non-syndromic obesity (40-70%), the search for genetic variants contributing to susceptibility has been a challenging task. Genome wide association (GWA) studies have dramatically changed the pace of detection of common genetic susceptibility variants. To date, more than 40 genetic variants have been associated with obesity and fat distribution. However, since these variants do not fully explain the heritability of obesity, other forms of variation, such as epigenetics marks, must be considered. Epigenetic marks, or "imprinting", affect gene expression without actually changing the DNA sequence. Failures in imprinting are known to cause extreme forms of obesity (e.g. Prader-Willi syndrome), but have also been convincingly associated with susceptibility to obesity. Furthermore, environmental exposures during critical developmental periods can affect the profile of epigenetic marks and result in obesity. We review the most recent evidence for genetic and epigenetic mechanisms involved in the susceptibility and development of obesity. Only a comprehensive understanding of the underlying genetic and epigenetic mechanisms, and the metabolic processes they govern, will allow us to manage, and eventually prevent, obesity. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Targeted Deep Resequencing Identifies Coding Variants in the PEAR1 Gene That Play a Role in Platelet Aggregation

PubMed Central

Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.

2013-01-01

Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978
Identification of candidate genes for familial early-onset essential tremor.

PubMed

Liu, Xinmin; Hernandez, Nora; Kisselev, Sergey; Floratos, Aris; Sawle, Ashley; Ionita-Laza, Iuliana; Ottman, Ruth; Louis, Elan D; Clark, Lorraine N

2016-07-01

Essential tremor (ET) is one of the most common causes of tremor in humans. Despite its high heritability and prevalence, few susceptibility genes for ET have been identified. To identify ET genes, whole-exome sequencing was performed in 37 early-onset ET families with an autosomal-dominant inheritance pattern. We identified candidate genes for follow-up functional studies in five ET families. In two independent families, we identified variants predicted to affect function in the nitric oxide (NO) synthase 3 gene (NOS3) that cosegregated with disease. NOS3 is highly expressed in the central nervous system (including cerebellum), neurons and endothelial cells, and is one of three enzymes that converts l-arginine to the neurotransmitter NO. In one family, a heterozygous variant, c.46G>A (p.(Gly16Ser)), in NOS3, was identified in three affected ET cases and was absent in an unaffected family member; and in a second family, a heterozygous variant, c.164C>T (p.(Pro55Leu)), was identified in three affected ET cases (dizygotic twins and their mother). Both variants result in amino-acid substitutions of highly conserved amino-acid residues that are predicted to be deleterious and damaging by in silico analysis. In three independent families, variants predicted to affect function were also identified in other genes, including KCNS2 (KV9.2), HAPLN4 (BRAL2) and USP46. These genes are highly expressed in the cerebellum and Purkinje cells, and influence function of the gamma-amino butyric acid (GABA)-ergic system. This is in concordance with recent evidence that the pathophysiological process in ET involves cerebellar dysfunction and possibly cerebellar degeneration with a reduction in Purkinje cells, and a decrease in GABA-ergic tone.
Heterozygous missense variants of LMX1A lead to nonsyndromic hearing impairment and vestibular dysfunction.

PubMed

Wesdorp, Mieke; de Koning Gans, Pia A M; Schraders, Margit; Oostrik, Jaap; Huynen, Martijn A; Venselaar, Hanka; Beynon, Andy J; van Gaalen, Judith; Piai, Vitória; Voermans, Nicol; van Rossum, Michelle M; Hartel, Bas P; Lelieveld, Stefan H; Wiel, Laurens; Verbist, Berit; Rotteveel, Liselotte J; van Dooren, Marieke F; Lichtner, Peter; Kunst, Henricus P M; Feenstra, Ilse; Admiraal, Ronald J C; Yntema, Helger G; Hoefsloot, Lies H; Pennings, Ronald J E; Kremer, Hannie

2018-05-12

Unraveling the causes and pathomechanisms of progressive disorders is essential for the development of therapeutic strategies. Here, we identified heterozygous pathogenic missense variants of LMX1A in two families of Dutch origin with progressive nonsyndromic hearing impairment (HI), using whole exome sequencing. One variant, c.721G > C (p.Val241Leu), occurred de novo and is predicted to affect the homeodomain of LMX1A, which is essential for DNA binding. The second variant, c.290G > C (p.Cys97Ser), predicted to affect a zinc-binding residue of the second LIM domain that is involved in protein-protein interactions. Bi-allelic deleterious variants of Lmx1a are associated with a complex phenotype in mice, including deafness and vestibular defects, due to arrest of inner ear development. Although Lmx1a mouse mutants demonstrate neurological, skeletal, pigmentation and reproductive system abnormalities, no syndromic features were present in the participating subjects of either family. LMX1A has previously been suggested as a candidate gene for intellectual disability, but our data do not support this, as affected subjects displayed normal cognition. Large variability was observed in the age of onset (a)symmetry, severity and progression rate of HI. About half of the affected individuals displayed vestibular dysfunction and experienced symptoms thereof. The late-onset progressive phenotype and the absence of cochleovestibular malformations on computed tomography scans indicate that heterozygous defects of LMX1A do not result in severe developmental abnormalities in humans. We propose that a single LMX1A wild-type copy is sufficient for normal development but insufficient for maintenance of cochleovestibular function. Alternatively, minor cochleovestibular developmental abnormalities could eventually lead to the progressive phenotype seen in the families.
A Missense Variant in KCNJ10 in Belgian Shepherd Dogs Affected by Spongy Degeneration with Cerebellar Ataxia (SDCA1).

PubMed

Mauri, Nico; Kleiter, Miriam; Leschnik, Michael; Högler, Sandra; Dietschi, Elisabeth; Wiedmer, Michaela; Dietrich, Joëlle; Henke, Diana; Steffen, Frank; Schuller, Simone; Gurtner, Corinne; Stokar-Regenscheit, Nadine; O'Toole, Donal; Bilzer, Thomas; Herden, Christiane; Oevermann, Anna; Jagannathan, Vidhya; Leeb, Tosso

2017-02-09

Spongy degeneration with cerebellar ataxia (SDCA) is a severe neurodegenerative disease with monogenic autosomal recessive inheritance in Malinois dogs, one of the four varieties of the Belgian Shepherd breed. We performed a genetic investigation in six families and seven isolated cases of Malinois dogs with signs of cerebellar dysfunction. Linkage analysis revealed an unexpected genetic heterogeneity within the studied cases. The affected dogs from four families and one isolated case shared a ∼1.4 Mb common homozygous haplotype segment on chromosome 38. Whole genome sequence analysis of three affected and 140 control dogs revealed a missense variant in the KCNJ10 gene encoding a potassium channel (c.986T>C; p.Leu329Pro). Pathogenic variants in KCNJ10 were reported previously in humans, mice, and dogs with neurological phenotypes. Therefore, we consider KCNJ10 :c.986T>C the most likely candidate causative variant for one subtype of SDCA in Malinois dogs, which we propose to term spongy degeneration with cerebellar ataxia 1 (SDCA1). However, our study also comprised samples from 12 Malinois dogs with cerebellar dysfunction which were not homozygous for this variant, suggesting a different genetic basis in these dogs. A retrospective detailed clinical and histopathological analysis revealed subtle neuropathological differences with respect to SDCA1-affected dogs. Thus, our study highlights the genetic and phenotypic complexity underlying cerebellar dysfunction in Malinois dogs and provides the basis for a genetic test to eradicate one specific neurodegenerative disease from the breeding population. These dogs represent an animal model for the human EAST syndrome. Copyright © 2017 Mauri et al.
First detection of canine parvovirus type 2b from diarrheic dogs in Himachal Pradesh.

PubMed

Sharma, Shalini; Dhar, Prasenjit; Thakur, Aneesh; Sharma, Vivek; Sharma, Mandeep

2016-09-01

The present study was conducted to detect the presence of canine parvovirus (CPV) among diarrheic dogs in Himachal Pradesh and to identify the most prevalent antigenic variant of CPV based on molecular typing and sequence analysis of VP2 gene. A total of 102 fecal samples were collected from clinical cases of diarrhea or hemorrhagic gastroenteritis from CPV vaccinated or non-vaccinated dogs. Samples were tested using CPV-specific polymerase chain reaction (PCR) targeting VP2 gene, multiplex PCR for detection of CPV-2a and CPV-2b antigenic variants, and a PCR for the detection of CPV-2c. CPV-2b isolate was cultured on Madin-Darby canine kidney (MDCK) cell lines and sequenced using VP2 structural protein gene. Multiple alignment and phylogenetic analysis was done using ClustalW and MEGA6 and inferred using the Neighbor-Joining method. No sample was found positive for the original CPV strain usually present in the vaccine. However, about 50% (52 out of 102) of the samples were found to be positive with CPV-2ab PCR assay that detects newer variants of CPV circulating in the field. In addition, multiplex PCR assay that identifies both CPV-2ab and CPV-2b revealed that CPV-2b was the major antigenic variant present in the affected dogs. A PCR positive isolate of CPV-2b was adapted to grow in MDCK cells and produced characteristic cytopathic effect after 5 th passage. Multiple sequence alignment of VP2 structural gene of CPV-2b isolate (Accession number HG004610) used in the study was found to be similar to other sequenced isolates in NCBI sequence database and showed 98-99% homology. This study reports the first detection of CPV-2b in dogs with hemorrhagic gastroenteritis in Himachal Pradesh and absence of other antigenic types of CPV. Further, CPV-specific PCR assay can be used for rapid confirmation of circulating virus strains under field conditions.
Uptake, Results, and Outcomes of Germline Multiple-Gene Sequencing After Diagnosis of Breast Cancer.

PubMed

Kurian, Allison W; Ward, Kevin C; Hamilton, Ann S; Deapen, Dennis M; Abrahamse, Paul; Bondarenko, Irina; Li, Yun; Hawley, Sarah T; Morrow, Monica; Jagsi, Reshma; Katz, Steven J

2018-05-10

Low-cost sequencing of multiple genes is increasingly available for cancer risk assessment. Little is known about uptake or outcomes of multiple-gene sequencing after breast cancer diagnosis in community practice. To examine the effect of multiple-gene sequencing on the experience and treatment outcomes for patients with breast cancer. For this population-based retrospective cohort study, patients with breast cancer diagnosed from January 2013 to December 2015 and accrued from SEER registries across Georgia and in Los Angeles, California, were surveyed (n = 5080, response rate = 70%). Responses were merged with SEER data and results of clinical genetic tests, either BRCA1 and BRCA2 (BRCA1/2) sequencing only or including additional other genes (multiple-gene sequencing), provided by 4 laboratories. Type of testing (multiple-gene sequencing vs BRCA1/2-only sequencing), test results (negative, variant of unknown significance, or pathogenic variant), patient experiences with testing (timing of testing, who discussed results), and treatment (strength of patient consideration of, and surgeon recommendation for, prophylactic mastectomy), and prophylactic mastectomy receipt. We defined a patient subgroup with higher pretest risk of carrying a pathogenic variant according to practice guidelines. Among 5026 patients (mean [SD] age, 59.9 [10.7]), 1316 (26.2%) were linked to genetic results from any laboratory. Multiple-gene sequencing increasingly replaced BRCA1/2-only testing over time: in 2013, the rate of multiple-gene sequencing was 25.6% and BRCA1/2-only testing, 74.4%;in 2015 the rate of multiple-gene sequencing was 66.5% and BRCA1/2-only testing, 33.5%. Multiple-gene sequencing was more often ordered by genetic counselors (multiple-gene sequencing, 25.5% and BRCA1/2-only testing, 15.3%) and delayed until after surgery (multiple-gene sequencing, 32.5% and BRCA1/2-only testing, 19.9%). Multiple-gene sequencing substantially increased rate of detection of any pathogenic variant (multiple-gene sequencing: higher-risk patients, 12%; average-risk patients, 4.2% and BRCA1/2-only testing: higher-risk patients, 7.8%; average-risk patients, 2.2%) and variants of uncertain significance, especially in minorities (multiple-gene sequencing: white patients, 23.7%; black patients, 44.5%; and Asian patients, 50.9% and BRCA1/2-only testing: white patients, 2.2%; black patients, 5.6%; and Asian patients, 0%). Multiple-gene sequencing was not associated with an increase in the rate of prophylactic mastectomy use, which was highest with pathogenic variants in BRCA1/2 (BRCA1/2, 79.0%; other pathogenic variant, 37.6%; variant of uncertain significance, 30.2%; negative, 35.3%). Multiple-gene sequencing rapidly replaced BRCA1/2-only testing for patients with breast cancer in the community and enabled 2-fold higher detection of clinically relevant pathogenic variants without an associated increase in prophylactic mastectomy. However, important targets for improvement in the clinical utility of multiple-gene sequencing include postsurgical delay and racial/ethnic disparity in variants of uncertain significance.
A dominant negative mutation at the ATP binding domain of AMHR2 is associated with a defective anti-Müllerian hormone signaling pathway.

PubMed

Li, Lin; Zhou, Xueya; Wang, Xi; Wang, Jing; Zhang, Wei; Wang, Binbin; Cao, Yunxia; Kee, Kehkooi

2016-09-01

Does a heterozygous mutation in AMHR2, identified in whole-exome sequencings (WES) of patients with primary ovarian insufficiency (POI), cause a defect in anti-Müllerian hormone (AMH) signaling? The I209N mutation at the adenosine triphosphate binding domain of AMHR2 exerts dominant negative defects in the AMH signaling pathway. Previous studies have demonstrated the associations of several sequence variants in AMH or AMHR2 with POI, but no functional assay has been performed to verify whether there was any defect on AMH signaling. Ninety-six unrelated female Chinese Han patients were diagnosed with idiopathic POI and subjected to WES. In silico analysis was done for the sequence variants followed by molecular assays to examine the functional effects of the sequence variants in human granulosa cells. In silico analysis, immunostaining, Western analysis, genome-wide expression analysis, quantitatively polymerase chain reaction were applied to the characterization of the sequence variants. We identified one novel heterozygous missense variant, p.Ala17Glu (A17E), in AMHR2. Subsequently, A17E and two independently reported missense variants, p.Ile209Asn (I209N) and p.Leu354Phe (L354F), were evaluated for effects on the AMH signaling pathway. In silico analysis predicted that all three variants may be deleterious. However, only one variant, I209N, showed severe defects in transducing the AMH signal as well as impaired SMAD1/5/8 phosphorylation. Furthermore, using genome-wide gene expression analysis, we identified genes whose expression was affected by the mutation, these included genes previously reported to participate in AMH signaling as well as newly identified genes. They are EMILIN2, FAM155A, GATA2, HES5, ID1, ID2, RLTPR, SMAD7, CBL, MALAT1 and SMARCA2. None. Although the in vitro assays demonstrated the causative effect of I209N on AMH signaling, further studies need to validate its long-term effects on folliculogenesis and POI. These results will aid both researchers and clinicians in understanding the molecular pathology of AMH signaling and POI to develop diagnostic assays or therapeutics approaches. Research funding is provided by the Ministry of Science and Technology of China [2012CB944704; 2012CB966702], and the National Natural Science Foundation of China [Grant number: 31171429]. The authors declare no conflict of interest. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.

PubMed

Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

2013-11-01

Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.
Systematic comparison of variant calling pipelines using gold standard personal exome variants

PubMed Central

Hwang, Sohyun; Kim, Eiru; Lee, Insuk; Marcotte, Edward M.

2015-01-01

The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes. PMID:26639839
Kaposi's Sarcoma-Associated Herpesvirus MicroRNA Single-Nucleotide Polymorphisms Identified in Clinical Samples Can Affect MicroRNA Processing, Level of Expression, and Silencing Activity

PubMed Central

Han, Soo-Jin; Marshall, Vickie; Barsov, Eugene; Quiñones, Octavio; Ray, Alex; Labo, Nazzarena; Trivett, Matthew; Ott, David; Renne, Rolf

2013-01-01

Kaposi's sarcoma-associated herpesvirus (KSHV) encodes 12 pre-microRNAs that can produce 25 KSHV mature microRNAs. We previously reported single-nucleotide polymorphisms (SNPs) in KSHV-encoded pre-microRNA and mature microRNA sequences from clinical samples (V. Marshall et al., J. Infect. Dis., 195:645–659, 2007). To determine whether microRNA SNPs affect pre-microRNA processing and, ultimately, mature microRNA expression levels, we performed a detailed comparative analysis of (i) mature microRNA expression levels, (ii) in vitro Drosha/Dicer processing, and (iii) RNA-induced silencing complex-dependent targeting of wild-type (wt) and variant microRNA genes. Expression of pairs of wt and variant pre-microRNAs from retroviral vectors and measurement of KSHV mature microRNA expression by real-time reverse transcription-PCR (RT-PCR) revealed differential expression levels that correlated with the presence of specific sequence polymorphisms. Measurement of KSHV mature microRNA expression in a panel of primary effusion lymphoma cell lines by real-time RT-PCR recapitulated some observed expression differences but suggested a more complex relationship between sequence differences and expression of mature microRNA. Furthermore, in vitro maturation assays demonstrated significant SNP-associated changes in Drosha/DGCR8 and/or Dicer processing. These data demonstrate that SNPs within KSHV-encoded pre-microRNAs are associated with differential microRNA expression levels. Given the multiple reports on the involvement of microRNAs in cancer, the biological significance of these phenotypic and genotypic variants merits further studies in patients with KSHV-associated malignancies. PMID:24006441
Evolutionary conservation analysis increases the colocalization of predicted exonic splicing enhancers in the BRCA1 gene with missense sequence changes and in-frame deletions, but not polymorphisms

PubMed Central

Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A

2005-01-01

Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
Truncating Variants in NAA15 Are Associated with Variable Levels of Intellectual Disability, Autism Spectrum Disorder, and Congenital Anomalies.

PubMed

Cheng, Hanyin; Dharmadhikari, Avinash V; Varland, Sylvia; Ma, Ning; Domingo, Deepti; Kleyner, Robert; Rope, Alan F; Yoon, Margaret; Stray-Pedersen, Asbjørg; Posey, Jennifer E; Crews, Sarah R; Eldomery, Mohammad K; Akdemir, Zeynep Coban; Lewis, Andrea M; Sutton, Vernon R; Rosenfeld, Jill A; Conboy, Erin; Agre, Katherine; Xia, Fan; Walkiewicz, Magdalena; Longoni, Mauro; High, Frances A; van Slegtenhorst, Marjon A; Mancini, Grazia M S; Finnila, Candice R; van Haeringen, Arie; den Hollander, Nicolette; Ruivenkamp, Claudia; Naidu, Sakkubai; Mahida, Sonal; Palmer, Elizabeth E; Murray, Lucinda; Lim, Derek; Jayakar, Parul; Parker, Michael J; Giusto, Stefania; Stracuzzi, Emanuela; Romano, Corrado; Beighley, Jennifer S; Bernier, Raphael A; Küry, Sébastien; Nizon, Mathilde; Corbett, Mark A; Shaw, Marie; Gardner, Alison; Barnett, Christopher; Armstrong, Ruth; Kassahn, Karin S; Van Dijck, Anke; Vandeweyer, Geert; Kleefstra, Tjitske; Schieving, Jolanda; Jongmans, Marjolijn J; de Vries, Bert B A; Pfundt, Rolph; Kerr, Bronwyn; Rojas, Samantha K; Boycott, Kym M; Person, Richard; Willaert, Rebecca; Eichler, Evan E; Kooy, R Frank; Yang, Yaping; Wu, Joseph C; Lupski, James R; Arnesen, Thomas; Cooper, Gregory M; Chung, Wendy K; Gecz, Jozef; Stessman, Holly A F; Meng, Linyan; Lyon, Gholson J

2018-05-03

N-alpha-acetylation is a common co-translational protein modification that is essential for normal cell function in humans. We previously identified the genetic basis of an X-linked infantile lethal Mendelian disorder involving a c.109T>C (p.Ser37Pro) missense variant in NAA10, which encodes the catalytic subunit of the N-terminal acetyltransferase A (NatA) complex. The auxiliary subunit of the NatA complex, NAA15, is the dimeric binding partner for NAA10. Through a genotype-first approach with whole-exome or genome sequencing (WES/WGS) and targeted sequencing analysis, we identified and phenotypically characterized 38 individuals from 33 unrelated families with 25 different de novo or inherited, dominantly acting likely gene disrupting (LGD) variants in NAA15. Clinical features of affected individuals with LGD variants in NAA15 include variable levels of intellectual disability, delayed speech and motor milestones, and autism spectrum disorder. Additionally, mild craniofacial dysmorphology, congenital cardiac anomalies, and seizures are present in some subjects. RNA analysis in cell lines from two individuals showed degradation of the transcripts with LGD variants, probably as a result of nonsense-mediated decay. Functional assays in yeast confirmed a deleterious effect for two of the LGD variants in NAA15. Further supporting a mechanism of haploinsufficiency, individuals with copy-number variant (CNV) deletions involving NAA15 and surrounding genes can present with mild intellectual disability, mild dysmorphic features, motor delays, and decreased growth. We propose that defects in NatA-mediated N-terminal acetylation (NTA) lead to variable levels of neurodevelopmental disorders in humans, supporting the importance of the NatA complex in normal human development. Copyright © 2018 American Society of Human Genetics. All rights reserved.
Is IGSF1 involved in human pituitary tumor formation?

PubMed

Faucz, Fabio R; Horvath, Anelia D; Azevedo, Monalisa F; Levy, Isaac; Bak, Beata; Wang, Ying; Xekouki, Paraskevi; Szarek, Eva; Gourgari, Evgenia; Manning, Allison D; de Alexandre, Rodrigo Bertollo; Saloustros, Emmanouil; Trivellin, Giampaolo; Lodish, Maya; Hofman, Paul; Anderson, Yvonne C; Holdaway, Ian; Oldfield, Edward; Chittiboina, Prashant; Nesterova, Maria; Biermasz, Nienke R; Wit, Jan M; Bernard, Daniel J; Stratakis, Constantine A

2015-02-01

IGSF1 is a membrane glycoprotein highly expressed in the anterior pituitary. Pathogenic mutations in the IGSF1 gene (on Xq26.2) are associated with X-linked central hypothyroidism and testicular enlargement in males. In this study, we tested the hypothesis that IGSF1 is involved in the development of pituitary tumors, especially those that produce growth hormone (GH). IGSF1 was sequenced in 21 patients with gigantism or acromegaly and 92 healthy individuals. Expression studies with a candidate pathogenic IGSF1 variant were carried out in transfected cells and immunohistochemistry for IGSF1 was performed in the sections of GH-producing adenomas, familial somatomammotroph hyperplasia, and in normal pituitary. We identified the sequence variant p.N604T, which in silico analysis suggested could affect IGSF1 function, in two male patients and one female with somatomammotroph hyperplasia from the same family. Of 60 female controls, two carried the same variant and seven were heterozygous for other variants. Immunohistochemistry showed increased IGSF1 staining in the GH-producing tumor from the patient with the IGSF1 p.N604T variant compared with a GH-producing adenoma from a patient negative for any IGSF1 variants and with normal control pituitary tissue. The IGSF1 gene appears polymorphic in the general population. A potentially pathogenic variant identified in the germline of three patients with gigantism from the same family (segregating with the disease) was also detected in two healthy female controls. Variations in IGSF1 expression in pituitary tissue in patients with or without IGSF1 germline mutations point to the need for further studies of IGSF1 action in pituitary adenoma formation. © 2015 Society for Endocrinology.

Is IGSF1 involved in human pituitary tumor formation?

PubMed Central

Faucz, Fabio R.; Horvath, Anelia D.; Azevedo, Monalisa F.; Levy, Isaac; Bak, Beata; Wang, Ying; Xekouki, Paraskevi; Szarek, Eva; Gourgari, Evgenia; Manning, Allison D.; de Alexandre, Rodrigo Bertollo; Saloustros, Emmanouil; Trivellin, Giampaolo; Lodish, Maya; Hofman, Paul; Anderson, Yvonne C; Holdaway, Ian; Oldfield, Edward; Chittiboina, Prashant; Nesterova, Maria; Biermasz, Nienke R.; Wit, Jan M.; Bernard, Daniel J.; Stratakis, Constantine A.

2014-01-01

IGSF1 is a membrane glycoprotein highly expressed in the anterior pituitary. Pathogenic mutations in the IGSF1 gene (on Xq26.2) are associated with X-linked central hypothyroidism and testicular enlargement in males. In this study we tested the hypothesis that IGSF1 is involved in the development of pituitary tumors, especially those that produce growth hormone (GH). IGSF1 was sequenced in 21 patients with gigantism or acromegaly and 92 healthy individuals. Expression studies with a candidate pathogenic IGSF1 variant were carried out in transfected cells and immunohistochemistry for IGSF1 was performed in sections from GH-producing adenomas, familial somatomammotroph hyperplasia and in normal pituitary. In two male patients, and in one female, with somatomammotroph hyperplasia from the same family, we identified the sequence variant p.N604T, which in silico analysis suggested could affect IGSF1 function. Of 60 female controls, two carried the same variant, and seven were heterozygous for other variants. Immunohistochemistry showed increase IGSF1 staining in the GH-producing tumor from the patient with the IGSF1 p.N604T variant compared to a GH-producing adenoma from a patient negative for any IGSF1 variants and to normal control pituitary tissue. The IGSF1 gene appears polymorphic in the general population. A potentially pathogenic variant identified in the germline of three patients with gigantism from the same family (segregating with the disease) was also detected in two healthy female controls. Variations in IGSF1 expression in pituitary tissue in patients with or without IGSF1 germline mutations point to the need for further studies of IGSF1 action in pituitary adenoma formation. PMID:25527509
Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma

PubMed Central

Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

2015-01-01

The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42–5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12–4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45–0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17–0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL. PMID:26506619
Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma.

PubMed

Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

2015-01-01

The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42-5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12-4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45-0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17-0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL.
Mutation detection of E6 and LCR genes from HPV 16 associated with carcinogenesis.

PubMed

Mosmann, Jessica P; Monetti, Marina S; Frutos, Maria C; Kiguen, Ana X; Venezuela, Raul F; Cuffini, Cecilia G

2015-01-01

Human papillomavirus (HPV) is responsible for one of the most frequent sexually transmitted infections. The first phylogenetic analysis was based on a LCR region fragment. Nowadays, 4 variants are known: African (Af-1, Af-2), Asian-American (AA) and European (E). However the existence of sub-lineages of the European variant havs been proposed, specific mutations in the E6 and LCR sequences being possibly related to persistent viral infections. The aim of this study was a phylogenetic study of HPV16 sequences of endocervical samples from Cordoba, in order to detect the circulating lineages and analyze the presence of mutations that could be correlated with malignant disease. The phylogenetic analysis determined that 86% of the samples belonged to the E variant, 7% to AF-1 and the remaining 7% to AF-2. The most frequent mutation in LCR sequences was G7521A, in 80% of the analyzed samples; it affects the binding site of a transcription factor that could contribute to carcinogenesis. In the E6 sequences, the most common mutation was T350G (L83V), detected in 67% of the samples, associated with increased risk of persistent infection. The high detection rate of the European lineage correlated with patterns of human migration. This study emphasizes the importance of recognizing circulating lineages, as well as the detection of mutations associated with high-grade neoplastic lesions that could be correlated to the development of carcinogenic lesions.
Preconception Carrier Screening by Genome Sequencing: Results from the Clinical Laboratory.

PubMed

Punj, Sumit; Akkari, Yassmine; Huang, Jennifer; Yang, Fei; Creason, Allison; Pak, Christine; Potter, Amiee; Dorschner, Michael O; Nickerson, Deborah A; Robertson, Peggy D; Jarvik, Gail P; Amendola, Laura M; Schleit, Jennifer; Simpson, Dana Kostiner; Rope, Alan F; Reiss, Jacob; Kauffman, Tia; Gilmore, Marian J; Himes, Patricia; Wilfond, Benjamin; Goddard, Katrina A B; Richards, C Sue

2018-06-07

Advances in sequencing technologies permit the analysis of a larger selection of genes for preconception carrier screening. The study was designed as a sequential carrier screen using genome sequencing to analyze 728 gene-disorder pairs for carrier and medically actionable conditions in 131 women and their partners (n = 71) who were planning a pregnancy. We report here on the clinical laboratory results from this expanded carrier screening program. Variants were filtered and classified using the latest American College of Medical Genetics and Genomics (ACMG) guideline; only pathogenic and likely pathogenic variants were confirmed by orthologous methods before being reported. Novel missense variants were classified as variants of uncertain significance. We reported 304 variants in 202 participants. Twelve carrier couples (12/71 couples tested) were identified for common conditions; eight were carriers for hereditary hemochromatosis. Although both known and novel variants were reported, 48% of all reported variants were missense. For novel splice-site variants, RNA-splicing assays were performed to aid in classification. We reported ten copy-number variants and five variants in non-coding regions. One novel variant was reported in F8, associated with hemophilia A; prenatal testing showed that the male fetus harbored this variant and the neonate suffered a life-threatening hemorrhage which was anticipated and appropriately managed. Moreover, 3% of participants had variants that were medically actionable. Compared with targeted mutation screening, genome sequencing improves the sensitivity of detecting clinically significant variants. While certain novel variant interpretation remains challenging, the ACMG guidelines are useful to classify variants in a healthy population. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
India Allele Finder: a web-based annotation tool for identifying common alleles in next-generation sequencing data of Indian origin.

PubMed

Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R

2017-06-27

We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.
Mutational analysis of patients with neurofibromatosis 2

DOE Office of Scientific and Technical Information (OSTI.GOV)

MacCollin, M.; Ramesh, V.; Pulaski, K.

Neurofibromatosis 2 (NF2) is a genetic disorder characterized by the development of multiple nervous-system tumors in young adulthood. The NF2 gene has recently been isolated and found to encode a new member, merlin, of the protein 4.1 family of cytoskeleton-associated proteins. To define the molecular basis of NF2 in affected individuals, the authors have used SSCP analysis to scan the exons of the NF2 gene from 33 unrelated patients with NF2. Twenty unique SSCP variants were seen in 21 patients; 10 of these individuals were known to be the only affected person in their kindred, while 7 had at leastmore » one other known affected relative. In all cases in which family members were available, the SSCP variant segregated with the disease; comparison of sporadic cases with their parents confirmed the de novo variants. DNA sequence analysis revealed that 19 of the 20 variants observed are predicted to lead to a truncated protein due to frameshift, creation of a stop codon, or interference with normal RNA splicing. A single patient carried a 3-bp deletion removing a phenylalanine residue. The authors conclude that the majority of NF2 patients carry an inactivating mutation of the NF2 gene and that neutral polymorphism in the gene is rare. 18 refs., 3 figs., 2 tabs.« less
Whole-Exome Sequencing in Familial Parkinson Disease

PubMed Central

Farlow, Janice L.; Robak, Laurie A.; Hetrick, Kurt; Bowling, Kevin; Boerwinkle, Eric; Coban-Akdemir, Zeynep H.; Gambin, Tomasz; Gibbs, Richard A.; Gu, Shen; Jain, Preti; Jankovic, Joseph; Jhangiani, Shalini; Kaw, Kaveeta; Lai, Dongbing; Lin, Hai; Ling, Hua; Liu, Yunlong; Lupski, James R.; Muzny, Donna; Porter, Paula; Pugh, Elizabeth; White, Janson; Doheny, Kimberly; Myers, Richard M.; Shulman, Joshua M.; Foroud, Tatiana

2016-01-01

IMPORTANCE Parkinson disease (PD) is a progressive neurodegenerative disease for which susceptibility is linked to genetic and environmental risk factors. OBJECTIVE To identify genetic variants contributing to disease risk in familial PD. DESIGN, SETTING, AND PARTICIPANTS A 2-stage study design that included a discovery cohort of families with PD and a replication cohort of familial probands was used. In the discovery cohort, rare exonic variants that segregated in multiple affected individuals in a family and were predicted to be conserved or damaging were retained. Genes with retained variants were prioritized if expressed in the brain and located within PD-relevant pathways. Genes in which prioritized variants were observed in at least 4 families were selected as candidate genes for replication in the replication cohort. The setting was among individuals with familial PD enrolled from academic movement disorder specialty clinics across the United States. All participants had a family history of PD. MAIN OUTCOMES AND MEASURES Identification of genes containing rare, likely deleterious, genetic variants in individuals with familial PD using a 2-stage exome sequencing study design. RESULTS The 93 individuals from 32 families in the discovery cohort (49.5% [46 of 93] female) had a mean (SD) age at onset of 61.8 (10.0) years. The 49 individuals with familial PD in the replication cohort (32.6% [16 of 49] female) had a mean (SD) age at onset of 50.1 (15.7) years. Discovery cohort recruitment dates were 1999 to 2009, and replication cohort recruitment dates were 2003 to 2014. Data analysis dates were 2011 to 2015. Three genes containing a total of 13 rare and potentially damaging variants were prioritized in the discovery cohort. Two of these genes (TNK2 and TNR) also had rare variants that were predicted to be damaging in the replication cohort. All 9 variants identified in the 2 replicated genes in 12 families across the discovery and replication cohorts were confirmed via Sanger sequencing. CONCLUSIONS AND RELEVANCE TNK2 and TNR harbored rare, likely deleterious, variants in individuals having familial PD, with similar findings in an independent cohort. To our knowledge, these genes have not been previously associated with PD, although they have been linked to critical neuronal functions. Further studies are required to confirm a potential role for these genes in the pathogenesis of PD. PMID:26595808
Germline EMSY sequence alterations in hereditary breast cancer and ovarian cancer families.

PubMed

Määttä, Kirsi M; Nurminen, Riikka; Kankuri-Tammilehto, Minna; Kallioniemi, Anne; Laasanen, Satu-Leena; Schleutker, Johanna

2017-07-24

BRCA1 and BRCA2 mutations explain approximately one-fifth of the inherited susceptibility in high-risk Finnish hereditary breast and ovarian cancer (HBOC) families. EMSY is located in the breast cancer-associated chromosomal region 11q13. The EMSY gene encodes a BRCA2-interacting protein that has been implicated in DNA damage repair and genomic instability. We analysed the role of germline EMSY variation in breast/ovarian cancer predisposition. The present study describes the first EMSY screening in patients with high familial risk for this disease. Index individuals from 71 high-risk, BRCA1/2-negative HBOC families were screened for germline EMSY sequence alterations in protein coding regions and exon-intron boundaries using Sanger sequencing and TaqMan assays. The identified variants were further screened in 36 Finnish HBOC patients and 904 controls. Moreover, one novel intronic deletion was screened in a cohort of 404 breast cancer patients unselected for family history. Haplotype block structure and the association of haplotypes with breast/ovarian cancer were analysed using Haploview. The functionality of the identified variants was predicted using Haploreg, RegulomeDB, Human Splicing Finder, and Pathogenic-or-Not-Pipeline 2. Altogether, 12 germline EMSY variants were observed. Two alterations were located in the coding region, five alterations were intronic, and five alterations were located in the 3'untranslated region (UTR). Variant frequencies did not significantly differ between cases and controls. The novel variant, c.2709 + 122delT, was detected in 1 out of 107 (0.9%) breast cancer patients, and the carrier showed a bilateral form of the disease. The deletion was absent in 897 controls (OR = 25.28; P = 0.1) and in 404 breast cancer patients unselected for family history. No haplotype was identified to increase the risk of breast/ovarian cancer. Functional analyses suggested that variants, particularly in the 3'UTR, were located within regulatory elements. The novel deletion was predicted to affect splicing regulatory elements. These results suggest that the identified EMSY variants are likely neutral at the population level. However, these variants may contribute to breast/ovarian cancer risk in single families. Additional analyses are warranted for rare novel intronic deletions and the 3'UTR variants predicted to have functional roles.
Simple and efficient identification of rare recessive pathologically important sequence variants from next generation exome sequence data.

PubMed

Carr, Ian M; Morgan, Joanne; Watson, Christopher; Melnik, Svitlana; Diggle, Christine P; Logan, Clare V; Harrison, Sally M; Taylor, Graham R; Pena, Sergio D J; Markham, Alexander F; Alkuraya, Fowzan S; Black, Graeme C M; Ali, Manir; Bonthron, David T

2013-07-01

Massively parallel ("next generation") DNA sequencing (NGS) has quickly become the method of choice for seeking pathogenic mutations in rare uncharacterized monogenic diseases. Typically, before DNA sequencing, protein-coding regions are enriched from patient genomic DNA, representing either the entire genome ("exome sequencing") or selected mapped candidate loci. Sequence variants, identified as differences between the patient's and the human genome reference sequences, are then filtered according to various quality parameters. Changes are screened against datasets of known polymorphisms, such as dbSNP and the 1000 Genomes Project, in the effort to narrow the list of candidate causative variants. An increasing number of commercial services now offer to both generate and align NGS data to a reference genome. This potentially allows small groups with limited computing infrastructure and informatics skills to utilize this technology. However, the capability to effectively filter and assess sequence variants is still an important bottleneck in the identification of deleterious sequence variants in both research and diagnostic settings. We have developed an approach to this problem comprising a user-friendly suite of programs that can interactively analyze, filter and screen data from enrichment-capture NGS data. These programs ("Agile Suite") are particularly suitable for small-scale gene discovery or for diagnostic analysis. © 2013 WILEY PERIODICALS, INC.
Paternal lineage early onset hereditary ovarian cancers: A Familial Ovarian Cancer Registry study.

PubMed

Eng, Kevin H; Szender, J Brian; Etter, John Lewis; Kaur, Jasmine; Poblete, Samantha; Huang, Ruea-Yea; Zhu, Qianqian; Grzesik, Katherine A; Battaglia, Sebastiano; Cannioto, Rikki; Krolewski, John J; Zsiros, Emese; Frederick, Peter J; Lele, Shashikant B; Moysich, Kirsten B; Odunsi, Kunle O

2018-02-01

Given prior evidence that an affected woman conveys a higher risk of ovarian cancer to her sister than to her mother, we hypothesized that there exists an X-linked variant evidenced by transmission to a woman from her paternal grandmother via her father. We ascertained 3,499 grandmother/granddaughter pairs from the Familial Ovarian Cancer Registry at the Roswell Park Cancer Institute observing 892 informative pairs with 157 affected granddaughters. We performed germline X-chromosome exome sequencing on 186 women with ovarian cancer from the registry. The rate of cancers was 28.4% in paternal grandmother/granddaughter pairs and 13.9% in maternal pairs consistent with an X-linked dominant model (Chi-square test X2 = 0.02, p = 0.89) and inconsistent with an autosomal dominant model (X2 = 20.4, p<0.001). Paternal grandmother cases had an earlier age-of-onset versus maternal cases (hazard ratio HR = 1.59, 95%CI: 1.12-2.25) independent of BRCA1/2 status. Reinforcing the X-linked hypothesis, we observed an association between prostate cancer in men and ovarian cancer in his mother and daughters (odds ratio, OR = 2.34, p = 0.034). Unaffected mothers with affected daughters produced significantly more daughters than sons (ratio = 1.96, p<0.005). We performed exome sequencing in reported BRCA negative cases from the registry. Considering age-of-onset, one missense variant (rs176026 in MAGEC3) reached chromosome-wide significance (Hazard ratio HR = 2.85, 95%CI: 1.75-4.65) advancing the age of onset by 6.7 years. In addition to the well-known contribution of BRCA, we demonstrate that a genetic locus on the X-chromosome contributes to ovarian cancer risk. An X-linked pattern of inheritance has implications for genetic risk stratification. Women with an affected paternal grandmother and sisters of affected women are at increased risk for ovarian cancer. Further work is required to validate this variant and to characterize carrier families.
Paternal lineage early onset hereditary ovarian cancers: A Familial Ovarian Cancer Registry study

PubMed Central

Eng, Kevin H.; Szender, J. Brian; Etter, John Lewis; Kaur, Jasmine; Poblete, Samantha; Huang, Ruea-Yea; Zhu, Qianqian; Battaglia, Sebastiano; Cannioto, Rikki; Krolewski, John J.; Zsiros, Emese; Frederick, Peter J.; Lele, Shashikant B.; Moysich, Kirsten B.; Odunsi, Kunle O.

2018-01-01

Given prior evidence that an affected woman conveys a higher risk of ovarian cancer to her sister than to her mother, we hypothesized that there exists an X-linked variant evidenced by transmission to a woman from her paternal grandmother via her father. We ascertained 3,499 grandmother/granddaughter pairs from the Familial Ovarian Cancer Registry at the Roswell Park Cancer Institute observing 892 informative pairs with 157 affected granddaughters. We performed germline X-chromosome exome sequencing on 186 women with ovarian cancer from the registry. The rate of cancers was 28.4% in paternal grandmother/granddaughter pairs and 13.9% in maternal pairs consistent with an X-linked dominant model (Chi-square test X2 = 0.02, p = 0.89) and inconsistent with an autosomal dominant model (X2 = 20.4, p<0.001). Paternal grandmother cases had an earlier age-of-onset versus maternal cases (hazard ratio HR = 1.59, 95%CI: 1.12–2.25) independent of BRCA1/2 status. Reinforcing the X-linked hypothesis, we observed an association between prostate cancer in men and ovarian cancer in his mother and daughters (odds ratio, OR = 2.34, p = 0.034). Unaffected mothers with affected daughters produced significantly more daughters than sons (ratio = 1.96, p<0.005). We performed exome sequencing in reported BRCA negative cases from the registry. Considering age-of-onset, one missense variant (rs176026 in MAGEC3) reached chromosome-wide significance (Hazard ratio HR = 2.85, 95%CI: 1.75–4.65) advancing the age of onset by 6.7 years. In addition to the well-known contribution of BRCA, we demonstrate that a genetic locus on the X-chromosome contributes to ovarian cancer risk. An X-linked pattern of inheritance has implications for genetic risk stratification. Women with an affected paternal grandmother and sisters of affected women are at increased risk for ovarian cancer. Further work is required to validate this variant and to characterize carrier families. PMID:29447163
Regularized rare variant enrichment analysis for case-control exome sequencing data.

PubMed

Larson, Nicholas B; Schaid, Daniel J

2014-02-01

Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
Amyotrophic lateral sclerosis onset is influenced by the burden of rare variants in known amyotrophic lateral sclerosis genes.

PubMed

Cady, Janet; Allred, Peggy; Bali, Taha; Pestronk, Alan; Goate, Alison; Miller, Timothy M; Mitra, Robi D; Ravits, John; Harms, Matthew B; Baloh, Robert H

2015-01-01

To define the genetic landscape of amyotrophic lateral sclerosis (ALS) and assess the contribution of possible oligogenic inheritance, we aimed to comprehensively sequence 17 known ALS genes in 391 ALS patients from the United States. Targeted pooled-sample sequencing was used to identify variants in 17 ALS genes. Fragment size analysis was used to define ATXN2 and C9ORF72 expansion sizes. Genotype-phenotype correlations were made with individual variants and total burden of variants. Rare variant associations for risk of ALS were investigated at both the single variant and gene level. A total of 64.3% of familial and 27.8% of sporadic subjects carried potentially pathogenic novel or rare coding variants identified by sequencing or an expanded repeat in C9ORF72 or ATXN2; 3.8% of subjects had variants in >1 ALS gene, and these individuals had disease onset 10 years earlier (p = 0.0046) than subjects with variants in a single gene. The number of potentially pathogenic coding variants did not influence disease duration or site of onset. Rare and potentially pathogenic variants in known ALS genes are present in >25% of apparently sporadic and 64% of familial patients, significantly higher than previous reports using less comprehensive sequencing approaches. A significant number of subjects carried variants in >1 gene, which influenced the age of symptom onset and supports oligogenic inheritance as relevant to disease pathogenesis. © 2014 American Neurological Association.
NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses

PubMed Central

Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves

2014-01-01

Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based CellMiner version, for which the present publication serves as a compendium. PMID:25032700
Comparison of Ion Personal Genome Machine Platforms for the Detection of Variants in BRCA1 and BRCA2.

PubMed

Hwang, Sang Mee; Lee, Ki Chan; Lee, Min Seob; Park, Kyoung Un

2018-01-01

Transition to next generation sequencing (NGS) for BRCA1 / BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1 /2. The results of Ion PGM with OTG-snpcaller, a pipeline based on Torrent mapping alignment program and Genome Analysis Toolkit, from 75 clinical samples and 14 reference DNA samples were compared with Sanger sequencing for BRCA1 / BRCA2 . Ten clinical samples and 14 reference DNA samples were additionally sequenced by Ion PGM Dx with Torrent Suite. Fifty types of variants including 18 pathogenic or variants of unknown significance were identified from 75 clinical samples and known variants of the reference samples were confirmed by Sanger sequencing and/or NGS. One false-negative results were present for Ion PGM/OTG-snpcaller for an indel variant misidentified as a single nucleotide variant. However, eight discordant results were present for Ion PGM Dx/Torrent Suite with both false-positive and -negative results. A 40-bp deletion, a 4-bp deletion and a 1-bp deletion variant was not called and a false-positive deletion was identified. Four other variants were misidentified as another variant. Ion PGM/OTG-snpcaller showed acceptable performance with good concordance with Sanger sequencing. However, Ion PGM Dx/Torrent Suite showed many discrepant results not suitable for use in a clinical laboratory, requiring further optimization of the data analysis for calling variants.
Human papillomavirus type 18 variant lineages in United States populations characterized by sequence analysis of LCR-E6, E2, and L1 regions.

PubMed

Arias-Pulido, Hugo; Peyton, Cheri L; Torrez-Martínez, Norah; Anderson, D Nelson; Wheeler, Cosette M

2005-07-20

While HPV 16 variant lineages have been well characterized, the knowledge about HPV 18 variants is limited. In this study, HPV 18 nucleotide variations in the E2 hinge region were characterized by sequence analysis in 47 control and 51 tumor specimens. Fifty of these specimens were randomly selected for sequencing of an LCR-E6 segment and 20 samples representative of LCR-E6 and E2 sequence variants were examined across the L1 region. A total of 2770 nucleotides per HPV 18 variant genome were considered in this study. HPV 18 variant nucleotides were linked among all gene segments analyzed and grouped into three main branches: Asian-American (AA), European (E), and African (Af). These three branches were equally distributed among controls and cases and when stratified by Hispanic and non-Hispanic ethnicities. Among invasive cervical cancer cases, no significant differences in the three HPV variant branches were observed among ethnic groups or when stratified by histopathology (squamous vs. adenocarcinoma). The Af branch showed the greatest nucleotide variability when compared to the HPV 18 reference sequence and was more closely related to HPV 45 than either AA or E branches. Our data also characterize nucleotide and amino acid variations in the L1 capsid gene among HPV 18 variants, which may be relevant to vaccine strategies and subsequent studies of naturally occurring HPV 18 variants. Several novel HPV 18 nucleotide variations were identified in this study.
VarBin, a novel method for classifying true and false positive variants in NGS data

PubMed Central

2013-01-01

Background Variant discovery for rare genetic diseases using Illumina genome or exome sequencing involves screening of up to millions of variants to find only the one or few causative variant(s). Sequencing or alignment errors create "false positive" variants, which are often retained in the variant screening process. Methods to remove false positive variants often retain many false positive variants. This report presents VarBin, a method to prioritize variants based on a false positive variant likelihood prediction. Methods VarBin uses the Genome Analysis Toolkit variant calling software to calculate the variant-to-wild type genotype likelihood ratio at each variant change and position divided by read depth. The resulting Phred-scaled, likelihood-ratio by depth (PLRD) was used to segregate variants into 4 Bins with Bin 1 variants most likely true and Bin 4 most likely false positive. PLRD values were calculated for a proband of interest and 41 additional Illumina HiSeq, exome and whole genome samples (proband's family or unrelated samples). At variant sites without apparent sequencing or alignment error, wild type/non-variant calls cluster near -3 PLRD and variant calls typically cluster above 10 PLRD. Sites with systematic variant calling problems (evident by variant quality scores and biases as well as displayed on the iGV viewer) tend to have higher and more variable wild type/non-variant PLRD values. Depending on the separation of a proband's variant PLRD value from the cluster of wild type/non-variant PLRD values for background samples at the same variant change and position, the VarBin method's classification is assigned to each proband variant (Bin 1 to Bin 4). Results To assess VarBin performance, Sanger sequencing was performed on 98 variants in the proband and background samples. True variants were confirmed in 97% of Bin 1 variants, 30% of Bin 2, and 0% of Bin 3/Bin 4. Conclusions These data indicate that VarBin correctly classifies the majority of true variants as Bin 1 and Bin 3/4 contained only false positive variants. The "uncertain" Bin 2 contained both true and false positive variants. Future work will further differentiate the variants in Bin 2. PMID:24266885
Unlocking hidden genomic sequence

PubMed Central

Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.

2004-01-01

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330
Correlation of rare coding variants in the gene encoding human glucokinase regulatory protein with phenotypic, cellular, and kinetic outcomes.

PubMed

Rees, Matthew G; Ng, David; Ruppert, Sarah; Turner, Clesson; Beer, Nicola L; Swift, Amy J; Morken, Mario A; Below, Jennifer E; Blech, Ilana; Mullikin, James C; McCarthy, Mark I; Biesecker, Leslie G; Gloyn, Anna L; Collins, Francis S

2012-01-01

Defining the genetic contribution of rare variants to common diseases is a major basic and clinical science challenge that could offer new insights into disease etiology and provide potential for directed gene- and pathway-based prevention and treatment. Common and rare nonsynonymous variants in the GCKR gene are associated with alterations in metabolic traits, most notably serum triglyceride levels. GCKR encodes glucokinase regulatory protein (GKRP), a predominantly nuclear protein that inhibits hepatic glucokinase (GCK) and plays a critical role in glucose homeostasis. The mode of action of rare GCKR variants remains unexplored. We identified 19 nonsynonymous GCKR variants among 800 individuals from the ClinSeq medical sequencing project. Excluding the previously described common missense variant p.Pro446Leu, all variants were rare in the cohort. Accordingly, we functionally characterized all variants to evaluate their potential phenotypic effects. Defects were observed for the majority of the rare variants after assessment of cellular localization, ability to interact with GCK, and kinetic activity of the encoded proteins. Comparing the individuals with functional rare variants to those without such variants showed associations with lipid phenotypes. Our findings suggest that, while nonsynonymous GCKR variants, excluding p.Pro446Leu, are rare in individuals of mixed European descent, the majority do affect protein function. In sum, this study utilizes computational, cell biological, and biochemical methods to present a model for interpreting the clinical significance of rare genetic variants in common disease.

Whole exome sequencing is necessary to clarify ID/DD cases with de novo copy number variants of uncertain significance: Two proof-of-concept examples.

PubMed

Giorgio, Elisa; Ciolfi, Andrea; Biamino, Elisa; Caputo, Viviana; Di Gregorio, Eleonora; Belligni, Elga Fabia; Calcia, Alessandro; Gaidolfi, Elena; Bruselles, Alessandro; Mancini, Cecilia; Cavalieri, Simona; Molinatto, Cristina; Cirillo Silengo, Margherita; Ferrero, Giovanni Battista; Tartaglia, Marco; Brusco, Alfredo

2016-07-01

Whole exome sequencing (WES) is a powerful tool to identify clinically undefined forms of intellectual disability/developmental delay (ID/DD), especially in consanguineous families. Here we report the genetic definition of two sporadic cases, with syndromic ID/DD for whom array-Comparative Genomic Hybridization (aCGH) identified a de novo copy number variant (CNV) of uncertain significance. The phenotypes included microcephaly with brachycephaly and a distinctive facies in one proband, and hypotonia in the legs and mild ataxia in the other. WES allowed identification of a functionally relevant homozygous variant affecting a known disease gene for rare syndromic ID/DD in each proband, that is, c.1423C>T (p.Arg377*) in the Trafficking Protein Particle Complex 9 (TRAPPC9), and c.154T>C (p.Cys52Arg) in the Very Low Density Lipoprotein Receptor (VLDLR). Four mutations affecting TRAPPC9 have been previously reported, and the present finding further depicts this syndromic form of ID, which includes microcephaly with brachycephaly, corpus callosum hypoplasia, facial dysmorphism, and overweight. VLDLR-associated cerebellar hypoplasia (VLDLR-CH) is characterized by non-progressive congenital ataxia and moderate-to-profound intellectual disability. The c.154T>C (p.Cys52Arg) mutation was associated with a very mild form of ataxia, mild intellectual disability, and cerebellar hypoplasia without cortical gyri simplification. In conclusion, we report two novel cases with rare causes of autosomal recessive ID, which document how interpreting de novo array-CGH variants represents a challenge in consanguineous families; as such, clinical WES should be considered in diagnostic testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Clinical, biochemical and molecular characterization of cystinuria in a cohort of 12 patients.

PubMed

Barbosa, M; Lopes, A; Mota, C; Martins, E; Oliveira, J; Alves, S; De Bonis, P; Mota, M do Céu; Dias, C; Rodrigues-Santos, P; Fortuna, A M; Quelhas, D; Lacerda, L; Bisceglia, L; Cardoso, M L

2012-01-01

Cystinuria is a rare autosomal inherited disorder characterized by impaired transport of cystine and dibasic aminoacids in the proximal renal tubule. Classically, cystinuria is classified as type I (silent heterozygotes) and non-type I (heterozygotes with urinary hyperexcretion of cystine). Molecularly, cystinuria is classified as type A (mutations on SLC3A1 gene) and type B (mutations on SLC7A9 gene). The goal of this study is to provide a comprehensive clinical, biochemical and molecular characterization of a cohort of 12 Portuguese patients affected with cystinuria in order to provide insight into genotype-phenotype correlations. We describe seven type I and five non-type I patients. Regarding the molecular classification, seven patients were type A and five were type B. In SLC3A1 gene, two large genomic rearrangements and 13 sequence variants, including four new variants c.611-2A>C; c.1136+44G>A; c.1597T (p.Y533N); c.*70A>G, were found. One large genomic rearrangement was found in SLC7A9 gene as well as 24 sequence variants including 3 novel variants: c.216C>T (p.C72C), c.1119G>A (p.S373S) and c.*82C>T. In our cohort the most frequent pathogenic mutations were: large rearrangements (33.3% of mutant alleles) and a missense mutation c.1400T>C (p.M467T) (11.1%). This report expands the spectrum of SLC3A1 and SLC7A9 mutations and provides guidance in the clinical implementation of molecular assays in routine genetic counseling of Portuguese patients affected with cystinuria. © 2011 John Wiley & Sons A/S.
Loss-of-Function Mutations in the WNT Co-receptor LRP6 Cause Autosomal-Dominant Oligodontia.

PubMed

Massink, Maarten P G; Créton, Marijn A; Spanevello, Francesca; Fennis, Willem M M; Cune, Marco S; Savelberg, Sanne M C; Nijman, Isaäc J; Maurice, Madelon M; van den Boogaard, Marie-José H; van Haaften, Gijs

2015-10-01

Tooth agenesis is one of the most common developmental anomalies in man. Oligodontia, a severe form of tooth agenesis, occurs both as an isolated anomaly and as a syndromal feature. We performed exome sequencing on 20 unrelated individuals with apparent non-syndromic oligodontia and failed to detect mutations in genes previously associated with oligodontia. In three of the probands, we detected heterozygous variants in LRP6, and sequencing of additional oligodontia-affected individuals yielded one additional mutation in LRP6. Three mutations (c.1144_1145dupAG [p.Ala383Glyfs(∗)8], c.1779dupT [p.Glu594(∗)], and c.2224_2225dupTT [p.Leu742Phefs(∗)7]) are predicted to truncate the protein, whereas the fourth (c.56C>T [p.Ala19Val]) is a missense variant of a conserved residue located at the cleavage site of the protein's signal peptide. All four affected individuals harboring a LRP6 mutation had a family history of tooth agenesis. LRP6 encodes a transmembrane cell-surface protein that functions as a co-receptor with members from the Frizzled protein family in the canonical Wnt/β-catenin signaling cascade. In this same pathway, WNT10A was recently identified as a major contributor in the etiology of non-syndromic oligodontia. We show that the LRP6 missense variant (c.56C>T) results in altered glycosylation and improper subcellular localization of the protein, resulting in abrogated activation of the Wnt pathway. Our results identify LRP6 variants as contributing to the etiology of non-syndromic autosomal-dominant oligodontia and suggest that this gene is a candidate for screening in DNA diagnostics. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Loss-of-Function Mutations in the WNT Co-receptor LRP6 Cause Autosomal-Dominant Oligodontia

PubMed Central

Massink, Maarten P.G.; Créton, Marijn A.; Spanevello, Francesca; Fennis, Willem M.M.; Cune, Marco S.; Savelberg, Sanne M.C.; Nijman, Isaäc J.; Maurice, Madelon M.; van den Boogaard, Marie-José H.; van Haaften, Gijs

2015-01-01

Tooth agenesis is one of the most common developmental anomalies in man. Oligodontia, a severe form of tooth agenesis, occurs both as an isolated anomaly and as a syndromal feature. We performed exome sequencing on 20 unrelated individuals with apparent non-syndromic oligodontia and failed to detect mutations in genes previously associated with oligodontia. In three of the probands, we detected heterozygous variants in LRP6, and sequencing of additional oligodontia-affected individuals yielded one additional mutation in LRP6. Three mutations (c.1144_1145dupAG [p.Ala383Glyfs∗8], c.1779dupT [p.Glu594∗], and c.2224_2225dupTT [p.Leu742Phefs∗7]) are predicted to truncate the protein, whereas the fourth (c.56C>T [p.Ala19Val]) is a missense variant of a conserved residue located at the cleavage site of the protein’s signal peptide. All four affected individuals harboring a LRP6 mutation had a family history of tooth agenesis. LRP6 encodes a transmembrane cell-surface protein that functions as a co-receptor with members from the Frizzled protein family in the canonical Wnt/β-catenin signaling cascade. In this same pathway, WNT10A was recently identified as a major contributor in the etiology of non-syndromic oligodontia. We show that the LRP6 missense variant (c.56C>T) results in altered glycosylation and improper subcellular localization of the protein, resulting in abrogated activation of the Wnt pathway. Our results identify LRP6 variants as contributing to the etiology of non-syndromic autosomal-dominant oligodontia and suggest that this gene is a candidate for screening in DNA diagnostics. PMID:26387593
Genotype-specific signal generation based on digestion of 3-way DNA junctions: application to KRAS variation detection.

PubMed

Amicarelli, Giulia; Adlerstein, Daniel; Shehi, Erlet; Wang, Fengfei; Makrigiorgos, G Mike

2006-10-01

Genotyping methods that reveal single-nucleotide differences are useful for a wide range of applications. We used digestion of 3-way DNA junctions in a novel technology, OneCutEventAmplificatioN (OCEAN) that allows sequence-specific signal generation and amplification. We combined OCEAN with peptide-nucleic-acid (PNA)-based variant enrichment to detect and simultaneously genotype v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS) codon 12 sequence variants in human tissue specimens. We analyzed KRAS codon 12 sequence variants in 106 lung cancer surgical specimens. We conducted a PNA-PCR reaction that suppresses wild-type KRAS amplification and genotyped the product with a set of OCEAN reactions carried out in fluorescence microplate format. The isothermal OCEAN assay enabled a 3-way DNA junction to form between the specific target nucleic acid, a fluorescently labeled "amplifier", and an "anchor". The amplifier-anchor contact contains the recognition site for a restriction enzyme. Digestion produces a cleaved amplifier and generation of a fluorescent signal. The cleaved amplifier dissociates from the 3-way DNA junction, allowing a new amplifier to bind and propagate the reaction. The system detected and genotyped KRAS sequence variants down to approximately 0.3% variant-to-wild-type alleles. PNA-PCR/OCEAN had a concordance rate with PNA-PCR/sequencing of 93% to 98%, depending on the exact implementation. Concordance rate with restriction endonuclease-mediated selective-PCR/sequencing was 89%. OCEAN is a practical and low-cost novel technology for sequence-specific signal generation. Reliable analysis of KRAS sequence alterations in human specimens circumvents the requirement for sequencing. Application is expected in genotyping KRAS codon 12 sequence variants in surgical specimens or in bodily fluids, as well as single-base variations and sequence alterations in other genes.
The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants

PubMed Central

Reuter, Miriam S.; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K.C.; Trost, Brett; Paton, Tara A.; Pereira, Sergio L.; Herbrick, Jo-Anne; Wintle, Richard F.; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R.; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W.L.; Wang, Zhuozhi; Patel, Rohan V.; Pellecchia, Giovanna; Wei, John; Strug, Lisa J.; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M.; Bassett, Anne S.; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D.; Stavropoulos, Dimitri J.; Bowdin, Sarah; Hildebrandt, Matthew R.; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M. Stephen; Monfared, Nasim; Hosseini, S. Mohsen; Joseph-George, Ann M.; Keeley, Fred W.; Cook, Ryan A.; Fiume, Marc; Lee, Hin C.; Marshall, Christian R.; Davies, Jill; Hazell, Allison; Buchanan, Janet A.; Szego, Michael J.; Scherer, Stephen W.

2018-01-01

BACKGROUND: The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. METHODS: Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. RESULTS: Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set (n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants — associated with cancer, cardiac or neurodegenerative phenotypes — remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. INTERPRETATION: Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. PMID:29431110
Network Analysis of Sequence-Function Relationships and Exploration of Sequence Space of TEM β-Lactamases.

PubMed

Zeil, Catharina; Widmann, Michael; Fademrecht, Silvia; Vogel, Constantin; Pleiss, Jürgen

2016-05-01

The Lactamase Engineering Database (www.LacED.uni-stuttgart.de) was developed to facilitate the classification and analysis of TEM β-lactamases. The current version contains 474 TEM variants. Two hundred fifty-nine variants form a large scale-free network of highly connected point mutants. The network was divided into three subnetworks which were enriched by single phenotypes: one network with predominantly 2be and two networks with 2br phenotypes. Fifteen positions were found to be highly variable, contributing to the majority of the observed variants. Since it is expected that a considerable fraction of the theoretical sequence space is functional, the currently sequenced 474 variants represent only the tip of the iceberg of functional TEM β-lactamase variants which form a huge natural reservoir of highly interconnected variants. Almost 50% of the variants are part of a quartet. Thus, two single mutations that result in functional enzymes can be combined into a functional protein. Most of these quartets consist of the same phenotype, or the mutations are additive with respect to the phenotype. By predicting quartets from triplets, 3,916 unknown variants were constructed. Eighty-seven variants complement multiple quartets and therefore have a high probability of being functional. The construction of a TEM β-lactamase network and subsequent analyses by clustering and quartet prediction are valuable tools to gain new insights into the viable sequence space of TEM β-lactamases and to predict their phenotype. The highly connected sequence space of TEM β-lactamases is ideally suited to network analysis and demonstrates the strengths of network analysis over tree reconstruction methods. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants.

PubMed

Reuter, Miriam S; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K C; Trost, Brett; Paton, Tara A; Pereira, Sergio L; Herbrick, Jo-Anne; Wintle, Richard F; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W L; Wang, Zhuozhi; Patel, Rohan V; Pellecchia, Giovanna; Wei, John; Strug, Lisa J; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M; Bassett, Anne S; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D; Stavropoulos, Dimitri J; Bowdin, Sarah; Hildebrandt, Matthew R; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M Stephen; Monfared, Nasim; Hosseini, S Mohsen; Joseph-George, Ann M; Keeley, Fred W; Cook, Ryan A; Fiume, Marc; Lee, Hin C; Marshall, Christian R; Davies, Jill; Hazell, Allison; Buchanan, Janet A; Szego, Michael J; Scherer, Stephen W

2018-02-05

The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set ( n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants - associated with cancer, cardiac or neurodegenerative phenotypes - remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. © 2018 Joule Inc. or its licensors.
Variants in Nebulin (NEB) Are Linked to the Development of Familial Primary Angle Closure Glaucoma in Basset Hounds

PubMed Central

Ahram, Dina F.; Grozdanic, Sinisa D.; Kecova, Helga; Henkes, Arjen; Collin, Rob W. J.; Kuehn, Markus H.

2015-01-01

Several dog breeds are susceptible to developing primary angle closure glaucoma (PACG), which suggests a genetic basis for the disease. We have identified a four-generation Basset Hound pedigree with characteristic autosomal recessive PACG that closely recapitulates PACG in humans. Our aim is to utilize gene mapping and whole exome sequencing approaches to identify PACG-causing sequence variants in the Basset. Extensive clinical phenotyping of all pedigree members was conducted. SNP-chip genotyping was carried out in 9 affected and 15 unaffected pedigree members. Two-point and multipoint linkage analyses of genome-wide SNP data were performed using Superlink-Online SNP-1.1 and a locus was mapped to chromosome 19q with a maximum LOD score of 3.24. The locus contains 12 Ensemble predicted canine genes and is syntenic to a region on chromosome 2 in the human genome. Using exome-sequencing analysis, a possibly damaging, non-synonymous variant in the gene Nebulin (NEB) was found to segregate with PACG which alters a phylogenetically conserved Lysine residue. The association of this variants with PACG was confirmed in a secondary cohort of unrelated Basset Hounds (p = 3.4 × 10-4, OR = 15.3 for homozygosity). Nebulin, a protein that promotes the contractile function of sarcomeres, was found to be prominently expressed in the ciliary muscles of the anterior segment. Our findings may provide insight into the molecular mechanisms that underlie PACG. The phenotypic similarities of disease presentation in dogs and humans may enable the translation of findings made in this study to patients with PACG. PMID:25938837
Variants in Nebulin (NEB) Are Linked to the Development of Familial Primary Angle Closure Glaucoma in Basset Hounds.

PubMed

Ahram, Dina F; Grozdanic, Sinisa D; Kecova, Helga; Henkes, Arjen; Collin, Rob W J; Kuehn, Markus H

2015-01-01

Several dog breeds are susceptible to developing primary angle closure glaucoma (PACG), which suggests a genetic basis for the disease. We have identified a four-generation Basset Hound pedigree with characteristic autosomal recessive PACG that closely recapitulates PACG in humans. Our aim is to utilize gene mapping and whole exome sequencing approaches to identify PACG-causing sequence variants in the Basset. Extensive clinical phenotyping of all pedigree members was conducted. SNP-chip genotyping was carried out in 9 affected and 15 unaffected pedigree members. Two-point and multipoint linkage analyses of genome-wide SNP data were performed using Superlink-Online SNP-1.1 and a locus was mapped to chromosome 19q with a maximum LOD score of 3.24. The locus contains 12 Ensemble predicted canine genes and is syntenic to a region on chromosome 2 in the human genome. Using exome-sequencing analysis, a possibly damaging, non-synonymous variant in the gene Nebulin (NEB) was found to segregate with PACG which alters a phylogenetically conserved Lysine residue. The association of this variants with PACG was confirmed in a secondary cohort of unrelated Basset Hounds (p = 3.4 × 10-4, OR = 15.3 for homozygosity). Nebulin, a protein that promotes the contractile function of sarcomeres, was found to be prominently expressed in the ciliary muscles of the anterior segment. Our findings may provide insight into the molecular mechanisms that underlie PACG. The phenotypic similarities of disease presentation in dogs and humans may enable the translation of findings made in this study to patients with PACG.
Emergent biomarker derived from next-generation sequencing to identify pain patients requiring uncommonly high opioid doses

PubMed Central

Kringel, D; Ultsch, A; Zimmermann, M; Jansen, J-P; Ilias, W; Freynhagen, R; Griessinger, N; Kopf, A; Stein, C; Doehring, A; Resch, E; Lötsch, J

2017-01-01

Next-generation sequencing (NGS) provides unrestricted access to the genome, but it produces ‘big data’ exceeding in amount and complexity the classical analytical approaches. We introduce a bioinformatics-based classifying biomarker that uses emergent properties in genetics to separate pain patients requiring extremely high opioid doses from controls. Following precisely calculated selection of the 34 most informative markers in the OPRM1, OPRK1, OPRD1 and SIGMAR1 genes, pattern of genotypes belonging to either patient group could be derived using a k-nearest neighbor (kNN) classifier that provided a diagnostic accuracy of 80.6±4%. This outperformed alternative classifiers such as reportedly functional opioid receptor gene variants or complex biomarkers obtained via multiple regression or decision tree analysis. The accumulation of several genetic variants with only minor functional influences may result in a qualitative consequence affecting complex phenotypes, pointing at emergent properties in genetics. PMID:27139154
Emergent biomarker derived from next-generation sequencing to identify pain patients requiring uncommonly high opioid doses.

PubMed

Kringel, D; Ultsch, A; Zimmermann, M; Jansen, J-P; Ilias, W; Freynhagen, R; Griessinger, N; Kopf, A; Stein, C; Doehring, A; Resch, E; Lötsch, J

2017-10-01

Next-generation sequencing (NGS) provides unrestricted access to the genome, but it produces 'big data' exceeding in amount and complexity the classical analytical approaches. We introduce a bioinformatics-based classifying biomarker that uses emergent properties in genetics to separate pain patients requiring extremely high opioid doses from controls. Following precisely calculated selection of the 34 most informative markers in the OPRM1, OPRK1, OPRD1 and SIGMAR1 genes, pattern of genotypes belonging to either patient group could be derived using a k-nearest neighbor (kNN) classifier that provided a diagnostic accuracy of 80.6±4%. This outperformed alternative classifiers such as reportedly functional opioid receptor gene variants or complex biomarkers obtained via multiple regression or decision tree analysis. The accumulation of several genetic variants with only minor functional influences may result in a qualitative consequence affecting complex phenotypes, pointing at emergent properties in genetics.
Integration of bioinformatics and imaging informatics for identifying rare PSEN1 variants in Alzheimer's disease.

PubMed

Nho, Kwangsik; Horgusluoglu, Emrin; Kim, Sungeun; Risacher, Shannon L; Kim, Dokyoon; Foroud, Tatiana; Aisen, Paul S; Petersen, Ronald C; Jack, Clifford R; Shaw, Leslie M; Trojanowski, John Q; Weiner, Michael W; Green, Robert C; Toga, Arthur W; Saykin, Andrew J

2016-08-12

Pathogenic mutations in PSEN1 are known to cause familial early-onset Alzheimer's disease (EOAD) but common variants in PSEN1 have not been found to strongly influence late-onset AD (LOAD). The association of rare variants in PSEN1 with LOAD-related endophenotypes has received little attention. In this study, we performed a rare variant association analysis of PSEN1 with quantitative biomarkers of LOAD using whole genome sequencing (WGS) by integrating bioinformatics and imaging informatics. A WGS data set (N = 815) from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort was used in this analysis. 757 non-Hispanic Caucasian participants underwent WGS from a blood sample and high resolution T1-weighted structural MRI at baseline. An automated MRI analysis technique (FreeSurfer) was used to measure cortical thickness and volume of neuroanatomical structures. We assessed imaging and cerebrospinal fluid (CSF) biomarkers as LOAD-related quantitative endophenotypes. Single variant analyses were performed using PLINK and gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). A total of 839 rare variants (MAF < 1/√(2 N) = 0.0257) were found within a region of ±10 kb from PSEN1. Among them, six exonic (three non-synonymous) variants were observed. A single variant association analysis showed that the PSEN1 p. E318G variant increases the risk of LOAD only in participants carrying APOE ε4 allele where individuals carrying the minor allele of this PSEN1 risk variant have lower CSF Aβ1-42 and higher CSF tau. A gene-based analysis resulted in a significant association of rare but not common (MAF ≥ 0.0257) PSEN1 variants with bilateral entorhinal cortical thickness. This is the first study to show that PSEN1 rare variants collectively show a significant association with the brain atrophy in regions preferentially affected by LOAD, providing further support for a role of PSEN1 in LOAD. The PSEN1 p. E318G variant increases the risk of LOAD only in APOE ε4 carriers. Integrating bioinformatics with imaging informatics for identification of rare variants could help explain the missing heritability in LOAD.
Multiplexed enrichment of rare DNA variants via sequence-selective and temperature-robust amplification

PubMed Central

Wu, Lucia R.; Chen, Sherry X.; Wu, Yalei; Patel, Abhijit A.; Zhang, David Yu

2018-01-01

Rare DNA-sequence variants hold important clinical and biological information, but existing detection techniques are expensive, complex, allele-specific, or don’t allow for significant multiplexing. Here, we report a temperature-robust polymerase-chain-reaction method, which we term blocker displacement amplification (BDA), that selectively amplifies all sequence variants, including single-nucleotide variants (SNVs), within a roughly 20-nucleotide window by 1,000-fold over wild-type sequences. This allows for easy detection and quantitation of hundreds of potential variants originally at ≤0.1% in allele frequency. BDA is compatible with inexpensive thermocycler instrumentation and employs a rationally designed competitive hybridization reaction to achieve comparable enrichment performance across annealing temperatures ranging from 56 °C to 64 °C. To show the sequence generality of BDA, we demonstrate enrichment of 156 SNVs and the reliable detection of single-digit copies. We also show that the BDA detection of rare driver mutations in cell-free DNA samples extracted from the blood plasma of lung-cancer patients is highly consistent with deep sequencing using molecular lineage tags, with a receiver operator characteristic accuracy of 95%. PMID:29805844
A deletion in the VLDLR gene in Eurasier dogs with cerebellar hypoplasia resembling a Dandy-Walker-like malformation (DWLM).

PubMed

Gerber, Martina; Fischer, Andrea; Jagannathan, Vidhya; Drögemüller, Michaela; Drögemüller, Cord; Schmidt, Martin J; Bernardino, Filipa; Manz, Eberhard; Matiasek, Kaspar; Rentmeister, Kai; Leeb, Tosso

2015-01-01

Dandy-Walker-like malformation (DWLM) is the result of aberrant brain development and mainly characterized by cerebellar hypoplasia. DWLM affected dogs display a non-progressive cerebellar ataxia. Several DWLM cases were recently observed in the Eurasier dog breed, which strongly suggested a monogenic autosomal recessive inheritance in this breed. We performed a genome-wide association study (GWAS) with 9 cases and 11 controls and found the best association of DWLM with markers on chromosome 1. Subsequent homozygosity mapping confirmed that all 9 cases were homozygous for a shared haplotype in this region, which delineated a critical interval of 3.35 Mb. We sequenced the genome of an affected Eurasier and compared it with the Boxer reference genome and 47 control genomes of dogs from other breeds. This analysis revealed 4 private non-synonymous variants in the critical interval of the affected Eurasier. We genotyped these variants in additional dogs and found perfect association for only one of these variants, a single base deletion in the VLDLR gene encoding the very low density lipoprotein receptor. This variant, VLDLR:c.1713delC is predicted to cause a frameshift and premature stop codon (p.W572Gfs*10). Variants in the VLDLR gene have been shown to cause congenital cerebellar ataxia and mental retardation in human patients and Vldlr knockout mice also display an ataxia phenotype. Our combined genetic data together with the functional knowledge on the VLDLR gene from other species thus strongly suggest that VLDLR:c.1713delC is indeed causing DWLM in Eurasier dogs.
Rare Variants in PLD3 Do Not Affect Risk for Early-Onset Alzheimer Disease in a European Consortium Cohort.

PubMed

Cacace, Rita; Van den Bossche, Tobi; Engelborghs, Sebastiaan; Geerts, Nathalie; Laureys, Annelies; Dillen, Lubina; Graff, Caroline; Thonberg, Håkan; Chiang, Huei-Hsin; Pastor, Pau; Ortega-Cubero, Sara; Pastor, Maria A; Diehl-Schmid, Janine; Alexopoulos, Panagiotis; Benussi, Luisa; Ghidoni, Roberta; Binetti, Giuliano; Nacmias, Benedetta; Sorbi, Sandro; Sanchez-Valle, Raquel; Lladó, Albert; Gelpi, Ellen; Almeida, Maria Rosário; Santana, Isabel; Tsolaki, Magda; Koutroumani, Maria; Clarimon, Jordi; Lleó, Alberto; Fortea, Juan; de Mendonça, Alexandre; Martins, Madalena; Borroni, Barbara; Padovani, Alessandro; Matej, Radoslav; Rohan, Zdenek; Vandenbulcke, Mathieu; Vandenberghe, Rik; De Deyn, Peter P; Cras, Patrick; van der Zee, Julie; Sleegers, Kristel; Van Broeckhoven, Christine

2015-12-01

Rare variants in the phospholipase D3 gene (PLD3) were associated with increased risk for late-onset Alzheimer disease (LOAD). We identified a missense mutation in PLD3 in whole-genome sequence data of a patient with autopsy confirmed Alzheimer disease (AD) and onset age of 50 years. Subsequently, we sequenced PLD3 in a Belgian early-onset Alzheimer disease (EOAD) patient (N = 261) and control (N = 319) cohort, as well as in European EOAD patients (N = 946) and control individuals (N = 1,209) ascertained in different European countries. Overall, we identified 22 rare variants with a minor allele frequency <1%, 20 missense and two splicing mutations. Burden analysis did not provide significant evidence for an enrichment of rare PLD3 variants in EOAD patients in any of the patient/control cohorts. Also, meta-analysis of the PLD3 data, including a published dataset of a German EOAD cohort, was not significant (P = 0.43; OR = 1.53, 95% CI 0.60-3.31). Consequently, our data do not support a role for PLD3 rare variants in the genetic etiology of EOAD in European EOAD patients. Our data corroborate the negative replication data obtained in LOAD studies and therefore a genetic role of PLD3 in AD remains to be demonstrated. © 2015 The Authors. **Human Mutation published by Wiley Periodicals, Inc.
Whole-exome sequencing and high throughput genotyping identified KCNJ11 as the thirteenth MODY gene.

PubMed

Bonnefond, Amélie; Philippe, Julien; Durand, Emmanuelle; Dechaume, Aurélie; Huyvaert, Marlène; Montagne, Louise; Marre, Michel; Balkau, Beverley; Fajardy, Isabelle; Vambergue, Anne; Vatin, Vincent; Delplanque, Jérôme; Le Guilcher, David; De Graeve, Franck; Lecoeur, Cécile; Sand, Olivier; Vaxillaire, Martine; Froguel, Philippe

2012-01-01

Maturity-onset of the young (MODY) is a clinically heterogeneous form of diabetes characterized by an autosomal-dominant mode of inheritance, an onset before the age of 25 years, and a primary defect in the pancreatic beta-cell function. Approximately 30% of MODY families remain genetically unexplained (MODY-X). Here, we aimed to use whole-exome sequencing (WES) in a four-generation MODY-X family to identify a new susceptibility gene for MODY. WES (Agilent-SureSelect capture/Illumina-GAIIx sequencing) was performed in three affected and one non-affected relatives in the MODY-X family. We then performed a high-throughput multiplex genotyping (Illumina-GoldenGate assay) of the putative causal mutations in the whole family and in 406 controls. A linkage analysis was also carried out. By focusing on variants of interest (i.e. gains of stop codon, frameshift, non-synonymous and splice-site variants not reported in dbSNP130) present in the three affected relatives and not present in the control, we found 69 mutations. However, as WES was not uniform between samples, a total of 324 mutations had to be assessed in the whole family and in controls. Only one mutation (p.Glu227Lys in KCNJ11) co-segregated with diabetes in the family (with a LOD-score of 3.68). No KCNJ11 mutation was found in 25 other MODY-X unrelated subjects. Beyond neonatal diabetes mellitus (NDM), KCNJ11 is also a MODY gene ('MODY13'), confirming the wide spectrum of diabetes related phenotypes due to mutations in NDM genes (i.e. KCNJ11, ABCC8 and INS). Therefore, the molecular diagnosis of MODY should include KCNJ11 as affected carriers can be ideally treated with oral sulfonylureas.
Linkage disequilibrium among commonly genotyped SNP and variants detected from bull sequence

USDA-ARS?s Scientific Manuscript database

Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNP genotyped by commercial assays. A number of variants detected from sequencing influential sires are likely to be causal, but noticable improvements in prediction accuracy using imputed sequen...
High prevalence of DUOX2 mutations in Japanese patients with permanent congenital hypothyroidism or transient hypothyroidism.

PubMed

Matsuo, Kumihiro; Tanahashi, Yusuke; Mukai, Tokuo; Suzuki, Shigeru; Tajima, Toshihiro; Azuma, Hiroshi; Fujieda, Kenji

2016-07-01

Dual oxidase 2 (DUOX2) mutations are a cause of dyshormonogenesis (DH) and have been identified in patients with permanent congenital hypothyroidism (PH) and with transient hypothyroidism (TH). We aimed to elucidate the prevalence and phenotypical variations of DUOX2 mutations. Forty-eight Japanese DH patients were enroled and analysed for sequence variants of DUOX2, DUOXA2, and TPO using polymerase chain reaction-amplified direct sequencing. Fourteen sequence variants of DUOX2, including 10 novel variants, were identified in 11 patients. DUOX2 variants were more prevalent (11/48, 22.9%) than TPO (3/48, 6.3%) (p=0.020). The prevalence of DUOX2 variants in TH was slightly, but not significantly, higher than in PH. Furthermore, one patient had digenic heterozygous sequence variants of both DUOX2 and TPO. Our results suggest that DUOX2 mutations might be the most common cause of both PH and TH, and that phenotypes of these mutations might be milder than those of other causes.
Genome-wide association study using high-density single nucleotide polymorphism arrays and whole-genome sequences for clinical mastitis traits in dairy cattle.

PubMed

Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S

2014-11-01

Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

Deep whole-genome sequencing of 90 Han Chinese genomes.

PubMed

Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

2017-09-01

Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.
Analysis of CHRNA7 rare variants in autism spectrum disorder susceptibility.

PubMed

Bacchelli, Elena; Battaglia, Agatino; Cameli, Cinzia; Lomartire, Silvia; Tancredi, Raffaella; Thomson, Susanne; Sutcliffe, James S; Maestrini, Elena

2015-04-01

Chromosome 15q13.3 recurrent microdeletions are causally associated with a wide range of phenotypes, including autism spectrum disorder (ASD), seizures, intellectual disability, and other psychiatric conditions. Whether the reciprocal microduplication is pathogenic is less certain. CHRNA7, encoding for the alpha7 subunit of the neuronal nicotinic acetylcholine receptor, is considered the likely culprit gene in mediating neurological phenotypes in 15q13.3 deletion cases. To assess if CHRNA7 rare variants confer risk to ASD, we performed copy number variant analysis and Sanger sequencing of the CHRNA7 coding sequence in a sample of 135 ASD cases. Sequence variation in this gene remains largely unexplored, given the existence of a fusion gene, CHRFAM7A, which includes a nearly identical partial duplication of CHRNA7. Hence, attempts to sequence coding exons must distinguish between CHRNA7 and CHRFAM7A, making next-generation sequencing approaches unreliable for this purpose. A CHRNA7 microduplication was detected in a patient with autism and moderate cognitive impairment; while no rare damaging variants were identified in the coding region, we detected rare variants in the promoter region, previously described to functionally reduce transcription. This study represents the first sequence variant analysis of CHRNA7 in a sample of idiopathic autism. © 2015 Wiley Periodicals, Inc.
Rare coding variants in Phospholipase D3 (PLD3) confer risk for Alzheimer's disease

PubMed Central

Cruchaga, Carlos; Benitez, Bruno A.; Cai, Yefei; Guerreiro, Rita; Harari, Oscar; Norton, Joanne; Budde, John; Bertelsen, Sarah; Jeng, Amanda T.; Cooper, Breanna; Skorupa, Tara; Carrell, David; Levitch, Denise; Hsu, Simon; Choi, Jiyoon; Ryten, Mina; Sassi, Celeste; Bras, Jose; Gibbs, Raphael J.; Hernandez, Dena G.; Lupton, Michelle K.; Powell, John; Forabosco, Paola; Ridge, Perry G.; Corcoran, Christopher D.; Tschanz, JoAnn T.; Norton, Maria C.; Munger, Ronald G.; Schmutz, Cameron; Leary, Maegan; Demirci, F. Yesim; Bamne, Mikhil N.; Wang, Xingbin; Lopez, Oscar L.; Ganguli, Mary; Medway, Christopher; Turton, James; Lord, Jenny; Braae, Anne; Barber, Imelda; Brown, Kristelle; Pastor, Pau; Lorenzo-Betancor, Oswaldo; Brkanac, Zoran; Scott, Erick; Topol, Eric; Morgan, Kevin; Rogaeva, Ekaterina; Singleton, Andy; Hardy, John; Kamboh, M. Ilyas; George-Hyslop, Peter St; Cairns, Nigel; Morris, John C.; Kauwe, John S.K.; Goate, Alison M.

2014-01-01

Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD)1,2. These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low frequency coding variants with large effects on LOAD risk, we performed whole exome-sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large case-control datasets. A rare variant in PLD3 (phospholipase-D family, member 3, rs145999145; V232M) segregated with disease status in two independent families and doubled risk for AD in seven independent case-control series (V232M meta-analysis; OR= 2.10, CI=1.47-2.99; p= 2.93×10-5, 11,354 cases and controls of European-descent). Gene-based burden analyses in 4,387 cases and controls of European-descent and 302 African American cases and controls, with complete sequence data for PLD3, indicate that several variants in this gene increase risk for AD in both populations (EA: OR= 2.75, CI=2.05-3.68; p=1.44×10-11, AA: OR= 5.48, CI=1.77-16.92; p=1.40×10-3). PLD3 is highly expressed in brain regions vulnerable to AD pathology, including hippocampus and cortex, and is expressed at lower levels in neurons from AD brains compared to control brains (p=8.10×10-10). Over-expression of PLD3 leads to a significant decrease in intracellular APP and extracellular Aβ42 and Aβ40, while knock-down of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a two-fold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may be used to identify rare variants with large effects on risk for disease or other complex traits. PMID:24336208
Variant discovery in the sheep milk transcriptome using RNA sequencing.

PubMed

Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan José

2017-02-15

The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain. A total of 216,637 variants were detected in the MSCs transcriptome of the eight ewes analyzed. Among them, a total of 57,795 variants were detected in the regions harboring Quantitative Trait Loci (QTL) for milk yield, protein percentage and fat percentage, of which 21.44% were novel variants. Among the total variants detected, 561 (2.52%) and 1,649 (7.42%) were predicted to produce high or moderate impact changes in the corresponding transcriptional unit, respectively. In the functional enrichment analysis of the genes positioned within selected QTL regions harboring novel relevant functional variants (high and moderate impact), the KEGG pathway with the highest enrichment was "protein processing in endoplasmic reticulum". Additionally, a total of 504 and 1,063 variants were identified in the genes encoding principal milk proteins and molecules involved in the lipid metabolism, respectively. Of these variants, 20 mutations were found to have putative relevant effects on the encoded proteins. We present herein the first transcriptomic approach aimed at identifying genetic variants of the genes expressed in the lactating mammary gland of sheep. Through the transcriptome analysis of variability within regions harboring QTL for milk yield, protein percentage and fat percentage, we have found several pathways and genes that harbor mutations that could affect dairy production traits. Moreover, remarkable variants were also found in candidate genes coding for major milk proteins and proteins related to milk fat metabolism. Several of the SNPs found in this study could be included as suitable markers in genotyping platforms or custom SNP arrays to perform association analyses in commercial populations and apply genomic selection protocols in the dairy production industry.
RareVariantVis: new tool for visualization of causative variants in rare monogenic disorders using whole genome sequencing data.

PubMed

Stokowy, Tomasz; Garbulowski, Mateusz; Fiskerstrand, Torunn; Holdhus, Rita; Labun, Kornel; Sztromwasser, Pawel; Gilissen, Christian; Hoischen, Alexander; Houge, Gunnar; Petersen, Kjell; Jonassen, Inge; Steen, Vidar M

2016-10-01

The search for causative genetic variants in rare diseases of presumed monogenic inheritance has been boosted by the implementation of whole exome (WES) and whole genome (WGS) sequencing. In many cases, WGS seems to be superior to WES, but the analysis and visualization of the vast amounts of data is demanding. To aid this challenge, we have developed a new tool-RareVariantVis-for analysis of genome sequence data (including non-coding regions) for both germ line and somatic variants. It visualizes variants along their respective chromosomes, providing information about exact chromosomal position, zygosity and frequency, with point-and-click information regarding dbSNP IDs, gene association and variant inheritance. Rare variants as well as de novo variants can be flagged in different colors. We show the performance of the RareVariantVis tool in the Genome in a Bottle WGS data set. https://www.bioconductor.org/packages/3.3/bioc/html/RareVariantVis.html tomasz.stokowy@k2.uib.no Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Efficient mutation identification in zebrafish by microarray capturing and next generation sequencing.

PubMed

Bontems, Franck; Baerlocher, Loic; Mehenni, Sabrina; Bahechar, Ilham; Farinelli, Laurent; Dosch, Roland

2011-02-18

Fish models like medaka, stickleback or zebrafish provide a valuable resource to study vertebrate genes. However, finding genetic variants e.g. mutations in the genome is still arduous. Here we used a combination of microarray capturing and next generation sequencing to identify the affected gene in the mozartkugelp11cv (mzlp11cv) mutant zebrafish. We discovered a 31-bp deletion in macf1 demonstrating the potential of this technique to efficiently isolate mutations in a vertebrate genome. Copyright © 2011 Elsevier Inc. All rights reserved.
Single-Exome sequencing identified a novel RP2 mutation in a child with X-linked retinitis pigmentosa.

PubMed

Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun

2016-10-01

To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.
The effect of rare alleles on estimated genomic relationships from whole genome sequence data.

PubMed

Eynard, Sonia E; Windig, Jack J; Leroy, Grégoire; van Binsbergen, Rianne; Calus, Mario P L

2015-03-12

Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity.
Stabilizing Selection, Purifying Selection, and Mutational Bias in Finite Populations

PubMed Central

Charlesworth, Brian

2013-01-01

Genomic traits such as codon usage and the lengths of noncoding sequences may be subject to stabilizing selection rather than purifying selection. Mutations affecting these traits are often biased in one direction. To investigate the potential role of stabilizing selection on genomic traits, the effects of mutational bias on the equilibrium value of a trait under stabilizing selection in a finite population were investigated, using two different mutational models. Numerical results were generated using a matrix method for calculating the probability distribution of variant frequencies at sites affecting the trait, as well as by Monte Carlo simulations. Analytical approximations were also derived, which provided useful insights into the numerical results. A novel conclusion is that the scaled intensity of selection acting on individual variants is nearly independent of the effective population size over a wide range of parameter space and is strongly determined by the logarithm of the mutational bias parameter. This is true even when there is a very small departure of the mean from the optimum, as is usually the case. This implies that studies of the frequency spectra of DNA sequence variants may be unable to distinguish between stabilizing and purifying selection. A similar investigation of purifying selection against deleterious mutations was also carried out. Contrary to previous suggestions, the scaled intensity of purifying selection with synergistic fitness effects is sensitive to population size, which is inconsistent with the general lack of sensitivity of codon usage to effective population size. PMID:23709636
Novel Genetic Variants of Sporadic Atrial Septal Defect (ASD) in a Chinese Population Identified by Whole-Exome Sequencing (WES)

PubMed Central

Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong

2018-01-01

Background Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. Material/Methods Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. Results From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10−4). Conclusions This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations. PMID:29505555
Novel Genetic Variants of Sporadic Atrial Septal Defect (ASD) in a Chinese Population Identified by Whole-Exome Sequencing (WES).

PubMed

Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong; Wang, Wenju; Jiang, Lihong

2018-03-05

BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10^-4). CONCLUSIONS This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.
A novel LPL intronic variant: g.18704C>A identified by re-sequencing Kuwaiti Arab samples is associated with high-density lipoprotein, very low-density lipoprotein and triglyceride lipid levels.

PubMed

Al-Bustan, Suzanne A; Al-Serri, Ahmad; Annice, Babitha G; Alnaqeeb, Majed A; Al-Kandari, Wafa Y; Dashti, Mohammed

2018-01-01

The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel "rare" variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004-0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001-0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia.
A novel LPL intronic variant: g.18704C>A identified by re-sequencing Kuwaiti Arab samples is associated with high-density lipoprotein, very low-density lipoprotein and triglyceride lipid levels

PubMed Central

Al-Serri, Ahmad; Annice, Babitha G.; Alnaqeeb, Majed A.; Al-Kandari, Wafa Y.; Dashti, Mohammed

2018-01-01

The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel “rare” variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004–0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001–0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia. PMID:29438437
GWASeq: targeted re-sequencing follow up to GWAS.

PubMed

Salomon, Matthew P; Li, Wai Lok Sibon; Edlund, Christopher K; Morrison, John; Fortini, Barbara K; Win, Aung Ko; Conti, David V; Thomas, Duncan C; Duggan, David; Buchanan, Daniel D; Jenkins, Mark A; Hopper, John L; Gallinger, Steven; Le Marchand, Loïc; Newcomb, Polly A; Casey, Graham; Marjoram, Paul

2016-03-03

For the last decade the conceptual framework of the Genome-Wide Association Study (GWAS) has dominated the investigation of human disease and other complex traits. While GWAS have been successful in identifying a large number of variants associated with various phenotypes, the overall amount of heritability explained by these variants remains small. This raises the question of how best to follow up on a GWAS, localize causal variants accounting for GWAS hits, and as a consequence explain more of the so-called "missing" heritability. Advances in high throughput sequencing technologies now allow for the efficient and cost-effective collection of vast amounts of fine-scale genomic data to complement GWAS. We investigate these issues using a colon cancer dataset. After QC, our data consisted of 1993 cases, 899 controls. Using marginal tests of associations, we identify 10 variants distributed among six targeted regions that are significantly associated with colorectal cancer, with eight of the variants being novel to this study. Additionally, we perform so-called 'SNP-set' tests of association and identify two sets of variants that implicate both common and rare variants in the etiology of colorectal cancer. Here we present a large-scale targeted re-sequencing resource focusing on genomic regions implicated in colorectal cancer susceptibility previously identified in several GWAS, which aims to 1) provide fine-scale targeted sequencing data for fine-mapping and 2) provide data resources to address methodological questions regarding the design of sequencing-based follow-up studies to GWAS. Additionally, we show that this strategy successfully identifies novel variants associated with colorectal cancer susceptibility and can implicate both common and rare variants.
Molecular genetic studies of DMT1 on 12q in French-Canadian restless legs syndrome patients and families.

PubMed

Xiong, Lan; Dion, Patrick; Montplaisir, Jacques; Levchenko, Anastasia; Thibodeau, Pascale; Karemera, Liliane; Rivière, Jean-Baptiste; St-Onge, Judith; Gaspar, Claudia; Dubé, Marie-Pierre; Desautels, Alex; Turecki, Gustavo; Rouleau, Guy A

2007-10-05

Converging evidence from clinical observations, brain imaging and pathological findings strongly indicate impaired brain iron regulation in restless legs syndrome (RLS). Animal models with mutation in (DMT1) divalent metal transporter 1 gene, an important brain iron transporter, demonstrate a similar iron deficiency profile as found in RLS brain. The human DMT1 gene, mapped to chromosome 12q near the RLS1 locus, qualifies as an excellent functional and possible positional candidate for RLS. DMT1 protein levels were assessed in lymphoblastoid cell lines from RLS patients and controls. Linkage analyses were carried out with markers flanking and within the DMT1 gene. Selected patient samples from RLS families with compatible linkage to the RLS1 locus on 12q were fully sequenced in both the coding regions and the long stretches of UTR sequences. Finally, selected sequence variants were further studied in case/control and family-based association tests. A clinical association of anemia and RLS was further confirmed in this study. There was no detectable difference in DMT1 protein levels between RLS patient lymphoblastoid cell lines and normal controls. Non-parametric linkage analyses failed to identify any significant linkage signals within the DMT1 gene region. Sequencing of selected patients did not detect any sequence variant(s) compatible with DMT1 harboring RLS causative mutation(s). Further studies did not find any association between ten SNPs, spanning the whole DMT1 gene region, and RLS affection status. Finally, two DMT1 intronic SNPs showed positive association with RLS in patients with a history of anemia, when compared to RLS patients without anemia. (c) 2007 Wiley-Liss, Inc.
Mutation load in melanoma is affected by MC1R genotype.

PubMed

Johansson, Peter A; Pritchard, Antonia L; Patch, Ann-Marie; Wilmott, James S; Pearson, John V; Waddell, Nicola; Scolyer, Richard A; Mann, Graham J; Hayward, Nicholas K

2017-03-01

Whole-genome sequencing of matched germline and tumour pairs in a well-characterized cohort of melanoma patients allowed investigation of associations between melanoma body site, age at melanoma onset and MC1R variant status with overall mutation burden and specific base pair changes observed in the corresponding melanoma. We observed statistically significant associations between mutation burden in melanoma and body site, age at onset and MC1R genotype, for both ultraviolet radiation (UVR) signature changes (C>T and CC>TT) and non-UVR base pair substitutions, as well as with overall variant load. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
GWAS and fine-mapping of 35 production, reproduction and conformation traits with imputed sequences of 27K Holstein bulls

USDA-ARS?s Scientific Manuscript database

Fine-mapping of causal variants is becoming feasible for complex traits in livestock GWAS, as an increasing number of animals are sequenced. Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on small reference populations of sequenced animals. ...
Animal selection for whole genome sequencing by quantifying the unique contribution of homozygous haplotypes sequenced

USDA-ARS?s Scientific Manuscript database

Major whole genome sequencing projects promise to identify rare and causal variants within livestock species; however, the efficient selection of animals for sequencing remains a major problem within these surveys. The goal of this project was to develop a library of high accuracy genetic variants f...
GWAS and fine-mapping of 35 production, reproduction, and conformation traits with imputed sequences of 27K Holstein bulls

USDA-ARS?s Scientific Manuscript database

Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on reference populations of sequenced animals. With the implementation of the 1000 Bull Genomes Project and increasing numbers of animals sequenced, fine-mapping of causal variants is becoming f...
Autosomal Dominant Diabetes Arising From a Wolfram Syndrome 1 Mutation

PubMed Central

Bonnycastle, Lori L.; Chines, Peter S.; Hara, Takashi; Huyghe, Jeroen R.; Swift, Amy J.; Heikinheimo, Pirkko; Mahadevan, Jana; Peltonen, Sirkku; Huopio, Hanna; Nuutila, Pirjo; Narisu, Narisu; Goldfeder, Rachel L.; Stitzel, Michael L.; Lu, Simin; Boehnke, Michael; Urano, Fumihiko; Collins, Francis S.; Laakso, Markku

2013-01-01

We used an unbiased genome-wide approach to identify exonic variants segregating with diabetes in a multigenerational Finnish family. At least eight members of this family presented with diabetes with age of diagnosis ranging from 18 to 51 years and a pattern suggesting autosomal dominant inheritance. We sequenced the exomes of four affected members of this family and performed follow-up genotyping of additional affected and unaffected family members. We uncovered a novel nonsynonymous variant (p.Trp314Arg) in the Wolfram syndrome 1 (WFS1) gene that segregates completely with the diabetic phenotype. Multipoint parametric linkage analysis with 13 members of this family identified a single linkage signal with maximum logarithm of odds score 3.01 at 4p16.2-p16.1, corresponding to a region harboring the WFS1 locus. Functional studies demonstrate a role for this variant in endoplasmic reticulum stress, which is consistent with the β-cell failure phenotype seen in mutation carriers. This represents the first compelling report of a mutation in WFS1 associated with dominantly inherited nonsyndromic adult-onset diabetes. PMID:23903355

Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

DTIC Science & Technology

2015-12-01

proportion greater than 0.25 (iv) Read depth greater than 8 in at least one sample The Table below shows variant data from Family 1041 categorized by...patients from a severely affected breast cancer Family 1041 . All Shared Rare Excluding IBD0 Intergenic 3,345,727 1,650,045 35,927 3,990 ncRNA
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing.

PubMed

Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan

2017-01-01

PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Pooled Resequencing of 122 Ulcerative Colitis Genes in a Large Dutch Cohort Suggests Population-Specific Associations of Rare Variants in MUC2.

PubMed

Visschedijk, Marijn C; Alberts, Rudi; Mucha, Soren; Deelen, Patrick; de Jong, Dirk J; Pierik, Marieke; Spekhorst, Lieke M; Imhann, Floris; van der Meulen-de Jong, Andrea E; van der Woude, C Janneke; van Bodegraven, Adriaan A; Oldenburg, Bas; Löwenberg, Mark; Dijkstra, Gerard; Ellinghaus, David; Schreiber, Stefan; Wijmenga, Cisca; Rivas, Manuel A; Franke, Andre; van Diemen, Cleo C; Weersma, Rinse K

2016-01-01

Genome-wide association studies have revealed several common genetic risk variants for ulcerative colitis (UC). However, little is known about the contribution of rare, large effect genetic variants to UC susceptibility. In this study, we performed a deep targeted re-sequencing of 122 genes in Dutch UC patients in order to investigate the contribution of rare variants to the genetic susceptibility to UC. The selection of genes consists of 111 established human UC susceptibility genes and 11 genes that lead to spontaneous colitis when knocked-out in mice. In addition, we sequenced the promoter regions of 45 genes where known variants exert cis-eQTL-effects. Targeted pooled re-sequencing was performed on DNA of 790 Dutch UC cases. The Genome of the Netherlands project provided sequence data of 500 healthy controls. After quality control and prioritization based on allele frequency and pathogenicity probability, follow-up genotyping of 171 rare variants was performed on 1021 Dutch UC cases and 1166 Dutch controls. Single-variant association and gene-based analyses identified an association of rare variants in the MUC2 gene with UC. The associated variants in the Dutch population could not be replicated in a German replication cohort (1026 UC cases, 3532 controls). In conclusion, this study has identified a putative role for MUC2 on UC susceptibility in the Dutch population and suggests a population-specific contribution of rare variants to UC.
Quick, sensitive and specific detection and evaluation of quantification of minor variants by high-throughput sequencing.

PubMed

Leung, Ross Ka-Kit; Dong, Zhi Qiang; Sa, Fei; Chong, Cheong Meng; Lei, Si Wan; Tsui, Stephen Kwok-Wing; Lee, Simon Ming-Yuen

2014-02-01

Minor variants have significant implications in quasispecies evolution, early cancer detection and non-invasive fetal genotyping but their accurate detection by next-generation sequencing (NGS) is hampered by sequencing errors. We generated sequencing data from mixtures at predetermined ratios in order to provide insight into sequencing errors and variations that can arise for which simulation cannot be performed. The information also enables better parameterization in depth of coverage, read quality and heterogeneity, library preparation techniques, technical repeatability for mathematical modeling, theory development and simulation experimental design. We devised minor variant authentication rules that achieved 100% accuracy in both testing and validation experiments. The rules are free from tedious inspection of alignment accuracy, sequencing read quality or errors introduced by homopolymers. The authentication processes only require minor variants to: (1) have minimum depth of coverage larger than 30; (2) be reported by (a) four or more variant callers, or (b) DiBayes or LoFreq, plus SNVer (or BWA when no results are returned by SNVer), and with the interassay coefficient of variation (CV) no larger than 0.1. Quantification accuracy undermined by sequencing errors could neither be overcome by ultra-deep sequencing, nor recruiting more variant callers to reach a consensus, such that consistent underestimation and overestimation (i.e. low CV) were observed. To accommodate stochastic error and adjust the observed ratio within a specified accuracy, we presented a proof of concept for the use of a double calibration curve for quantification, which provides an important reference towards potential industrial-scale fabrication of calibrants for NGS.
Analysis of human papillomavirus 16 E6, E7 genes and Long Control Region in cervical samples from Uruguayan women.

PubMed

Ramas, Viviana; Mirazo, Santiago; Bonilla, Sylvia; Ruchansky, Dora; Arbiza, Juan

2018-05-15

This study aims to investigate the HPV16 variant distribution by sequence analyses of E6, E7 oncogenes and the Long Control Region (LCR), from cervical cells collected from Uruguayan women, and to reconstruct the phylogenetic relationships among variants. Forty-seven HPV16 variants, obtained from women with HSIL, LSIL, ASCUS and NILM cytological classes were analyzed for LCR and 12 were further studied for E6 and E7. Detailed sequence comparison, genetic heterogeneity analyses and phylogenetic reconstruction were performed. A high variability was observed among LCR sequences, which were distributed in 18 different variants. E6 and E7 sequences exhibited novel non-synonymous substitutions. Uruguayan sequences mainly belonged to the European lineage, and only 5 sequences clustered in non-European branches; 3 of them in the Asian-American and North-American linage and 2 in an African branch. Additionally, 6 new variants from European and African clusters were identified. HPV16 isolates mainly belonged to the European lineage, though strains from African and Asian-American lineages were also identified. Herein is reported for the first time the distribution and molecular characterization of HPV16 variants from Uruguay, providing novel insights on the molecular epidemiology of this infectious disease in the South America. A high variability among HPV 16 isolates mainly belonged to European lineage, provides an extensive sequence dataset from a country with high burden of cervical cancer. Copyright © 2018 Elsevier B.V. All rights reserved.
Whole Exome Sequencing Identifies Rare Protein-Coding Variants in Behçet's Disease.

PubMed

Ognenovski, Mikhail; Renauer, Paul; Gensterblum, Elizabeth; Kötter, Ina; Xenitidis, Theodoros; Henes, Jörg C; Casali, Bruno; Salvarani, Carlo; Direskeneli, Haner; Kaufman, Kenneth M; Sawalha, Amr H

2016-05-01

Behçet's disease (BD) is a systemic inflammatory disease with an incompletely understood etiology. Despite the identification of multiple common genetic variants associated with BD, rare genetic variants have been less explored. We undertook this study to investigate the role of rare variants in BD by performing whole exome sequencing in BD patients of European descent. Whole exome sequencing was performed in a discovery set comprising 14 German BD patients of European descent. For replication and validation, Sanger sequencing and Sequenom genotyping were performed in the discovery set and in 2 additional independent sets of 49 German BD patients and 129 Italian BD patients of European descent. Genetic association analysis was then performed in BD patients and 503 controls of European descent. Functional effects of associated genetic variants were assessed using bioinformatic approaches. Using whole exome sequencing, we identified 77 rare variants (in 74 genes) with predicted protein-damaging effects in BD. These variants were genotyped in 2 additional patient sets and then analyzed to reveal significant associations with BD at 2 genetic variants detected in all 3 patient sets that remained significant after Bonferroni correction. We detected genetic association between BD and LIMK2 (rs149034313), involved in regulating cytoskeletal reorganization, and between BD and NEIL1 (rs5745908), involved in base excision DNA repair (P = 3.22 × 10(-4) and P = 5.16 × 10(-4) , respectively). The LIMK2 association is a missense variant with predicted protein damage that may influence functional interactions with proteins involved in cytoskeletal regulation by Rho GTPase, inflammation mediated by chemokine and cytokine signaling pathways, T cell activation, and angiogenesis (Bonferroni-corrected P = 5.63 × 10(-14) , P = 7.29 × 10(-6) , P = 1.15 × 10(-5) , and P = 6.40 × 10(-3) , respectively). The genetic association in NEIL1 is a predicted splice donor variant that may introduce a deleterious intron retention and result in a noncoding transcript variant. We used whole exome sequencing in BD for the first time and identified 2 rare putative protein-damaging genetic variants associated with this disease. These genetic variants might influence cytoskeletal regulation and DNA repair mechanisms in BD and might provide further insight into increased leukocyte tissue infiltration and the role of oxidative stress in BD. © 2016, American College of Rheumatology.
Identification of verotoxin type 2 variant B subunit genes in Escherichia coli by the polymerase chain reaction and restriction fragment length polymorphism analysis.

PubMed Central

Tyler, S D; Johnson, W M; Lior, H; Wang, G; Rozee, K R

1991-01-01

A set of synthetic oligonucleotide primers was designed for use in a polymerase chain reaction protocol to specifically detect the B subunit genes in vtx2ha and vtx2hb, which code for the production of the VT2 (Shiga-like toxin II) variant cytotoxins VT2v-a and VT2v-b, respectively. An additional set of primers amplified a fragment common to the B subunits of the VT2 and the VT2 variant genes. Subsequent restriction endonuclease digestion of this amplicon permitted prediction of specific VT2 and variant genotypes on the basis of predetermined restriction fragment length polymorphisms. Genotypes of 21 VT2-producing strains of Escherichia coli were determined using this polymerase chain reaction-restriction fragment length polymorphism procedure. Four strains contained B subunit target sequences only for VT2 genes, 9 strains contained sequences only for VT2v-a genes, and 3 strains contained sequences only for VT2v-b. For genes in combination, one strain contained B subunit genes for both VT2 and VT2v-a and two strains contained B subunit genes for VT2 and VT2v-b. Two strains of E. coli O91:H21 contained both VT2v-a and VT2v-b B subunit genes. The VT2 reference strain of E. coli, E32511, was found to contain the targeted sequences from both VT2 and VT2v-a genes, whereas the recombinant E. coli, pEB1, possessed only that of the VT2 gene. The specific activities of extracellular VT2 determined in HeLa cells ranged from 0.3 to 41.7 TCD50 per microgram of protein in strains carrying the VT2 gene target and from 0 to 50.0 TCD50 per microgram of protein in strains carrying only the VT2 variant target (TCD50 is the tissue culture dose by which 50% of the cells were affected), suggesting that phenotypic expression does not correlate with genotype. Images PMID:1679436
Use of whole-exome sequencing to determine the genetic basis of multiple mitochondrial respiratory chain complex deficiencies.

PubMed

Taylor, Robert W; Pyle, Angela; Griffin, Helen; Blakely, Emma L; Duff, Jennifer; He, Langping; Smertenko, Tania; Alston, Charlotte L; Neeve, Vivienne C; Best, Andrew; Yarham, John W; Kirschner, Janbernd; Schara, Ulrike; Talim, Beril; Topaloglu, Haluk; Baric, Ivo; Holinski-Feder, Elke; Abicht, Angela; Czermin, Birgit; Kleinle, Stephanie; Morris, Andrew A M; Vassallo, Grace; Gorman, Grainne S; Ramesh, Venkateswaran; Turnbull, Douglass M; Santibanez-Koref, Mauro; McFarland, Robert; Horvath, Rita; Chinnery, Patrick F

2014-07-02

Mitochondrial disorders have emerged as a common cause of inherited disease, but their diagnosis remains challenging. Multiple respiratory chain complex defects are particularly difficult to diagnose at the molecular level because of the massive number of nuclear genes potentially involved in intramitochondrial protein synthesis, with many not yet linked to human disease. To determine the molecular basis of multiple respiratory chain complex deficiencies. We studied 53 patients referred to 2 national centers in the United Kingdom and Germany between 2005 and 2012. All had biochemical evidence of multiple respiratory chain complex defects but no primary pathogenic mitochondrial DNA mutation. Whole-exome sequencing was performed using 62-Mb exome enrichment, followed by variant prioritization using bioinformatic prediction tools, variant validation by Sanger sequencing, and segregation of the variant with the disease phenotype in the family. Presumptive causal variants were identified in 28 patients (53%; 95% CI, 39%-67%) and possible causal variants were identified in 4 (8%; 95% CI, 2%-18%). Together these accounted for 32 patients (60% 95% CI, 46%-74%) and involved 18 different genes. These included recurrent mutations in RMND1, AARS2, and MTO1, each on a haplotype background consistent with a shared founder allele, and potential novel mutations in 4 possible mitochondrial disease genes (VARS2, GARS, FLAD1, and PTCD1). Distinguishing clinical features included deafness and renal involvement associated with RMND1 and cardiomyopathy with AARS2 and MTO1. However, atypical clinical features were present in some patients, including normal liver function and Leigh syndrome (subacute necrotizing encephalomyelopathy) seen in association with TRMU mutations and no cardiomyopathy with founder SCO2 mutations. It was not possible to confidently identify the underlying genetic basis in 21 patients (40%; 95% CI, 26%-54%). Exome sequencing enhances the ability to identify potential nuclear gene mutations in patients with biochemically defined defects affecting multiple mitochondrial respiratory chain complexes. Additional study is required in independent patient populations to determine the utility of this approach in comparison with traditional diagnostic methods.
A novel ATTR L32V mutation causes familial amyloid polyneuropathy in a Bolivian family.

PubMed

Martínez-Ulloa, Pedro L; Vallejo, Manuela; Corral, Iñigo; García-Barragán, Nuria; Alcazar, Alberto; Martínez-Alonso, Emma; Martínez-Poles, Javier; Pian, Hector; Jiménez-Escrig, Adriano

2017-09-01

We report a new transthyretin (ATTR) gene c.272C>G mutation and variant protein, p.Leu32Val, in a kindred of Bolivian origin with a rapid progressive peripheral neuropathy and cardiomyopathy. Three individuals from a kindred with peripheral nerve and cardiac amyloidosis were examined. Analysis of the TTR gene was performed by Sanger direct sequencing. Neuropathologic examination was obtained on the index patient with mass spectrometry study of the ATTR deposition. Direct DNA sequence analysis of exons 2, 3, and 4 of the TTR gene demonstrated a c.272 C>G mutation in exon 2 (p.L32V). Sural nerve biopsy revealed massive amyloid deposition in the perineurium, endoneurium and vasa nervorum. Mass spectrometric analyses of ATTR immunoprecipitated from nerve biopsy showed the presence of both wild-type and variant proteins. The observed mass results for the wild-type and variant proteins were consistent with the predicted values calculated from the genetic analysis data. The ATTR L32V is associated with a severe course. This has implications for treatment of affected individuals and counseling of family members. © 2017 Peripheral Nerve Society.
De novo variants in EBF3 are associated with hypotonia, developmental delay, intellectual disability, and autism

PubMed Central

Tanaka, Akemi J.; Cho, Megan T.; Willaert, Rebecca; Retterer, Kyle; Zarate, Yuri A.; Bosanko, Katie; Stefans, Vikki; Oishi, Kimihiko; Williamson, Amy; Wilson, Golder N.; Basinger, Alice; Barbaro-Dieber, Tina; Ortega, Lucia; Sorrentino, Susanna; Gabriel, Melissa K.; Anderson, Ilse J.; Sacoto, Maria J. Guillen; Schnur, Rhonda E.; Chung, Wendy K.

2017-01-01

Using whole-exome sequencing, we identified seven unrelated individuals with global developmental delay, hypotonia, dysmorphic facial features, and an increased frequency of short stature, ataxia, and autism with de novo heterozygous frameshift, nonsense, splice, and missense variants in the Early B-cell Transcription Factor Family Member 3 (EBF3) gene. EBF3 is a member of the collier/olfactory-1/early B-cell factor (COE) family of proteins, which are required for central nervous system (CNS) development. COE proteins are highly evolutionarily conserved and regulate neuronal specification, migration, axon guidance, and dendritogenesis during development and are essential for maintaining neuronal identity in adult neurons. Haploinsufficiency of EBF3 may affect brain development and function, resulting in developmental delay, intellectual disability, and behavioral differences observed in individuals with a deleterious variant in EBF3. PMID:29162653
Presynaptic congenital myasthenic syndrome with a homozygous sequence variant in LAMA5 combines myopia, facial tics, and failure of neuromuscular transmission.

PubMed

Maselli, Ricardo A; Arredondo, Juan; Vázquez, Jessica; Chong, Jessica X; Bamshad, Michael J; Nickerson, Deborah A; Lara, Marian; Ng, Fiona; Lo, Victoria L; Pytel, Peter; McDonald, Craig M

2017-08-01

Defects in genes encoding the isoforms of the laminin alpha subunit have been linked to various phenotypic manifestations, including brain malformations, muscular dystrophy, ocular defects, cardiomyopathy, and skin abnormalities. We report here a severe defect of neuromuscular transmission in a consanguineous patient with a homozygous variant in the laminin alpha-5 subunit gene (LAMA5). The variant c.8046C>T (p.Arg2659Trp) is rare and has a predicted deleterious effect. The affected individual, who also carries a rare homozygous sequence variant in LAMA1, had muscle weakness, myopia, and facial tics. Magnetic resonance imaging of brain showed mild volume loss and periventricular T2 prolongation. Repetitive nerve stimulation revealed 50% decrement of compound muscle action potential amplitudes and 250% facilitation immediately after exercise, Endplate studies identified a profound reduction of the endplate potential quantal content and endplates with normal postsynaptic folding that were denuded or partially occupied by small nerve terminals. Expression studies revealed that p.Arg2659Trp caused decreased binding of laminin alpha-5 to SV2A and impaired laminin-521 cell-adhesion and cell projection support in primary neuronal cultures. In summary, this report describing severe neuromuscular transmission failure in a patient with a LAMA5 mutation expands the list of phenotypes associated with defects in genes encoding alpha-laminins. © 2017 Wiley Periodicals, Inc.
Screening of SHOX gene sequence variants in Saudi Arabian children with idiopathic short stature.

PubMed

Alharthi, Abdulla A; El-Hallous, Ehab I; Talaat, Iman M; Alghamdi, Hamed A; Almalki, Matar I; Gaber, Ahmed

2017-10-01

Short stature affects approximately 2%-3% of children, representing one of the most frequent disorders for which clinical attention is sought during childhood. Despite assumed genetic heterogeneity, mutations or deletions in the short stature homeobox-containing gene ( SHOX ) are frequently detected in subjects with short stature. Idiopathic short stature (ISS) refers to patients with short stature for various unknown reasons. The goal of this study was to screen all the exons of SHOX to identify related mutations. We screened all the exons of SHOX for mutations analysis in 105 ISS children patients (57 girls and 48 boys) living in Taif governorate, KSA using a direct DNA sequencing method. Height, arm span, and sitting height were recorded, and subischial leg length was calculated. A total of 30 of 105 ISS patients (28%) contained six polymorphic variants in exons 1, 2, 4, and 6. One mutation was found in the DNA domain binding region of exon 4. Three of these polymorphic variants were novel, while the others were reported previously. There were no significant differences in anthropometric measures in ISS patients with and without identifiable polymorphic variants in SHOX . In Saudi Arabia ISS patients, rather than SHOX , it is possible that new genes are involved in longitudinal growth. Additional molecular analysis is required to diagnose and understand the etiology of this disease.
Evaluation of amplification refractory mutation system (ARMS) technique for quick and accurate prenatal gene diagnosis of CHM variant in choroideremia.

PubMed

Yang, Lisha; Ijaz, Iqra; Cheng, Jingliang; Wei, Chunli; Tan, Xiaojun; Khan, Md Asaduzzaman; Fu, Xiaodong; Fu, Junjiang

2018-01-01

Choroideremia is a rare X-linked recessive inherited disorder that causes chorioretinal dystrophy leading to visual impairment in its early stages which finally causes total blindness in the affected person. It is caused due to mutations in the CHM gene. In this study, we have recruited a pedigree with choroideremia and detected a nonsense variant (c.C799T:p.R267X) in CHM of the proband (I:1). Different primer sets for amplification refractory mutation system (ARMS) were designed and PCR conditions were optimized. Then, we evaluated the sequence variant in the patient, carrier, and a fetus by using ARMS technique to identify if they inherited the pathogenic gene from parental generation; we used amniotic fluid DNA for the diagnosis of the gene in the fetus. The primer pairs, WT2+C and MT+C, amplified high specific products in different DNAs which were verified by Sanger sequencing. Based on our results, ARMS technique is fast, accurate, and reliable prenatal gene diagnostic tool to assess CHM variants. Taken together, our study indicates that ARMS technique can be used as a potential molecular tool in the diagnosis of prenatal mutation for choroideremia as well as other genetic diseases in undeveloped and developing countries, where there might be shortage of medical resources and supplies.
Expression, immunogenicity and variation of iron-regulated surface protein A from bovine isolates of Staphylococcus aureus.

PubMed

Misra, Neha; Wines, Tyler F; Knopp, Colton L; McGuire, Mark A; Tinker, Juliette K

2017-05-01

Staphylococcus aureus iron-regulated surface protein A (IsdA) is a fibrinogen and fibronectin adhesin that also contributes to iron sequestration and resistance to innate immunity. IsdA is conserved in human isolates and has been investigated as a human vaccine candidate. Here we report the expression of isdA, the efficacy of anti-IsdA responses and the existence of IsdA sequence variants from bovine Staphylococcus. Clinical staphylococci were obtained from US dairy farms and assayed by PCR for the presence and expression of isdA. isdA-positive species from bovines included S. aureus, S. haemolyticus and S. chromogenes. Immunoassays on bovine milk and serum confirmed the induction and opsonophagocytic activity of anti-IsdA humoral responses. The variable region of isdA was sequenced and protein alignments predicted the presence of two main variants consistent with those from human S. aureus. Mouse antibodies against one IsdA variant reduced staphylococcal binding to fibronectin in vitro in an isotype-dependent manner. Purified IsdA variants bound distinctly to fibronectin and fibrinogen. Our findings demonstrate that variability within the C-terminus of this adhesin affects immune reactivity and binding specificity, but are consistent with the significance of IsdA in bovine disease and relevant for vaccine development. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Reducing false-positive incidental findings with ensemble genotyping and logistic regression based variant filtering methods.

PubMed

Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choe, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B; Gupta, Neha; Kohane, Isaac S; Green, Robert C; Kong, Sek Won

2014-08-01

As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false-positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here, we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false-negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous single nucleotide variants (SNVs); 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery in NA12878, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and an ensemble genotyping would be essential to minimize false-positive DNM candidates. © 2014 WILEY PERIODICALS, INC.
Targeted next-generation sequencing makes new molecular diagnoses and expands genotype-phenotype relationship in Ehlers-Danlos syndrome.

PubMed

Weerakkody, Ruwan A; Vandrovcova, Jana; Kanonidou, Christina; Mueller, Michael; Gampawar, Piyush; Ibrahim, Yousef; Norsworthy, Penny; Biggs, Jennifer; Abdullah, Abdulshakur; Ross, David; Black, Holly A; Ferguson, David; Cheshire, Nicholas J; Kazkaz, Hanadi; Grahame, Rodney; Ghali, Neeti; Vandersteen, Anthony; Pope, F Michael; Aitman, Timothy J

2016-11-01

Ehlers-Danlos syndrome (EDS) comprises a group of overlapping hereditary disorders of connective tissue with significant morbidity and mortality, including major vascular complications. We sought to identify the diagnostic utility of a next-generation sequencing (NGS) panel in a mixed EDS cohort. We developed and applied PCR-based NGS assays for targeted, unbiased sequencing of 12 collagen and aortopathy genes to a cohort of 177 unrelated EDS patients. Variants were scored blind to previous genetic testing and then compared with results of previous Sanger sequencing. Twenty-eight pathogenic variants in COL5A1/2, COL3A1, FBN1, and COL1A1 and four likely pathogenic variants in COL1A1, TGFBR1/2, and SMAD3 were identified by the NGS assays. These included all previously detected single-nucleotide and other short pathogenic variants in these genes, and seven newly detected pathogenic or likely pathogenic variants leading to clinically significant diagnostic revisions. Twenty-two variants of uncertain significance were identified, seven of which were in aortopathy genes and required clinical follow-up. Unbiased NGS-based sequencing made new molecular diagnoses outside the expected EDS genotype-phenotype relationship and identified previously undetected clinically actionable variants in aortopathy susceptibility genes. These data may be of value in guiding future clinical pathways for genetic diagnosis in EDS.Genet Med 18 11, 1119-1127.
Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

PubMed Central

Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

2014-01-01

Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775
Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol.

PubMed

Lange, Leslie A; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M; Smith, Joshua D; Turner, Emily H; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-Ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A; Holmen, Oddgeir L; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C; Correa, Adolfo; Griswold, Michael E; Jakobsdottir, Johanna; Smith, Albert V; Schreiner, Pamela J; Feitosa, Mary F; Zhang, Qunyuan; Huffman, Jennifer E; Crosby, Jacy; Wassel, Christina L; Do, Ron; Franceschini, Nora; Martin, Lisa W; Robinson, Jennifer G; Assimes, Themistocles L; Crosslin, David R; Rosenthal, Elisabeth A; Tsai, Michael; Rieder, Mark J; Farlow, Deborah N; Folsom, Aaron R; Lumley, Thomas; Fox, Ervin R; Carlson, Christopher S; Peters, Ulrike; Jackson, Rebecca D; van Duijn, Cornelia M; Uitterlinden, André G; Levy, Daniel; Rotter, Jerome I; Taylor, Herman A; Gudnason, Vilmundur; Siscovick, David S; Fornage, Myriam; Borecki, Ingrid B; Hayward, Caroline; Rudan, Igor; Chen, Y Eugene; Bottinger, Erwin P; Loos, Ruth J F; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M; Gabriel, Stacey B; O'Donnell, Christopher J; Post, Wendy S; North, Kari E; Reiner, Alexander P; Boerwinkle, Eric; Psaty, Bruce M; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P; Cupples, L Adrienne; Kooperberg, Charles; Wilson, James G; Nickerson, Deborah A; Abecasis, Goncalo R; Rich, Stephen S; Tracy, Russell P; Willer, Cristen J

2014-02-06

Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98(th) or <2(nd) percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Novel mutations in PAX6, OTX2 and NDP in anophthalmia, microphthalmia and coloboma.

PubMed

Deml, Brett; Reis, Linda M; Lemyre, Emmanuelle; Clark, Robin D; Kariminejad, Ariana; Semina, Elena V

2016-04-01

Anophthalmia and microphthalmia (A/M) are developmental ocular malformations defined as the complete absence or reduction in size of the eye. A/M is a highly heterogeneous disorder with SOX2 and FOXE3 playing major roles in dominant and recessive pedigrees, respectively; however, the majority of cases lack a genetic etiology. We analyzed 28 probands affected with A/M spectrum (without mutations in SOX2/FOXE3) by whole-exome sequencing. Analysis of 83 known A/M factors identified pathogenic/likely pathogenic variants in PAX6, OTX2 and NDP in three patients. A novel heterozygous likely pathogenic variant in PAX6, c.767T>C, p.(Val256Ala), was identified in two brothers with bilateral microphthalmia, coloboma, primary aphakia, iris hypoplasia, sclerocornea and congenital glaucoma; the unaffected mother appears to be a mosaic carrier. While A/M has been reported as a rare feature, this is the first report of congenital primary aphakia in association with PAX6 and the identified allele represents the first variant in the PAX6 homeodomain to be associated with A/M. A novel pathogenic variant in OTX2, c.651delC, p.(Thr218Hisfs*76), in a patient with syndromic bilateral anophthalmia and a hemizygous pathogenic variant in NDP, c.293 C>T, p.(Pro98Leu), in two brothers with isolated bilateral microphthalmia and sclerocornea were also identified. Pathogenic/likely pathogenic variants were not discovered in the 25 remaining A/M cases. This study underscores the utility of whole-exome sequencing for identification of causative mutations in highly variable ocular phenotypes as well as the extreme genetic heterogeneity of A/M conditions.
Novel mutations in PAX6, OTX2 and NDP in anophthalmia, microphthalmia and coloboma

PubMed Central

Deml, Brett; Reis, Linda M; Lemyre, Emmanuelle; Clark, Robin D; Kariminejad, Ariana; Semina, Elena V

2016-01-01

Anophthalmia and microphthalmia (A/M) are developmental ocular malformations defined as the complete absence or reduction in size of the eye. A/M is a highly heterogeneous disorder with SOX2 and FOXE3 playing major roles in dominant and recessive pedigrees, respectively; however, the majority of cases lack a genetic etiology. We analyzed 28 probands affected with A/M spectrum (without mutations in SOX2/FOXE3) by whole-exome sequencing. Analysis of 83 known A/M factors identified pathogenic/likely pathogenic variants in PAX6, OTX2 and NDP in three patients. A novel heterozygous likely pathogenic variant in PAX6, c.767T>C, p.(Val256Ala), was identified in two brothers with bilateral microphthalmia, coloboma, primary aphakia, iris hypoplasia, sclerocornea and congenital glaucoma; the unaffected mother appears to be a mosaic carrier. While A/M has been reported as a rare feature, this is the first report of congenital primary aphakia in association with PAX6 and the identified allele represents the first variant in the PAX6 homeodomain to be associated with A/M. A novel pathogenic variant in OTX2, c.651delC, p.(Thr218Hisfs*76), in a patient with syndromic bilateral anophthalmia and a hemizygous pathogenic variant in NDP, c.293 C>T, p.(Pro98Leu), in two brothers with isolated bilateral microphthalmia and sclerocornea were also identified. Pathogenic/likely pathogenic variants were not discovered in the 25 remaining A/M cases. This study underscores the utility of whole-exome sequencing for identification of causative mutations in highly variable ocular phenotypes as well as the extreme genetic heterogeneity of A/M conditions. PMID:26130484

PVRL1 Variants Contribute to Non-Syndromic Cleft Lip and Palate in Multiple Populations

PubMed Central

Avila, Joseph R.; Jezewski, Peter A.; Vieira, Alexandre R.; Orioli, Iêda M.; Castilla, Eduardo E.; Christensen, Kaare; Daack-Hirsch, Sandra; Romitti, Paul A.; Murray, Jeffrey C.

2007-01-01

Poliovirus Receptor Like-1 (PVRL1) is a member of the immunoglobulin super family that acts in the initiation and maintenance of epithelial adherens junctions and is mutated in the cleft lip and palate/ectodermal dysplasia 1 syndrome (CLPED1, OMIM #225000). In addition, a common non-sense mutation in PVRL1 was discovered more often among non-syndromic sporadic clefting cases in Northern Venezuela in a previous case-control study. The present work sought to ascertain the role of PVRL1 in the sporadic forms of orofacial clefting in multiple populations. Multiple rare and common variants from all three splice isoforms were initially ascertained by sequencing 92 Iowan and 86 Filipino cases and CEPH controls. Using a family-based analysis to examine these variants, the common glycine allele of the G361V coding variant was significantly overtransmitted among all orofacial clefting phenotypes (P = 0.005). This represented G361V genotyping from over 800 Iowan, Danish, and Filipino families. Among four rare amino acid changes found within the V1 and C1 domains, S112T and T131A were found adjacent to critical amino acid positions within the V1 variable domain, regions previously shown to mediate cell-to-cell and cell-to-virus adhesion. The T131A variant was not found in over 1,300 non-affected control samples although the alanine is found in other species. The serine of the S112T variant position is conserved across all known PVRL1 sequences. Together these data suggest that both rare and common mutations within PVRL1 make a minor contribution to disrupting the initiation and regulation of cell-to-cell adhesion and downstream morphogenesis of the embryonic face. PMID:17089422
Contributions of Function-Altering Variants in Genes Implicated in Pubertal Timing and Body Mass for Self-Limited Delayed Puberty.

PubMed

Howard, Sasha R; Guasti, Leonardo; Poliandri, Ariel; David, Alessia; Cabrera, Claudia P; Barnes, Michael R; Wehkalampi, Karoliina; O'Rahilly, Stephen; Aiken, Catherine E; Coll, Anthony P; Ma, Marcella; Rimmington, Debra; Yeo, Giles S H; Dunkel, Leo

2018-02-01

Self-limited delayed puberty (DP) is often associated with a delay in physical maturation, but although highly heritable the causal genetic factors remain elusive. Genome-wide association studies of the timing of puberty have identified multiple loci for age at menarche in females and voice break in males, particularly in pathways controlling energy balance. We sought to assess the contribution of rare variants in such genes to the phenotype of familial DP. We performed whole-exome sequencing in 67 pedigrees (125 individuals with DP and 35 unaffected controls) from our unique cohort of familial self-limited DP. Using a whole-exome sequencing filtering pipeline one candidate gene [fat mass and obesity-associated gene (FTO)] was identified. In silico, in vitro, and mouse model studies were performed to investigate the pathogenicity of FTO variants and timing of puberty in FTO+/- mice. We identified potentially pathogenic, rare variants in genes in linkage disequilibrium with genome-wide association studies of age at menarche loci in 283 genes. Of these, five genes were implicated in the control of body mass. After filtering for segregation with trait, one candidate, FTO, was retained. Two FTO variants, found in 14 affected individuals from three families, were also associated with leanness in these patients with DP. One variant (p.Leu44Val) demonstrated altered demethylation activity of the mutant protein in vitro. Fto+/- mice displayed a significantly delayed timing of pubertal onset (P < 0.05). Mutations in genes implicated in body mass and timing of puberty in the general population may contribute to the pathogenesis of self-limited DP. Copyright © 2017 Endocrine Society
Malan syndrome: Sotos-like overgrowth with de novo NFIX sequence variants and deletions in six new patients and a review of the literature.

PubMed

Klaassens, Merel; Morrogh, Deborah; Rosser, Elisabeth M; Jaffer, Fatima; Vreeburg, Maaike; Bok, Levinus A; Segboer, Tim; van Belzen, Martine; Quinlivan, Ros M; Kumar, Ajith; Hurst, Jane A; Scott, Richard H

2015-05-01

De novo monoallelic variants in NFIX cause two distinct syndromes. Whole gene deletions, nonsense variants and missense variants affecting the DNA-binding domain have been seen in association with a Sotos-like phenotype that we propose is referred to as Malan syndrome. Frameshift and splice-site variants thought to avoid nonsense-mediated RNA decay have been seen in Marshall-Smith syndrome. We report six additional patients with Malan syndrome and de novo NFIX deletions or sequence variants and review the 20 patients now reported. The phenotype is characterised by moderate postnatal overgrowth and macrocephaly. Median height and head circumference in childhood are 2.0 and 2.3 standard deviations (SD) above the mean, respectively. There is overlap of the facial phenotype with NSD1-positive Sotos syndrome in some cases including a prominent forehead, high anterior hairline, downslanting palpebral fissures and prominent chin. Neonatal feeding difficulties and/or hypotonia have been reported in 30% of patients. Developmental delay/learning disability have been reported in all cases and are typically moderate. Ocular phenotypes are common, including strabismus (65%), nystagmus (25% ) and optic disc pallor/hypoplasia (25%). Other recurrent features include pectus excavatum (40%) and scoliosis (25%). Eight reported patients have a deletion also encompassing CACNA1A, haploinsufficiency of which causes episodic ataxia type 2 or familial hemiplegic migraine. One previous case had episodic ataxia and one case we report has had cyclical vomiting responsive to pizotifen. In individuals with this contiguous gene deletion syndrome, awareness of possible later neurological manifestations is important, although their penetrance is not yet clear.
Silent genetic alterations identified by targeted next-generation sequencing in pheochromocytoma/paraganglioma: A clinicopathological correlations.

PubMed

Pillai, Suja; Gopalan, Vinod; Lo, Chung Y; Liew, Victor; Smith, Robert A; Lam, Alfred King Y

2017-02-01

The goal of this pilot study was to develop a customized, cost-effective amplicon panel (Ampliseq) for target sequencing in a cohort of patients with sporadic phaeochromocytoma/paraganglioma. Phaeochromocytoma/paragangliomas from 25 patients were analysed by targeted next-generation sequencing approach using an Ion Torrent PGM instrument. Primers for 15 target genes (NF1, RET, VHL, SDHA, SDHB, SDHC, SDHD, SDHAF2, TMEM127, MAX, MEN1, KIF1Bβ, EPAS1, CDKN2 & PHD2) were designed using ion ampliseq designer. Ion Reporter software and Ingenuity® Variant Analysis™ software (www.ingenuity.com/variants) from Ingenuity Systems were used to analysis these results. Overall, 713 variants were identified. The variants identified from the Ion Reporter ranged from 64 to 161 per patient. Single nucleotide variants (SNV) were the most common. Further annotation with the help of Ingenuity variant analysis revealed 29 of these 713variants were deletions. Of these, six variants were non-pathogenic and four were likely to be pathogenic. The remaining 19 variants were of uncertain significance. The most frequently altered gene in the cohort was KIF1B followed by NF1. Novel KIF1B pathogenic variant c.3375+1G>A was identified. The mutation was noted in a patient with clinically confirmed neurofibromatosis. Chromosome 1 showed the presence of maximum number of variants. Use of targeted next-generation sequencing is a sensitive method for the detecting genetic changes in patients with phaeochromocytoma/paraganglioma. The precise detection of these genetic changes helps in understanding the pathogenesis of these tumours. Copyright © 2016 Elsevier Inc. All rights reserved.
Next generation sequencing to identify novel genetic variants causative of autosomal dominant familial hypercholesterolemia associated with increased risk of coronary heart disease.

PubMed

Al-Allaf, Faisal A; Athar, Mohammad; Abduljaleel, Zainularifeen; Taher, Mohiuddin M; Khan, Wajahatullah; Ba-Hammam, Faisal A; Abalkhail, Hala; Alashwal, Abdullah

2015-07-01

Familial hypercholesterolemia (FH) is an autosomal dominant inherited disease characterized by elevated plasma low-density lipoprotein cholesterol (LDL-C). It is an autosomal dominant disease, caused by variants in Ldlr, ApoB or Pcsk9, which results in high levels of LDL-cholesterol (LDL-C) leading to early coronary heart disease. Sequencing whole genome for screening variants for FH are not suitable due to high cost. Hence, in this study we performed targeted customized sequencing of FH 12 genes (Ldlr, ApoB, Pcsk9, Abca1, Apoa2, Apoc3, Apon2, Arh, Ldlrap1, Apoc2, ApoE, and Lpl) that have been implicated in the homozygous phenotype of a proband pedigree to identify candidate variants by NGS Ion torrent PGM. Only three genes (Ldlr, ApoB, and Pcsk9) were found to be highly associated with FH based on the variant rate. The results showed that seven deleterious variants in Ldlr, ApoB, and Pcsk9 genes were pathological and were clinically significant based on predictions identified by SIFT and PolyPhen. Targeted customized sequencing is an efficient technique for screening variants among targeted FH genes. Final validation of seven deleterious variants conducted by capillary resulted to only one novel variant in Ldlr gene that was found in exon 14 (c.2026delG, p. Gly676fs). The variant found in Ldlr gene was a novel heterozygous variant derived from a male in the proband. Copyright © 2015 Elsevier B.V. All rights reserved.
Genomic Rearrangements in Arabidopsis Considered as Quantitative Traits.

PubMed

Imprialou, Martha; Kahles, André; Steffen, Joshua G; Osborne, Edward J; Gan, Xiangchao; Lempe, Janne; Bhomra, Amarjit; Belfield, Eric; Visscher, Anne; Greenhalgh, Robert; Harberd, Nicholas P; Goram, Richard; Hein, Jotun; Robert-Seilaniantz, Alexandre; Jones, Jonathan; Stegle, Oliver; Kover, Paula; Tsiantis, Miltos; Nordborg, Magnus; Rätsch, Gunnar; Clark, Richard M; Mott, Richard

2017-04-01

To understand the population genetics of structural variants and their effects on phenotypes, we developed an approach to mapping structural variants that segregate in a population sequenced at low coverage. We avoid calling structural variants directly. Instead, the evidence for a potential structural variant at a locus is indicated by variation in the counts of short-reads that map anomalously to that locus. These structural variant traits are treated as quantitative traits and mapped genetically, analogously to a gene expression study. Association between a structural variant trait at one locus, and genotypes at a distant locus indicate the origin and target of a transposition. Using ultra-low-coverage (0.3×) population sequence data from 488 recombinant inbred Arabidopsis thaliana genomes, we identified 6502 segregating structural variants. Remarkably, 25% of these were transpositions. While many structural variants cannot be delineated precisely, we validated 83% of 44 predicted transposition breakpoints by polymerase chain reaction. We show that specific structural variants may be causative for quantitative trait loci for germination and resistance to infection by the fungus Albugo laibachii , isolate Nc14. Further we show that the phenotypic heritability attributable to read-mapping anomalies differs from, and, in the case of time to germination and bolting, exceeds that due to standard genetic variation. Genes within structural variants are also more likely to be silenced or dysregulated. This approach complements the prevalent strategy of structural variant discovery in fewer individuals sequenced at high coverage. It is generally applicable to large populations sequenced at low-coverage, and is particularly suited to mapping transpositions. Copyright © 2017 by the Genetics Society of America.
Early-Onset Progressive Retinal Atrophy Associated with an IQCB1 Variant in African Black-Footed Cats (Felis nigripes)

PubMed Central

Oh, Annie; Pearce, Jacqueline W.; Gandolfi, Barbara; Creighton, Erica K.; Suedmeyer, William K.; Selig, Michael; Bosiack, Ann P.; Castaner, Leilani J.; Whiting, Rebecca E. H.; Belknap, Ellen B.; Lyons, Leslie A.; Aderdein, Danielle; Alves, Paulo C.; Barsh, Gregory S.; Beale, Holly C.; Boyko, Adam R.; Castelhano, Marta G.; Chan, Patricia; Ellinwood, N. Matthew; Garrick, Dorian J.; Helps, Christopher R.; Kaelin, Christopher B.; Leeb, Tosso; Lohi, Hannes; Longeri, Maria; Malik, Richard; Montague, Michael J.; Munday, John S.; Murphy, William J.; Pedersen, Niels C.; Rothschild, Max F.; Swanson, William F.; Terio, Karen A.; Todhunter, Rory J.; Warren, Wesley C.

2017-01-01

African black-footed cats (Felis nigripes) are endangered wild felids. One male and full-sibling female African black-footed cat developed vision deficits and mydriasis as early as 3 months of age. The diagnosis of early-onset progressive retinal atrophy (PRA) was supported by reduced direct and consensual pupillary light reflexes, phenotypic presence of retinal degeneration, and a non-recordable electroretinogram with negligible amplitudes in both eyes. Whole genome sequencing, conducted on two unaffected parents and one affected offspring was compared to a variant database from 51 domestic cats and a Pallas cat, revealed 50 candidate variants that segregated concordantly with the PRA phenotype. Testing in additional affected cats confirmed that cats homozygous for a 2 base pair (bp) deletion within IQ calmodulin-binding motif-containing protein-1 (IQCB1), the gene that encodes for nephrocystin-5 (NPHP5), had vision loss. The variant segregated concordantly in other related individuals within the pedigree supporting the identification of a recessively inherited early-onset feline PRA. Analysis of the black-footed cat studbook suggests additional captive cats are at risk. Genetic testing for IQCB1 and avoidance of matings between carriers should be added to the species survival plan for captive management. PMID:28322220
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Whole-exome sequencing of primary plasma cell leukemia discloses heterogeneous mutational patterns.

PubMed

Cifola, Ingrid; Lionetti, Marta; Pinatel, Eva; Todoerti, Katia; Mangano, Eleonora; Pietrelli, Alessandro; Fabris, Sonia; Mosca, Laura; Simeon, Vittorio; Petrucci, Maria Teresa; Morabito, Fortunato; Offidani, Massimo; Di Raimondo, Francesco; Falcone, Antonietta; Caravita, Tommaso; Battaglia, Cristina; De Bellis, Gianluca; Palumbo, Antonio; Musto, Pellegrino; Neri, Antonino

2015-07-10

Primary plasma cell leukemia (pPCL) is a rare and aggressive form of plasma cell dyscrasia and may represent a valid model for high-risk multiple myeloma (MM). To provide novel information concerning the mutational profile of this disease, we performed the whole-exome sequencing of a prospective series of 12 pPCL cases included in a Phase II multicenter clinical trial and previously characterized at clinical and molecular levels. We identified 1, 928 coding somatic non-silent variants on 1, 643 genes, with a mean of 166 variants per sample, and only few variants and genes recurrent in two or more samples. An excess of C > T transitions and the presence of two main mutational signatures (related to APOBEC over-activity and aging) occurring in different translocation groups were observed. We identified 14 candidate cancer driver genes, mainly involved in cell-matrix adhesion, cell cycle, genome stability, RNA metabolism and protein folding. Furthermore, integration of mutation data with copy number alteration profiles evidenced biallelically disrupted genes with potential tumor suppressor functions. Globally, cadherin/Wnt signaling, extracellular matrix and cell cycle checkpoint resulted the most affected functional pathways. Sequencing results were finally combined with gene expression data to better elucidate the biological relevance of mutated genes. This study represents the first whole-exome sequencing screen of pPCL and evidenced a remarkable genetic heterogeneity of mutational patterns. This may provide a contribution to the comprehension of the pathogenetic mechanisms associated with this aggressive form of PC dyscrasia and potentially with high-risk MM.
CEP78 is mutated in a distinct type of Usher syndrome.

PubMed

Fu, Qing; Xu, Mingchu; Chen, Xue; Sheng, Xunlun; Yuan, Zhisheng; Liu, Yani; Li, Huajin; Sun, Zixi; Li, Huiping; Yang, Lizhu; Wang, Keqing; Zhang, Fangxia; Li, Yumei; Zhao, Chen; Sui, Ruifang; Chen, Rui

2017-03-01

Usher syndrome is a genetically heterogeneous disorder featured by combined visual impairment and hearing loss. Despite a dozen of genes involved in Usher syndrome having been identified, the genetic basis remains unknown in 20-30% of patients. In this study, we aimed to identify the novel disease-causing gene of a distinct subtype of Usher syndrome. Ophthalmic examinations and hearing tests were performed on patients with Usher syndrome in two consanguineous families. Target capture sequencing was initially performed to screen causative mutations in known retinal disease-causing loci. Whole exome sequencing (WES) and whole genome sequencing (WGS) were applied for identifying novel disease-causing genes. RT-PCR and Sanger sequencing were performed to evaluate the splicing-altering effect of identified CEP78 variants. Patients from the two independent families show a mild Usher syndrome phenotype featured by juvenile or adult-onset cone-rod dystrophy and sensorineural hearing loss. WES and WGS identified two homozygous rare variants that affect mRNA splicing of a ciliary gene CEP78 . RT-PCR confirmed that the two variants indeed lead to abnormal splicing, resulting in premature stop of protein translation due to frameshift. Our results provide evidence that CEP78 is a novel disease-causing gene for Usher syndrome, demonstrating an additional link between ciliopathy and Usher protein network in photoreceptor cells and inner ear hair cells. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Rare variants in APP, PSEN1 and PSEN2 increase risk for AD in late-onset Alzheimer's disease families.

PubMed

Cruchaga, Carlos; Haller, Gabe; Chakraverty, Sumitra; Mayo, Kevin; Vallania, Francesco L M; Mitra, Robi D; Faber, Kelley; Williamson, Jennifer; Bird, Tom; Diaz-Arrastia, Ramon; Foroud, Tatiana M; Boeve, Bradley F; Graff-Radford, Neill R; St Jean, Pamela; Lawson, Michael; Ehm, Margaret G; Mayeux, Richard; Goate, Alison M

2012-01-01

Pathogenic mutations in APP, PSEN1, PSEN2, MAPT and GRN have previously been linked to familial early onset forms of dementia. Mutation screening in these genes has been performed in either very small series or in single families with late onset AD (LOAD). Similarly, studies in single families have reported mutations in MAPT and GRN associated with clinical AD but no systematic screen of a large dataset has been performed to determine how frequently this occurs. We report sequence data for 439 probands from late-onset AD families with a history of four or more affected individuals. Sixty sequenced individuals (13.7%) carried a novel or pathogenic mutation. Eight pathogenic variants, (one each in APP and MAPT, two in PSEN1 and four in GRN) three of which are novel, were found in 14 samples. Thirteen additional variants, present in 23 families, did not segregate with disease, but the frequency of these variants is higher in AD cases than controls, indicating that these variants may also modify risk for disease. The frequency of rare variants in these genes in this series is significantly higher than in the 1,000 genome project (p = 5.09 × 10⁻⁵; OR = 2.21; 95%CI = 1.49-3.28) or an unselected population of 12,481 samples (p = 6.82 × 10⁻⁵; OR = 2.19; 95%CI = 1.347-3.26). Rare coding variants in APP, PSEN1 and PSEN2, increase risk for or cause late onset AD. The presence of variants in these genes in LOAD and early-onset AD demonstrates that factors other than the mutation can impact the age at onset and penetrance of at least some variants associated with AD. MAPT and GRN mutations can be found in clinical series of AD most likely due to misdiagnosis. This study clearly demonstrates that rare variants in these genes could explain an important proportion of genetic heritability of AD, which is not detected by GWAS.
An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.

PubMed

Ribeiro, Antonio; Golicz, Agnieszka; Hackett, Christine Anne; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J; Bayer, Micha

2015-11-11

Single Nucleotide Polymorphisms (SNPs) are widely used molecular markers, and their use has increased massively since the inception of Next Generation Sequencing (NGS) technologies, which allow detection of large numbers of SNPs at low cost. However, both NGS data and their analysis are error-prone, which can lead to the generation of false positive (FP) SNPs. We explored the relationship between FP SNPs and seven factors involved in mapping-based variant calling - quality of the reference sequence, read length, choice of mapper and variant caller, mapping stringency and filtering of SNPs by read mapping quality and read depth. This resulted in 576 possible factor level combinations. We used error- and variant-free simulated reads to ensure that every SNP found was indeed a false positive. The variation in the number of FP SNPs generated ranged from 0 to 36,621 for the 120 million base pairs (Mbp) genome. All of the experimental factors tested had statistically significant effects on the number of FP SNPs generated and there was a considerable amount of interaction between the different factors. Using a fragmented reference sequence led to a dramatic increase in the number of FP SNPs generated, as did relaxed read mapping and a lack of SNP filtering. The choice of reference assembler, mapper and variant caller also significantly affected the outcome. The effect of read length was more complex and suggests a possible interaction between mapping specificity and the potential for contributing more false positives as read length increases. The choice of tools and parameters involved in variant calling can have a dramatic effect on the number of FP SNPs produced, with particularly poor combinations of software and/or parameter settings yielding tens of thousands in this experiment. Between-factor interactions make simple recommendations difficult for a SNP discovery pipeline but the quality of the reference sequence is clearly of paramount importance. Our findings are also a stark reminder that it can be unwise to use the relaxed mismatch settings provided as defaults by some read mappers when reads are being mapped to a relatively unfinished reference sequence from e.g. a non-model organism in its early stages of genomic exploration.
Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

PubMed Central

2012-01-01

Background The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. Results In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom® MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). Conclusions The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. PMID:22235840
A survey of tools for variant analysis of next-generation genome sequencing data

PubMed Central

Pabinger, Stephan; Dander, Andreas; Fischer, Maria; Snajder, Rene; Sperk, Michael; Efremova, Mirjana; Krabichler, Birgit; Speicher, Michael R.; Zschocke, Johannes

2014-01-01

Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers. PMID:23341494
Whole exome sequencing identifies genetic variants in inherited thrombocytopenia with secondary qualitative function defects

PubMed Central

Johnson, Ben; Lowe, Gillian C.; Futterer, Jane; Lordkipanidzé, Marie; MacDonald, David; Simpson, Michael A.; Sanchez-Guiú, Isabel; Drake, Sian; Bem, Danai; Leo, Vincenzo; Fletcher, Sarah J.; Dawood, Ban; Rivera, José; Allsup, David; Biss, Tina; Bolton-Maggs, Paula HB; Collins, Peter; Curry, Nicola; Grimley, Charlotte; James, Beki; Makris, Mike; Motwani, Jayashree; Pavord, Sue; Talks, Katherine; Thachil, Jecko; Wilde, Jonathan; Williams, Mike; Harrison, Paul; Gissen, Paul; Mundell, Stuart; Mumford, Andrew; Daly, Martina E.; Watson, Steve P.; Morgan, Neil V.

2016-01-01

Inherited thrombocytopenias are a heterogeneous group of disorders characterized by abnormally low platelet counts which can be associated with abnormal bleeding. Next-generation sequencing has previously been employed in these disorders for the confirmation of suspected genetic abnormalities, and more recently in the discovery of novel disease-causing genes. However its full potential has not yet been exploited. Over the past 6 years we have sequenced the exomes from 55 patients, including 37 index cases and 18 additional family members, all of whom were recruited to the UK Genotyping and Phenotyping of Platelets study. All patients had inherited or sustained thrombocytopenia of unknown etiology with platelet counts varying from 11×109/L to 186×109/L. Of the 51 patients phenotypically tested, 37 (73%), had an additional secondary qualitative platelet defect. Using whole exome sequencing analysis we have identified “pathogenic” or “likely pathogenic” variants in 46% (17/37) of our index patients with thrombocytopenia. In addition, we report variants of uncertain significance in 12 index cases, including novel candidate genetic variants in previously unreported genes in four index cases. These results demonstrate that whole exome sequencing is an efficient method for elucidating potential pathogenic genetic variants in inherited thrombocytopenia. Whole exome sequencing also has the added benefit of discovering potentially pathogenic genetic variants for further study in novel genes not previously implicated in inherited thrombocytopenia. PMID:27479822
Inferring Short-Range Linkage Information from Sequencing Chromatograms

PubMed Central

Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas

2013-01-01

Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
Common and rare variants associated with kidney stones and biochemical traits

PubMed Central

Oddsson, Asmundur; Sulem, Patrick; Helgason, Hannes; Edvardsson, Vidar O.; Thorleifsson, Gudmar; Sveinbjörnsson, Gardar; Haraldsdottir, Eik; Eyjolfsson, Gudmundur I.; Sigurdardottir, Olof; Olafsson, Isleifur; Masson, Gisli; Holm, Hilma; Gudbjartsson, Daniel F.; Thorsteinsdottir, Unnur; Indridason, Olafur S.; Palsson, Runolfur; Stefansson, Kari

2015-01-01

Kidney stone disease is a complex disorder with a strong genetic component. We conducted a genome-wide association study of 28.3 million sequence variants detected through whole-genome sequencing of 2,636 Icelanders that were imputed into 5,419 kidney stone cases, including 2,172 cases with a history of recurrent kidney stones, and 279,870 controls. We identify sequence variants associating with kidney stones at ALPL (rs1256328[T], odds ratio (OR)=1.21, P=5.8 × 10−10) and a suggestive association at CASR (rs7627468[A], OR=1.16, P=2.0 × 10−8). Focusing our analysis on coding sequence variants in 63 genes with preferential kidney expression we identify two rare missense variants SLC34A1 p.Tyr489Cys (OR=2.38, P=2.8 × 10−5) and TRPV5 p.Leu530Arg (OR=3.62, P=4.1 × 10−5) associating with recurrent kidney stones. We also observe associations of the identified kidney stone variants with biochemical traits in a large population set, indicating potential biological mechanism. PMID:26272126
Common and rare variants associated with kidney stones and biochemical traits.

PubMed

Oddsson, Asmundur; Sulem, Patrick; Helgason, Hannes; Edvardsson, Vidar O; Thorleifsson, Gudmar; Sveinbjörnsson, Gardar; Haraldsdottir, Eik; Eyjolfsson, Gudmundur I; Sigurdardottir, Olof; Olafsson, Isleifur; Masson, Gisli; Holm, Hilma; Gudbjartsson, Daniel F; Thorsteinsdottir, Unnur; Indridason, Olafur S; Palsson, Runolfur; Stefansson, Kari

2015-08-14

Kidney stone disease is a complex disorder with a strong genetic component. We conducted a genome-wide association study of 28.3 million sequence variants detected through whole-genome sequencing of 2,636 Icelanders that were imputed into 5,419 kidney stone cases, including 2,172 cases with a history of recurrent kidney stones, and 279,870 controls. We identify sequence variants associating with kidney stones at ALPL (rs1256328[T], odds ratio (OR)=1.21, P=5.8 × 10(-10)) and a suggestive association at CASR (rs7627468[A], OR=1.16, P=2.0 × 10(-8)). Focusing our analysis on coding sequence variants in 63 genes with preferential kidney expression we identify two rare missense variants SLC34A1 p.Tyr489Cys (OR=2.38, P=2.8 × 10(-5)) and TRPV5 p.Leu530Arg (OR=3.62, P=4.1 × 10(-5)) associating with recurrent kidney stones. We also observe associations of the identified kidney stone variants with biochemical traits in a large population set, indicating potential biological mechanism.
From days to hours: reporting clinically actionable variants from whole genome sequencing.

PubMed

Middha, Sumit; Baheti, Saurabh; Hart, Steven N; Kocher, Jean-Pierre A

2014-01-01

As the cost of whole genome sequencing (WGS) decreases, clinical laboratories will be looking at broadly adopting this technology to screen for variants of clinical significance. To fully leverage this technology in a clinical setting, results need to be reported quickly, as the turnaround rate could potentially impact patient care. The latest sequencers can sequence a whole human genome in about 24 hours. However, depending on the computing infrastructure available, the processing of data can take several days, with the majority of computing time devoted to aligning reads to genomics regions that are to date not clinically interpretable. In an attempt to accelerate the reporting of clinically actionable variants, we have investigated the utility of a multi-step alignment algorithm focused on aligning reads and calling variants in genomic regions of clinical relevance prior to processing the remaining reads on the whole genome. This iterative workflow significantly accelerates the reporting of clinically actionable variants with no loss of accuracy when compared to genotypes obtained with the OMNI SNP platform or to variants detected with a standard workflow that combines Novoalign and GATK.
The genetic architecture of type 2 diabetes.

PubMed

Fuchsberger, Christian; Flannick, Jason; Teslovich, Tanya M; Mahajan, Anubha; Agarwala, Vineeta; Gaulton, Kyle J; Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I

2016-08-04

The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.

The genetic architecture of type 2 diabetes

PubMed Central

Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I

2016-01-01

The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of heritability. To test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole genome sequencing in 2,657 Europeans with and without diabetes, and exome sequencing in a total of 12,940 subjects from five ancestral groups. To increase statistical power, we expanded sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support a major role for lower-frequency variants in predisposition to type 2 diabetes. PMID:27398621
Sequence Variation in the Small-Subunit rRNA Gene of Plasmodium malariae and Prevalence of Isolates with the Variant Sequence in Sichuan, China

PubMed Central

Liu, Qing; Zhu, Shenghua; Mizuno, Sahoko; Kimura, Masatsugu; Liu, Peina; Isomura, Shin; Wang, Xingzhen; Kawamoto, Fumihiko

1998-01-01

By two PCR-based diagnostic methods, Plasmodium malariae infections have been rediscovered at two foci in the Sichuan province of China, a region where no cases of P. malariae have been officially reported for the last 2 decades. In addition, a variant form of P. malariae which has a deletion of 19 bp and seven substitutions of base pairs in the target sequence of the small-subunit (SSU) rRNA gene was detected with high frequency. Alignment analysis of Plasmodium sp. SSU rRNA gene sequences revealed that the 5′ region of the variant sequence is identical to that of P. vivax or P. knowlesi and its 3′ region is identical to that of P. malariae. The same sequence variations were also found in P. malariae isolates collected along the Thai-Myanmar border, suggesting a wide distribution of this variant form from southern China to Southeast Asia. PMID:9774600
Germline Missense Variants in the BTNL2 Gene Are Associated with Prostate Cancer Susceptibility

PubMed Central

FitzGerald, Liesel M.; Kumar, Akash; Boyle, Evan A.; Zhang, Yuzheng; McIntosh, Laura M.; Kolb, Suzanne; Stott-Miller, Marni; Smith, Tiffany; Karyadi, Danielle M.; Ostrander, Elaine A.; Hsu, Li; Shendure, Jay; Stanford, Janet L.

2013-01-01

Background Rare, inherited mutations account for 5%–10% of all prostate cancer (PCa) cases. However, to date, few causative mutations have been identified. Methods To identify rare mutations for PCa, we performed whole-exome sequencing (WES) in multiple kindreds (n = 91) from 19 hereditary prostate cancer (HPC) families characterized by aggressive or early onset phenotypes. Candidate variants (n = 130) identified through family- and bioinformatics-based filtering of WES data were then genotyped in an independent set of 270 HPC families (n = 819 PCa cases; n = 496 unaffected relatives) for replication. Two variants with supportive evidence were subsequently genotyped in a population-based case-control study (n = 1,155 incident PCa cases; n = 1,060 age-matched controls) for further confirmation. All participants were men of European ancestry. Results The strongest evidence was for two germline missense variants in the butyrophilin-like 2 (BTNL2) gene (rs41441651, p.Asp336Asn and rs28362675, p.Gly454Cys) that segregated with affection status in two of the WES families. In the independent set of 270 HPC families, 1.5% (rs41441651; P = 0.0032) and 1.2% (rs28362675; P = 0.0070) of affected men, but no unaffected men, carried a variant. Both variants were associated with elevated PCa risk in the population-based study (rs41441651: OR = 2.7; 95% CI, 1.27–5.87; P = 0.010; rs28362675: OR = 2.5; 95% CI, 1.16–5.46; P = 0.019). Conclusions Results indicate that rare BTNL2 variants play a role in susceptibility to both familial and sporadic prostate cancer. Impact Results implicate BTNL2 as a novel PCa susceptibility gene. PMID:23833122
Higher criticism approach to detect rare variants using whole genome sequencing data

PubMed Central

2014-01-01

Because of low statistical power of single-variant tests for whole genome sequencing (WGS) data, the association test for variant groups is a key approach for genetic mapping. To address the features of sparse and weak genetic effects to be detected, the higher criticism (HC) approach has been proposed and theoretically has proven optimal for detecting sparse and weak genetic effects. Here we develop a strategy to apply the HC approach to WGS data that contains rare variants as the majority. By using Genetic Analysis Workshop 18 "dose" genetic data with simulated phenotypes, we assess the performance of HC under a variety of strategies for grouping variants and collapsing rare variants. The HC approach is compared with the minimal p-value method and the sequence kernel association test. The results show that the HC approach is preferred for detecting weak genetic effects. PMID:25519367
ANGPTL8/Betatrophin R59W variant is associated with higher glucose level in non-diabetic Arabs living in Kuwaits.

PubMed

Abu-Farha, Mohamed; Melhem, Motasem; Abubaker, Jehad; Behbehani, Kazem; Alsmadi, Osama; Elkum, Naser

2016-02-11

ANGPTL8 (betatrophin) has been recently identified as a regulator of lipid metabolism through its interaction with ANGPTL3. A sequence variant in ANGPTL8 has been shown to associate with lower level of Low Density Lipoprotein (LDL) and High Density Lipoprotein (HDL). The objective of this study is to identify sequence variants in ANGPTL8 gene in Arabs and investigate their association with ANGPTL8 plasma level and clinical parameters. A cross sectional study was designed to examine the level of ANGPTL8 in 283 non-diabetic Arabs, and to identify its sequence variants using Sanger sequencing and their association with various clinical parameters. Using Sanger sequencing, we sequenced the full ANGPTL8 gene in 283 Arabs identifying two single nucleotide polymorphisms (SNPs) Rs.892066 and Rs.2278426 in the coding region. Our data shows for the first time that Arabs with the heterozygote form of (c.194C > T Rs.2278426) had higher level of Fasting Blood Glucose (FBG) compared to the CC homozygotes. LDL and HDL level in these subjects did not show significant difference between the two subgroups. Circulation level of ANGPTL8 did not vary between the two forms. No significant changes were observed between the various forms of Rs.892066 variant and FBG, LDL or HDL. Our data shows for the first time that heterozygote form of ANGPTL8 Rs.2278426 variant was associated with higher FBG level in Arabs highlighting the importance of these variants in controlling the function of betatrophin.
A novel hemoglobin variant found on the α1 chain: Hb KSVGH (HBA1: p.Lys57_Gly58insSerHisGlySerAlaGlnValLys).

PubMed

Wang, Mei-Chun; Tsai, Kuo-Wang; Chu, Chih-Hsun; Yu, Ming-Sun; Lam, Hing-Chung

2015-01-01

Glycosylated hemoglobin (Hb A1C) is a crucial indicator for the long-term control and the diagnosis of diabetes. However, the presence of hemoglobin (Hb) variants may affect the measured value of Hb A1C and result in an abnormal graph trend and inconsistency between the clinical blood sugar test and Hb A1C values. In this study, laboratory data of 41,267 patients with diabetes were collected. The Hb A1C levels and the graph results were examined. We identified 74 cases containing abnormal Hb A1C graph trends. The conducted blood cell counts and capillary Hb electrophoresis were used to analyze Hb variants. We also determined gene variation for the Hb variants by a sequence approach. Fifteen different types of Hb variants were identified in this study. Among these, we found a novel variant in which the α1 subunit of Hb showed an insertion of 24 nucleotides (nts) between the 56th and 57th residues. We named this novel variant Hb Kaohsiung Veterans General Hospital (Hb KSVGH) (HBA1: p.Lys57_Gly58insSerHisGlySerAlaGlnValLys).
Clinical and molecular characterization of females affected by X-linked retinoschisis.

PubMed

Staffieri, Sandra E; Rose, Loreto; Chang, Andrew; De Roach, John N; McLaren, Terri L; Mackey, David A; Hewitt, Alex W; Lamey, Tina M

2015-01-01

X-linked retinoschisis (XLRS) is a leading cause of juvenile macular degeneration associated with mutations in the RS1 gene. XLRS has a variable expressivity in males and shows no clinical phenotype in carrier females. Clinical and molecular characterization of male and female individuals affected with XLRS in a consanguineous family. Consanguineous Eastern European-Australian family Four clinically affected and nine unaffected family members were genetically and clinically characterized. Deoxyribonucleic acid (DNA) analysis was conducted by the Australian Inherited Retinal Disease Register and DNA Bank. Clinical and molecular characterization of the causative mutation in a consanguineous family with XLRS. By direct sequencing of the RS1 gene, one pathogenic variant, NM_000330.3: c.304C > T, p. R102W, was identified in all clinically diagnosed individuals analysed. The two females were homozygous for the variant, and the males were hemizygous. Clinical and genetic characterization of affected homozygous females in XLRS affords the rare opportunity to explore the molecular mechanisms of XLRS and the manifestation of these mutations as disease in humans. © 2015 Royal Australian and New Zealand College of Ophthalmologists.
Frizzled-4 Variations Associated with Retinopathy and Intrauterine Growth Retardation: A Potential Marker for Prematurity and Retinopathy.

PubMed

Dailey, Wendy A; Gryc, Wojciech; Garg, Pooja G; Drenser, Kimberly A

2015-09-01

To present the association between mutations affecting the Wnt-signaling receptor protein (FZD4), inherited vitreoretinopathies, and retinopathy of prematurity (ROP). Retrospective analysis of prospective samples at a tertiary referral center. Patients referred to our practice for management of a variety of pediatric vitreoretinopathies were offered participation in an ophthalmic biobank (421 participants with vitreoretinopathies were included in this study). Full-term healthy infants (n = 98) were recruited to the study as controls. Patients with various vitreoretinopathies were prospectively enrolled in an ophthalmic biobank, approved by the Human Investigation Committee at William Beaumont Hospital. Retrospective genetic analysis of the FZD4 gene was performed (Sanger sequencing). Participants with a diagnosis of familial exudative vitreoretinopathy (FEVR), Norrie disease, Coats' disease, bilateral persistent fetal vasculature, and ROP were reviewed for the presence of a FZD4 variant. Data retrieval included status of retinopathy (including staging when possible), gestational age (GA), birth weight (BW) (when available), and family and birth histories. The association of FZD4 variants with the presence of vitreoretinopathy. The sequence variation p.[P33S(;)P168S] is the most prevalent FZD4 variant and is statistically significant for ROP and FEVR (P = 4.6E-04 and P = 2.4E-03, respectively) compared with full-term newborns (P = 1.7E-01). In addition, infants expressing the sequence variation tended to have significantly lower BWs for respective GA (P = 0.04). This suggests that the FZD4 p.[P33S(;)P168S] variant may be a risk factor for retinopathy and restricted intrauterine growth. Testing for FZD4 gene mutations is useful in patients with suspected FEVR and ROP. The relatively high prevalence of the p.[P33S(;)P168S] variant in ROP and intrauterine growth restriction suggests that it also may be a marker for increased risk of developing ROP and preterm birth. Copyright © 2015 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
Identification of missing variants by combining multiple analytic pipelines.

PubMed

Ren, Yingxue; Reddy, Joseph S; Pottier, Cyril; Sarangi, Vivekananda; Tian, Shulan; Sinnwell, Jason P; McDonnell, Shannon K; Biernacka, Joanna M; Carrasquillo, Minerva M; Ross, Owen A; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hudson, Matthew; Mainzer, Liudmila Sergeevna; Asmann, Yan W

2018-04-16

After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. Identification of the complete variant set from sequencing data is the prerequisite of genetic association analyses. The current analytic practice of calling genetic variants from sequencing data using a single bioinformatics pipeline is no longer adequate with the increasingly large projects. The number and percentage of quality variants that passed quality filters but are missed by the one-pipeline approach rapidly increased with sample size.
The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS Data.

PubMed

He, Zongxiao; Zhang, Di; Renton, Alan E; Li, Biao; Zhao, Linhai; Wang, Gao T; Goate, Alison M; Mayeux, Richard; Leal, Suzanne M

2017-02-02

Whole-genome and exome sequence data can be cost-effectively generated for the detection of rare-variant (RV) associations in families. Causal variants that aggregate in families usually have larger effect sizes than those found in sporadic cases, so family-based designs can be a more powerful approach than population-based designs. Moreover, some family-based designs are robust to confounding due to population admixture or substructure. We developed a RV extension of the generalized disequilibrium test (GDT) to analyze sequence data obtained from nuclear and extended families. The GDT utilizes genotype differences of all discordant relative pairs to assess associations within a family, and the RV extension combines the single-variant GDT statistic over a genomic region of interest. The RV-GDT has increased power by efficiently incorporating information beyond first-degree relatives and allows for the inclusion of covariates. Using simulated genetic data, we demonstrated that the RV-GDT method has well-controlled type I error rates, even when applied to admixed populations and populations with substructure. It is more powerful than existing family-based RV association methods, particularly for the analysis of extended pedigrees and pedigrees with missing data. We analyzed whole-genome sequence data from families affected by Alzheimer disease to illustrate the application of the RV-GDT. Given the capability of the RV-GDT to adequately control for population admixture or substructure and analyze pedigrees with missing genotype data and its superior power over other family-based methods, it is an effective tool for elucidating the involvement of RVs in the etiology of complex traits. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
An XRCC4 Splice Mutation Associated With Severe Short Stature, Gonadal Failure, and Early-Onset Metabolic Syndrome

PubMed Central

de Bruin, Christiaan; Mericq, Verónica; Andrew, Shayne F.; van Duyvenvoorde, Hermine A.; Verkaik, Nicole S.; Losekoot, Monique; Porollo, Aleksey; Garcia, Hernán; Kuang, Yi; Hanson, Dan; Clayton, Peter; van Gent, Dik C.; Wit, Jan M.; Hwa, Vivian

2015-01-01

Context: Severe short stature can be caused by defects in numerous biological processes including defects in IGF-1 signaling, centromere function, cell cycle control, and DNA damage repair. Many syndromic causes of short stature are associated with medical comorbidities including hypogonadism and microcephaly. Objective: To identify an underlying genetic etiology in two siblings with severe short stature and gonadal failure. Design: Clinical phenotyping, genetic analysis, complemented by in vitro functional studies of the candidate gene. Setting: An academic pediatric endocrinology clinic. Patients or Other Participants: Two adult siblings (male patient [P1] and female patient 2 [P2]) presented with a history of severe postnatal growth failure (adult heights: P1, −6.8 SD score; P2, −4 SD score), microcephaly, primary gonadal failure, and early-onset metabolic syndrome in late adolescence. In addition, P2 developed a malignant gastrointestinal stromal tumor at age 28. Intervention(s): Single nucleotide polymorphism microarray and exome sequencing. Results: Combined microarray analysis and whole exome sequencing of the two affected siblings and one unaffected sister identified a homozygous variant in XRCC4 as the probable candidate variant. Sanger sequencing and mRNA studies revealed a splice variant resulting in an in-frame deletion of 23 amino acids. Primary fibroblasts (P1) showed a DNA damage repair defect. Conclusions: In this study we have identified a novel pathogenic variant in XRCC4, a gene that plays a critical role in non-homologous end-joining DNA repair. This finding expands the spectrum of DNA damage repair syndromes to include XRCC4 deficiency causing severe postnatal growth failure, microcephaly, gonadal failure, metabolic syndrome, and possibly tumor predisposition. PMID:25742519
Leveraging long read sequencing from a single individual to provide a comprehensive resource for benchmarking variant calling methods

PubMed Central

Mu, John C.; Tootoonchi Afshar, Pegah; Mohiyuddin, Marghoob; Chen, Xi; Li, Jian; Bani Asadi, Narges; Gerstein, Mark B.; Wong, Wing H.; Lam, Hugo Y. K.

2015-01-01

A high-confidence, comprehensive human variant set is critical in assessing accuracy of sequencing algorithms, which are crucial in precision medicine based on high-throughput sequencing. Although recent works have attempted to provide such a resource, they still do not encompass all major types of variants including structural variants (SVs). Thus, we leveraged the massive high-quality Sanger sequences from the HuRef genome to construct by far the most comprehensive gold set of a single individual, which was cross validated with deep Illumina sequencing, population datasets, and well-established algorithms. It was a necessary effort to completely reanalyze the HuRef genome as its previously published variants were mostly reported five years ago, suffering from compatibility, organization, and accuracy issues that prevent their direct use in benchmarking. Our extensive analysis and validation resulted in a gold set with high specificity and sensitivity. In contrast to the current gold sets of the NA12878 or HS1011 genomes, our gold set is the first that includes small variants, deletion SVs and insertion SVs up to a hundred thousand base-pairs. We demonstrate the utility of our HuRef gold set to benchmark several published SV detection tools. PMID:26412485
North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene

PubMed Central

Sullivan, Lori S.; Wheaton, Dianna K.; Locke, Kirsten G.; Jones, Kaylie D.; Koboldt, Daniel C.; Fulton, Robert S.; Wilson, Richard K.; Blanton, Susan H.; Birch, David G.; Daiger, Stephen P.

2016-01-01

Purpose To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). Methods A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Results Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13. The duplication creates a partial copy of CCNC and a complete copy of PRDM13. The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. Conclusions The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1 hypersensitive site upstream of the CCNC and PRDM13 genes or a tandem duplication of the PRDM13 gene. The duplication found in the RFS355 family is distinct from the previously reported duplication and provides additional support that dysregulation of PRDM13, not CCNC, is the cause of NCMD mapped to the MCDR1 locus. PMID:27777503
North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene.

PubMed

Bowne, Sara J; Sullivan, Lori S; Wheaton, Dianna K; Locke, Kirsten G; Jones, Kaylie D; Koboldt, Daniel C; Fulton, Robert S; Wilson, Richard K; Blanton, Susan H; Birch, David G; Daiger, Stephen P

2016-01-01

To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13 . The duplication creates a partial copy of CCNC and a complete copy of PRDM13 . The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1 hypersensitive site upstream of the CCNC and PRDM13 genes or a tandem duplication of the PRDM13 gene. The duplication found in the RFS355 family is distinct from the previously reported duplication and provides additional support that dysregulation of PRDM13 , not CCNC , is the cause of NCMD mapped to the MCDR1 locus.
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.

PubMed

Lakshmikumaran, M; Negi, M S

1994-03-01

Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
Next-generation sequencing for genetic testing of familial colorectal cancer syndromes.

PubMed

Simbolo, Michele; Mafficini, Andrea; Agostini, Marco; Pedrazzani, Corrado; Bedin, Chiara; Urso, Emanuele D; Nitti, Donato; Turri, Giona; Scardoni, Maria; Fassan, Matteo; Scarpa, Aldo

2015-01-01

Genetic screening in families with high risk to develop colorectal cancer (CRC) prevents incurable disease and permits personalized therapeutic and follow-up strategies. The advancement of next-generation sequencing (NGS) technologies has revolutionized the throughput of DNA sequencing. A series of 16 probands for either familial adenomatous polyposis (FAP; 8 cases) or hereditary nonpolyposis colorectal cancer (HNPCC; 8 cases) were investigated for intragenic mutations in five CRC familial syndromes-associated genes (APC, MUTYH, MLH1, MSH2, MSH6) applying both a custom multigene Ion AmpliSeq NGS panel and conventional Sanger sequencing. Fourteen pathogenic variants were detected in 13/16 FAP/HNPCC probands (81.3 %); one FAP proband presented two co-existing pathogenic variants, one in APC and one in MUTYH. Thirteen of these 14 pathogenic variants were detected by both NGS and Sanger, while one MSH2 mutation (L280FfsX3) was identified only by Sanger sequencing. This is due to a limitation of the NGS approach in resolving sequences close or within homopolymeric stretches of DNA. To evaluate the performance of our NGS custom panel we assessed its capability to resolve the DNA sequences corresponding to 2225 pathogenic variants reported in the COSMIC database for APC, MUTYH, MLH1, MSH2, MSH6. Our NGS custom panel resolves the sequences where 2108 (94.7 %) of these variants occur. The remaining 117 mutations reside inside or in close proximity to homopolymer stretches; of these 27 (1.2 %) are imprecisely identified by the software but can be resolved by visual inspection of the region, while the remaining 90 variants (4.0 %) are blind spots. In summary, our custom panel would miss 4 % (90/2225) of pathogenic variants that would need a small set of Sanger sequencing reactions to be solved. The multiplex NGS approach has the advantage of analyzing multiple genes in multiple samples simultaneously, requiring only a reduced number of Sanger sequences to resolve homopolymeric DNA regions not adequately assessed by NGS. The implementation of NGS approaches in routine diagnostics of familial CRC is cost-effective and significantly reduces diagnostic turnaround times.
A splice variant in the ACSL5 gene relates migraine with fatty acid activation in mitochondria

PubMed Central

Matesanz, Fuencisla; Fedetz, María; Barrionuevo, Cristina; Karaky, Mohamad; Catalá-Rabasa, Antonio; Potenciano, Victor; Bello-Morales, Raquel; López-Guerrero, Jose-Antonio; Alcina, Antonio

2016-01-01

Genome-wide association studies (GWAS) in migraine are providing the molecular basis of this heterogeneous disease, but the understanding of its aetiology is still incomplete. Although some biomarkers have currently been accepted for migraine, large amount of studies for identifying new ones is needed. The migraine-associated variant rs12355831:A>G (P=2 × 10−6), described in a GWAS of the International Headache Genetic Consortium, is localized in a non-coding sequence with unknown function. We sought to identify the causal variant and the genetic mechanism involved in the migraine risk. To this end, we integrated data of RNA sequences from the Genetic European Variation in Health and Disease (GEUVADIS) and genotypes from 1000 GENOMES of 344 lymphoblastoid cell lines (LCLs), to determine the expression quantitative trait loci (eQTLs) in the region. We found that the migraine-associated variant belongs to a linkage disequilibrium block associated with the expression of an acyl-coenzyme A synthetase 5 (ACSL5) transcript lacking exon 20 (ACSL5-Δ20). We showed by exon-skipping assay a direct causality of rs2256368-G in the exon 20 skipping of approximately 20 to 40% of ACSL5 RNA molecules. In conclusion, we identified the functional variant (rs2256368:A>G) affecting ACSL5 exon 20 skipping, as a causal factor linked to the migraine-associated rs12355831:A>G, suggesting that the activation of long-chain fatty acids by the spliced ACSL5-Δ20 molecules, a mitochondrial located enzyme, is involved in migraine pathology. PMID:27189022
Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants

PubMed Central

2014-01-01

Background Grapevine (Vitis vinifera L.) is the most important Mediterranean fruit crop, used to produce both wine and spirits as well as table grape and raisins. Wine and table grape cultivars represent two divergent germplasm pools with different origins and domestication history, as well as differential characteristics for berry size, cluster architecture and berry chemical profile, among others. ‘Sultanina’ plays a pivotal role in modern table grape breeding providing the main source of seedlessness. This cultivar is also one of the most planted for fresh consumption and raisins production. Given its importance, we sequenced it and implemented a novel strategy for the de novo assembly of its highly heterozygous genome. Results Our approach produced a draft genome of 466 Mb, recovering 82% of the genes present in the grapevine reference genome; in addition, we identified 240 novel genes. A large number of structural variants and SNPs were identified. Among them, 45 (21 SNPs and 24 INDELs) were experimentally confirmed in ‘Sultanina’ and six SNPs in other 23 table grape varieties. Transposable elements corresponded to ca. 80% of the repetitive sequences involved in structural variants and more than 2,000 genes were affected in their structure by these variants. Some of these genes are likely involved in embryo development, suggesting that they may contribute to seedlessness, a key trait for table grapes. Conclusions This work produced the first structural variants and SNPs catalog for grapevine, constituting a novel and very powerful tool for genomic studies in this key fruit crop, particularly useful to support marker assisted breeding in table grapes. PMID:24397443
VaDiR: an integrated approach to Variant Detection in RNA.

PubMed

Neums, Lisa; Suenaga, Seiji; Beyerlein, Peter; Anders, Sara; Koestler, Devin; Mariani, Andrea; Chien, Jeremy

2018-02-01

Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets.
Rare variants and autoimmune disease.

PubMed

Massey, Jonathan; Eyre, Steve

2014-09-01

The study of rare variants in monogenic forms of autoimmune disease has offered insight into the aetiology of more complex pathologies. Research in complex autoimmune disease initially focused on sequencing candidate genes, with some early successes, notably in uncovering low-frequency variation associated with Type 1 diabetes mellitus. However, other early examples have proved difficult to replicate, and a recent study across six autoimmune diseases, re-sequencing 25 autoimmune disease-associated genes in large sample sizes, failed to find any associated rare variants. The study of rare and low-frequency variation in autoimmune diseases has been made accessible by the inclusion of such variants on custom genotyping arrays (e.g. Immunochip and Exome arrays). Whole-exome sequencing approaches are now also being utilised to uncover the contribution of rare coding variants to disease susceptibility, severity and treatment response. Other sequencing strategies are starting to uncover the role of regulatory rare variation. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing

PubMed Central

Diroma, Maria Angela; Santorsola, Mariangela; Guttà, Cristiano; Gasparre, Giuseppe; Picardi, Ernesto; Pesole, Graziano; Attimonelli, Marcella

2014-01-01

Motivation: The increasing availability of mitochondria-targeted and off-target sequencing data in whole-exome and whole-genome sequencing studies (WXS and WGS) has risen the demand of effective pipelines to accurately measure heteroplasmy and to easily recognize the most functionally important mitochondrial variants among a huge number of candidates. To this purpose, we developed MToolBox, a highly automated pipeline to reconstruct and analyze human mitochondrial DNA from high-throughput sequencing data. Results: MToolBox implements an effective computational strategy for mitochondrial genomes assembling and haplogroup assignment also including a prioritization analysis of detected variants. MToolBox provides a Variant Call Format file featuring, for the first time, allele-specific heteroplasmy and annotation files with prioritized variants. MToolBox was tested on simulated samples and applied on 1000 Genomes WXS datasets. Availability and implementation: MToolBox package is available at https://sourceforge.net/projects/mtoolbox/. Contact: marcella.attimonelli@uniba.it Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25028726
De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries.

PubMed

Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

2015-09-21

Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants.
A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome

USDA-ARS?s Scientific Manuscript database

Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...
Whole-exome sequencing supports genetic heterogeneity in childhood apraxia of speech.

PubMed

Worthey, Elizabeth A; Raca, Gordana; Laffin, Jennifer J; Wilk, Brandon M; Harris, Jeremy M; Jakielski, Kathy J; Dimmock, David P; Strand, Edythe A; Shriberg, Lawrence D

2013-10-02

Childhood apraxia of speech (CAS) is a rare, severe, persistent pediatric motor speech disorder with associated deficits in sensorimotor, cognitive, language, learning and affective processes. Among other neurogenetic origins, CAS is the disorder segregating with a mutation in FOXP2 in a widely studied, multigenerational London family. We report the first whole-exome sequencing (WES) findings from a cohort of 10 unrelated participants, ages 3 to 19 years, with well-characterized CAS. As part of a larger study of children and youth with motor speech sound disorders, 32 participants were classified as positive for CAS on the basis of a behavioral classification marker using auditory-perceptual and acoustic methods that quantify the competence, precision and stability of a speaker's speech, prosody and voice. WES of 10 randomly selected participants was completed using the Illumina Genome Analyzer IIx Sequencing System. Image analysis, base calling, demultiplexing, read mapping, and variant calling were performed using Illumina software. Software developed in-house was used for variant annotation, prioritization and interpretation to identify those variants likely to be deleterious to neurodevelopmental substrates of speech-language development. Among potentially deleterious variants, clinically reportable findings of interest occurred on a total of five chromosomes (Chr3, Chr6, Chr7, Chr9 and Chr17), which included six genes either strongly associated with CAS (FOXP1 and CNTNAP2) or associated with disorders with phenotypes overlapping CAS (ATP13A4, CNTNAP1, KIAA0319 and SETX). A total of 8 (80%) of the 10 participants had clinically reportable variants in one or two of the six genes, with variants in ATP13A4, KIAA0319 and CNTNAP2 being the most prevalent. Similar to the results reported in emerging WES studies of other complex neurodevelopmental disorders, our findings from this first WES study of CAS are interpreted as support for heterogeneous genetic origins of this pediatric motor speech disorder with multiple genes, pathways and complex interactions. We also submit that our findings illustrate the potential use of WES for both gene identification and case-by-case clinical diagnostics in pediatric motor speech disorders.
Characterization of alanine to valine sequence variants in the Fc region of nivolumab biosimilar produced in Chinese hamster ovary cells.

PubMed

Li, Yantao; Fu, Tuo; Liu, Tao; Guo, Huaizu; Guo, Qingcheng; Xu, Jin; Zhang, Dapeng; Qian, Weizhu; Dai, Jianxin; Li, Bohua; Guo, Yajun; Hou, Sheng; Wang, Hao

2016-07-01

Nivolumab is a therapeutic fully human IgG4 antibody to programmed death 1 (PD-1). In this study, a nivolumab biosimilar, which was produced in our laboratory, was analyzed and characterized. Sequence variants that contain undesired amino acid sequences may cause concern during biosimilar bioprocess development. We found that low levels of sequence variants were detected in the heavy chain of the nivolumab biosimilar by ultra performance liquid chromatography (UPLC) and tandem mass spectrometry. It was further identified with UPLC-MS/MS by IdeS or trypsin digestion. The sequence variant was confirmed through addition of synthetic mutant peptide. Subsequently, the mixing base signal of normal and mutant sequence was detected through DNA sequencing. The relative levels of mutant A424V in the Fc region of the heavy chain have been detected and demonstrated to be 12.25% and 13.54%, via base peak intensity (BPI) and UV chromatography of the tryptic peptide mapping, respectively. A424V variant was also quantified by real-time PCR (RT-PCR) at the DNA and RNA level, which was 19.2% and 16.8%, respectively. The relative content of the mutant was consistent at the DNA, RNA and protein level, indicating that the A424V mutation may have little influence at transcriptional or translational levels. These results demonstrate that orthogonal state-of-the-art techniques such as LC- UV- MS and RT-PCR should be implemented to characterize recombinant proteins and cell lines for development of biosimilars. Our study suggests that it is important to establish an integrated and effective analytical method to monitor and characterize sequence variants during antibody drug development, especially for antibody biosimilar products.
Polymorphisms and variants in the prion protein sequence of European moose (Alces alces), reindeer (Rangifer tarandus), roe deer (Capreolus capreolus) and fallow deer (Dama dama) in Scandinavia

PubMed Central

Wik, Lotta; Mikko, Sofia; Klingeborn, Mikael; Stéen, Margareta; Simonsson, Magnus; Linné, Tommy

2012-01-01

The prion protein (PrP) sequence of European moose, reindeer, roe deer and fallow deer in Scandinavia has high homology to the PrP sequence of North American cervids. Variants in the European moose PrP sequence were found at amino acid position 109 as K or Q. The 109Q variant is unique in the PrP sequence of vertebrates. During the 1980s a wasting syndrome in Swedish moose, Moose Wasting Syndrome (MWS), was described. SNP analysis demonstrated a difference in the observed genotype proportions of the heterozygous Q/K and homozygous Q/Q variants in the MWS animals compared with the healthy animals. In MWS moose the allele frequencies for 109K and 109Q were 0.73 and 0.27, respectively, and for healthy animals 0.69 and 0.31. Both alleles were seen as heterozygotes and homozygotes. In reindeer, PrP sequence variation was demonstrated at codon 176 as D or N and codon 225 as S or Y. The PrP sequences in roe deer and fallow deer were identical with published GenBank sequences. PMID:22441661
Severe myopia with unusual retinal anomalies and Dandy-Walker sequence in two sibs. A distinct new neuro-ocular disorder.

PubMed

de Crecchio, Giuseppe; Cennamo, Gilda; de Leeuw, Nicole; Ventruto, Maria Luisa; Lonardo, Maria Concetta; Friso, Patrizia; Ventruto, Valerio

2013-12-01

We have observed a male and a female, sibs of non-consanguineous parents, affected by severe myopia with characteristic retinal defects and Dandy-Walker variant. The peculiarity of the retinopathy consists of pathological myopia with anomalous vitreal fenestrated membranes in the retinal periphery. We suppose that these associations may configure a new genetic syndrome.
Identification of influenza A pandemic (H1N1) 2009 variants during the first 2009 influenza outbreak in Mexico City.

PubMed

Zepeda, Hector M; Perea-Araujo, Lizbeth; Zarate-Segura, Paola B; Vázquez-Pérez, Joel A; Miliar-García, Angel; Garibay-Orijel, Claudio; Domínguez-López, Aarón; Badillo-Corona, Jesús A; López-Orduña, Eduardo; García-González, Octavio P; Villaseñor-Ruíz, Ignacio; Ahued-Ortega, Armando; Aguilar-Faisal, Leopoldo; Bravo, Jorge; Lara-Padilla, Eleazar; García-Cavazos, Ricardo J

2010-05-01

In March 2009, public health surveillance detected increased numbers of influenza-like illness presenting to hospitals in Mexico City. The aetiological agent was subsequently determined to be a novel influenza A (H1N1) triple reassortant, which has spread worldwide. As a consequence the World Health Organisation has declared the first Influenza pandemic of the 21st century. To describe clinically and molecularly the first outbreak of influenza A pH1N1 (2009) during 1-5 May to establish a baseline of epidemiological data for pH1N1. Also, to monitor for the emergence of antiviral resistance, and mutations affecting virulence and transmissibility. Samples were collected from 751 patients with influenza-like symptoms throughout Mexico City and were tested for influenza A pH1N1 (2009) using real-time PCR. In the samples that were positive for influenza A pH1N1 (2009) fragments from the haemagglutinin (H1) and neuraminidase (N1) genes were sequenced. A total of 203/751 (27%) patients were positive for the pandemic H1N1 (2009) virus (53% male and 47% female). The 0-12-year-old group was the most affected 85/751 (42%). Sequence analysis showed five new variants of the pandemic H1N1 (2009) virus for NA: G249E (GQ292900), M269I (GQ292892), Y274H (GQ292913), T332A (GQ292933), N344K (GQ292882), and four variants for HA: N461K (GQ293006), K505R (GQ292989), I435V (GQ292995), I527N (GQ292997). We have provided a baseline of epidemiological data from the first outbreak of influenza A pH1N1 (2009) during 1-5 May in Mexico City. The sequencing of partial fragments of the HA and NA genes did not show the presence of previously described mutations affecting known sites of antiviral resistance in seasonal influenza A such as the H275Y (oseltamivir resistance), R293 or N295 etc. Copyright 2010 Elsevier B.V. All rights reserved.
A power set-based statistical selection procedure to locate susceptible rare variants associated with complex traits with sequencing data.

PubMed

Sun, Hokeun; Wang, Shuang

2014-08-15

Existing association methods for rare variants from sequencing data have focused on aggregating variants in a gene or a genetic region because of the fact that analysing individual rare variants is underpowered. However, these existing rare variant detection methods are not able to identify which rare variants in a gene or a genetic region of all variants are associated with the complex diseases or traits. Once phenotypic associations of a gene or a genetic region are identified, the natural next step in the association study with sequencing data is to locate the susceptible rare variants within the gene or the genetic region. In this article, we propose a power set-based statistical selection procedure that is able to identify the locations of the potentially susceptible rare variants within a disease-related gene or a genetic region. The selection performance of the proposed selection procedure was evaluated through simulation studies, where we demonstrated the feasibility and superior power over several comparable existing methods. In particular, the proposed method is able to handle the mixed effects when both risk and protective variants are present in a gene or a genetic region. The proposed selection procedure was also applied to the sequence data on the ANGPTL gene family from the Dallas Heart Study to identify potentially susceptible rare variants within the trait-related genes. An R package 'rvsel' can be downloaded from http://www.columbia.edu/∼sw2206/ and http://statsun.pusan.ac.kr. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Functional analysis of a large set of BRCA2 exon 7 variants highlights the predictive value of hexamer scores in detecting alterations of exonic splicing regulatory elements.

PubMed

Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra

2013-11-01

Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.
Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies.

PubMed

Standish, Kristopher A; Carland, Tristan M; Lockwood, Glenn K; Pfeiffer, Wayne; Tatineni, Mahidhar; Huang, C Chris; Lamberth, Sarah; Cherkas, Yauheniya; Brodmerkel, Carrie; Jaeger, Ed; Smith, Lance; Rajagopal, Gunaretnam; Curran, Mark E; Schork, Nicholas J

2015-09-22

Next-generation sequencing (NGS) technologies have become much more efficient, allowing whole human genomes to be sequenced faster and cheaper than ever before. However, processing the raw sequence reads associated with NGS technologies requires care and sophistication in order to draw compelling inferences about phenotypic consequences of variation in human genomes. It has been shown that different approaches to variant calling from NGS data can lead to different conclusions. Ensuring appropriate accuracy and quality in variant calling can come at a computational cost. We describe our experience implementing and evaluating a group-based approach to calling variants on large numbers of whole human genomes. We explore the influence of many factors that may impact the accuracy and efficiency of group-based variant calling, including group size, the biogeographical backgrounds of the individuals who have been sequenced, and the computing environment used. We make efficient use of the Gordon supercomputer cluster at the San Diego Supercomputer Center by incorporating job-packing and parallelization considerations into our workflow while calling variants on 437 whole human genomes generated as part of large association study. We ultimately find that our workflow resulted in high-quality variant calls in a computationally efficient manner. We argue that studies like ours should motivate further investigations combining hardware-oriented advances in computing systems with algorithmic developments to tackle emerging 'big data' problems in biomedical research brought on by the expansion of NGS technologies.
Whole-exome sequencing identifies common and rare variant metabolic QTLs in a Middle Eastern population.

PubMed

Yousri, Noha A; Fakhro, Khalid A; Robay, Amal; Rodriguez-Flores, Juan L; Mohney, Robert P; Zeriri, Hassina; Odeh, Tala; Kader, Sara Abdul; Aldous, Eman K; Thareja, Gaurav; Kumar, Manish; Al-Shakaki, Alya; Chidiac, Omar M; Mohamoud, Yasmin A; Mezey, Jason G; Malek, Joel A; Crystal, Ronald G; Suhre, Karsten

2018-01-23

Metabolomics-genome-wide association studies (mGWAS) have uncovered many metabolic quantitative trait loci (mQTLs) influencing human metabolic individuality, though predominantly in European cohorts. By combining whole-exome sequencing with a high-resolution metabolomics profiling for a highly consanguineous Middle Eastern population, we discover 21 common variant and 12 functional rare variant mQTLs, of which 45% are novel altogether. We fine-map 10 common variant mQTLs to new metabolite ratio associations, and 11 common variant mQTLs to putative protein-altering variants. This is the first work to report common and rare variant mQTLs linked to diseases and/or pharmacological targets in a consanguineous Arab cohort, with wide implications for precision medicine in the Middle East.
Transposon Variants and Their Effects on Gene Expression in Arabidopsis

PubMed Central

Wang, Xi; Weigel, Detlef; Smith, Lisa M.

2013-01-01

Transposable elements (TEs) make up the majority of many plant genomes. Their transcription and transposition is controlled through siRNAs and epigenetic marks including DNA methylation. To dissect the interplay of siRNA–mediated regulation and TE evolution, and to examine how TE differences affect nearby gene expression, we investigated genome-wide differences in TEs, siRNAs, and gene expression among three Arabidopsis thaliana accessions. Both TE sequence polymorphisms and presence of linked TEs are positively correlated with intraspecific variation in gene expression. The expression of genes within 2 kb of conserved TEs is more stable than that of genes next to variant TEs harboring sequence polymorphisms. Polymorphism levels of TEs and closely linked adjacent genes are positively correlated as well. We also investigated the distribution of 24-nt-long siRNAs, which mediate TE repression. TEs targeted by uniquely mapping siRNAs are on average farther from coding genes, apparently because they more strongly suppress expression of adjacent genes. Furthermore, siRNAs, and especially uniquely mapping siRNAs, are enriched in TE regions missing in other accessions. Thus, targeting by uniquely mapping siRNAs appears to promote sequence deletions in TEs. Overall, our work indicates that siRNA–targeting of TEs may influence removal of sequences from the genome and hence evolution of gene expression in plants. PMID:23408902
Whole exome sequencing identifies genetic variants in inherited thrombocytopenia with secondary qualitative function defects.

PubMed

Johnson, Ben; Lowe, Gillian C; Futterer, Jane; Lordkipanidzé, Marie; MacDonald, David; Simpson, Michael A; Sanchez-Guiú, Isabel; Drake, Sian; Bem, Danai; Leo, Vincenzo; Fletcher, Sarah J; Dawood, Ban; Rivera, José; Allsup, David; Biss, Tina; Bolton-Maggs, Paula Hb; Collins, Peter; Curry, Nicola; Grimley, Charlotte; James, Beki; Makris, Mike; Motwani, Jayashree; Pavord, Sue; Talks, Katherine; Thachil, Jecko; Wilde, Jonathan; Williams, Mike; Harrison, Paul; Gissen, Paul; Mundell, Stuart; Mumford, Andrew; Daly, Martina E; Watson, Steve P; Morgan, Neil V

2016-10-01

Inherited thrombocytopenias are a heterogeneous group of disorders characterized by abnormally low platelet counts which can be associated with abnormal bleeding. Next-generation sequencing has previously been employed in these disorders for the confirmation of suspected genetic abnormalities, and more recently in the discovery of novel disease-causing genes. However its full potential has not yet been exploited. Over the past 6 years we have sequenced the exomes from 55 patients, including 37 index cases and 18 additional family members, all of whom were recruited to the UK Genotyping and Phenotyping of Platelets study. All patients had inherited or sustained thrombocytopenia of unknown etiology with platelet counts varying from 11×10 9 /L to 186×10 9 /L. Of the 51 patients phenotypically tested, 37 (73%), had an additional secondary qualitative platelet defect. Using whole exome sequencing analysis we have identified "pathogenic" or "likely pathogenic" variants in 46% (17/37) of our index patients with thrombocytopenia. In addition, we report variants of uncertain significance in 12 index cases, including novel candidate genetic variants in previously unreported genes in four index cases. These results demonstrate that whole exome sequencing is an efficient method for elucidating potential pathogenic genetic variants in inherited thrombocytopenia. Whole exome sequencing also has the added benefit of discovering potentially pathogenic genetic variants for further study in novel genes not previously implicated in inherited thrombocytopenia. Copyright© Ferrata Storti Foundation.
Biallelic Variants in UBA5 Link Dysfunctional UFM1 Ubiquitin-like Modifier Pathway to Severe Infantile-Onset Encephalopathy.

PubMed

Muona, Mikko; Ishimura, Ryosuke; Laari, Anni; Ichimura, Yoshinobu; Linnankivi, Tarja; Keski-Filppula, Riikka; Herva, Riitta; Rantala, Heikki; Paetau, Anders; Pöyhönen, Minna; Obata, Miki; Uemura, Takefumi; Karhu, Thomas; Bizen, Norihisa; Takebayashi, Hirohide; McKee, Shane; Parker, Michael J; Akawi, Nadia; McRae, Jeremy; Hurles, Matthew E; Kuismin, Outi; Kurki, Mitja I; Anttonen, Anna-Kaisa; Tanaka, Keiji; Palotie, Aarno; Waguri, Satoshi; Lehesjoki, Anna-Elina; Komatsu, Masaaki

2016-09-01

The ubiquitin fold modifier 1 (UFM1) cascade is a recently identified evolutionarily conserved ubiquitin-like modification system whose function and link to human disease have remained largely uncharacterized. By using exome sequencing in Finnish individuals with severe epileptic syndromes, we identified pathogenic compound heterozygous variants in UBA5, encoding an activating enzyme for UFM1, in two unrelated families. Two additional individuals with biallelic UBA5 variants were identified from the UK-based Deciphering Developmental Disorders study and one from the Northern Finland Intellectual Disability cohort. The affected individuals (n = 9) presented in early infancy with severe irritability, followed by dystonia and stagnation of development. Furthermore, the majority of individuals display postnatal microcephaly and epilepsy and develop spasticity. The affected individuals were compound heterozygous for a missense substitution, c.1111G>A (p.Ala371Thr; allele frequency of 0.28% in Europeans), and a nonsense variant or c.164G>A that encodes an amino acid substitution p.Arg55His, but also affects splicing by facilitating exon 2 skipping, thus also being in effect a loss-of-function allele. Using an in vitro thioester formation assay and cellular analyses, we show that the p.Ala371Thr variant is hypomorphic with attenuated ability to transfer the activated UFM1 to UFC1. Finally, we show that the CNS-specific knockout of Ufm1 in mice causes neonatal death accompanied by microcephaly and apoptosis in specific neurons, further suggesting that the UFM1 system is essential for CNS development and function. Taken together, our data imply that the combination of a hypomorphic p.Ala371Thr variant in trans with a loss-of-function allele in UBA5 underlies a severe infantile-onset encephalopathy. Copyright © 2016 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
ANLN truncation causes a familial fatal acute respiratory distress syndrome in Dalmatian dogs

PubMed Central

Syrjä, Pernilla; Arumilli, Meharji; Järvinen, Anna-Kaisa; Rajamäki, Minna

2017-01-01

Acute respiratory distress syndrome (ARDS) is the leading cause of death in critical care medicine. The syndrome is typified by an exaggerated inflammatory response within the lungs. ARDS has been reported in many species, including dogs. We have previously reported a fatal familial juvenile respiratory disease accompanied by occasional unilateral renal aplasia and hydrocephalus, in Dalmatian dogs. The condition with a suggested recessive mode of inheritance resembles acute exacerbation of usual interstitial pneumonia in man. We combined SNP-based homozygosity mapping of two ARDS-affected Dalmatian dogs and whole genome sequencing of one affected dog to identify a case-specific homozygous nonsense variant, c.31C>T; p.R11* in the ANLN gene. Subsequent analysis of the variant in a total cohort of 188 Dalmatians, including seven cases, indicated complete segregation of the variant with the disease and confirmed an autosomal recessive mode of inheritance. Low carrier frequency of 1.7% was observed in a population cohort. The early nonsense variant results in a nearly complete truncation of the ANLN protein and immunohistochemical analysis of the affected lung tissue demonstrated the lack of the membranous and cytoplasmic staining of ANLN protein in the metaplastic bronchial epithelium. The ANLN gene encodes an anillin actin binding protein with a suggested regulatory role in the integrity of intercellular junctions. Our study suggests that defective ANLN results in abnormal cellular organization of the bronchiolar epithelium, which in turn predisposes to acute respiratory distress. ANLN has been previously linked to a dominant focal segmental glomerulosclerosis in human without pulmonary defects. However, the lack of similar renal manifestations in the affected Dalmatians suggest a novel ANLN-related pulmonary function and disease association. PMID:28222102
An Evaluation of Different Target Enrichment Methods in Pooled Sequencing Designs for Complex Disease Association Studies

PubMed Central

Day-Williams, Aaron G.; McLay, Kirsten; Drury, Eleanor; Edkins, Sarah; Coffey, Alison J.; Palotie, Aarno; Zeggini, Eleftheria

2011-01-01

Pooled sequencing can be a cost-effective approach to disease variant discovery, but its applicability in association studies remains unclear. We compare sequence enrichment methods coupled to next-generation sequencing in non-indexed pools of 1, 2, 10, 20 and 50 individuals and assess their ability to discover variants and to estimate their allele frequencies. We find that pooled resequencing is most usefully applied as a variant discovery tool due to limitations in estimating allele frequency with high enough accuracy for association studies, and that in-solution hybrid-capture performs best among the enrichment methods examined regardless of pool size. PMID:22069447
Contribution of single amino acid and codon substitutions to the production and secretion of a lipase by Bacillus subtilis.

PubMed

Skoczinski, Pia; Volkenborn, Kristina; Fulton, Alexander; Bhadauriya, Anuseema; Nutschel, Christina; Gohlke, Holger; Knapp, Andreas; Jaeger, Karl-Erich

2017-09-25

Bacillus subtilis produces and secretes proteins in amounts of up to 20 g/l under optimal conditions. However, protein production can be challenging if transcription and cotranslational secretion are negatively affected, or the target protein is degraded by extracellular proteases. This study aims at elucidating the influence of a target protein on its own production by a systematic mutational analysis of the homologous B. subtilis model protein lipase A (LipA). We have covered the full natural diversity of single amino acid substitutions at 155 positions of LipA by site saturation mutagenesis excluding only highly conserved residues and qualitatively and quantitatively screened about 30,000 clones for extracellular LipA production. Identified variants with beneficial effects on production were sequenced and analyzed regarding B. subtilis growth behavior, extracellular lipase activity and amount as well as changes in lipase transcript levels. In total, 26 LipA variants were identified showing an up to twofold increase in either amount or activity of extracellular lipase. These variants harbor single amino acid or codon substitutions that did not substantially affect B. subtilis growth. Subsequent exemplary combination of beneficial single amino acid substitutions revealed an additive effect solely at the level of extracellular lipase amount; however, lipase amount and activity could not be increased simultaneously. Single amino acid and codon substitutions can affect LipA secretion and production by B. subtilis. Several codon-related effects were observed that either enhance lipA transcription or promote a more efficient folding of LipA. Single amino acid substitutions could improve LipA production by increasing its secretion or stability in the culture supernatant. Our findings indicate that optimization of the expression system is not sufficient for efficient protein production in B. subtilis. The sequence of the target protein should also be considered as an optimization target for successful protein production. Our results further suggest that variants with improved properties might be identified much faster and easier if mutagenesis is prioritized towards elements that contribute to enzymatic activity or structural integrity.
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Analysis of selected genes associated with cardiomyopathy by next-generation sequencing.

PubMed

Szabadosova, Viktoria; Boronova, Iveta; Ferenc, Peter; Tothova, Iveta; Bernasovska, Jarmila; Zigova, Michaela; Kmec, Jan; Bernasovsky, Ivan

2018-02-01

As the leading cause of congestive heart failure, cardiomyopathy represents a heterogenous group of heart muscle disorders. Despite considerable progress being made in the genetic diagnosis of cardiomyopathy by detection of the mutations in the most prevalent cardiomyopathy genes, the cause remains unsolved in many patients. High-throughput mutation screening in the disease genes for cardiomyopathy is now possible because of using target enrichment followed by next-generation sequencing. The aim of the study was to analyze a panel of genes associated with dilated or hypertrophic cardiomyopathy based on previously published results in order to identify the subjects at risk. The method of next-generation sequencing by IlluminaHiSeq 2500 platform was used to detect sequence variants in 16 individuals diagnosed with dilated or hypertrophic cardiomyopathy. Detected variants were filtered and the functional impact of amino acid changes was predicted by computational programs. DNA samples of the 16 patients were analyzed by whole exome sequencing. We identified six nonsynonymous variants that were shown to be pathogenic in all used prediction softwares: rs3744998 (EPG5), rs11551768 (MGME1), rs148374985 (MURC), rs78461695 (PLEC), rs17158558 (RET) and rs2295190 (SYNE1). Two of the analyzed sequence variants had minor allele frequency (MAF)<0.01: rs148374985 (MURC), rs34580776 (MYBPC3). Our data support the potential role of the detected variants in pathogenesis of dilated or hypertrophic cardiomyopathy; however, the possibility that these variants might not be true disease-causing variants but are susceptibility alleles that require additional mutations or injury to cause the clinical phenotype of disease must be considered. © 2017 Wiley Periodicals, Inc.

Variants of beta-glucosidase

DOEpatents

Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

2015-07-14

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Variants of beta-glucosidases

DOEpatents

Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

2014-10-07

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Variants of beta-glucosidase

DOEpatents

Fidantsef, Ana [Davis, CA; Lamsa, Michael [Davis, CA; Gorre-Clancy, Brian [Elk Grove, CA

2009-12-29

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
De novo variants in EBF3 are associated with hypotonia, developmental delay, intellectual disability, and autism.

PubMed

Tanaka, Akemi J; Cho, Megan T; Willaert, Rebecca; Retterer, Kyle; Zarate, Yuri A; Bosanko, Katie; Stefans, Vikki; Oishi, Kimihiko; Williamson, Amy; Wilson, Golder N; Basinger, Alice; Barbaro-Dieber, Tina; Ortega, Lucia; Sorrentino, Susanna; Gabriel, Melissa K; Anderson, Ilse J; Sacoto, Maria J Guillen; Schnur, Rhonda E; Chung, Wendy K

2017-11-01

Using whole-exome sequencing, we identified seven unrelated individuals with global developmental delay, hypotonia, dysmorphic facial features, and an increased frequency of short stature, ataxia, and autism with de novo heterozygous frameshift, nonsense, splice, and missense variants in the Early B-cell Transcription Factor Family Member 3 ( EBF3 ) gene. EBF3 is a member of the collier/olfactory-1/early B-cell factor (COE) family of proteins, which are required for central nervous system (CNS) development. COE proteins are highly evolutionarily conserved and regulate neuronal specification, migration, axon guidance, and dendritogenesis during development and are essential for maintaining neuronal identity in adult neurons. Haploinsufficiency of EBF3 may affect brain development and function, resulting in developmental delay, intellectual disability, and behavioral differences observed in individuals with a deleterious variant in EBF3 . © 2017 Tanaka et al.; Published by Cold Spring Harbor Laboratory Press.
Common variants near CAV1 and CAV2 are associated with primary open-angle glaucoma

PubMed Central

Thorleifsson, Gudmar; Walters, G Bragi; Hewitt, Alex W; Masson, Gisli; Helgason, Agnar; DeWan, Andrew; Sigurdsson, Asgeir; Jonasdottir, Adalbjorg; Gudjonsson, Sigurjon A; Magnusson, Kristinn P; Stefansson, Hreinn; Lam, Dennis S C; Tam, Pancy O S; Gudmundsdottir, Gudrun J; Southgate, Laura; Burdon, Kathryn P; Gottfredsdottir, Maria Soffia; Aldred, Micheala A; Mitchell, Paul; St Clair, David; Collier, David A; Tang, Nelson; Sveinsson, Orn; Macgregor, Stuart; Martin, Nicholas G; Cree, Angela J; Gibson, Jane; MacLeod, Alex; Jacob, Aby; Ennis, Sarah; Young, Terri L; Chan, Juliana C N; Karwatowski, Wojciech S S; Hammond, Christopher J; Thordarson, Kristjan; Zhang, Mingzhi; Wadelius, Claes; Lotery, Andrew J; Trembath, Richard C; Pang, Chi Pui; Hoh, Josephine; Craig, Jamie E; Kong, Augustine; Mackey, David A; Jonasson, Fridbert; Thorsteinsdottir, Unnur; Stefansson, Kari

2011-01-01

We conducted a genome-wide association study for primary open-angle glaucoma (POAG) in 1,263 affected individuals (cases) and 34,877 controls from Iceland. We identified a common sequence variant at 7q31 (rs4236601[A], odds ratio (OR) = 1.36, P = 5.0 × 10-10). We then replicated the association in sample sets of 2,175 POAG cases and 2,064 controls from Sweden, the UK and Australia (combined OR = 1.18, P = 0.0015) and in 299 POAG cases and 580 unaffected controls from Hong Kong and Shantou, China (combined OR = 5.42, P = 0.0021). The risk variant identified here is located close to CAV1 and CAV2, both of which are expressed in the trabecular meshwork and retinal ganglion cells that are involved in the pathogenesis of POAG. PMID:20835238
Molecular Epidemiology of Mutations in Antimicrobial Resistance Loci of Pseudomonas aeruginosa Isolates from Airways of Cystic Fibrosis Patients.

PubMed

Greipel, Leonie; Fischer, Sebastian; Klockgether, Jens; Dorda, Marie; Mielke, Samira; Wiehlmann, Lutz; Cramer, Nina; Tümmler, Burkhard

2016-11-01

The chronic airway infections with Pseudomonas aeruginosa in people with cystic fibrosis (CF) are treated with aerosolized antibiotics, oral fluoroquinolones, and/or intravenous combination therapy with aminoglycosides and β-lactam antibiotics. An international strain collection of 361 P. aeruginosa isolates from 258 CF patients seen at 30 CF clinics was examined for mutations in 17 antimicrobial susceptibility and resistance loci that had been identified as hot spots of mutation by genome sequencing of serial isolates from a single CF clinic. Combinatorial amplicon sequencing of pooled PCR products identified 1,112 sequence variants that were not present in the genomes of representative strains of the 20 most common clones of the global P. aeruginosa population. A high frequency of singular coding variants was seen in spuE, mexA, gyrA, rpoB, fusA1, mexZ, mexY, oprD, ampD, parR, parS, and envZ (amgS), reflecting the pressure upon P. aeruginosa in lungs of CF patients to generate novel protein variants. The proportion of nonneutral amino acid exchanges was high. Of the 17 loci, mexA, mexZ, and pagL were most frequently affected by independent stop mutations. Private and de novo mutations seem to play a pivotal role in the response of P. aeruginosa populations to the antimicrobial load and the individual CF host. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Mutations in DSTYK and dominant urinary tract malformations.

PubMed

Sanna-Cherchi, Simone; Sampogna, Rosemary V; Papeta, Natalia; Burgess, Katelyn E; Nees, Shannon N; Perry, Brittany J; Choi, Murim; Bodria, Monica; Liu, Yan; Weng, Patricia L; Lozanovski, Vladimir J; Verbitsky, Miguel; Lugani, Francesca; Sterken, Roel; Paragas, Neal; Caridi, Gianluca; Carrea, Alba; Dagnino, Monica; Materna-Kiryluk, Anna; Santamaria, Giuseppe; Murtas, Corrado; Ristoska-Bojkovska, Nadica; Izzi, Claudia; Kacak, Nilgun; Bianco, Beatrice; Giberti, Stefania; Gigante, Maddalena; Piaggio, Giorgio; Gesualdo, Loreto; Vukic, Durdica Kosuljandic; Vukojevic, Katarina; Saraga-Babic, Mirna; Saraga, Marijan; Gucev, Zoran; Allegri, Landino; Latos-Bielenska, Anna; Casu, Domenica; State, Matthew; Scolari, Francesco; Ravazzolo, Roberto; Kiryluk, Krzysztof; Al-Awqati, Qais; D'Agati, Vivette D; Drummond, Iain A; Tasic, Velibor; Lifton, Richard P; Ghiggeri, Gian Marco; Gharavi, Ali G

2013-08-15

Congenital abnormalities of the kidney and the urinary tract are the most common cause of pediatric kidney failure. These disorders are highly heterogeneous, and the etiologic factors are poorly understood. We performed genomewide linkage analysis and whole-exome sequencing in a family with an autosomal dominant form of congenital abnormalities of the kidney or urinary tract (seven affected family members). We also performed a sequence analysis in 311 unrelated patients, as well as histologic and functional studies. Linkage analysis identified five regions of the genome that were shared among all affected family members. Exome sequencing identified a single, rare, deleterious variant within these linkage intervals, a heterozygous splice-site mutation in the dual serine-threonine and tyrosine protein kinase gene (DSTYK). This variant, which resulted in aberrant splicing of messenger RNA, was present in all affected family members. Additional, independent DSTYK mutations, including nonsense and splice-site mutations, were detected in 7 of 311 unrelated patients. DSTYK is highly expressed in the maturing epithelia of all major organs, localizing to cell membranes. Knockdown in zebrafish resulted in developmental defects in multiple organs, which suggested loss of fibroblast growth factor (FGF) signaling. Consistent with this finding is the observation that DSTYK colocalizes with FGF receptors in the ureteric bud and metanephric mesenchyme. DSTYK knockdown in human embryonic kidney cells inhibited FGF-stimulated phosphorylation of extracellular-signal-regulated kinase (ERK), the principal signal downstream of receptor tyrosine kinases. We detected independent DSTYK mutations in 2.3% of patients with congenital abnormalities of the kidney or urinary tract, a finding that suggests that DSTYK is a major determinant of human urinary tract development, downstream of FGF signaling. (Funded by the National Institutes of Health and others.).
Mutations in DSTYK and Dominant Urinary Tract Malformations

PubMed Central

Sanna-Cherchi, Simone; Nees, Shannon N.; Perry, Brittany J.; Choi, Murim; Bodria, Monica; Liu, Yan; Weng, Patricia L.; Lozanovski, Vladimir J.; Verbitsky, Miguel; Lugani, Francesca; Sterken, Roel; Paragas, Neal; Caridi, Gianluca; Carrea, Alba; Dagnino, Monica; Materna-Kiryluk, Anna; Santamaria, Giuseppe; Murtas, Corrado; Ristoska-Bojkovska, Nadica; Izzi, Claudia; Kacak, Nilgun; Bianco, Beatrice; Giberti, Stefania; Gigante, Maddalena; Piaggio, Giorgio; Gesualdo, Loreto; Vukic, Durdica Kosuljandic; Vukojevic, Katarina; Saraga-Babic, Mirna; Saraga, Marijan; Gucev, Zoran; Allegri, Landino; Latos-Bielenska, Anna; Casu, Domenica; State, Matthew; Scolari, Francesco; Ravazzolo, Roberto; Kiryluk, Krzysztof; Al-Awqati, Qais; D'Agati, Vivette D.; Drummond, Iain A.; Tasic, Velibor; Lifton, Richard P.; Ghiggeri, Gian Marco; Gharavi, Ali G.

2013-01-01

BACKGROUND Congenital abnormalities of the kidney and the urinary tract are the most common cause of pediatric kidney failure. These disorders are highly heterogeneous, and the etiologic factors are poorly understood. METHODS We performed genomewide linkage analysis and whole-exome sequencing in a family with an autosomal dominant form of congenital abnormalities of the kidney or urinary tract (seven affected family members). We also performed a sequence analysis in 311 unrelated patients, as well as histologic and functional studies. RESULTS Linkage analysis identified five regions of the genome that were shared among all affected family members. Exome sequencing identified a single, rare, deleterious variant within these linkage intervals, a heterozygous splice-site mutation in the dual serine–threonine and tyrosine protein kinase gene (DSTYK). This variant, which resulted in aberrant splicing of messenger RNA, was present in all affected family members. Additional, independent DSTYK mutations, including nonsense and splice-site mutations, were detected in 7 of 311 unrelated patients. DSTYK is highly expressed in the maturing epithelia of all major organs, localizing to cell membranes. Knockdown in zebrafish resulted in developmental defects in multiple organs, which suggested loss of fibroblast growth factor (FGF) signaling. Consistent with this finding is the observation that DSTYK colocalizes with FGF receptors in the ureteric bud and metanephric mesenchyme. DSTYK knockdown in human embryonic kidney cells inhibited FGF-stimulated phosphorylation of extracellular-signal-regulated kinase (ERK), the principal signal downstream of receptor tyrosine kinases. CONCLUSIONS We detected independent DSTYK mutations in 2.3% of patients with congenital abnormalities of the kidney or urinary tract, a finding that suggests that DSTYK is a major determinant of human urinary tract development, downstream of FGF signaling. (Funded by the National Institutes of Health and others.) PMID:23862974
Dominant KCNA2 mutation causes episodic ataxia and pharmacoresponsive epilepsy.

PubMed

Corbett, Mark A; Bellows, Susannah T; Li, Melody; Carroll, Renée; Micallef, Silvana; Carvill, Gemma L; Myers, Candace T; Howell, Katherine B; Maljevic, Snezana; Lerche, Holger; Gazina, Elena V; Mefford, Heather C; Bahlo, Melanie; Berkovic, Samuel F; Petrou, Steven; Scheffer, Ingrid E; Gecz, Jozef

2016-11-08

To identify the genetic basis of a family segregating episodic ataxia, infantile seizures, and heterogeneous epilepsies and to study the phenotypic spectrum of KCNA2 mutations. A family with 7 affected individuals over 3 generations underwent detailed phenotyping. Whole genome sequencing was performed on a mildly affected grandmother and her grandson with epileptic encephalopathy (EE). Segregating variants were filtered and prioritized based on functional annotations. The effects of the mutation on channel function were analyzed in vitro by voltage clamp assay and in silico by molecular modeling. KCNA2 was sequenced in 35 probands with heterogeneous phenotypes. The 7 family members had episodic ataxia (5), self-limited infantile seizures (5), evolving to genetic generalized epilepsy (4), focal seizures (2), and EE (1). They had a segregating novel mutation in the shaker type voltage-gated potassium channel KCNA2 (CCDS_827.1: c.765_773del; p.255_257del). A rare missense SCN2A (rs200884216) variant was also found in 2 affected siblings and their unaffected mother. The p.255_257del mutation caused dominant negative loss of channel function. Molecular modeling predicted repositioning of critical arginine residues in the voltage-sensing domain. KCNA2 sequencing revealed 1 de novo mutation (CCDS_827.1: c.890G>A; p.Arg297Gln) in a girl with EE, ataxia, and tremor. A KCNA2 mutation caused dominantly inherited episodic ataxia, mild infantile-onset seizures, and later generalized and focal epilepsies in the setting of normal intellect. This observation expands the KCNA2 phenotypic spectrum from EE often associated with chronic ataxia, reflecting the marked variation in severity observed in many ion channel disorders. © 2016 American Academy of Neurology.
Evaluation of the X-Linked High-Grade Myopia Locus (MYP1) with Cone Dysfunction and Color Vision Deficiencies

PubMed Central

Metlapally, Ravikanth; Michaelides, Michel; Bulusu, Anuradha; Li, Yi-Ju; Schwartz, Marianne; Rosenberg, Thomas; Hunt, David M.; Moore, Anthony T.; Züchner, Stephan; Rickman, Catherine Bowes; Young, Terri L.

2014-01-01

Purpose X-linked high myopia with mild cone dysfunction and color vision defects has been mapped to chromosome Xq28 (MYP1 locus). CXorf2/TEX28 is a nested, intercalated gene within the red-green opsin cone pigment gene tandem array on Xq28. The authors investigated whether TEX28 gene alterations were associated with the Xq28-linked myopia phenotype. Genomic DNA from five pedigrees (with high myopia and either protanopia or deuteranopia) that mapped to Xq28 were screened for TEX28 copy number variations (CNVs) and sequence variants. Methods To examine for CNVs, ultra-high resolution array-comparative genomic hybridization (array-CGH) assays were performed comparing the subject genomic DNA with control samples (two pairs from two pedigrees). Opsin or TEX28 gene-targeted quantitative real-time gene expression assays (comparative CT method) were performed to validate the array-CGH findings. All exons of TEX28, including intron/exon boundaries, were amplified and sequenced using standard techniques. Results Array-CGH findings revealed predicted duplications in affected patient samples. Although only three copies of TEX28 were previously reported within the opsin array, quantitative real-time analysis of the TEX28 targeted assay of affected male or carrier female individuals in these pedigrees revealed either fewer (one) or more (four or five) copies than did related and control unaffected individuals. Sequence analysis of TEX28 did not reveal any variants associated with the disease status. Conclusions CNVs have been proposed to play a role in disease inheritance and susceptibility as they affect gene dosage. TEX28 gene CNVs appear to be associated with the MYP1 X-linked myopia phenotypes. PMID:19098318
Whole-exome sequencing reveals genetic variants associated with chronic kidney disease characterized by tubulointerstitial damages in North Central Region, Sri Lanka.

PubMed

Nanayakkara, Shanika; Senevirathna, S T M L D; Parahitiyawa, Nipuna B; Abeysekera, Tilak; Chandrajith, Rohana; Ratnatunga, Neelakanthi; Hitomi, Toshiaki; Kobayashi, Hatasu; Harada, Kouji H; Koizumi, Akio

2015-09-01

The familial clustering observed in chronic kidney disease of uncertain etiology (CKDu) characterized by tubulointerstitial damages in the North Central Region of Sri Lanka strongly suggests the involvement of genetic factors in its pathogenesis. The objective of the present study is to use whole-exome sequencing to identify the genetic variants associated with CKDu. Whole-exome sequencing of eight CKDu cases and eight controls was performed, followed by direct sequencing of candidate loci in 301 CKDu cases and 276 controls. Association study revealed rs34970857 (c.658G > A/p.V220M) located in the KCNA10 gene encoding a voltage-gated K channel as the most promising SNP with the highest odds ratio of 1.74. Four rare variants were identified in gene encoding Laminin beta2 (LAMB2) which is known to cause congenital nephrotic syndrome. Three out of four variants in LAMB2 were novel variants found exclusively in cases. Genetic investigations provide strong evidence on the presence of genetic susceptibility for CKDu. Possibility of presence of several rare variants associated with CKDu in this population is also suggested.
HGVS Recommendations for the Description of Sequence Variants: 2016 Update.

PubMed

den Dunnen, Johan T; Dalgleish, Raymond; Maglott, Donna R; Hart, Reece K; Greenblatt, Marc S; McGowan-Jordan, Jean; Roux, Anne-Francoise; Smith, Timothy; Antonarakis, Stylianos E; Taschner, Peter E M

2016-06-01

The consistent and unambiguous description of sequence variants is essential to report and exchange information on the analysis of a genome. In particular, DNA diagnostics critically depends on accurate and standardized description and sharing of the variants detected. The sequence variant nomenclature system proposed in 2000 by the Human Genome Variation Society has been widely adopted and has developed into an internationally accepted standard. The recommendations are currently commissioned through a Sequence Variant Description Working Group (SVD-WG) operating under the auspices of three international organizations: the Human Genome Variation Society (HGVS), the Human Variome Project (HVP), and the Human Genome Organization (HUGO). Requests for modifications and extensions go through the SVD-WG following a standard procedure including a community consultation step. Version numbers are assigned to the nomenclature system to allow users to specify the version used in their variant descriptions. Here, we present the current recommendations, HGVS version 15.11, and briefly summarize the changes that were made since the 2000 publication. Most focus has been on removing inconsistencies and tightening definitions allowing automatic data processing. An extensive version of the recommendations is available online, at http://www.HGVS.org/varnomen. © 2016 WILEY PERIODICALS, INC.
Exome Sequencing Identifies Potential Risk Variants for Mendelian Disorders at High Prevalence in Qatar

PubMed Central

Rodriguez-Flores, Juan L.; Fakhro, Khalid; Hackett, Neil R.; Salit, Jacqueline; Fuller, Jennifer; Agosto-Perez, Francisco; Gharbiah, Maey; Malek, Joel A.; Zirie, Mahmoud; Jayyousi, Amin; Badii, Ramin; Al-Marri, Ajayeb Al-Nabet; Chouchane, Lotfi; Stadler, Dora J.; Hunter-Zinck, Haley; Mezey, Jason G.; Crystal, Ronald G.

2013-01-01

Exome sequencing of families of related individuals has been highly successful in identifying genetic polymorphisms responsible for Mendelian disorders. Here, we demonstrate the value of the reverse approach, where we use exome sequencing of a sample of unrelated individuals to analyze allele frequencies of known causal mutations for Mendelian diseases. We sequenced the exomes of 100 individuals representing the three major genetic subgroups of the Qatari population (Q1 Bedouin, Q2 Persian-South Asian, Q3 African) and identified 37 variants in 33 genes with effects on 36 clinically significant Mendelian diseases. These include variants not present in 1000 Genomes and variants at high frequency when compared to 1000 Genomes populations. Several of these Mendelian variants were only segregating in one Qatari subpopulation, where the observed subpopulation specificity trends were confirmed in an independent population of 386 Qataris. Pre-marital genetic screening in Qatar tests for only 4 out of the 37, such that this study provides a set of Mendelian disease variants with potential impact on the epidemiological profile of the population that could be incorporated into the testing program if further experimental and clinical characterization confirms high penetrance. PMID:24123366
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing

PubMed Central

Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan

2017-01-01

PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored. PMID:28632759
A Multiple-Sequence Variant of the Multiple-Baseline Design: A Strategy for Analysis of Sequence Effects and Treatment Comparison.

ERIC Educational Resources Information Center

Noell, George H.; Gresham, Frank M.

2001-01-01

Describes design logic and potential uses of a variant of the multiple-baseline design. The multiple-baseline multiple-sequence (MBL-MS) consists of multiple-baseline designs that are interlaced with one another and include all possible sequences of treatments. The MBL-MS design appears to be primarily useful for comparison of treatments taking…
Molecular Cytogenetics Guides Massively Parallel Sequencing of a Radiation-Induced Chromosome Translocation in Human Cells.

PubMed

Cornforth, Michael N; Anur, Pavana; Wang, Nicholas; Robinson, Erin; Ray, F Andrew; Bedford, Joel S; Loucas, Bradford D; Williams, Eli S; Peto, Myron; Spellman, Paul; Kollipara, Rahul; Kittler, Ralf; Gray, Joe W; Bailey, Susan M

2018-05-11

Chromosome rearrangements are large-scale structural variants that are recognized drivers of oncogenic events in cancers of all types. Cytogenetics allows for their rapid, genome-wide detection, but does not provide gene-level resolution. Massively parallel sequencing (MPS) promises DNA sequence-level characterization of the specific breakpoints involved, but is strongly influenced by bioinformatics filters that affect detection efficiency. We sought to characterize the breakpoint junctions of chromosomal translocations and inversions in the clonal derivatives of human cells exposed to ionizing radiation. Here, we describe the first successful use of DNA paired-end analysis to locate and sequence across the breakpoint junctions of a radiation-induced reciprocal translocation. The analyses employed, with varying degrees of success, several well-known bioinformatics algorithms, a task made difficult by the involvement of repetitive DNA sequences. As for underlying mechanisms, the results of Sanger sequencing suggested that the translocation in question was likely formed via microhomology-mediated non-homologous end joining (mmNHEJ). To our knowledge, this represents the first use of MPS to characterize the breakpoint junctions of a radiation-induced chromosomal translocation in human cells. Curiously, these same approaches were unsuccessful when applied to the analysis of inversions previously identified by directional genomic hybridization (dGH). We conclude that molecular cytogenetics continues to provide critical guidance for structural variant discovery, validation and in "tuning" analysis filters to enable robust breakpoint identification at the base pair level.
Genomic variation in macrophage-cultured European porcine reproductive and respiratory syndrome virus Olot/91 revealed using ultra-deep next generation sequencing.

PubMed

Lu, Zen H; Brown, Alexander; Wilson, Alison D; Calvert, Jay G; Balasch, Monica; Fuentes-Utrilla, Pablo; Loecherbach, Julia; Turner, Frances; Talbot, Richard; Archibald, Alan L; Ait-Ali, Tahar

2014-03-04

Porcine Reproductive and Respiratory Syndrome (PRRS) is a disease of major economic impact worldwide. The etiologic agent of this disease is the PRRS virus (PRRSV). Increasing evidence suggest that microevolution within a coexisting quasispecies population can give rise to high sequence heterogeneity in PRRSV. We developed a pipeline based on the ultra-deep next generation sequencing approach to first construct the complete genome of a European PRRSV, strain Olot/9, cultured on macrophages and then capture the rare variants representative of the mixed quasispecies population. Olot/91 differs from the reference Lelystad strain by about 5% and a total of 88 variants, with frequencies as low as 1%, were detected in the mixed population. These variants included 16 non-synonymous variants concentrated in the genes encoding structural and nonstructural proteins; including Glycoprotein 2a and 5. Using an ultra-deep sequencing methodology, the complete genome of Olot/91 was constructed without any prior knowledge of the sequence. Rare variants that constitute minor fractions of the heterogeneous PRRSV population could successfully be detected to allow further exploration of microevolutionary events.
A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family.

PubMed

Lucotte, Gérard

2010-10-04

This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon.
A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family

PubMed Central

2010-01-01

This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon. PMID:21092341
Whole genome sequencing and imputation in isolated populations identify genetic associations with medically-relevant complex traits

PubMed Central

Southam, Lorraine; Gilly, Arthur; Süveges, Dániel; Farmaki, Aliki-Eleni; Schwartzentruber, Jeremy; Tachmazidou, Ioanna; Matchan, Angela; Rayner, Nigel W.; Tsafantakis, Emmanouil; Karaleftheri, Maria; Xue, Yali; Dedoussis, George; Zeggini, Eleftheria

2017-01-01

Next-generation association studies can be empowered by sequence-based imputation and by studying founder populations. Here we report ∼9.5 million variants from whole-genome sequencing (WGS) of a Cretan-isolated population, and show enrichment of rare and low-frequency variants with predicted functional consequences. We use a WGS-based imputation approach utilizing 10,422 reference haplotypes to perform genome-wide association analyses and observe 17 genome-wide significant, independent signals, including replicating evidence for association at eight novel low-frequency variant signals. Two novel cardiometabolic associations are at lead variants unique to the founder population sequences: chr16:70790626 (high-density lipoprotein levels beta −1.71 (SE 0.25), P=1.57 × 10−11, effect allele frequency (EAF) 0.006); and rs145556679 (triglycerides levels beta −1.13 (SE 0.17), P=2.53 × 10−11, EAF 0.013). Our findings add empirical support to the contribution of low-frequency variants in complex traits, demonstrate the advantage of including population-specific sequences in imputation panels and exemplify the power gains afforded by population isolates. PMID:28548082

Some links on this page may take you to non-federal websites. Their policies may differ from this site.