Byers, Helen; Wallis, Yvonne; van Veen, Elke M; Lalloo, Fiona; Reay, Kim; Smith, Philip; Wallace, Andrew J; Bowers, Naomi; Newman, William G; Evans, D Gareth
2016-11-01
The sensitivity of testing BRCA1 and BRCA2 remains unresolved as the frequency of deep intronic splicing variants has not been defined in high-risk familial breast/ovarian cancer families. This variant category is reported at significant frequency in other tumour predisposition genes, including NF1 and MSH2. We carried out comprehensive whole gene RNA analysis on 45 high-risk breast/ovary and male breast cancer families with no identified pathogenic variant on exonic sequencing and copy number analysis of BRCA1/2. In addition, we undertook variant screening of a 10-gene high/moderate risk breast/ovarian cancer panel by next-generation sequencing. DNA testing identified the causative variant in 50/56 (89%) breast/ovarian/male breast cancer families with Manchester scores of ≥50 with two variants being confirmed to affect splicing on RNA analysis. RNA sequencing of BRCA1/BRCA2 on 45 individuals from high-risk families identified no deep intronic variants and did not suggest loss of RNA expression as a cause of lost sensitivity. Panel testing in 42 samples identified a known RAD51D variant, a high-risk ATM variant in another breast ovary family and a truncating CHEK2 mutation. Current exonic sequencing and copy number analysis variant detection methods of BRCA1/2 have high sensitivity in high-risk breast/ovarian cancer families. Sequence analysis of RNA does not identify any variants undetected by current analysis of BRCA1/2. However, RNA analysis clarified the pathogenicity of variants of unknown significance detected by current methods. The low diagnostic uplift achieved through sequence analysis of the other known breast/ovarian cancer susceptibility genes indicates that further high-risk genes remain to be identified.
USDA-ARS?s Scientific Manuscript database
Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...
Copy Number Variation across European Populations
Chen, Wanting; Hayward, Caroline; Wright, Alan F.; Hicks, Andrew A.; Vitart, Veronique; Knott, Sara; Wild, Sarah H.; Pramstaller, Peter P.; Wilson, James F.; Rudan, Igor; Porteous, David J.
2011-01-01
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations. PMID:21829696
Zhuo, L; Reed, K M; Phillips, R B
1995-06-01
Variation in the intergenic spacer (IGS) of the ribosomal DNA (rDNA) of lake trout (Salvelinus namaycush) was examined. Digestion of genomic DNA with restriction enzymes showed that almost every individual had a unique combination of length variants with most of this variation occurring within rather than between populations. Sequence analysis of a 2.3 kilobase (kb) EcoRI-DraI fragment spanning the 3' end of the 28S coding region and approximately 1.8 kb of the IGS revealed two blocks of repetitive DNA. Putative transcriptional termination sites were found approximately 220 bases (b) downstream from the end of the 28S coding region. Comparison of the 2.3-kb fragments with two longer (3.1 kb) fragments showed that the major difference in length resulted from variation in the number of short (89 b) repeats located 3' to the putative terminator. Repeat units within a single nucleolus organizer region (NOR) appeared relatively homogeneous and genetic analysis found variants to be stably inherited. A comparison of the number of spacer-length variants with the number of NORs found that the number of length variants per individual was always less than the number of NORs. Examination of spacer variants in five populations showed that populations with more NORs had more spacer variants, indicating that variants are present at different rDNA sites on nonhomologous chromosomes.
USDA-ARS?s Scientific Manuscript database
Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...
Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.
Lee, Sangmoon; Seo, Jihae; Park, Jinman; Nam, Jae-Yong; Choi, Ahyoung; Ignatius, Jason S; Bjornson, Robert D; Chae, Jong-Hee; Jang, In-Jin; Lee, Sanghyuk; Park, Woong-Yang; Baek, Daehyun; Choi, Murim
2017-06-27
Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean population is a distinct ethnic group comparable to other discrete ethnic groups in Africa and Europe, providing a rationale for such independent genomic datasets. Indeed, KOVA conferred 22.8% increased variant filtering power in addition to Exome Aggregation Consortium (ExAC) when used on Korean exomes. Functional assessment of nonsynonymous variant supported the presence of purifying selection in Koreans. Analysis of copy number variants detected 5.2 deletions and 10.3 amplifications per individual with an increased fraction of novel variants among smaller and rarer copy number variable segments. We also report a list of germline variants that are associated with increased tumor susceptibility. This catalog can function as a critical addition to the pre-existing variant databases in pursuing genetic studies of Korean individuals.
Using whole-exome sequencing to identify variants inherited from mosaic parents
Rios, Jonathan J; Delgado, Mauricio R
2015-01-01
Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828
Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2013-11-01
Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.
Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J; Robertson, Peggy D; Krumm, Niklas; Nickerson, Deborah A; Hershberger, Ray E
2013-04-01
BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.
Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji
2010-07-01
We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.
Walker, Logan C; Marquart, Louise; Pearson, John F; Wiggins, George A R; O'Mara, Tracy A; Parsons, Michael T; Barrowdale, Daniel; McGuffog, Lesley; Dennis, Joe; Benitez, Javier; Slavin, Thomas P; Radice, Paolo; Frost, Debra; Godwin, Andrew K; Meindl, Alfons; Schmutzler, Rita Katharina; Isaacs, Claudine; Peshkin, Beth N; Caldes, Trinidad; Hogervorst, Frans BL; Lazaro, Conxi; Jakubowska, Anna; Montagna, Marco; Chen, Xiaoqing; Offit, Kenneth; Hulick, Peter J; Andrulis, Irene L; Lindblom, Annika; Nussbaum, Robert L; Nathanson, Katherine L; Chenevix-Trench, Georgia; Antoniou, Antonis C; Couch, Fergus J; Spurdle, Amanda B
2017-01-01
Genome-wide studies of patients carrying pathogenic variants (mutations) in BRCA1 or BRCA2 have reported strong associations between single-nucleotide polymorphisms (SNPs) and cancer risk. To conduct the first genome-wide association analysis of copy-number variants (CNVs) with breast or ovarian cancer risk in a cohort of 2500 BRCA1 pathogenic variant carriers, CNV discovery was performed using multiple calling algorithms and Illumina 610k SNP array data from a previously published genome-wide association study. Our analysis, which focused on functionally disruptive genomic deletions overlapping gene regions, identified a number of loci associated with risk of breast or ovarian cancer for BRCA1 pathogenic variant carriers. Despite only including putative deletions called by at least two or more algorithms, detection of selected CNVs by ancillary molecular technologies only confirmed 40% of predicted common (>1% allele frequency) variants. These include four loci that were associated (unadjusted P<0.05) with breast cancer (GTF2H2, ZNF385B, NAALADL2 and PSG5), and two loci associated with ovarian cancer (CYP2A7 and OR2A1). An interesting finding from this study was an association of a validated CNV deletion at the CYP2A7 locus (19q13.2) with decreased ovarian cancer risk (relative risk=0.50, P=0.007). Genomic analysis found this deletion coincides with a region displaying strong regulatory potential in ovarian tissue, but not in breast epithelial cells. This study highlighted the need to verify CNVs in vitro, but also provides evidence that experimentally validated CNVs (with plausible biological consequences) can modify risk of breast or ovarian cancer in BRCA1 pathogenic variant carriers. PMID:28145423
A survey of tools for variant analysis of next-generation genome sequencing data
Pabinger, Stephan; Dander, Andreas; Fischer, Maria; Snajder, Rene; Sperk, Michael; Efremova, Mirjana; Krabichler, Birgit; Speicher, Michael R.; Zschocke, Johannes
2014-01-01
Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers. PMID:23341494
Rare-Variant Association Analysis: Study Designs and Statistical Tests
Lee, Seunggeung; Abecasis, Gonçalo R.; Boehnke, Michael; Lin, Xihong
2014-01-01
Despite the extensive discovery of trait- and disease-associated common variants, much of the genetic contribution to complex traits remains unexplained. Rare variants can explain additional disease risk or trait variability. An increasing number of studies are underway to identify trait- and disease-associated rare variants. In this review, we provide an overview of statistical issues in rare-variant association studies with a focus on study designs and statistical tests. We present the design and analysis pipeline of rare-variant studies and review cost-effective sequencing designs and genotyping platforms. We compare various gene- or region-based association tests, including burden tests, variance-component tests, and combined omnibus tests, in terms of their assumptions and performance. Also discussed are the related topics of meta-analysis, population-stratification adjustment, genotype imputation, follow-up studies, and heritability due to rare variants. We provide guidelines for analysis and discuss some of the challenges inherent in these studies and future research directions. PMID:24995866
Wang, Jia-Chi; Boyar, Fatih Z
2016-01-01
Chromosomal microarray analysis (CMA) has been recommended and practiced routinely in the large reference laboratories of U.S.A. as the first-tier test for the postnatal evaluation of individuals with intellectual disability, autism spectrum disorders, and/or multiple congenital anomalies. Using CMA as a diagnostic tool and without a routine setting of fluorescence in situ hybridization with labeled bacterial artificial chromosome probes (BAC-FISH) in the large reference laboratories becomes a challenge in the characterization of chromosome 9 pericentric region. This region has a very complex genomic structure and contains a variety of heterochromatic and euchromatic polymorphic variants. These variants were usually studied by G-banding, C-banding and BAC-FISH analysis. Chromosomal microarray analysis (CMA) was not recommended since it may lead to false positive results. Here, we presented a cohort of four cases, in which high-resolution CMA was used as the first-tier test or simultaneously with G-banding analysis on the proband to identify pathogenic copy number variants (CNVs) in the whole genome. CMA revealed large pathogenic CNVs from chromosome 9 in 3 cases which also revealed different G-banding patterns between the two chromosome 9 homologues. Although we demonstrated that high-resolution CMA played an important role in the identification of pathogenic copy number variants in chromosome 9 pericentric regions, the lack of BAC-FISH analysis or other useful tools renders significant challenges in the characterization of chromosome 9 pericentric regions. None; it is not a clinical trial, and the cases were retrospectively collected and analyzed.
Chiu, Chi-yang; Jung, Jeesun; Wang, Yifan; Weeks, Daniel E.; Wilson, Alexander F.; Bailey-Wilson, Joan E.; Amos, Christopher I.; Mills, James L.; Boehnke, Michael; Xiong, Momiao; Fan, Ruzong
2016-01-01
In this paper, extensive simulations are performed to compare two statistical methods to analyze multiple correlated quantitative phenotypes: (1) approximate F-distributed tests of multivariate functional linear models (MFLM) and additive models of multivariate analysis of variance (MANOVA), and (2) Gene Association with Multiple Traits (GAMuT) for association testing of high-dimensional genotype data. It is shown that approximate F-distributed tests of MFLM and MANOVA have higher power and are more appropriate for major gene association analysis (i.e., scenarios in which some genetic variants have relatively large effects on the phenotypes); GAMuT has higher power and is more appropriate for analyzing polygenic effects (i.e., effects from a large number of genetic variants each of which contributes a small amount to the phenotypes). MFLM and MANOVA are very flexible and can be used to perform association analysis for: (i) rare variants, (ii) common variants, and (iii) a combination of rare and common variants. Although GAMuT was designed to analyze rare variants, it can be applied to analyze a combination of rare and common variants and it performs well when (1) the number of genetic variants is large and (2) each variant contributes a small amount to the phenotypes (i.e., polygenes). MFLM and MANOVA are fixed effect models which perform well for major gene association analysis. GAMuT can be viewed as an extension of sequence kernel association tests (SKAT). Both GAMuT and SKAT are more appropriate for analyzing polygenic effects and they perform well not only in the rare variant case, but also in the case of a combination of rare and common variants. Data analyses of European cohorts and the Trinity Students Study are presented to compare the performance of the two methods. PMID:27917525
Gayarre, Javier; Martín-Gimeno, Paloma; Osorio, Ana; Paumard, Beatriz; Barroso, Alicia; Fernández, Victoria; de la Hoya, Miguel; Rojo, Alejandro; Caldés, Trinidad; Palacios, José; Urioste, Miguel; Benítez, Javier; García, María J
2017-09-26
Despite a high prevalence of deleterious missense variants, most studies of RAD51C ovarian cancer susceptibility gene only provide in silico pathogenicity predictions of missense changes. We identified a novel deleterious RAD51C missense variant (p.Arg312Trp) in a high-risk family, and propose a criteria to prioritise RAD51C missense changes qualifying for functional analysis. To evaluate pathogenicity of p.Arg312Trp variant we used sequence homology, loss of heterozygosity (LOH) and segregation analysis, and a comprehensive functional characterisation. To define a functional-analysis prioritisation criteria, we used outputs for the known functionally confirmed deleterious and benign RAD51C missense changes from nine pathogenicity prediction algorithms. The p.Arg312Trp variant failed to correct mitomycin and olaparib hypersensitivity and to complement abnormal RAD51C foci formation according to functional assays, which altogether with LOH and segregation data demonstrated deleteriousness. Prioritisation criteria were based on the number of predictors providing a deleterious output, with a minimum of 5 to qualify for testing and a PredictProtein score greater than 33 to assign high-priority indication. Our study points to a non-negligible number of RAD51C missense variants likely to impair protein function, provides a guideline to prioritise and encourage their selection for functional analysis and anticipates that reference laboratories should have available resources to conduct such assays.
Ho Duy, Binh; Zhytnik, Lidiia; Maasalu, Katre; Kändla, Ivo; Prans, Ele; Reimann, Ene; Märtson, Aare; Kõks, Sulev
2016-08-12
The genetics of osteogenesis imperfecta (OI) have not been studied in a Vietnamese population before. We performed mutational analysis of the COL1A1 and COL1A2 genes in 91 unrelated OI patients of Vietnamese origin. We then systematically characterized the mutation profiles of these two genes which are most commonly related to OI. Genomic DNA was extracted from EDTA-preserved blood according to standard high-salt extraction methods. Sequence analysis and pathogenic variant identification was performed with Mutation Surveyor DNA variant analysis software. Prediction of the pathogenicity of mutations was conducted using Alamut Visual software. The presence of variants was checked against Dalgleish's osteogenesis imperfecta mutation database. The sample consisted of 91 unrelated osteogenesis imperfecta patients. We identified 54 patients with COL1A1/2 pathogenic variants; 33 with COL1A1 and 21 with COL1A2. Two patients had multiple pathogenic variants. Seventeen novel COL1A1 and 10 novel COL1A2 variants were identified. The majority of identified COL1A1/2 pathogenic variants occurred in a glycine substitution (36/56, 64.3 %), usually serine (23/36, 63.9 %). We found two pathogenic variants of the COL1A1 gene c.2461G > A (p.Gly821Ser) in four unrelated patients and one, c.2005G > A (p.Ala669Thr), in two unrelated patients. Our data showed a lower number of collagen OI pathogenic variants in Vietnamese patients compared to reported rates for Asian populations. The OI mutational profile of the Vietnamese population is unique and related to the presence of a high number of recessive mutations in non-collagenous OI genes. Further analysis of OI patients negative for collagen mutations, is required.
[Fine mapping of complex disease susceptibility loci].
Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao
2014-01-01
Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.
Tabb, Keri L.; Hellwege, Jacklyn N.; Palmer, Nicholette D.; Dimitrov, Latchezar; Sajuthi, Satria; Taylor, Kent D.; NG, Maggie C.Y.; Hawkins, Gregory A.; Chen, Yii-Der Ida; Brown, W. Mark; McWilliams, David; Williams, Adrienne; Lorenzo, Carlos; Norris, Jill M.; Long, Jirong; Rotter, Jerome I.; Curran, Joanne E.; Blangero, John; Wagenknecht, Lynne E.; Langefeld, Carl D.; Bowden, Donald W.
2017-01-01
Summary Family-based methods are a potentially powerful tool to identify trait-defining genetic variants in extended families, particularly when used to complement conventional association analysis. We utilized two-point linkage analysis and single variant association analysis to evaluate whole exome sequencing (WES) data from 1,205 Hispanic Americans (78 families) from the Insulin Resistance Atherosclerosis Family Study. WES identified 211,612 variants above the minor allele frequency threshold of ≥0.005. These variants were tested for linkage and/or association with 50 cardiometabolic traits after quality control checks. Two-point linkage analysis yielded 10,580,600 LOD scores with 1,148 LOD scores ≥3, 183 LOD scores ≥4, and 29 LOD scores ≥5. The maximal novel LOD score was 5.50 for rs2289043:T>C, in UNC5C with subcutaneous adipose tissue volume. Association analysis identified 13 variants attaining genome-wide significance (p<5×10-08), with the strongest association between rs651821:C>T in APOA5, and triglyceride levels (p=3.67×10-10). Overall, there was a 5.2-fold increase in the number of informative variants detected by WES compared to exome chip analysis in this population, nearly 30% of which were novel variants relative to dbSNP build 138. Thus, integration of results from two-point linkage and single-variant association analysis from WES data enabled identification of novel signals potentially contributing to cardiometabolic traits. PMID:28067407
Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.
Fang, Hongyan; Zhang, Hong; Yang, Yaning
2016-07-01
Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.
Karyotype versus microarray testing for genetic abnormalities after stillbirth.
Reddy, Uma M; Page, Grier P; Saade, George R; Silver, Robert M; Thorsten, Vanessa R; Parker, Corette B; Pinar, Halit; Willinger, Marian; Stoll, Barbara J; Heim-Hall, Josefine; Varner, Michael W; Goldenberg, Robert L; Bukowski, Radek; Wapner, Ronald J; Drews-Botsch, Carolyn D; O'Brien, Barbara M; Dudley, Donald J; Levy, Brynn
2012-12-06
Genetic abnormalities have been associated with 6 to 13% of stillbirths, but the true prevalence may be higher. Unlike karyotype analysis, microarray analysis does not require live cells, and it detects small deletions and duplications called copy-number variants. The Stillbirth Collaborative Research Network conducted a population-based study of stillbirth in five geographic catchment areas. Standardized postmortem examinations and karyotype analyses were performed. A single-nucleotide polymorphism array was used to detect copy-number variants of at least 500 kb in placental or fetal tissue. Variants that were not identified in any of three databases of apparently unaffected persons were then classified into three groups: probably benign, clinical significance unknown, or pathogenic. We compared the results of karyotype and microarray analyses of samples obtained after delivery. In our analysis of samples from 532 stillbirths, microarray analysis yielded results more often than did karyotype analysis (87.4% vs. 70.5%, P<0.001) and provided better detection of genetic abnormalities (aneuploidy or pathogenic copy-number variants, 8.3% vs. 5.8%; P=0.007). Microarray analysis also identified more genetic abnormalities among 443 antepartum stillbirths (8.8% vs. 6.5%, P=0.02) and 67 stillbirths with congenital anomalies (29.9% vs. 19.4%, P=0.008). As compared with karyotype analysis, microarray analysis provided a relative increase in the diagnosis of genetic abnormalities of 41.9% in all stillbirths, 34.5% in antepartum stillbirths, and 53.8% in stillbirths with anomalies. Microarray analysis is more likely than karyotype analysis to provide a genetic diagnosis, primarily because of its success with nonviable tissue, and is especially valuable in analyses of stillbirths with congenital anomalies or in cases in which karyotype results cannot be obtained. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development.).
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Decision Variants for the Automatic Determination of Optimal Feature Subset in RF-RFE.
Chen, Qi; Meng, Zhaopeng; Liu, Xinyi; Jin, Qianguo; Su, Ran
2018-06-15
Feature selection, which identifies a set of most informative features from the original feature space, has been widely used to simplify the predictor. Recursive feature elimination (RFE), as one of the most popular feature selection approaches, is effective in data dimension reduction and efficiency increase. A ranking of features, as well as candidate subsets with the corresponding accuracy, is produced through RFE. The subset with highest accuracy (HA) or a preset number of features (PreNum) are often used as the final subset. However, this may lead to a large number of features being selected, or if there is no prior knowledge about this preset number, it is often ambiguous and subjective regarding final subset selection. A proper decision variant is in high demand to automatically determine the optimal subset. In this study, we conduct pioneering work to explore the decision variant after obtaining a list of candidate subsets from RFE. We provide a detailed analysis and comparison of several decision variants to automatically select the optimal feature subset. Random forest (RF)-recursive feature elimination (RF-RFE) algorithm and a voting strategy are introduced. We validated the variants on two totally different molecular biology datasets, one for a toxicogenomic study and the other one for protein sequence analysis. The study provides an automated way to determine the optimal feature subset when using RF-RFE.
Validation and Implementation of BRCA1/2 Variant Screening in Ovarian Tumor Tissue.
de Jonge, Marthe M; Ruano, Dina; van Eijk, Ronald; van der Stoep, Nienke; Nielsen, Maartje; Wijnen, Juul T; Ter Haar, Natalja T; Baalbergen, Astrid; Bos, Monique E M M; Kagie, Marjolein J; Vreeswijk, Maaike P G; Gaarenstroom, Katja N; Kroep, Judith R; Smit, Vincent T H B M; Bosse, Tjalling; van Wezel, Tom; van Asperen, Christi J
2018-06-21
BRCA1/2 variant analysis in tumor tissue could streamline the referral of patients with epithelial ovarian, fallopian tube, or primary peritoneal cancer to genetic counselors and select patients who benefit most from targeted treatment. We investigated the sensitivity of BRCA1/2 variant analysis in formalin-fixed, paraffin-embedded tumor tissue using a combination of next-generation sequencing and copy number variant multiplex ligation-dependent probe amplification. After optimization using a training cohort of known BRCA1/2 mutation carriers, validation was performed in a prospective cohort (Clinical implementation Of BRCA1/2 screening in ovarian tumor tissue: COBRA-cohort) in which screening of BRCA1/2 tumor DNA and leukocyte germline DNA was performed in parallel. BRCA1 promoter hypermethylation and pedigree analysis were also performed. In the training cohort 45 of 46 germline BRCA1/2 variants were detected (sensitivity 98%). In the COBRA cohort (n=62), all six germline variants were identified (sensitivity 100%), together with five somatic BRCA1/2 variants and eight cases with BRCA1 promoter hypermethylation. In four BRCA1/2 variant-negative patients, surveillance or prophylactic management options were offered based on positive family histories. We conclude that BRCA1/2 formalin-fixed, paraffin-embedded tumor tissue analysis reliably detects BRCA1/2 variants. When taking family history of BRCA1/2 variant-negative patients into account, tumor BRCA1/2 variant screening allows more efficient selection of epithelial ovarian cancer patients for genetic counselling and simultaneously selects patients who benefit most from targeted treatment. Copyright © 2018. Published by Elsevier Inc.
Bailey-Wilson, Joan E.; Brennan, Jennifer S.; Bull, Shelley B; Culverhouse, Robert; Kim, Yoonhee; Jiang, Yuan; Jung, Jeesun; Li, Qing; Lamina, Claudia; Liu, Ying; Mägi, Reedik; Niu, Yue S.; Simpson, Claire L.; Wang, Libo; Yilmaz, Yildiz E.; Zhang, Heping; Zhang, Zhaogong
2012-01-01
Group 14 of Genetic Analysis Workshop 17 examined several issues related to analysis of complex traits using DNA sequence data. These issues included novel methods for analyzing rare genetic variants in an aggregated manner (often termed collapsing rare variants), evaluation of various study designs to increase power to detect effects of rare variants, and the use of machine learning approaches to model highly complex heterogeneous traits. Various published and novel methods for analyzing traits with extreme locus and allelic heterogeneity were applied to the simulated quantitative and disease phenotypes. Overall, we conclude that power is (as expected) dependent on locus-specific heritability or contribution to disease risk, large samples will be required to detect rare causal variants with small effect sizes, extreme phenotype sampling designs may increase power for smaller laboratory costs, methods that allow joint analysis of multiple variants per gene or pathway are more powerful in general than analyses of individual rare variants, population-specific analyses can be optimal when different subpopulations harbor private causal mutations, and machine learning methods may be useful for selecting subsets of predictors for follow-up in the presence of extreme locus heterogeneity and large numbers of potential predictors. PMID:22128066
Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R
2017-06-27
We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.
2013-01-01
Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools. Conclusions FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes. PMID:23441864
Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier
2013-11-01
Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n = 96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4% of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes.
Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier
2013-10-24
Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n=96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4 % of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes. © 2013 Published by Elsevier Ltd.
Use of allele scores as instrumental variables for Mendelian randomization
Burgess, Stephen; Thompson, Simon G
2013-01-01
Background An allele score is a single variable summarizing multiple genetic variants associated with a risk factor. It is calculated as the total number of risk factor-increasing alleles for an individual (unweighted score), or the sum of weights for each allele corresponding to estimated genetic effect sizes (weighted score). An allele score can be used in a Mendelian randomization analysis to estimate the causal effect of the risk factor on an outcome. Methods Data were simulated to investigate the use of allele scores in Mendelian randomization where conventional instrumental variable techniques using multiple genetic variants demonstrate ‘weak instrument’ bias. The robustness of estimates using the allele score to misspecification (for example non-linearity, effect modification) and to violations of the instrumental variable assumptions was assessed. Results Causal estimates using a correctly specified allele score were unbiased with appropriate coverage levels. The estimates were generally robust to misspecification of the allele score, but not to instrumental variable violations, even if the majority of variants in the allele score were valid instruments. Using a weighted rather than an unweighted allele score increased power, but the increase was small when genetic variants had similar effect sizes. Naive use of the data under analysis to choose which variants to include in an allele score, or for deriving weights, resulted in substantial biases. Conclusions Allele scores enable valid causal estimates with large numbers of genetic variants. The stringency of criteria for genetic variants in Mendelian randomization should be maintained for all variants in an allele score. PMID:24062299
Cady, Janet; Allred, Peggy; Bali, Taha; Pestronk, Alan; Goate, Alison; Miller, Timothy M; Mitra, Robi D; Ravits, John; Harms, Matthew B; Baloh, Robert H
2015-01-01
To define the genetic landscape of amyotrophic lateral sclerosis (ALS) and assess the contribution of possible oligogenic inheritance, we aimed to comprehensively sequence 17 known ALS genes in 391 ALS patients from the United States. Targeted pooled-sample sequencing was used to identify variants in 17 ALS genes. Fragment size analysis was used to define ATXN2 and C9ORF72 expansion sizes. Genotype-phenotype correlations were made with individual variants and total burden of variants. Rare variant associations for risk of ALS were investigated at both the single variant and gene level. A total of 64.3% of familial and 27.8% of sporadic subjects carried potentially pathogenic novel or rare coding variants identified by sequencing or an expanded repeat in C9ORF72 or ATXN2; 3.8% of subjects had variants in >1 ALS gene, and these individuals had disease onset 10 years earlier (p = 0.0046) than subjects with variants in a single gene. The number of potentially pathogenic coding variants did not influence disease duration or site of onset. Rare and potentially pathogenic variants in known ALS genes are present in >25% of apparently sporadic and 64% of familial patients, significantly higher than previous reports using less comprehensive sequencing approaches. A significant number of subjects carried variants in >1 gene, which influenced the age of symptom onset and supports oligogenic inheritance as relevant to disease pathogenesis. © 2014 American Neurological Association.
Copy number variants in patients with short stature
van Duyvenvoorde, Hermine A; Lui, Julian C; Kant, Sarina G; Oostdijk, Wilma; Gijsbers, Antoinet CJ; Hoffer, Mariëtte JV; Karperien, Marcel; Walenkamp, Marie JE; Noordam, Cees; Voorhoeve, Paul G; Mericq, Verónica; Pereira, Alberto M; Claahsen-van de Grinten, Hedi L; van Gool, Sandy A; Breuning, Martijn H; Losekoot, Monique; Baron, Jeffrey; Ruivenkamp, Claudia AL; Wit, Jan M
2014-01-01
Height is a highly heritable and classic polygenic trait. Recent genome-wide association studies (GWAS) have revealed that at least 180 genetic variants influence adult height. However, these variants explain only about 10% of the phenotypic variation in height. Genetic analysis of short individuals can lead to the discovery of novel rare gene defects with a large effect on growth. In an effort to identify novel genes associated with short stature, genome-wide analysis for copy number variants (CNVs), using single-nucleotide polymorphism arrays, in 162 patients (149 families) with short stature was performed. Segregation analysis was performed if possible, and genes in CNVs were compared with information from GWAS, gene expression in rodents' growth plates and published information. CNVs were detected in 40 families. In six families, a known cause of short stature was found (SHOX deletion or duplication, IGF1R deletion), in two combined with a de novo potentially pathogenic CNV. Thirty-three families had one or more potentially pathogenic CNVs (n=40). In 24 of these families, segregation analysis could be performed, identifying three de novo CNVs and nine CNVs segregating with short stature. Four were located near loci associated with height in GWAS (ADAMTS17, TULP4, PRKG2/BMP3 and PAPPA). Besides six CNVs known to be causative for short stature, 40 CNVs with possible pathogenicity were identified. Segregation studies and bioinformatics analysis suggested various potential candidate genes. PMID:24065112
Chung, Dongjun; Kim, Hang J; Zhao, Hongyu
2017-02-01
Genome-wide association studies (GWAS) have identified tens of thousands of genetic variants associated with hundreds of phenotypes and diseases, which have provided clinical and medical benefits to patients with novel biomarkers and therapeutic targets. However, identification of risk variants associated with complex diseases remains challenging as they are often affected by many genetic variants with small or moderate effects. There has been accumulating evidence suggesting that different complex traits share common risk basis, namely pleiotropy. Recently, several statistical methods have been developed to improve statistical power to identify risk variants for complex traits through a joint analysis of multiple GWAS datasets by leveraging pleiotropy. While these methods were shown to improve statistical power for association mapping compared to separate analyses, they are still limited in the number of phenotypes that can be integrated. In order to address this challenge, in this paper, we propose a novel statistical framework, graph-GPA, to integrate a large number of GWAS datasets for multiple phenotypes using a hidden Markov random field approach. Application of graph-GPA to a joint analysis of GWAS datasets for 12 phenotypes shows that graph-GPA improves statistical power to identify risk variants compared to statistical methods based on smaller number of GWAS datasets. In addition, graph-GPA also promotes better understanding of genetic mechanisms shared among phenotypes, which can potentially be useful for the development of improved diagnosis and therapeutics. The R implementation of graph-GPA is currently available at https://dongjunchung.github.io/GGPA/.
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Wang, Shaolin; Yang, Zhongli; Ma, Jennie Z.; Payne, Thomas J.; Li, Ming D
2013-01-01
Through linkage analysis, candidate gene approach, and genome-wide association studies (GWAS), many genetic susceptibility factors for substance dependence have been discovered, such as the alcohol dehydrogenase gene (ALDH2) for alcohol dependence (AD) and nicotinic acetylcholine receptor (nAChR) subunit variants on chromosomes 8 and 15 for nicotine dependence (ND). However, these confirmed genetic factors contribute only a small portion of the heritability responsible for each addiction. Among many potential factors, rare variants in those identified and unidentified susceptibility genes are supposed to contribute greatly to the missing heritability. Several studies focusing on rare variants have been conducted by taking advantage of next-generation sequencing technologies, which revealed that some rare variants of nAChR subunits are associated with ND in both genetic and functional studies. However, these studies investigated variants for only a small number of genes and need to be expanded to broad regions/genes in a larger population. This review presents an update on recently developed methods for rare-variant identification and association analysis and on studies focused on rare-variant discovery and function related to addictions. PMID:23990377
Keogh, Michael J; Wei, Wei; Wilson, Ian; Coxhead, Jon; Ryan, Sarah; Rollinson, Sara; Griffin, Helen; Kurzawa-Akanbi, Marzena; Santibanez-Koref, Mauro; Talbot, Kevin; Turner, Martin R; McKenzie, Chris-Anne; Troakes, Claire; Attems, Johannes; Smith, Colin; Al Sarraj, Safa; Morris, Chris M; Ansorge, Olaf; Pickering-Brown, Stuart; Ironside, James W; Chinnery, Patrick F
2017-01-01
Given the central role of genetic factors in the pathogenesis of common neurodegenerative disorders, it is critical that mechanistic studies in human tissue are interpreted in a genetically enlightened context. To address this, we performed exome sequencing and copy number variant analysis on 1511 frozen human brains with a diagnosis of Alzheimer's disease (AD, n = 289), frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS, n = 252), Creutzfeldt-Jakob disease (CJD, n = 239), Parkinson's disease (PD, n = 39), dementia with Lewy bodies (DLB, n = 58), other neurodegenerative, vascular, or neurogenetic disorders (n = 266), and controls with no significant neuropathology (n = 368). Genomic DNA was extracted from brain tissue in all cases before exome sequencing (Illumina Nextera 62 Mb capture) with variants called by FreeBayes; copy number variant (CNV) analysis (Illumina HumanOmniExpress-12 BeadChip); C9orf72 repeat expansion detection; and APOE genotyping. Established or likely pathogenic heterozygous, compound heterozygous, or homozygous variants, together with the C9orf72 hexanucleotide repeat expansions and a copy number gain of APP, were found in 61 brains. In addition to known risk alleles in 349 brains (23.9% of 1461 undergoing exome sequencing), we saw an association between rare variants in GRN and DLB. Rare CNVs were found in <1.5% of brains, including copy number gains of PRPH that were overrepresented in AD. Clinical, pathological, and genetic data are available, enabling the retrieval of specific frozen brains through the UK Medical Research Council Brain Banks Network. This allows direct access to pathological and control human brain tissue based on an individual's genetic architecture, thus enabling the functional validation of known genetic risk factors and potentially pathogenic alleles identified in future studies. © 2017 Keogh et al.; Published by Cold Spring Harbor Laboratory Press.
Larson, Nicholas B.; Berardi, Cecilia; Decker, Paul A.; Wassel, Christina L.; Kirsch, Phillip S.; Pankow, James S.; Sale, Michele M.; de Andrade, Mariza; Sicotte, Hugues; Tang, Weihong; Hanson, Naomi Q.; Tsai, Michael Y.; Taylor, Kent D.; Bielinski, Suzette J.
2015-01-01
Summary Hepatocyte growth factor (HGF) is a mesenchyme-derived pleiotropic factor that regulates cell growth, motility, mitogenesis, and morphogenesis in a variety of cells, and increased serum levels of HGF have been linked to a number of clinical and subclinical cardiovascular disease phenotypes. However, little is currently known regarding what genetic factors influence HGF levels, despite evidence of substantial genetic contributions to HGF variation. Based upon ethnicity-stratified single-variant association analysis and trans-ethnic meta-analysis of 6201 participants of the Multi-Ethnic Study of Atherosclerosis (MESA), we discovered five statistically significant common and low-frequency variants: HGF missense polymorphism rs5745687 (p.E299K) as well as four variants (rs16844364, rs4690098, rs114303452, rs3748034) within or in proximity to HGFAC. We also identified two significant ethnicity-specific gene-level associations (A1BG in African Americans; FASN in Chinese Americans) based upon low-frequency/rare variants, while meta-analysis of gene-level results identified a significant association for HGFAC. However, identified single-variant associations explained modest proportions of the total trait variation and were not significantly associated with coronary artery calcium or coronary heart disease. Our findings indicate genetic factors influencing circulating HGF levels may be complex and ethnically diverse. PMID:25998175
Sapkota, Yadav; Vivo, Immaculata De; Steinthorsdottir, Valgerdur; Fassbender, Amelie; Bowdler, Lisa; Buring, Julie E; Edwards, Todd L; Jones, Sarah; O, Dorien; Peterse, Daniëlle; Rexrode, Kathryn M; Ridker, Paul M; Schork, Andrew J; Thorleifsson, Gudmar; Wallace, Leanne M; Kraft, Peter; Morris, Andrew P; Nyholt, Dale R; Edwards, Digna R Velez; Nyegaard, Mette; D'Hooghe, Thomas; Chasman, Daniel I; Stefansson, Kari; Missmer, Stacey A; Montgomery, Grant W
2017-09-12
Genome-wide association (GWA) studies have identified 19 independent common risk loci for endometriosis. Most of the GWA variants are non-coding and the genes responsible for the association signals have not been identified. Herein, we aimed to assess the potential role of protein-modifying variants in endometriosis using exome-array genotyping in 7164 cases and 21005 controls, and a replication set of 1840 cases and 129016 controls of European ancestry. Results in the discovery sample identified significant evidence for association with coding variants in single-variant (rs1801232-CUBN) and gene-level (CIITA and PARP4) meta-analyses, but these did not survive replication. In the combined analysis, there was genome-wide significant evidence for rs13394619 (P = 2.3 × 10 -9 ) in GREB1 at 2p25.1 - a locus previously identified in a GWA meta-analysis of European and Japanese samples. Despite sufficient power, our results did not identify any protein-modifying variants (MAF > 0.01) with moderate or large effect sizes in endometriosis, although these variants may exist in non-European populations or in high-risk families. The results suggest continued discovery efforts should focus on genotyping large numbers of surgically-confirmed endometriosis cases and controls, and/or sequencing high-risk families to identify novel rare variants to provide greater insights into the molecular pathogenesis of the disease.
Pulmonary Nontuberculous Mycobacterial Infection. A Multisystem, Multigenic Disease.
Szymanski, Eva P; Leung, Janice M; Fowler, Cedar J; Haney, Carissa; Hsu, Amy P; Chen, Fei; Duggal, Priya; Oler, Andrew J; McCormack, Ryan; Podack, Eckhard; Drummond, Rebecca A; Lionakis, Michail S; Browne, Sarah K; Prevots, D Rebecca; Knowles, Michael; Cutting, Gary; Liu, Xinyue; Devine, Scott E; Fraser, Claire M; Tettelin, Hervé; Olivier, Kenneth N; Holland, Steven M
2015-09-01
The clinical features of patients infected with pulmonary nontuberculous mycobacteria (PNTM) are well described, but the genetic components of infection susceptibility are not. To examine genetic variants in patients with PNTM, their unaffected family members, and a control group. Whole-exome sequencing was done on 69 white patients with PNTM and 18 of their white unaffected family members. We performed a candidate gene analysis using immune, cystic fibrosis transmembrance conductance regulator (CFTR), cilia, and connective tissue gene sets. The numbers of patients, family members, and control subjects with variants in each category were compared, as was the average number of variants per person. A significantly higher number of patients with PNTM than the other subjects had low-frequency, protein-affecting variants in immune, CFTR, cilia, and connective tissue categories (35, 26, 90, and 90%, respectively). Patients with PNTM also had significantly more cilia and connective tissue variants per person than did control subjects (2.47 and 2.55 compared with 1.38 and 1.40, respectively; P = 1.4 × 10(-6) and P = 2.7 × 10(-8), respectively). Patients with PNTM had an average of 5.26 variants across all categories (1.98 in control subjects; P = 2.8 × 10(-17)), and they were more likely than control subjects to have variants in multiple categories. We observed similar results for family members without PNTM infection, with the exception of the immune category. Patients with PNTM have more low-frequency, protein-affecting variants in immune, CFTR, cilia, and connective tissue genes than their unaffected family members and control subjects. We propose that PNTM infection is a multigenic disease in which combinations of variants across gene categories, plus environmental exposures, increase susceptibility to the infection.
Clinical Applications of Molecular Genetic Discoveries
Marian, A.J.
2015-01-01
Genome-wide association studies (GWAS) of complex traits have mapped more than 15,000 common single nucleotide variants (SNVs). Likewise, applications of massively parallel nucleic acid sequencing technologies often referred to as Next Generation Sequencing, to molecular genetic studies of complex traits have catalogued a large number of rare variants (population frequency of <0.01) in cases with complex traits. Moreover, high throughput nucleic acid sequencing, variant burden analysis, and linkage studies are illuminating the presence of large number of SNVs in cases and families with single gene disorders. The plethora of the genetic variants has exposed the formidable challenge of identifying the causal and pathogenic variants from the enormous number of innocuous common and rare variants that exist in the population as well as in an individual genome. The arduous task of identifying the causal and pathogenic variants is further compounded by the pleiotropic effects of the variants, complexity of cis and trans interactions in the genome, variability in phenotypic expression of the disease, as well as phenotypic plasticity, and the multifarious determinants of the phenotype. Population genetic studies offer the initial roadmaps and have the potential to elucidate novel pathways involved in the pathogenesis of the disease. However, the genome of an individual is unique, rendering unambiguous identification of the causal or pathogenic variant in a single individual exceedingly challenging. Yet, the focus of the practice of medicine is on the individual, as Sir William Osler elegantly expressed in his insightful quotation: “The good physician treats the disease; the great physician treats the patient who has the disease.” The daunting task facing physicians, patients, and researchers alike is to apply the modern genetic discoveries to care of the individual with or at risk of the disease. PMID:26548329
Hu, Hao; Wienker, Thomas F; Musante, Luciana; Kalscheuer, Vera M; Kahrizi, Kimia; Najmabadi, Hossein; Ropers, H Hilger
2014-12-01
Next-generation sequencing has greatly accelerated the search for disease-causing defects, but even for experts the data analysis can be a major challenge. To facilitate the data processing in a clinical setting, we have developed a novel medical resequencing analysis pipeline (MERAP). MERAP assesses the quality of sequencing, and has optimized capacity for calling variants, including single-nucleotide variants, insertions and deletions, copy-number variation, and other structural variants. MERAP identifies polymorphic and known causal variants by filtering against public domain databases, and flags nonsynonymous and splice-site changes. MERAP uses a logistic model to estimate the causal likelihood of a given missense variant. MERAP considers the relevant information such as phenotype and interaction with known disease-causing genes. MERAP compares favorably with GATK, one of the widely used tools, because of its higher sensitivity for detecting indels, its easy installation, and its economical use of computational resources. Upon testing more than 1,200 individuals with mutations in known and novel disease genes, MERAP proved highly reliable, as illustrated here for five families with disease-causing variants. We believe that the clinical implementation of MERAP will expedite the diagnostic process of many disease-causing defects. © 2014 WILEY PERIODICALS, INC.
Barrett, Karlene T; Rodikova, Ekaterina; Weese-Mayer, Debra E; Rand, Casey M; Marazita, Mary L; Cooper, Margaret E; Berry-Kravis, Elizabeth M; Bech-Hansen, N Torben; Wilson, Richard J A
2013-12-01
Stress peptide, pituitary adenylate cyclase-activating polypeptide (PACAP), has been implicated in sudden infant death syndrome (SIDS). The aim of this exploratory study was to determine whether variants in the gene encoding the PACAP-specific receptor, PAC1, are associated with SIDS in Caucasian and African American infants. Polymerase chain reaction and Sanger DNA sequencing was used to compare variants in the 5'-untranslated region, exons and intron-exon boundaries of the PAC1 gene in 96 SIDS cases and 96 race- and gender-matched controls. The intron 3 variant, A/G: rs758995 (variant 'h'), and the intron 6 variant, C/T: rs10081254 (variant 'n'), were significantly associated with SIDS in Caucasians and African Americans, respectively (p < 0.05). Also associated with SIDS were interactions between the variants rs2302475 (variant 'i') in PAC1 and rs8192597 and rs2856966 in PACAP among Caucasians (p < 0.02) and rs2267734 (variant 'q') in PAC1 and rs1893154 in PACAP among African Americans (p < 0.01). However, none of these differences survived post hoc analysis. Overall, this study does not support a strong association between variants in the PAC1 gene and SIDS; however, a number of potential associations between race-specific variants and SIDS were identified that warrant targeted investigations in future studies. ©2013 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.
Antanaviciute, Agne; Watson, Christopher M; Harrison, Sally M; Lascelles, Carolina; Crinnion, Laura; Markham, Alexander F; Bonthron, David T; Carr, Ian M
2015-12-01
Exome sequencing has become a de facto standard method for Mendelian disease gene discovery in recent years, yet identifying disease-causing mutations among thousands of candidate variants remains a non-trivial task. Here we describe a new variant prioritization tool, OVA (ontology variant analysis), in which user-provided phenotypic information is exploited to infer deeper biological context. OVA combines a knowledge-based approach with a variant-filtering framework. It reduces the number of candidate variants by considering genotype and predicted effect on protein sequence, and scores the remainder on biological relevance to the query phenotype.We take advantage of several ontologies in order to bridge knowledge across multiple biomedical domains and facilitate computational analysis of annotations pertaining to genes, diseases, phenotypes, tissues and pathways. In this way, OVA combines information regarding molecular and physical phenotypes and integrates both human and model organism data to effectively prioritize variants. By assessing performance on both known and novel disease mutations, we show that OVA performs biologically meaningful candidate variant prioritization and can be more accurate than another recently published candidate variant prioritization tool. OVA is freely accessible at http://dna2.leeds.ac.uk:8080/OVA/index.jsp. Supplementary data are available at Bioinformatics online. umaan@leeds.ac.uk. © The Author 2015. Published by Oxford University Press.
Le Gall, Jessica; Nizon, Mathilde; Pichon, Olivier; Andrieux, Joris; Audebert-Bellanger, Séverine; Baron, Sabine; Beneteau, Claire; Bilan, Frédéric; Boute, Odile; Busa, Tiffany; Cormier-Daire, Valérie; Ferec, Claude; Fradin, Mélanie; Gilbert-Dussardier, Brigitte; Jaillard, Sylvie; Jønch, Aia; Martin-Coignard, Dominique; Mercier, Sandra; Moutton, Sébastien; Rooryck, Caroline; Schaefer, Elise; Vincent, Marie; Sanlaville, Damien; Le Caignec, Cédric; Jacquemont, Sébastien; David, Albert; Isidor, Bertrand
2017-08-01
Sex chromosome aneuploidies (SCA) is a group of conditions in which individuals have an abnormal number of sex chromosomes. SCA, such as Klinefelter's syndrome, XYY syndrome, and Triple X syndrome are associated with a large range of neurological outcome. Another genetic event such as another cytogenetic abnormality may explain a part of this variable expressivity. In this study, we have recruited fourteen patients with intellectual disability or developmental delay carrying SCA associated with a copy-number variant (CNV). In our cohort (four patients 47,XXY, four patients 47,XXX, and six patients 47,XYY), seven patients were carrying a pathogenic CNV, two a likely pathogenic CNV and five a variant of uncertain significance. Our analysis suggests that CNV might be considered as an additional independent genetic factor for intellectual disability and developmental delay for patients with SCA and neurodevelopmental disorder.
Current state-of-art of STR sequencing in forensic genetics.
Alonso, Antonio; Barrio, Pedro A; Müller, Petra; Köcher, Steffi; Berger, Burkhard; Martin, Pablo; Bodner, Martin; Willuweit, Sascha; Parson, Walther; Roewer, Lutz; Budowle, Bruce
2018-05-11
The current state of validation and implementation strategies of MPS technology for the analysis of STR markers for forensic genetics use is described, covering the topics of the current catalogue of commercial MPS-STR panels, leading MPS-platforms, and MPS-STR data analysis tools. In addition, the developmental and internal validation studies carried out to date to evaluate reliability, sensitivity, mixture analysis, concordance, and the ability to analyze challenged samples are summarized. The results of various MPS-STR population studies that showed a large number of new STR sequence variants that increase the power of discrimination in several forensically-relevant loci are also presented. Finally, various initiatives developed by several international projects and standardization (or guidelines) groups to facilitate application of MPS technology for STR marker analyses are discussed in regard to promoting a standard STR sequence nomenclature, performing population studies to detect sequence variants, and developing a universal system to translate sequence variants into a simple STR nomenclature (numbers and letters) compatible with national STR databases. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Burgess, Stephen; Zuber, Verena; Valdes-Marquez, Elsa; Sun, Benjamin B; Hopewell, Jemma C
2017-12-01
Mendelian randomization uses genetic variants to make causal inferences about the effect of a risk factor on an outcome. With fine-mapped genetic data, there may be hundreds of genetic variants in a single gene region any of which could be used to assess this causal relationship. However, using too many genetic variants in the analysis can lead to spurious estimates and inflated Type 1 error rates. But if only a few genetic variants are used, then the majority of the data is ignored and estimates are highly sensitive to the particular choice of variants. We propose an approach based on summarized data only (genetic association and correlation estimates) that uses principal components analysis to form instruments. This approach has desirable theoretical properties: it takes the totality of data into account and does not suffer from numerical instabilities. It also has good properties in simulation studies: it is not particularly sensitive to varying the genetic variants included in the analysis or the genetic correlation matrix, and it does not have greatly inflated Type 1 error rates. Overall, the method gives estimates that are less precise than those from variable selection approaches (such as using a conditional analysis or pruning approach to select variants), but are more robust to seemingly arbitrary choices in the variable selection step. Methods are illustrated by an example using genetic associations with testosterone for 320 genetic variants to assess the effect of sex hormone related pathways on coronary artery disease risk, in which variable selection approaches give inconsistent inferences. © 2017 The Authors Genetic Epidemiology Published by Wiley Periodicals, Inc.
Diroma, Maria Angela; Santorsola, Mariangela; Guttà, Cristiano; Gasparre, Giuseppe; Picardi, Ernesto; Pesole, Graziano; Attimonelli, Marcella
2014-01-01
Motivation: The increasing availability of mitochondria-targeted and off-target sequencing data in whole-exome and whole-genome sequencing studies (WXS and WGS) has risen the demand of effective pipelines to accurately measure heteroplasmy and to easily recognize the most functionally important mitochondrial variants among a huge number of candidates. To this purpose, we developed MToolBox, a highly automated pipeline to reconstruct and analyze human mitochondrial DNA from high-throughput sequencing data. Results: MToolBox implements an effective computational strategy for mitochondrial genomes assembling and haplogroup assignment also including a prioritization analysis of detected variants. MToolBox provides a Variant Call Format file featuring, for the first time, allele-specific heteroplasmy and annotation files with prioritized variants. MToolBox was tested on simulated samples and applied on 1000 Genomes WXS datasets. Availability and implementation: MToolBox package is available at https://sourceforge.net/projects/mtoolbox/. Contact: marcella.attimonelli@uniba.it Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25028726
Application of Nexus copy number software for CNV detection and analysis.
Darvishi, Katayoon
2010-04-01
Among human structural genomic variation, copy number variants (CNVs) are the most frequently known component, comprised of gains/losses of DNA segments that are generally 1 kb in length or longer. Array-based comparative genomic hybridization (aCGH) has emerged as a powerful tool for detecting genomic copy number variants (CNVs). With the rapid increase in the density of array technology and with the adaptation of new high-throughput technology, a reliable and computationally scalable method for accurate mapping of recurring DNA copy number aberrations has become a main focus in research. Here we introduce Nexus Copy Number software, a platform-independent tool, to analyze the output files of all types of commercial and custom-made comparative genomic hybridization (CGH) and single-nucleotide polymorphism (SNP) arrays, such as those manufactured by Affymetrix, Agilent Technologies, Illumina, and Roche NimbleGen. It also supports data generated by various array image-analysis software tools such as GenePix, ImaGene, and BlueFuse. (c) 2010 by John Wiley & Sons, Inc.
Kimura, Hiroki; Tsuboi, Daisuke; Wang, Chenyao; Kushima, Itaru; Koide, Takayoshi; Ikeda, Masashi; Iwayama, Yoshimi; Toyota, Tomoko; Yamamoto, Noriko; Kunimoto, Shohko; Nakamura, Yukako; Yoshimi, Akira; Banno, Masahiro; Xing, Jingrui; Takasaki, Yuto; Yoshida, Mami; Aleksic, Branko; Uno, Yota; Okada, Takashi; Iidaka, Tetsuya; Inada, Toshiya; Suzuki, Michio; Ujike, Hiroshi; Kunugi, Hiroshi; Kato, Tadafumi; Yoshikawa, Takeo; Iwata, Nakao; Kaibuchi, Kozo; Ozaki, Norio
2015-01-01
Background: Nuclear distribution E homolog 1 (NDE1), located within chromosome 16p13.11, plays an essential role in microtubule organization, mitosis, and neuronal migration and has been suggested by several studies of rare copy number variants to be a promising schizophrenia (SCZ) candidate gene. Recently, increasing attention has been paid to rare single-nucleotide variants (SNVs) discovered by deep sequencing of candidate genes, because such SNVs may have large effect sizes and their functional analysis may clarify etiopathology. Methods and Results: We conducted mutation screening of NDE1 coding exons using 433 SCZ and 145 pervasive developmental disorders samples in order to identify rare single nucleotide variants with a minor allele frequency ≤5%. We then performed genetic association analysis using a large number of unrelated individuals (3554 SCZ, 1041 bipolar disorder [BD], and 4746 controls). Among the discovered novel rare variants, we detected significant associations between SCZ and S214F (P = .039), and between BD and R234C (P = .032). Furthermore, functional assays showed that S214F affected axonal outgrowth and the interaction between NDE1 and YWHAE (14-3-3 epsilon; a neurodevelopmental regulator). Conclusions: This study strengthens the evidence for association between rare variants within NDE1 and SCZ, and may shed light into the molecular mechanisms underlying this severe psychiatric disorder. PMID:25332407
Analysis of CHRNA7 rare variants in autism spectrum disorder susceptibility.
Bacchelli, Elena; Battaglia, Agatino; Cameli, Cinzia; Lomartire, Silvia; Tancredi, Raffaella; Thomson, Susanne; Sutcliffe, James S; Maestrini, Elena
2015-04-01
Chromosome 15q13.3 recurrent microdeletions are causally associated with a wide range of phenotypes, including autism spectrum disorder (ASD), seizures, intellectual disability, and other psychiatric conditions. Whether the reciprocal microduplication is pathogenic is less certain. CHRNA7, encoding for the alpha7 subunit of the neuronal nicotinic acetylcholine receptor, is considered the likely culprit gene in mediating neurological phenotypes in 15q13.3 deletion cases. To assess if CHRNA7 rare variants confer risk to ASD, we performed copy number variant analysis and Sanger sequencing of the CHRNA7 coding sequence in a sample of 135 ASD cases. Sequence variation in this gene remains largely unexplored, given the existence of a fusion gene, CHRFAM7A, which includes a nearly identical partial duplication of CHRNA7. Hence, attempts to sequence coding exons must distinguish between CHRNA7 and CHRFAM7A, making next-generation sequencing approaches unreliable for this purpose. A CHRNA7 microduplication was detected in a patient with autism and moderate cognitive impairment; while no rare damaging variants were identified in the coding region, we detected rare variants in the promoter region, previously described to functionally reduce transcription. This study represents the first sequence variant analysis of CHRNA7 in a sample of idiopathic autism. © 2015 Wiley Periodicals, Inc.
Skunk and Raccoon Rabies in the Eastern United States: Temporal and Spatial Analysis
Curns, Aaron T.; Rupprecht, Charles E.; Hanlon, Cathleen A.; Krebs, John W.; Childs, James E.
2003-01-01
Since 1981, an epizootic of raccoon rabies has spread throughout the eastern United States. A concomitant increase in reported rabies cases in skunks has raised concerns that an independent maintenance cycle of rabies virus in skunks could become established, affecting current strategies of wildlife rabies control programs. Rabies surveillance data from 1981 through 2000 obtained from the health departments of 11 eastern states were used to analyze temporal and spatial characteristics of rabies epizootics in each species. Spatial analysis indicated that epizootics in raccoons and skunks moved in a similar direction from 1990 to 2000. Temporal regression analysis showed that the number of rabid raccoons predicted the number of rabid skunks through time, with a 1-month lag. In areas where the raccoon rabies virus variant is enzootic, spatio-temporal analysis does not provide evidence that this rabies virus variant is currently cycling independently among skunks. PMID:14519253
Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored. PMID:28632759
Using high-resolution variant frequencies to empower clinical genome interpretation.
Whiffin, Nicola; Minikel, Eric; Walsh, Roddy; O'Donnell-Luria, Anne H; Karczewski, Konrad; Ing, Alexander Y; Barton, Paul J R; Funke, Birgit; Cook, Stuart A; MacArthur, Daniel; Ware, James S
2017-10-01
PurposeWhole-exome and whole-genome sequencing have transformed the discovery of genetic variants that cause human Mendelian disease, but discriminating pathogenic from benign variants remains a daunting challenge. Rarity is recognized as a necessary, although not sufficient, criterion for pathogenicity, but frequency cutoffs used in Mendelian analysis are often arbitrary and overly lenient. Recent very large reference datasets, such as the Exome Aggregation Consortium (ExAC), provide an unprecedented opportunity to obtain robust frequency estimates even for very rare variants.MethodsWe present a statistical framework for the frequency-based filtering of candidate disease-causing variants, accounting for disease prevalence, genetic and allelic heterogeneity, inheritance mode, penetrance, and sampling variance in reference datasets.ResultsUsing the example of cardiomyopathy, we show that our approach reduces by two-thirds the number of candidate variants under consideration in the average exome, without removing true pathogenic variants (false-positive rate<0.001).ConclusionWe outline a statistically robust framework for assessing whether a variant is "too common" to be causative for a Mendelian disorder of interest. We present precomputed allele frequency cutoffs for all variants in the ExAC dataset.
Wala, Jeremiah; Zhang, Cheng-Zhong; Meyerson, Matthew; Beroukhim, Rameen
2016-07-01
We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. VariantBam and full documentation are available at github.com/jwalabroad/VariantBam rameen@broadinstitute.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Tang, Rongying; Prosser, Debra O.; Love, Donald R.
2016-01-01
The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609
Fejzo, Marlena Schoenberg; Myhre, Ronny; Colodro-Conde, Lucía; MacGibbon, Kimber W; Sinsheimer, Janet S; Reddy, M V Prasad Linga; Pajukanta, Päivi; Nyholt, Dale R; Wright, Margaret J; Martin, Nicholas G; Engel, Stephanie M; Medland, Sarah E; Magnus, Per; Mullin, Patrick M
2017-01-05
Hyperemesis Gravidarum (HG), severe nausea/vomiting in pregnancy (NVP), can cause poor maternal/fetal outcomes. Genetic predisposition suggests the genetic component is essential in discovering an etiology. We performed whole-exome sequencing of 5 families followed by analysis of variants in 584 cases/431 controls. Variants in RYR2 segregated with disease in 2 families. The novel variant L3277R was not found in any case/control. The rare variant, G1886S was more common in cases (p = 0.046) and extreme cases (p = 0.023). Replication of G1886S using Norwegian/Australian data was supportive. Common variants rs790899 and rs1891246 were significantly associated with HG and weight loss. Copy-number analysis revealed a deletion in a patient. RYR2 encodes an intracellular calcium release channel involved in vomiting, cyclic-vomiting syndrome, and is a thyroid hormone target gene. Additionally, RYR2 is a downstream drug target of Inderal, used to treat HG and CVS. Thus, herein we provide genetic evidence for a pathway and therapy for HG. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Yang, Jinliang; Jiang, Haiying; Yeh, Cheng-Ting; Yu, Jianming; Jeddeloh, Jeffrey A; Nettleton, Dan; Schnable, Patrick S
2015-11-01
Although approaches for performing genome-wide association studies (GWAS) are well developed, conventional GWAS requires high-density genotyping of large numbers of individuals from a diversity panel. Here we report a method for performing GWAS that does not require genotyping of large numbers of individuals. Instead XP-GWAS (extreme-phenotype GWAS) relies on genotyping pools of individuals from a diversity panel that have extreme phenotypes. This analysis measures allele frequencies in the extreme pools, enabling discovery of associations between genetic variants and traits of interest. This method was evaluated in maize (Zea mays) using the well-characterized kernel row number trait, which was selected to enable comparisons between the results of XP-GWAS and conventional GWAS. An exome-sequencing strategy was used to focus sequencing resources on genes and their flanking regions. A total of 0.94 million variants were identified and served as evaluation markers; comparisons among pools showed that 145 of these variants were statistically associated with the kernel row number phenotype. These trait-associated variants were significantly enriched in regions identified by conventional GWAS. XP-GWAS was able to resolve several linked QTL and detect trait-associated variants within a single gene under a QTL peak. XP-GWAS is expected to be particularly valuable for detecting genes or alleles responsible for quantitative variation in species for which extensive genotyping resources are not available, such as wild progenitors of crops, orphan crops, and other poorly characterized species such as those of ecological interest. © 2015 The Authors The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Quantitative trait nucleotide analysis using Bayesian model selection.
Blangero, John; Goring, Harald H H; Kent, Jack W; Williams, Jeff T; Peterson, Charles P; Almasy, Laura; Dyer, Thomas D
2005-10-01
Although much attention has been given to statistical genetic methods for the initial localization and fine mapping of quantitative trait loci (QTLs), little methodological work has been done to date on the problem of statistically identifying the most likely functional polymorphisms using sequence data. In this paper we provide a general statistical genetic framework, called Bayesian quantitative trait nucleotide (BQTN) analysis, for assessing the likely functional status of genetic variants. The approach requires the initial enumeration of all genetic variants in a set of resequenced individuals. These polymorphisms are then typed in a large number of individuals (potentially in families), and marker variation is related to quantitative phenotypic variation using Bayesian model selection and averaging. For each sequence variant a posterior probability of effect is obtained and can be used to prioritize additional molecular functional experiments. An example of this quantitative nucleotide analysis is provided using the GAW12 simulated data. The results show that the BQTN method may be useful for choosing the most likely functional variants within a gene (or set of genes). We also include instructions on how to use our computer program, SOLAR, for association analysis and BQTN analysis.
Torrezan, Giovana T; de Almeida, Fernanda G Dos Santos R; Figueiredo, Márcia C P; Barros, Bruna D de Figueiredo; de Paula, Cláudia A A; Valieris, Renan; de Souza, Jorge E S; Ramalho, Rodrigo F; da Silva, Felipe C C; Ferreira, Elisa N; de Nóbrega, Amanda F; Felicio, Paula S; Achatz, Maria I; de Souza, Sandro J; Palmero, Edenir I; Carraro, Dirce M
2018-01-01
Pathogenic variants in known breast cancer (BC) predisposing genes explain only about 30% of Hereditary Breast Cancer (HBC) cases, whereas the underlying genetic factors for most families remain unknown. Here, we used whole-exome sequencing (WES) to identify genetic variants associated to HBC in 17 patients of Brazil with familial BC and negative for causal variants in major BC risk genes ( BRCA1/2, TP53 , and CHEK2 c.1100delC). First, we searched for rare variants in 27 known HBC genes and identified two patients harboring truncating pathogenic variants in ATM and BARD1 . For the remaining 15 negative patients, we found a substantial vast number of rare genetic variants. Thus, for selecting the most promising variants we used functional-based variant prioritization, followed by NGS validation, analysis in a control group, cosegregation analysis in one family and comparison with previous WES studies, shrinking our list to 23 novel BC candidate genes, which were evaluated in an independent cohort of 42 high-risk BC patients. Rare and possibly damaging variants were identified in 12 candidate genes in this cohort, including variants in DNA repair genes ( ERCC1 and SXL4 ) and other cancer-related genes ( NOTCH2, ERBB2, MST1R , and RAF1 ). Overall, this is the first WES study applied for identifying novel genes associated to HBC in Brazilian patients, in which we provide a set of putative BC predisposing genes. We also underpin the value of using WES for assessing the complex landscape of HBC susceptibility, especially in less characterized populations.
Pillai, Suja; Gopalan, Vinod; Lo, Chung Y; Liew, Victor; Smith, Robert A; Lam, Alfred King Y
2017-02-01
The goal of this pilot study was to develop a customized, cost-effective amplicon panel (Ampliseq) for target sequencing in a cohort of patients with sporadic phaeochromocytoma/paraganglioma. Phaeochromocytoma/paragangliomas from 25 patients were analysed by targeted next-generation sequencing approach using an Ion Torrent PGM instrument. Primers for 15 target genes (NF1, RET, VHL, SDHA, SDHB, SDHC, SDHD, SDHAF2, TMEM127, MAX, MEN1, KIF1Bβ, EPAS1, CDKN2 & PHD2) were designed using ion ampliseq designer. Ion Reporter software and Ingenuity® Variant Analysis™ software (www.ingenuity.com/variants) from Ingenuity Systems were used to analysis these results. Overall, 713 variants were identified. The variants identified from the Ion Reporter ranged from 64 to 161 per patient. Single nucleotide variants (SNV) were the most common. Further annotation with the help of Ingenuity variant analysis revealed 29 of these 713variants were deletions. Of these, six variants were non-pathogenic and four were likely to be pathogenic. The remaining 19 variants were of uncertain significance. The most frequently altered gene in the cohort was KIF1B followed by NF1. Novel KIF1B pathogenic variant c.3375+1G>A was identified. The mutation was noted in a patient with clinically confirmed neurofibromatosis. Chromosome 1 showed the presence of maximum number of variants. Use of targeted next-generation sequencing is a sensitive method for the detecting genetic changes in patients with phaeochromocytoma/paraganglioma. The precise detection of these genetic changes helps in understanding the pathogenesis of these tumours. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhu, Yun; Fan, Ruzong; Xiong, Momiao
2017-01-01
Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics. PMID:29040274
Xu, Bin; Woodroffe, Abigail; Rodriguez-Murillo, Laura; Roos, J Louw; van Rensburg, Elizabeth J; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria
2009-09-29
To elucidate the genetic architecture of familial schizophrenia we combine linkage analysis with studies of fine-level chromosomal variation in families recruited from the Afrikaner population in South Africa. We demonstrate that individually rare inherited copy number variants (CNVs) are more frequent in cases with familial schizophrenia as compared to unaffected controls and affect almost exclusively genic regions. Interestingly, we find that while the prevalence of rare structural variants is similar in familial and sporadic cases, the type of variants is markedly different. In addition, using a high-density linkage scan with a panel of nearly 2,000 markers, we identify a region on chromosome 13q34 that shows genome-wide significant linkage to schizophrenia and show that in the families not linked to this locus, there is evidence for linkage to chromosome 1p36. No causative CNVs were identified in either locus. Overall, our results from approaches designed to detect risk variants with relatively low frequency and high penetrance in a well-defined and relatively homogeneous population, provide strong empirical evidence supporting the notion that multiple genetic variants, including individually rare ones, that affect many different genes contribute to the genetic risk of familial schizophrenia. They also highlight differences in the genetic architecture of the familial and sporadic forms of the disease.
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.
Lakshmikumaran, M; Negi, M S
1994-03-01
Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
Clan Genomics and the Complex Architecture of Human Disease
Belmont, John W.; Boerwinkle, Eric
2013-01-01
Human diseases are caused by alleles that encompass the full range of variant types, from single-nucleotide changes to copy-number variants, and these variations span a broad frequency spectrum, from the very rare to the common. The picture emerging from analysis of whole-genome sequences, the 1000 Genomes Project pilot studies, and targeted genomic sequencing derived from very large sample sizes reveals an abundance of rare and private variants. One implication of this realization is that recent mutation may have a greater influence on disease susceptibility or protection than is conferred by variations that arose in distant ancestors. PMID:21962505
Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong
2016-01-08
Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.
Dadaev, Tokhir; Saunders, Edward J; Newcombe, Paul J; Anokian, Ezequiel; Leongamornlert, Daniel A; Brook, Mark N; Cieza-Borrella, Clara; Mijuskovic, Martina; Wakerell, Sarah; Olama, Ali Amin Al; Schumacher, Fredrick R; Berndt, Sonja I; Benlloch, Sara; Ahmed, Mahbubl; Goh, Chee; Sheng, Xin; Zhang, Zhuo; Muir, Kenneth; Govindasami, Koveela; Lophatananon, Artitaya; Stevens, Victoria L; Gapstur, Susan M; Carter, Brian D; Tangen, Catherine M; Goodman, Phyllis; Thompson, Ian M; Batra, Jyotsna; Chambers, Suzanne; Moya, Leire; Clements, Judith; Horvath, Lisa; Tilley, Wayne; Risbridger, Gail; Gronberg, Henrik; Aly, Markus; Nordström, Tobias; Pharoah, Paul; Pashayan, Nora; Schleutker, Johanna; Tammela, Teuvo L J; Sipeky, Csilla; Auvinen, Anssi; Albanes, Demetrius; Weinstein, Stephanie; Wolk, Alicja; Hakansson, Niclas; West, Catharine; Dunning, Alison M; Burnet, Neil; Mucci, Lorelei; Giovannucci, Edward; Andriole, Gerald; Cussenot, Olivier; Cancel-Tassin, Géraldine; Koutros, Stella; Freeman, Laura E Beane; Sorensen, Karina Dalsgaard; Orntoft, Torben Falck; Borre, Michael; Maehle, Lovise; Grindedal, Eli Marie; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Martin, Richard M; Travis, Ruth C; Key, Tim J; Hamilton, Robert J; Fleshner, Neil E; Finelli, Antonio; Ingles, Sue Ann; Stern, Mariana C; Rosenstein, Barry; Kerns, Sarah; Ostrer, Harry; Lu, Yong-Jie; Zhang, Hong-Wei; Feng, Ninghan; Mao, Xueying; Guo, Xin; Wang, Guomin; Sun, Zan; Giles, Graham G; Southey, Melissa C; MacInnis, Robert J; FitzGerald, Liesel M; Kibel, Adam S; Drake, Bettina F; Vega, Ana; Gómez-Caamaño, Antonio; Fachal, Laura; Szulkin, Robert; Eklund, Martin; Kogevinas, Manolis; Llorca, Javier; Castaño-Vinyals, Gemma; Penney, Kathryn L; Stampfer, Meir; Park, Jong Y; Sellers, Thomas A; Lin, Hui-Yi; Stanford, Janet L; Cybulski, Cezary; Wokolorczyk, Dominika; Lubinski, Jan; Ostrander, Elaine A; Geybels, Milan S; Nordestgaard, Børge G; Nielsen, Sune F; Weisher, Maren; Bisbjerg, Rasmus; Røder, Martin Andreas; Iversen, Peter; Brenner, Hermann; Cuk, Katarina; Holleczek, Bernd; Maier, Christiane; Luedeke, Manuel; Schnoeller, Thomas; Kim, Jeri; Logothetis, Christopher J; John, Esther M; Teixeira, Manuel R; Paulo, Paula; Cardoso, Marta; Neuhausen, Susan L; Steele, Linda; Ding, Yuan Chun; De Ruyck, Kim; De Meerleer, Gert; Ost, Piet; Razack, Azad; Lim, Jasmine; Teo, Soo-Hwang; Lin, Daniel W; Newcomb, Lisa F; Lessel, Davor; Gamulin, Marija; Kulis, Tomislav; Kaneva, Radka; Usmani, Nawaid; Slavov, Chavdar; Mitev, Vanio; Parliament, Matthew; Singhal, Sandeep; Claessens, Frank; Joniau, Steven; Van den Broeck, Thomas; Larkin, Samantha; Townsend, Paul A; Aukim-Hastie, Claire; Gago-Dominguez, Manuela; Castelao, Jose Esteban; Martinez, Maria Elena; Roobol, Monique J; Jenster, Guido; van Schaik, Ron H N; Menegaux, Florence; Truong, Thérèse; Koudou, Yves Akoli; Xu, Jianfeng; Khaw, Kay-Tee; Cannon-Albright, Lisa; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Thibodeau, Stephen N; McDonnell, Shannon K; Schaid, Daniel J; Lindstrom, Sara; Turman, Constance; Ma, Jing; Hunter, David J; Riboli, Elio; Siddiq, Afshan; Canzian, Federico; Kolonel, Laurence N; Le Marchand, Loic; Hoover, Robert N; Machiela, Mitchell J; Kraft, Peter; Freedman, Matthew; Wiklund, Fredrik; Chanock, Stephen; Henderson, Brian E; Easton, Douglas F; Haiman, Christopher A; Eeles, Rosalind A; Conti, David V; Kote-Jarai, Zsofia
2018-06-11
Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.
Genetic Candidate Variants in Two Multigenerational Families with Childhood Apraxia of Speech
Wijsman, Ellen M.; Nato, Alejandro Q.; Matsushita, Mark M.; Chapman, Kathy L.; Stanaway, Ian B.; Wolff, John; Oda, Kaori; Gabo, Virginia B.; Raskind, Wendy H.
2016-01-01
Childhood apraxia of speech (CAS) is a severe and socially debilitating form of speech sound disorder with suspected genetic involvement, but the genetic etiology is not yet well understood. Very few known or putative causal genes have been identified to date, e.g., FOXP2 and BCL11A. Building a knowledge base of the genetic etiology of CAS will make it possible to identify infants at genetic risk and motivate the development of effective very early intervention programs. We investigated the genetic etiology of CAS in two large multigenerational families with familial CAS. Complementary genomic methods included Markov chain Monte Carlo linkage analysis, copy-number analysis, identity-by-descent sharing, and exome sequencing with variant filtering. No overlaps in regions with positive evidence of linkage between the two families were found. In one family, linkage analysis detected two chromosomal regions of interest, 5p15.1-p14.1, and 17p13.1-q11.1, inherited separately from the two founders. Single-point linkage analysis of selected variants identified CDH18 as a primary gene of interest and additionally, MYO10, NIPBL, GLP2R, NCOR1, FLCN, SMCR8, NEK8, and ANKRD12, possibly with additive effects. Linkage analysis in the second family detected five regions with LOD scores approaching the highest values possible in the family. A gene of interest was C4orf21 (ZGRF1) on 4q25-q28.2. Evidence for previously described causal copy-number variations and validated or suspected genes was not found. Results are consistent with a heterogeneous CAS etiology, as is expected in many neurogenic disorders. Future studies will investigate genome variants in these and other families with CAS. PMID:27120335
Geerlings, M J; Volokhina, E B; de Jong, E K; van de Kar, N; Pauper, M; Hoyng, C B; van den Heuvel, L P; den Hollander, A I
2018-06-11
Genetic alterations in the complement system have been linked to a variety of diseases, including atypical hemolytic uremic syndrome (aHUS), C3 glomerulopathy (C3G), and age-related macular degeneration (AMD). We performed sequence analysis of the complement genes CFH, CFI, and C3 in 866 aHUS/C3G and 697 AMD patients. In total we identified 505 low frequency alleles, representing 121 unique variants, of which 51 are novel. CFH contained the largest number of unique low frequency variants (n=64; 53%), followed by C3 (n=32; 26%) and CFI (n=25; 21%). A substantial number of variants were found in both patients groups (n=48; 40%), while 41 (34%) variants were found only in aHUS/C3G and 32 (26%) variants were AMD-specific. Genotype-phenotype correlations between the disease groups identified a higher frequency of protein-altering alleles in SCR20 of Factor H (FH), and in the serine protease domain of Factor I (FI) in aHUS/C3G patients. In AMD a higher frequency of protein-altering alleles was observed in SCR3, SCR5 and SCR7 of FH, the SRCR domain of FI, and in the MG3 domain of C3. In conclusion, we observed a substantial overlap of variants between aHUS/C3G and AMD, however, there is a distinct clustering of variants within specific domains. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Canver, Matthew C; Lessard, Samuel; Pinello, Luca; Wu, Yuxuan; Ilboudo, Yann; Stern, Emily N; Needleman, Austen J; Galactéros, Frédéric; Brugnara, Carlo; Kutlar, Abdullah; McKenzie, Colin; Reid, Marvin; Chen, Diane D; Das, Partha Pratim; A Cole, Mitchel; Zeng, Jing; Kurita, Ryo; Nakamura, Yukio; Yuan, Guo-Cheng; Lettre, Guillaume; Bauer, Daniel E; Orkin, Stuart H
2017-04-01
Cas9-mediated, high-throughput, saturating in situ mutagenesis permits fine-mapping of function across genomic segments. Disease- and trait-associated variants identified in genome-wide association studies largely cluster at regulatory loci. Here we demonstrate the use of multiple designer nucleases and variant-aware library design to interrogate trait-associated regulatory DNA at high resolution. We developed a computational tool for the creation of saturating-mutagenesis libraries with single or multiple nucleases with incorporation of variants. We applied this methodology to the HBS1L-MYB intergenic region, which is associated with red-blood-cell traits, including fetal hemoglobin levels. This approach identified putative regulatory elements that control MYB expression. Analysis of genomic copy number highlighted potential false-positive regions, thus emphasizing the importance of off-target analysis in the design of saturating-mutagenesis experiments. Together, these data establish a widely applicable high-throughput and high-resolution methodology to identify minimal functional sequences within large disease- and trait-associated regions.
Monroe, Glen R; Kappen, Isabelle FPM; Stokman, Marijn F; Terhal, Paulien A; van den Boogaard, Marie-José H; Savelberg, Sanne MC; van der Veken, Lars T; van Es, Robert JJ; Lens, Susanne M; Hengeveld, Rutger C; Creton, Marijn A; Janssen, Nard G; Mink van der Molen, Aebele B; Ebbeling, Michelle B; Giles, Rachel H; Knoers, Nine V; van Haaften, Gijs
2016-01-01
The oral-facial-digital (OFD) syndromes comprise a group of related disorders with a combination of oral, facial and digital anomalies. Variants in several ciliary genes have been associated with subtypes of OFD syndrome, yet in most OFD patients the underlying cause remains unknown. We investigated the molecular basis of disease in two brothers with OFD type II, Mohr syndrome, by performing single-nucleotide polymorphism (SNP)-array analysis on the brothers and their healthy parents to identify homozygous regions and candidate genes. Subsequently, we performed whole-exome sequencing (WES) on the family. Using WES, we identified compound heterozygous variants c.[464G>C][1226G>A] in NIMA (Never in Mitosis Gene A)-Related Kinase 1 (NEK1). The novel variant c.464G>C disturbs normal splicing in an essential region of the kinase domain. The nonsense variant c.1226G>A, p.(Trp409*), results in nonsense-associated alternative splicing, removing the first coiled-coil domain of NEK1. Candidate variants were confirmed with Sanger sequencing and alternative splicing assessed with cDNA analysis. Immunocytochemistry was used to assess cilia number and length. Patient-derived fibroblasts showed severely reduced ciliation compared with control fibroblasts (18.0 vs 48.9%, P<0.0001), but showed no significant difference in cilia length. In conclusion, we identified compound heterozygous deleterious variants in NEK1 in two brothers with Mohr syndrome. Ciliation in patient fibroblasts is drastically reduced, consistent with a ciliary defect pathogenesis. Our results establish NEK1 variants involved in the etiology of a subset of patients with OFD syndrome type II and support the consideration of including (routine) NEK1 analysis in patients suspected of OFD. PMID:27530628
Monroe, Glen R; Kappen, Isabelle Fpm; Stokman, Marijn F; Terhal, Paulien A; van den Boogaard, Marie-José H; Savelberg, Sanne Mc; van der Veken, Lars T; van Es, Robert Jj; Lens, Susanne M; Hengeveld, Rutger C; Creton, Marijn A; Janssen, Nard G; Mink van der Molen, Aebele B; Ebbeling, Michelle B; Giles, Rachel H; Knoers, Nine V; van Haaften, Gijs
2016-12-01
The oral-facial-digital (OFD) syndromes comprise a group of related disorders with a combination of oral, facial and digital anomalies. Variants in several ciliary genes have been associated with subtypes of OFD syndrome, yet in most OFD patients the underlying cause remains unknown. We investigated the molecular basis of disease in two brothers with OFD type II, Mohr syndrome, by performing single-nucleotide polymorphism (SNP)-array analysis on the brothers and their healthy parents to identify homozygous regions and candidate genes. Subsequently, we performed whole-exome sequencing (WES) on the family. Using WES, we identified compound heterozygous variants c.[464G>C];[1226G>A] in NIMA (Never in Mitosis Gene A)-Related Kinase 1 (NEK1). The novel variant c.464G>C disturbs normal splicing in an essential region of the kinase domain. The nonsense variant c.1226G>A, p.(Trp409*), results in nonsense-associated alternative splicing, removing the first coiled-coil domain of NEK1. Candidate variants were confirmed with Sanger sequencing and alternative splicing assessed with cDNA analysis. Immunocytochemistry was used to assess cilia number and length. Patient-derived fibroblasts showed severely reduced ciliation compared with control fibroblasts (18.0 vs 48.9%, P<0.0001), but showed no significant difference in cilia length. In conclusion, we identified compound heterozygous deleterious variants in NEK1 in two brothers with Mohr syndrome. Ciliation in patient fibroblasts is drastically reduced, consistent with a ciliary defect pathogenesis. Our results establish NEK1 variants involved in the etiology of a subset of patients with OFD syndrome type II and support the consideration of including (routine) NEK1 analysis in patients suspected of OFD.
HFE gene C282Y variant is associated with colorectal cancer in Caucasians: a meta-analysis.
Chen, Weidong; Zhao, Hua; Li, Tiegang; Yao, Hongliang
2013-08-01
The HFE gene has been suggested to play an important role in the pathogenesis of colorectal cancer. However, the results have been conflicting. In this study, we performed a meta-analysis to clarify the association of HFE gene C282Y variant with colorectal cancer. PubMed and Embase were retrieved to identify the potential literature. Pooled odds ratio (OR) with 95 % confidence interval (CI) was calculated using fixed- or random-effects model. A total of eight papers including nine studies (7,588 colorectal cancer cases and 81,571 controls) for HFE gene C282Y variant were included in the meta-analysis. The result indicated that HFE gene C282Y variant was significantly associated with colorectal cancer under recessive model (OR = 2.00, 95 % CI = 1.32-3.04), with no evidence of between-study heterogeneity (I (2) = 0.2 %, p = 0.432). Further subgroup analysis by number of cases suggested the effect was significant in studies with more than 500 cases (OR = 2.51, 95 % CI = 1.58-3.98, I (2) = 0.0 %, p = 0.921), but not in studies with less than 500 cases (OR = 0.75, 95 % CI = 0.28-1.97, I (2) = 0.0 %, p = 0.622). The current meta-analysis supported the positive association of HFE gene C282Y variant with colorectal cancer. Further large-scale studies with the consideration for gene-gene/gene-environment interactions should be conducted to investigate the association.
Bravo-Alonso, Irene; Navarrete, Rosa; Arribas-Carreira, Laura; Perona, Almudena; Abia, David; Couce, María Luz; García-Cazorla, Angels; Morais, Ana; Domingo, Rosario; Ramos, María Antonia; Swanson, Michael A; Van Hove, Johan L K; Ugarte, Magdalena; Pérez, Belén; Pérez-Cerdá, Celia; Rodríguez-Pombo, Pilar
2017-06-01
The rapid analysis of genomic data is providing effective mutational confirmation in patients with clinical and biochemical hallmarks of a specific disease. This is the case for nonketotic hyperglycinemia (NKH), a Mendelian disorder causing seizures in neonates and early-infants, primarily due to mutations in the GLDC gene. However, understanding the impact of missense variants identified in this gene is a major challenge for the application of genomics into clinical practice. Herein, a comprehensive functional and structural analysis of 19 GLDC missense variants identified in a cohort of 26 NKH patients was performed. Mutant cDNA constructs were expressed in COS7 cells followed by enzymatic assays and Western blot analysis of the GCS P-protein to assess the residual activity and mutant protein stability. Structural analysis, based on molecular modeling of the 3D structure of GCS P-protein, was also performed. We identify hypomorphic variants that produce attenuated phenotypes with improved prognosis of the disease. Structural analysis allows us to interpret the effects of mutations on protein stability and catalytic activity, providing molecular evidence for clinical outcome and disease severity. Moreover, we identify an important number of mutants whose loss-of-functionality is associated with instability and, thus, are potential targets for rescue using folding therapeutic approaches. © 2017 Wiley Periodicals, Inc.
Xu, Bin; Woodroffe, Abigail; Rodriguez-Murillo, Laura; Roos, J. Louw; van Rensburg, Elizabeth J.; Abecasis, Gonçalo R.; Gogos, Joseph A.; Karayiorgou, Maria
2009-01-01
To elucidate the genetic architecture of familial schizophrenia we combine linkage analysis with studies of fine-level chromosomal variation in families recruited from the Afrikaner population in South Africa. We demonstrate that individually rare inherited copy number variants (CNVs) are more frequent in cases with familial schizophrenia as compared to unaffected controls and affect almost exclusively genic regions. Interestingly, we find that while the prevalence of rare structural variants is similar in familial and sporadic cases, the type of variants is markedly different. In addition, using a high-density linkage scan with a panel of nearly 2,000 markers, we identify a region on chromosome 13q34 that shows genome-wide significant linkage to schizophrenia and show that in the families not linked to this locus, there is evidence for linkage to chromosome 1p36. No causative CNVs were identified in either locus. Overall, our results from approaches designed to detect risk variants with relatively low frequency and high penetrance in a well-defined and relatively homogeneous population, provide strong empirical evidence supporting the notion that multiple genetic variants, including individually rare ones, that affect many different genes contribute to the genetic risk of familial schizophrenia. They also highlight differences in the genetic architecture of the familial and sporadic forms of the disease. PMID:19805367
Identification of pathogen genomic variants through an integrated pipeline
2014-01-01
Background Whole-genome sequencing represents a powerful experimental tool for pathogen research. We present methods for the analysis of small eukaryotic genomes, including a streamlined system (called Platypus) for finding single nucleotide and copy number variants as well as recombination events. Results We have validated our pipeline using four sets of Plasmodium falciparum drug resistant data containing 26 clones from 3D7 and Dd2 background strains, identifying an average of 11 single nucleotide variants per clone. We also identify 8 copy number variants with contributions to resistance, and report for the first time that all analyzed amplification events are in tandem. Conclusions The Platypus pipeline provides malaria researchers with a powerful tool to analyze short read sequencing data. It provides an accurate way to detect SNVs using known software packages, and a novel methodology for detection of CNVs, though it does not currently support detection of small indels. We have validated that the pipeline detects known SNVs in a variety of samples while filtering out spurious data. We bundle the methods into a freely available package. PMID:24589256
NASA Astrophysics Data System (ADS)
Juchum, Fabrício Sacramento; Costa, Marco Antônio; Amorim, André Márcio; Corrêa, Ronan Xavier
2008-11-01
Caesalpinia echinata (brazilwood or Pernambuco wood) comprises a complex of three morphological leaf variants, characterized by differences in the number and size of the pinnae and leaflets, and occurring in allopatric and sympatric populations. The present study evaluates the utility of the chloroplast DNA trnL intron in a phylogenetic analysis of the three leaf variants along with other species of Caesalpinia and generic relatives. Our study supports the hypothesis that the name C. echinata designates a species complex and provides evidence that one of the forms, the highly divergent C. echinata large-leafleted variant, represents a distinct taxon.
Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J
2017-05-01
Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.
The functional spectrum of low-frequency coding variation.
Marth, Gabor T; Yu, Fuli; Indap, Amit R; Garimella, Kiran; Gravel, Simon; Leong, Wen Fung; Tyler-Smith, Chris; Bainbridge, Matthew; Blackwell, Tom; Zheng-Bradley, Xiangqun; Chen, Yuan; Challis, Danny; Clarke, Laura; Ball, Edward V; Cibulskis, Kristian; Cooper, David N; Fulton, Bob; Hartl, Chris; Koboldt, Dan; Muzny, Donna; Smith, Richard; Sougnez, Carrie; Stewart, Chip; Ward, Alistair; Yu, Jin; Xue, Yali; Altshuler, David; Bustamante, Carlos D; Clark, Andrew G; Daly, Mark; DePristo, Mark; Flicek, Paul; Gabriel, Stacey; Mardis, Elaine; Palotie, Aarno; Gibbs, Richard
2011-09-14
Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.
Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K.; Thomas, Venetta; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Bernstein, Leslie; Blot, William J.; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J.; Cheng, Iona; Chu, Lisa; Deming, Sandra L.; Driver, W. Ryan; Goodman, Phyllis; Hayes, Richard B.; Hennis, Anselm J. M.; Hsing, Ann W.; Hu, Jennifer J.; Ingles, Sue A.; John, Esther M.; Kittles, Rick A.; Kolb, Suzanne; Leske, M. Cristina; Monroe, Kristine R.; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F.; Rodriguez-Gil, Jorge L.; Rybicki, Ben A.; Schumacher, Fredrick; Stanford, Janet L.; Signorello, Lisa B.; Strom, Sara S.; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S.; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G.; Stram, Alexander H.; Kolonel, Laurence N.; Marchand, Loïc Le; Henderson, Brian E.; Haiman, Christopher A.; Stram, Daniel O.
2015-01-01
Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious. PMID:26125186
Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K; Thomas, Venetta; Ambrosone, Christine B; Bandera, Elisa V; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J; Cheng, Iona; Chu, Lisa; Deming, Sandra L; Driver, W Ryan; Goodman, Phyllis; Hayes, Richard B; Hennis, Anselm J M; Hsing, Ann W; Hu, Jennifer J; Ingles, Sue A; John, Esther M; Kittles, Rick A; Kolb, Suzanne; Leske, M Cristina; Millikan, Robert C; Monroe, Kristine R; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F; Rodriguez-Gil, Jorge L; Rybicki, Ben A; Schumacher, Fredrick; Stanford, Janet L; Signorello, Lisa B; Strom, Sara S; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G; Stram, Alexander H; Kolonel, Laurence N; Le Marchand, Loïc; Henderson, Brian E; Haiman, Christopher A; Stram, Daniel O
2015-01-01
Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious.
Lopez-Doriga, Adriana; Feliubadaló, Lídia; Menéndez, Mireia; Lopez-Doriga, Sergio; Morón-Duran, Francisco D; del Valle, Jesús; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Campos, Olga; Gómez, Carolina; Pineda, Marta; González, Sara; Moreno, Victor; Capellá, Gabriel; Lázaro, Conxi
2014-03-01
Next-generation sequencing (NGS) has revolutionized genomic research and is set to have a major impact on genetic diagnostics thanks to the advent of benchtop sequencers and flexible kits for targeted libraries. Among the main hurdles in NGS are the difficulty of performing bioinformatic analysis of the huge volume of data generated and the high number of false positive calls that could be obtained, depending on the NGS technology and the analysis pipeline. Here, we present the development of a free and user-friendly Web data analysis tool that detects and filters sequence variants, provides coverage information, and allows the user to customize some basic parameters. The tool has been developed to provide accurate genetic analysis of targeted sequencing of common high-risk hereditary cancer genes using amplicon libraries run in a GS Junior System. The Web resource is linked to our own mutation database, to assist in the clinical classification of identified variants. We believe that this tool will greatly facilitate the use of the NGS approach in routine laboratories.
2007-10-01
AD_________________ Award Number: DAMD17-03-1-0774 TITLE: CHEK2 *1100delC Variant and BRCA1/2...NUMBER CHEK2 *1100delC Variant and BRCA1/2-Negative Familial Breast Cancer - A Family- Based Genetic Association Study 5b. GRANT NUMBER DAMD17...association between the CHEK2 *1100delC gene variant and breast cancer among BRCA1/2-negative families. Vital to DNA replication and normal growth of breast
A Likelihood-Based Framework for Association Analysis of Allele-Specific Copy Numbers.
Hu, Y J; Lin, D Y; Sun, W; Zeng, D
2014-10-01
Copy number variants (CNVs) and single nucleotide polymorphisms (SNPs) co-exist throughout the human genome and jointly contribute to phenotypic variations. Thus, it is desirable to consider both types of variants, as characterized by allele-specific copy numbers (ASCNs), in association studies of complex human diseases. Current SNP genotyping technologies capture the CNV and SNP information simultaneously via fluorescent intensity measurements. The common practice of calling ASCNs from the intensity measurements and then using the ASCN calls in downstream association analysis has important limitations. First, the association tests are prone to false-positive findings when differential measurement errors between cases and controls arise from differences in DNA quality or handling. Second, the uncertainties in the ASCN calls are ignored. We present a general framework for the integrated analysis of CNVs and SNPs, including the analysis of total copy numbers as a special case. Our approach combines the ASCN calling and the association analysis into a single step while allowing for differential measurement errors. We construct likelihood functions that properly account for case-control sampling and measurement errors. We establish the asymptotic properties of the maximum likelihood estimators and develop EM algorithms to implement the corresponding inference procedures. The advantages of the proposed methods over the existing ones are demonstrated through realistic simulation studies and an application to a genome-wide association study of schizophrenia. Extensions to next-generation sequencing data are discussed.
Yu, Hui; Zhang, Victor Wei; Stray-Pedersen, Asbjørg; Hanson, Imelda Celine; Forbes, Lisa R; de la Morena, M Teresa; Chinn, Ivan K; Gorman, Elizabeth; Mendelsohn, Nancy J; Pozos, Tamara; Wiszniewski, Wojciech; Nicholas, Sarah K; Yates, Anne B; Moore, Lindsey E; Berge, Knut Erik; Sorte, Hanne; Bayer, Diana K; ALZahrani, Daifulah; Geha, Raif S; Feng, Yanming; Wang, Guoli; Orange, Jordan S; Lupski, James R; Wang, Jing; Wong, Lee-Jun
2016-10-01
Primary immunodeficiency diseases (PIDDs) are inherited disorders of the immune system. The most severe form, severe combined immunodeficiency (SCID), presents with profound deficiencies of T cells, B cells, or both at birth. If not treated promptly, affected patients usually do not live beyond infancy because of infections. Genetic heterogeneity of SCID frequently delays the diagnosis; a specific diagnosis is crucial for life-saving treatment and optimal management. We developed a next-generation sequencing (NGS)-based multigene-targeted panel for SCID and other severe PIDDs requiring rapid therapeutic actions in a clinical laboratory setting. The target gene capture/NGS assay provides an average read depth of approximately 1000×. The deep coverage facilitates simultaneous detection of single nucleotide variants and exonic copy number variants in one comprehensive assessment. Exons with insufficient coverage (<20× read depth) or high sequence homology (pseudogenes) are complemented by amplicon-based sequencing with specific primers to ensure 100% coverage of all targeted regions. Analysis of 20 patient samples with low T-cell receptor excision circle numbers on newborn screening or a positive family history or clinical suspicion of SCID or other severe PIDD identified deleterious mutations in 14 of them. Identified pathogenic variants included both single nucleotide variants and exonic copy number variants, such as hemizygous nonsense, frameshift, and missense changes in IL2RG; compound heterozygous changes in ATM, RAG1, and CIITA; homozygous changes in DCLRE1C and IL7R; and a heterozygous nonsense mutation in CHD7. High-throughput deep sequencing analysis with complete clinical validation greatly increases the diagnostic yield of severe primary immunodeficiency. Establishing a molecular diagnosis enables early immune reconstitution through prompt therapeutic intervention and guides management for improved long-term quality of life. Copyright © 2016 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Dimensionality Assessment of Ordered Polytomous Items with Parallel Analysis
ERIC Educational Resources Information Center
Timmerman, Marieke E.; Lorenzo-Seva, Urbano
2011-01-01
Parallel analysis (PA) is an often-recommended approach for assessment of the dimensionality of a variable set. PA is known in different variants, which may yield different dimensionality indications. In this article, the authors considered the most appropriate PA procedure to assess the number of common factors underlying ordered polytomously…
The Cohesive Population Genetics of Molecular Drive
Ohta, Tomoko; Dover, Gabriel A.
1984-01-01
The long-term population genetics of multigene families is influenced by several biased and unbiased mechanisms of nonreciprocal exchanges (gene conversion, unequal exchanges, transposition) between member genes, often distributed on several chromosomes. These mechanisms cause fluctuations in the copy number of variant genes in an individual and lead to a gradual replacement of an original family of n genes (A) in N number of individuals by a variant gene (a). The process for spreading a variant gene through a family and through a population is called molecular drive. Consideration of the known slow rates of nonreciprocal exchanges predicts that the population variance in the copy number of gene a per individual is small at any given generation during molecular drive. Genotypes at a given generation are expected only to range over a small section of all possible genotypes from one extreme (n number of A) to the other (n number of a). A theory is developed for estimating the size of the population variance by using the concept of identity coefficients. In particular, the variance in the course of spreading of a single mutant gene of a multigene family was investigated in detail, and the theory of identity coefficients at the state of steady decay of genetic variability proved to be useful. Monte Carlo simulations and numerical analysis based on realistic rates of exchange in families of known size reveal the correctness of the theoretical prediction and also assess the effect of bias in turnover. The population dynamics of molecular drive in gradually increasing the mean copy number of a variant gene without the generation of a large variance (population cohesion) is of significance regarding potential interactions between natural selection and molecular drive. PMID:6500260
The cohesive population genetics of molecular drive.
Ohta, T; Dover, G A
1984-10-01
The long-term population genetics of multigene families is influenced by several biased and unbiased mechanisms of nonreciprocal exchanges (gene conversion, unequal exchanges, transposition) between member genes, often distributed on several chromosomes. These mechanisms cause fluctuations in the copy number of variant genes in an individual and lead to a gradual replacement of an original family of n genes (A) in N number of individuals by a variant gene (a). The process for spreading a variant gene through a family and through a population is called molecular drive. Consideration of the known slow rates of nonreciprocal exchanges predicts that the population variance in the copy number of gene a per individual is small at any given generation during molecular drive. Genotypes at a given generation are expected only to range over a small section of all possible genotypes from one extreme (n number of A) to the other (n number of a). A theory is developed for estimating the size of the population variance by using the concept of identity coefficients. In particular, the variance in the course of spreading of a single mutant gene of a multigene family was investigated in detail, and the theory of identity coefficients at the state of steady decay of genetic variability proved to be useful. Monte Carlo simulations and numerical analysis based on realistic rates of exchange in families of known size reveal the correctness of the theoretical prediction and also assess the effect of bias in turnover. The population dynamics of molecular drive in gradually increasing the mean copy number of a variant gene without the generation of a large variance (population cohesion) is of significance regarding potential interactions between natural selection and molecular drive.
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
Adaptation and major chromosomal changes in populations of Saccharomyces cerevisiae.
Adams, J; Puskas-Rozsa, S; Simlar, J; Wilke, C M
1992-07-01
Thirteen independent populations of Saccharomyces cerevisiae (nine haploid and four diploid) were maintained in continuous culture for up to approximately 1000 generations, with growth limited by the concentration of organic phosphates in medium buffered at pH 6. Analysis of clones isolated from these populations showed that a number (17) of large-scale chromosomal-length variants and rearrangements were present in the populations at their termination. Nine of the 16 yeast chromosomes were involved in such changes. Few of the changes could be explained by copy-number increases in the structural loci for acid phosphatase. Several considerations concerning the nature and frequency of the chromosome-length variants observed lead us to conclude that they are selectively advantageous.
Implication of common and disease specific variants in CLU, CR1, and PICALM.
Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo
2012-08-01
Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.
Preliminary spectrum of genetic variants in familial hypercholesterolemia in Argentina.
Bañares, Virginia G; Corral, Pablo; Medeiros, Ana Margarida; Araujo, María Beatriz; Lozada, Alfredo; Bustamante, Juan; Cerretini, Roxana; López, Graciela; Bourbon, Mafalda; Schreier, Laura E
Familial hypercholesterolemia (FH) is a genetic disorder characterized by elevated low-density lipoprotein cholesterol and early cardiovascular disease. As cardiovascular disease is a leading cause of mortality in Argentina, early identification of patients with FH is of great public health importance. The aim of our study was to identify families with FH and to approximate to the characterization of the genetic spectrum mutations of FH in Argentina. Thirty-three not related index cases were selected with clinical diagnosis of FH. Genetic analysis was performed by sequencing, multiplex ligation-dependent probe amplification, and bioinformatics tools. Twenty genetic variants were identified among 24 cases (73%), 95% on the low-density lipoprotein receptor gene. The only variant on APOB was the R3527Q. Four were novel variants: c.-135C>A, c.170A>C p.(Asp57Ala), c.684G>C p.(Glu228Asp), and c.1895A>T p.(Asn632Ile); the bioinformatics' analysis revealed clear destabilizing effects for 2 of them. The exon 14 presented the highest number of variants (32%). Four variants were observed in more than 1 case and the c.2043C>A p.(Cys681*) was carried by 18% of index cases. Two true homozygotes, 3 compound heterozygotes, and 1 double heterozygote were identified. This study characterizes for the first time in Argentina genetic variants associated with FH and suggest that the allelic heterogeneity of the FH in the country could have 1 relative common low-density lipoprotein receptor mutation. This knowledge is important for the genotype-phenotype correlation and for optimizing both cholesterol-lowering therapies and mutational analysis protocols. In addition, these data contribute to the understanding of the molecular basis of FH in Argentina. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.
Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V
2012-02-17
The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.
A novel recurrent mutation in MITF predisposes to familial and sporadic melanoma
Yokoyama, Satoru; Woods, Susan L.; Boyle, Glen M.; Aoude, Lauren G.; MacGregor, Stuart; Zismann, Victoria; Gartside, Michael; Cust, Anne E.; Haq, Rizwan; Harland, Mark; Taylor, John C.; Duffy, David L.; Holohan, Kelly; Dutton-Regester, Ken; Palmer, Jane M.; Bonazzi, Vanessa; Stark, Mitchell S.; Symmons, Judith; Law, Matthew H.; Schmidt, Christopher; Lanagan, Cathy; O’Connor, Linda; Holland, Elizabeth A.; Schmid, Helen; Maskiell, Judith A.; Jetann, Jodie; Ferguson, Megan; Jenkins, Mark A.; Kefford, Richard F.; Giles, Graham G.; Armstrong, Bruce K.; Aitken, Joanne F.; Hopper, John L.; Whiteman, David C.; Pharoah, Paul D.; Easton, Douglas F.; Dunning, Alison M.; Newton-Bishop, Julia A.; Montgomery, Grant W.; Martin, Nicholas G.; Mann, Graham J.; Bishop, D. Timothy; Tsao, Hensin; Trent, Jeffrey M.; Fisher, David E.; Hayward, Nicholas K.; Brown, Kevin M.
2012-01-01
So far, two familial melanoma genes have been identified, accounting for a minority of genetic risk in families. Mutations in CDKN2A account for approximately 40% of familial cases1, and predisposing mutations in CDK4 have been reported in a very small number of melanoma kindreds2. To identify other familial melanoma genes, here we conducted whole-genome sequencing of probands from several melanoma families, identifying one individual carrying a novel germline variant (coding DNA sequence c.G1075A; protein sequence p.E318K; rs149617956) in the melanoma-lineage-specific oncogene microphthalmia-associated transcription factor (MITF). Although the variant co-segregated with melanoma in some but not all cases in the family, linkage analysis of 31 families subsequently identified to carry the variant generated a log odds ratio (lod) score of 2.7 under a dominant model, indicating E318K as a possible intermediate risk variant. Consistent with this, the E318K variant was significantly associated with melanoma in a large Australian case–control sample. Likewise, it was similarly associated in an independent case–control sample from the United Kingdom. In the Australian sample, the variant allele was significantly over-represented in cases with a family history of melanoma, multiple primary melanomas, or both. The variant allele was also associated with increased naevus count and non-blue eye colour. Functional analysis of E318K showed that MITF encoded by the variant allele had impaired sumoylation and differentially regulated several MITF targets. These data indicate that MITF is a melanoma-predisposition gene and highlight the utility of whole-genome sequencing to identify novel rare variants associated with disease susceptibility. PMID:22080950
RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing.
Chang, Lun-Ching; Das, Biswajit; Lih, Chih-Jian; Si, Han; Camalier, Corinne E; McGregor, Paul M; Polley, Eric
2016-01-01
With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that mapped to a genomic region correlated with the DNA copy number variants (CNVs). We propose a method RefCNV that uses a reference set to estimate the distribution of the coverage for each exon. The construction of the reference set includes an evaluation of the sources of variability in the coverage distribution. We observed that the processing steps had an impact on the coverage distribution. For each exon, we compared the observed coverage with the expected normal coverage. Thresholds for determining CNVs were selected to control the false-positive error rate. RefCNV prediction correlated significantly (r = 0.96-0.86) with CNV measured by digital polymerase chain reaction for MET (7q31), EGFR (7p12), or ERBB2 (17q12) in 13 tumor cell lines. The genome-wide CNV analysis showed a good overall correlation (Spearman's coefficient = 0.82) between RefCNV estimation and publicly available CNV data in Cancer Cell Line Encyclopedia. RefCNV also showed better performance than three other CNV estimation methods in genome-wide CNV analysis.
Santos, Sara; Bastos, Estela; Baptista, Cláudia S.; Sá, Daniela; Caloustian, Christophe; Guedes-Pinto, Henrique; Gärtner, Fátima; Gut, Ivo G.; Chaves, Raquel
2012-01-01
The human ERBB2 proto-oncogene is widely considered a key gene involved in human breast cancer onset and progression. Among spontaneous tumors, mammary tumors are the most frequent cause of cancer death in cats and second most frequent in humans. In fact, naturally occurring tumors in domestic animals, more particularly cat mammary tumors, have been proposed as a good model for human breast cancer, but critical genetic and molecular information is still scarce. The aims of this study include the analysis of the cat ERBB2 gene partial sequences (between exon 17 and 20) in order to characterize a normal and a mammary lesion heterogeneous populations. Cat genomic DNA was extracted from normal frozen samples (n = 16) and from frozen and formalin-fixed paraffin-embedded mammary lesion samples (n = 41). We amplified and sequenced two cat ERBB2 DNA fragments comprising exons 17 to 20. It was possible to identify five sequence variants and six haplotypes in the total population. Two sequence variants and two haplotypes show to be specific for cat mammary tumor samples. Bioinformatics analysis predicts that four of the sequence variants can produce alternative transcripts or activate cryptic splicing sites. Also, a possible association was identified between clinicopathological traits and the variant haplotypes. As far as we know, this is the first attempt to examine ERBB2 genetic variations in cat mammary genome and its possible association with the onset and progression of cat mammary tumors. The demonstration of a possible association between primary tumor size (one of the two most important prognostic factors) and the number of masses with the cat ERBB2 variant haplotypes reveal the importance of the analysis of this gene in veterinary medicine. PMID:22489125
Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.
2015-01-01
Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172
Hasumi, Hisashi; Furuya, Mitsuko; Tatsuno, Kenji; Yamamoto, Shogo; Baba, Masaya; Hasumi, Yukiko; Isono, Yasuhiro; Suzuki, Kae; Jikuya, Ryosuke; Otake, Shinji; Muraoka, Kentaro; Osaka, Kimito; Hayashi, Narihiko; Makiyama, Kazuhide; Miyoshi, Yasuhide; Kondo, Keiichi; Nakaigawa, Noboru; Kawahara, Takashi; Izumi, Koji; Teranishi, Junichi; Yumura, Yasushi; Uemura, Hiroji; Nagashima, Yoji; Metwalli, Adam R; Schmidt, Laura S; Aburatani, Hiroyuki; Linehan, W Marston; Yao, Masahiro
2018-05-14
Birt-Hogg-Dubé (BHD) syndrome is a hereditary kidney cancer syndrome, which predisposes patients to develop kidney cancer, cutaneous fibrofolliculomas and pulmonary cysts. The responsible gene FLCN is a tumor suppressor for kidney cancer which plays an important role in energy homeostasis through the regulation of mitochondrial oxidative metabolism. However, the process by which FLCN-deficiency leads to renal tumorigenesis is unclear. In order to clarify molecular pathogenesis of BHD-associated kidney cancer, we conducted whole-exome sequencing analysis using next-generation sequencing technology as well as metabolite analysis using LC/MS and GC/MS. Whole-exome sequencing analysis of BHD-associated kidney cancer revealed that copy number variations (CNV) of BHD-associated kidney cancer are considerably different from those already reported in sporadic cases. In somatic variant analysis, very few variants were commonly observed in BHD-associated kidney cancer; however, variants in chromatin remodeling genes were frequently observed in BHD-associated kidney cancer (17/29 tumors, 59%). Metabolite analysis of BHD-associated kidney cancer revealed metabolic reprogramming towards upregulated redox regulation which may neutralize reactive oxygen species potentially produced from mitochondria with increased respiratory capacity under FLCN-deficiency. BHD-associated kidney cancer displays unique molecular characteristics which are completely different from sporadic kidney cancer, providing mechanistic insight into tumorigenesis under FLCN-deficiency as well as a foundation for development of novel therapeutics for kidney cancer.
Preconception Carrier Screening by Genome Sequencing: Results from the Clinical Laboratory.
Punj, Sumit; Akkari, Yassmine; Huang, Jennifer; Yang, Fei; Creason, Allison; Pak, Christine; Potter, Amiee; Dorschner, Michael O; Nickerson, Deborah A; Robertson, Peggy D; Jarvik, Gail P; Amendola, Laura M; Schleit, Jennifer; Simpson, Dana Kostiner; Rope, Alan F; Reiss, Jacob; Kauffman, Tia; Gilmore, Marian J; Himes, Patricia; Wilfond, Benjamin; Goddard, Katrina A B; Richards, C Sue
2018-06-07
Advances in sequencing technologies permit the analysis of a larger selection of genes for preconception carrier screening. The study was designed as a sequential carrier screen using genome sequencing to analyze 728 gene-disorder pairs for carrier and medically actionable conditions in 131 women and their partners (n = 71) who were planning a pregnancy. We report here on the clinical laboratory results from this expanded carrier screening program. Variants were filtered and classified using the latest American College of Medical Genetics and Genomics (ACMG) guideline; only pathogenic and likely pathogenic variants were confirmed by orthologous methods before being reported. Novel missense variants were classified as variants of uncertain significance. We reported 304 variants in 202 participants. Twelve carrier couples (12/71 couples tested) were identified for common conditions; eight were carriers for hereditary hemochromatosis. Although both known and novel variants were reported, 48% of all reported variants were missense. For novel splice-site variants, RNA-splicing assays were performed to aid in classification. We reported ten copy-number variants and five variants in non-coding regions. One novel variant was reported in F8, associated with hemophilia A; prenatal testing showed that the male fetus harbored this variant and the neonate suffered a life-threatening hemorrhage which was anticipated and appropriately managed. Moreover, 3% of participants had variants that were medically actionable. Compared with targeted mutation screening, genome sequencing improves the sensitivity of detecting clinically significant variants. While certain novel variant interpretation remains challenging, the ACMG guidelines are useful to classify variants in a healthy population. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Moult, John
2017-09-01
The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area. © 2017 The Authors. **Human Mutation published by Wiley Periodicals, Inc.
A novel EML4-ALK variant: exon 6 of EML4 fused to exon 19 of ALK.
Penzel, Roland; Schirmacher, Peter; Warth, Arne
2012-07-01
Cytotoxic chemotherapy remains the mainstay of treatment for most patients with advanced disease. Recently, anaplastic lymphoma kinase (ALK) expression as a major target for successful treatment with ALK inhibitors was detected in a subset of non-small-cell lung carcinomas, usually as a result of echinoderm microtubule-associated protein-like 4 (EML4)-ALK rearrangements. Although the chromosomal breakpoint within the EML4 gene varied, the breakpoint within ALK was most frequently reported within intron 19 or rarely in exon 20. Therefore, the different EML4-ALK variants so far contain the same 3' portion of ALK starting with exon 20. Here, we report a novel EML4-ALK variant detected by reverse transcription polymerase chain reaction analysis. Subsequent sequencing revealed an EML4-ALK fusion variant in which exon 6 of EML4 was fused to exon 19 of ALK. It occurred in a predominant solid pulmonary adenocarcinoma of a 65-year-old woman with a clear split signal of ALK in fluorescence in situ hybridization analysis and a weakly homogeneous ALK expression in immunohistochemical staining. Because of the growing number of fusion variants a primary reverse transcription polymerase chain reaction-based screening for ALK-positive non-small-cell lung carcinoma patients may not be sufficient for predictive diagnostics but transcript-based approaches and sequencing of ALK fusion variants might finally contribute to an optimized selection of patients.
Dynamic response analysis of structure under time-variant interval process model
NASA Astrophysics Data System (ADS)
Xia, Baizhan; Qin, Yuan; Yu, Dejie; Jiang, Chao
2016-10-01
Due to the aggressiveness of the environmental factor, the variation of the dynamic load, the degeneration of the material property and the wear of the machine surface, parameters related with the structure are distinctly time-variant. Typical model for time-variant uncertainties is the random process model which is constructed on the basis of a large number of samples. In this work, we propose a time-variant interval process model which can be effectively used to deal with time-variant uncertainties with limit information. And then two methods are presented for the dynamic response analysis of the structure under the time-variant interval process model. The first one is the direct Monte Carlo method (DMCM) whose computational burden is relative high. The second one is the Monte Carlo method based on the Chebyshev polynomial expansion (MCM-CPE) whose computational efficiency is high. In MCM-CPE, the dynamic response of the structure is approximated by the Chebyshev polynomials which can be efficiently calculated, and then the variational range of the dynamic response is estimated according to the samples yielded by the Monte Carlo method. To solve the dependency phenomenon of the interval operation, the affine arithmetic is integrated into the Chebyshev polynomial expansion. The computational effectiveness and efficiency of MCM-CPE is verified by two numerical examples, including a spring-mass-damper system and a shell structure.
Pathway-based variant enrichment analysis on the example of dilated cardiomyopathy.
Backes, Christina; Meder, Benjamin; Lai, Alan; Stoll, Monika; Rühle, Frank; Katus, Hugo A; Keller, Andreas
2016-01-01
Genome-wide association (GWA) studies have significantly contributed to the understanding of human genetic variation and its impact on clinical traits. Frequently only a limited number of highly significant associations were considered as biologically relevant. Increasingly, network analysis of affected genes is used to explore the potential role of the genetic background on disease mechanisms. Instead of first determining affected genes or calculating scores for genes and performing pathway analysis on the gene level, we integrated both steps and directly calculated enrichment on the genetic variant level. The respective approach has been tested on dilated cardiomyopathy (DCM) GWA data as showcase. To compute significance values, 5000 permutation tests were carried out and p values were adjusted for multiple testing. For 282 KEGG pathways, we computed variant enrichment scores and significance values. Of these, 65 were significant. Surprisingly, we discovered the "nucleotide excision repair" and "tuberculosis" pathways to be most significantly associated with DCM (p = 10(-9)). The latter pathway is driven by genes of the HLA-D antigen group, a finding that closely resembles previous discoveries made by expression quantitative trait locus analysis in the context of DCM-GWA. Next, we implemented a sub-network-based analysis, which searches for affected parts of KEGG, however, independent on the pre-defined pathways. Here, proteins of the contractile apparatus of cardiac cells as well as the FAS sub-network were found to be affected by common polymorphisms in DCM. In this work, we performed enrichment analysis directly on variants, leveraging the potential to discover biological information in thousands of published GWA studies. The applied approach is cutoff free and considers a ranked list of genetic variants as input.
Design of DNA pooling to allow incorporation of covariates in rare variants analysis.
Guan, Weihua; Li, Chun
2014-01-01
Rapid advances in next-generation sequencing technologies facilitate genetic association studies of an increasingly wide array of rare variants. To capture the rare or less common variants, a large number of individuals will be needed. However, the cost of a large scale study using whole genome or exome sequencing is still high. DNA pooling can serve as a cost-effective approach, but with a potential limitation that the identity of individual genomes would be lost and therefore individual characteristics and environmental factors could not be adjusted in association analysis, which may result in power loss and a biased estimate of genetic effect. For case-control studies, we propose a design strategy for pool creation and an analysis strategy that allows covariate adjustment, using multiple imputation technique. Simulations show that our approach can obtain reasonable estimate for genotypic effect with only slight loss of power compared to the much more expensive approach of sequencing individual genomes. Our design and analysis strategies enable more powerful and cost-effective sequencing studies of complex diseases, while allowing incorporation of covariate adjustment.
Yoon, Cindy W; Kim, Young-Eun; Seo, Sang Won; Ki, Chang-Seok; Choi, Seong Hye; Kim, Jong-Won; Na, Duk L
2015-08-01
Although cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL) is thought to be a common form of hereditary subcortical vascular cognitive impairment (SVCI), there is little data on the frequency of NOTCH3 variants in SVCI patients. We prospectively screened for NOTCH3 variants in consecutive SVCI patients who underwent brain magnetic resonance imaging and amyloid positron emission tomography as well as sequence analysis for mutational hotspots in the NOTCH3 gene. Among 117 patients with SVCI, 16 patients had either known mutations or variants of unknown significance in the NOTCH3 gene. There were no differences in clinical and neuroimaging features between SVCI patients with and without NOTCH3 variants, only except for a higher number of deep microbleeds in SVCI patients with NOTCH3 variants. Our findings suggest that there is a phenotypic entity of NOTCH3 variant that is similar to that of sporadic SVCI but not of typical CADASIL. Notably, 2 SVCI patients with NOTCH3 mutations showed significant amyloid burden, which challenges the prevailing concept that CADASIL represents the genetic model of pure small vessel disease. Copyright © 2015 Elsevier Inc. All rights reserved.
Krämer, Andreas; Shah, Sohela; Rebres, Robert Anthony; Tang, Susan; Richards, Daniel Rene
2017-08-11
Next-generation sequencing is widely used to identify disease-causing variants in patients with rare genetic disorders. Identifying those variants from whole-genome or exome data can be both scientifically challenging and time consuming. A significant amount of time is spent on variant annotation, and interpretation. Fully or partly automated solutions are therefore needed to streamline and scale this process. We describe Phenotype Driven Ranking (PDR), an algorithm integrated into Ingenuity Variant Analysis, that uses observed patient phenotypes to prioritize diseases and genes in order to expedite causal-variant discovery. Our method is based on a network of phenotype-disease-gene relationships derived from the QIAGEN Knowledge Base, which allows for efficient computational association of phenotypes to implicated diseases, and also enables scoring and ranking. We have demonstrated the utility and performance of PDR by applying it to a number of clinical rare-disease cases, where the true causal gene was known beforehand. It is also shown that PDR compares favorably to a representative alternative tool.
Park, Ji Soo; Nam, Eun Ji; Park, Hyung Seok; Han, Jung Woo; Lee, Jung-Yun; Kim, Jieun; Kim, Tae Il; Lee, Seung-Tae
2017-10-01
Comparison of variant frequencies in the general population has become an essential part of the American College of Medical Genetics and Genomics (ACMG) standards and guidelines for interpreting sequence variants. We determined the optimal number of relevant ethnic controls that should be used to accurately calculate the odds ratio (OR) of genetic variants. Using the ACMG guidelines, we reclassified BRCA1 and BRCA2 mutations and variants of unknown significance in 745 Korean patients susceptible to hereditary breast and ovarian cancer compared with 1,314 Korean population controls. We observed that the ORs were falsely inflated when we analyzed several variants using non-Korean population data. Our simulation indicated that the number of controls needed for the lower limit of a 95% confidence interval to exceed 1.0 varied according to the frequency of the variant in each patient group, with more than 820 controls needed for a variant existing in 1% of cases. Using a sufficient number of relevant population data, we could efficiently classify variants and identified the BRCA1 p.Leu1780Pro mutation as a possible pathogenic founder mutation in Korean patients. Our study suggests that BRCA1 p.Leu1780Pro is a novel pathogenic mutation found in Korean patients. We also determined the optimal number of relevant ethnic controls needed for accurate variant classification according to the ACMG guidelines.
Yi, SoJeong; An, Hyungmi; Lee, Howard; Lee, Sangin; Ieiri, Ichiro; Lee, Youngjo; Cho, Joo-Youn; Hirota, Takeshi; Fukae, Masato; Yoshida, Kenji; Nagatsuka, Shinichiro; Kimura, Miyuki; Irie, Shin; Sugiyama, Yuichi; Shin, Dong Wan; Lim, Kyoung Soo; Chung, Jae-Yong; Yu, Kyung-Sang; Jang, In-Jin
2014-10-01
Interethnic differences in genetic polymorphism in genes encoding drug-metabolizing enzymes and transporters are one of the major factors that cause ethnic differences in drug response. This study aimed to investigate genetic polymorphisms in genes involved in drug metabolism, transport, and excretion among Korean, Japanese, and Chinese populations, the three major East Asian ethnic groups. The frequencies of 1936 variants representing 225 genes encoding drug-metabolizing enzymes and transporters were determined from 786 healthy participants (448 Korean, 208 Japanese, and 130 Chinese) using the Affymetrix Drug-Metabolizing Enzymes and Transporters Plus microarray. To compare allele or genotype frequencies in the high-dimensional data among the three East Asian ethnic groups, multiple testing, principal component analysis (PCA), and regularized multinomial logit model through least absolute shrinkage and selection operator were used. On microarray analysis, 1071 of 1936 variants (>50% of markers) were found to be monomorphic. In a large number of genetic variants, the fixation index and Pearson's correlation coefficient of minor allele frequencies were less than 0.034 and greater than 0.95, respectively, among the three ethnic groups. PCA identified 47 genetic variants with multiple testing, but was unable to discriminate ethnic groups by the first three components. Multinomial least absolute shrinkage and selection operator analysis identified 269 genetic variants that showed different frequencies among the three ethnic groups. However, none of those variants distinguished between the three ethnic groups during subsequent PCA. Korean, Japanese, and Chinese populations are not pharmacogenetically distant from one another, at least with regard to drug disposition, metabolism, and elimination.
SG-ADVISER CNV: copy-number variant annotation and interpretation.
Erikson, Galina A; Deshpande, Neha; Kesavan, Balachandar G; Torkamani, Ali
2015-09-01
Copy-number variants have been associated with a variety of diseases, especially cancer, autism, schizophrenia, and developmental delay. The majority of clinically relevant events occur de novo, necessitating the interpretation of novel events. In this light, we present the Scripps Genome ADVISER CNV annotation pipeline and Web server, which aims to fill the gap between copy number variant detection and interpretation by performing in-depth annotations and functional predictions for copy number variants. The Scripps Genome ADVISER CNV suite includes a Web server interface to a high-performance computing environment for calculations of annotations and a table-based user interface that allows for the execution of numerous annotation-based variant filtration strategies and statistics. The annotation results include details regarding location, impact on the coding portion of genes, allele frequency information (including allele frequencies from the Scripps Wellderly cohort), and overlap information with other reference data sets (including ClinVar, DGV, DECIPHER). A summary variant classification is produced (ADVISER score) based on the American College of Medical Genetics and Genomics scoring guidelines. We demonstrate >90% sensitivity/specificity for detection of pathogenic events. Scripps Genome ADVISER CNV is designed to allow users with no prior bioinformatics expertise to manipulate large volumes of copy-number variant data. Scripps Genome ADVISER CNV is available at http://genomics.scripps.edu/ADVISER/.
Identifying genetic variants that affect viability in large cohorts
Berisa, Tomaz; Day, Felix R.; Perry, John R. B.
2017-01-01
A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P~6.2 × 10−6 for fathers and P~2.0 × 10−3 for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10−3). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans. PMID:28873088
Abrahams, M-R; Anderson, J A; Giorgi, E E; Seoighe, C; Mlisana, K; Ping, L-H; Athreya, G S; Treurnicht, F K; Keele, B F; Wood, N; Salazar-Gonzalez, J F; Bhattacharya, T; Chu, H; Hoffman, I; Galvin, S; Mapanje, C; Kazembe, P; Thebus, R; Fiscus, S; Hide, W; Cohen, M S; Karim, S Abdool; Haynes, B F; Shaw, G M; Hahn, B H; Korber, B T; Swanstrom, R; Williamson, C
2009-04-01
Identifying the specific genetic characteristics of successfully transmitted variants may prove central to the development of effective vaccine and microbicide interventions. Although human immunodeficiency virus transmission is associated with a population bottleneck, the extent to which different factors influence the diversity of transmitted viruses is unclear. We estimate here the number of transmitted variants in 69 heterosexual men and women with primary subtype C infections. From 1,505 env sequences obtained using a single genome amplification approach we show that 78% of infections involved single variant transmission and 22% involved multiple variant transmissions (median of 3). We found evidence for mutations selected for cytotoxic-T-lymphocyte or antibody escape and a high prevalence of recombination in individuals infected with multiple variants representing another potential escape pathway in these individuals. In a combined analysis of 171 subtype B and C transmission events, we found that infection with more than one variant does not follow a Poisson distribution, indicating that transmission of individual virions cannot be seen as independent events, each occurring with low probability. While most transmissions resulted from a single infectious unit, multiple variant transmissions represent a significant fraction of transmission events, suggesting that there may be important mechanistic differences between these groups that are not yet understood.
Comprehensive genetic testing for female and male infertility using next-generation sequencing.
Patel, Bonny; Parets, Sasha; Akana, Matthew; Kellogg, Gregory; Jansen, Michael; Chang, Chihyu; Cai, Ying; Fox, Rebecca; Niknazar, Mohammad; Shraga, Roman; Hunter, Colby; Pollock, Andrew; Wisotzkey, Robert; Jaremko, Malgorzata; Bisignano, Alex; Puig, Oscar
2018-05-19
To develop a comprehensive genetic test for female and male infertility in support of medical decisions during assisted reproductive technology (ART) protocols. We developed a next-generation sequencing (NGS) gene panel consisting of 87 genes including promoters, 5' and 3' untranslated regions, exons, and selected introns. In addition, sex chromosome aneuploidies and Y chromosome microdeletions were analyzed concomitantly using the same panel. The NGS panel was analytically validated by retrospective analysis of 118 genomic DNA samples with known variants in loci representative of female and male infertility. Our results showed analytical accuracy of > 99%, with > 98% sensitivity for single-nucleotide variants (SNVs) and > 91% sensitivity for insertions/deletions (indels). Clinical sensitivity was assessed with samples containing variants representative of male and female infertility, and it was 100% for SNVs/indels, CFTR IVS8-5T variants, sex chromosome aneuploidies, and copy number variants (CNVs) and > 93% for Y chromosome microdeletions. Cost analysis shows potential savings when comparing this single NGS assay with the standard approach, which includes multiple assays. A single, comprehensive, NGS panel can simplify the ordering process for healthcare providers, reduce turnaround time, and lower the overall cost of testing for genetic assessment of infertility in females and males, while maintaining accuracy.
Hackmann, Karl; Kuhlee, Franziska; Betcheva-Krajcir, Elitza; Kahlert, Anne-Karin; Mackenroth, Luisa; Klink, Barbara; Di Donato, Nataliya; Tzschach, Andreas; Kast, Karin; Wimberger, Pauline; Schrock, Evelin; Rump, Andreas
2016-10-01
Detection of predisposing copy number variants (CNV) in 330 families affected with hereditary breast and ovarian cancer (HBOC). In order to complement mutation detection with Illumina's TruSight Cancer panel, we designed a customized high-resolution 8 × 60k array for CGH (aCGH) that covers all 94 genes from the panel. Copy number variants with immediate clinical relevance were detected in 12 families (3.6%). Besides 3 known CNVs in CHEK2, RAD51C, and BRCA1, we identified 3 novel pathogenic CNVs in BRCA1 (deletion of exons 4-13, deletion of exons 12-18) and ATM (deletion exons 57-63) plus an intragenic duplication of BRCA2 (exons 3-11) and an intronic BRCA1 variant with unknown pathogenicity. The precision of high-resolution aCGH enabled straight forward breakpoint amplification of a BRCA1 deletion which subsequently allowed for fast and economic CNV verification in family members of the index patient. Furthermore, we used our aCGH data to validate an algorithm that was able to detect all identified copy number changes from next-generation sequencing (NGS) data. Copy number detection is a mandatory analysis in HBOC families at least if no predisposing mutations were found by sequencing. Currently, high-resolution array CGH is our first choice of method of analysis due to unmatched detection precision. Although it seems possible to detect CNV from sequencing data, there currently is no satisfying tool to do so in a routine diagnostic setting.
Rosenthal, E T; Bowles, K R; Pruss, D; van Kan, A; Vail, P J; McElroy, H; Wenstrup, R J
2015-12-01
Based on current consensus guidelines and standard practice, many genetic variants detected in clinical testing are classified as disease causing based on their predicted impact on the normal expression or function of the gene in the absence of additional data. However, our laboratory has identified a subset of such variants in hereditary cancer genes for which compelling contradictory evidence emerged after the initial evaluation following the first observation of the variant. Three representative examples of variants in BRCA1, BRCA2 and MSH2 that are predicted to disrupt splicing, prematurely truncate the protein, or remove the start codon were evaluated for pathogenicity by analyzing clinical data with multiple classification algorithms. Available clinical data for all three variants contradicts the expected pathogenic classification. These variants illustrate potential pitfalls associated with standard approaches to variant classification as well as the challenges associated with monitoring data, updating classifications, and reporting potentially contradictory interpretations to the clinicians responsible for translating test outcomes to appropriate clinical action. It is important to address these challenges now as the model for clinical testing moves toward the use of large multi-gene panels and whole exome/genome analysis, which will dramatically increase the number of genetic variants identified. © 2015 The Authors. Clinical Genetics published by John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.
Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N; Baldwin, Clinton T; Andersen, Stacy; Schork, Nicholas J; Steinberg, Martin H; Perls, Thomas T
2011-01-01
Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals' DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging.
Whole Genome Sequences of a Male and Female Supercentenarian, Ages Greater than 114 Years
Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N.; Baldwin, Clinton T.; Andersen, Stacy; Schork, Nicholas J.; Steinberg, Martin H.; Perls, Thomas T.
2012-01-01
Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals’ DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging. PMID:22303384
Clinicopathologic features and management of blastoid variant of mantle cell lymphoma.
Shrestha, Rajesh; Bhatt, Vijaya Raj; Guru Murthy, Guru Subramanian; Armitage, James O
2015-01-01
The blastoid variant of mantle cell lymphoma (MCL), which accounts for less than one-third of MCL, may arise de novo or as a transformation from the classical form of MCL. Blastoid variant, which predominantly involves men in their sixth decade, has frequent extranodal involvement (40-60%), stage IV disease (up to 85%) and central nervous system (CNS) involvement. Diagnosis relies on morphological features and is challenging. Immunophenotyping may display CD23 and CD10 positivity and CD5 negativity in a subset. Genetic analysis demonstrates an increased number of complex genetic alterations. Blastoid variant responds poorly to conventional chemotherapy and has a short duration of response. Although the optimal therapy remains to be established, CNS prophylaxis and the use of aggressive immunochemotherapy followed by autologous stem cell transplant may prolong the remission rate and survival. Further studies are crucial to expand our understanding of this disease entity and improve the clinical outcome.
Bogucki, Artur J
2014-01-01
The knee joint is a bicondylar hinge two-level joint with six degrees of freedom. The location of the functional axis of flexion-extension motion is still a subject of research and discussions. During the swing phase, the femoral condyles do not have direct contact with the tibial articular surfaces and the intra-articular space narrows with increasing weight bearing. The geometry of knee movements is determined by the shape of articular surfaces. A digital recording of the gait of a healthy volunteer was analysed. In the first experimental variant, the subject was wearing a knee orthosis controlling flexion and extension with a hinge-type single-axis joint. In the second variant, the examination involved a hinge-type double-axis orthosis. Statistical analysis involved mathematically calculated values of displacement P. Scatter graphs with a fourth-order polynomial trend line with a confidence interval of 0.95 due to noise were prepared for each experimental variant. In Variant 1, the average displacement was 15.1 mm, the number of tests was 43, standard deviation was 8.761, and the confidence interval was 2.2. The maximum value of displacement was 30.9 mm and the minimum value was 0.7 mm. In Variant 2, the average displacement was 13.4 mm, the number of tests was 44, standard deviation was 7.275, and the confidence interval was 1.8. The maximum value of displacement was 30.2 mm and the minimum value was 3.4 mm. An analysis of moving averages for both experimental variants revealed that displacement trends for both types of orthosis were compatible from the mid-stance to the mid-swing phase. 1. The method employed in the experiment allows for determining the alignment between the axis of the knee joint and that of shin and thigh orthoses. 2. Migration of the single and double-axis orthoses during the gait cycle exceeded 3 cm. 3. During weight bearing, the double-axis orthosis was positioned more correctly. 4. The study results may be helpful in designing new hinge-type knee joints.
Li, Man; Li, Yong; Weeks, Olivia; Mijatovic, Vladan; Teumer, Alexander; Huffman, Jennifer E; Tromp, Gerard; Fuchsberger, Christian; Gorski, Mathias; Lyytikäinen, Leo-Pekka; Nutile, Teresa; Sedaghat, Sanaz; Sorice, Rossella; Tin, Adrienne; Yang, Qiong; Ahluwalia, Tarunveer S; Arking, Dan E; Bihlmeyer, Nathan A; Böger, Carsten A; Carroll, Robert J; Chasman, Daniel I; Cornelis, Marilyn C; Dehghan, Abbas; Faul, Jessica D; Feitosa, Mary F; Gambaro, Giovanni; Gasparini, Paolo; Giulianini, Franco; Heid, Iris; Huang, Jinyan; Imboden, Medea; Jackson, Anne U; Jeff, Janina; Jhun, Min A; Katz, Ronit; Kifley, Annette; Kilpeläinen, Tuomas O; Kumar, Ashish; Laakso, Markku; Li-Gao, Ruifang; Lohman, Kurt; Lu, Yingchang; Mägi, Reedik; Malerba, Giovanni; Mihailov, Evelin; Mohlke, Karen L; Mook-Kanamori, Dennis O; Robino, Antonietta; Ruderfer, Douglas; Salvi, Erika; Schick, Ursula M; Schulz, Christina-Alexandra; Smith, Albert V; Smith, Jennifer A; Traglia, Michela; Yerges-Armstrong, Laura M; Zhao, Wei; Goodarzi, Mark O; Kraja, Aldi T; Liu, Chunyu; Wessel, Jennifer; Boerwinkle, Eric; Borecki, Ingrid B; Bork-Jensen, Jette; Bottinger, Erwin P; Braga, Daniele; Brandslund, Ivan; Brody, Jennifer A; Campbell, Archie; Carey, David J; Christensen, Cramer; Coresh, Josef; Crook, Errol; Curhan, Gary C; Cusi, Daniele; de Boer, Ian H; de Vries, Aiko P J; Denny, Joshua C; Devuyst, Olivier; Dreisbach, Albert W; Endlich, Karlhans; Esko, Tõnu; Franco, Oscar H; Fulop, Tibor; Gerhard, Glenn S; Glümer, Charlotte; Gottesman, Omri; Grarup, Niels; Gudnason, Vilmundur; Hansen, Torben; Harris, Tamara B; Hayward, Caroline; Hocking, Lynne; Hofman, Albert; Hu, Frank B; Husemoen, Lise Lotte N; Jackson, Rebecca D; Jørgensen, Torben; Jørgensen, Marit E; Kähönen, Mika; Kardia, Sharon L R; König, Wolfgang; Kooperberg, Charles; Kriebel, Jennifer; Launer, Lenore J; Lauritzen, Torsten; Lehtimäki, Terho; Levy, Daniel; Linksted, Pamela; Linneberg, Allan; Liu, Yongmei; Loos, Ruth J F; Lupo, Antonio; Meisinger, Christine; Melander, Olle; Metspalu, Andres; Mitchell, Paul; Nauck, Matthias; Nürnberg, Peter; Orho-Melander, Marju; Parsa, Afshin; Pedersen, Oluf; Peters, Annette; Peters, Ulrike; Polasek, Ozren; Porteous, David; Probst-Hensch, Nicole M; Psaty, Bruce M; Qi, Lu; Raitakari, Olli T; Reiner, Alex P; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Rossouw, Jacques E; Schmidt, Frank; Siscovick, David; Soranzo, Nicole; Strauch, Konstantin; Toniolo, Daniela; Turner, Stephen T; Uitterlinden, André G; Ulivi, Sheila; Velayutham, Dinesh; Völker, Uwe; Völzke, Henry; Waldenberger, Melanie; Wang, Jie Jin; Weir, David R; Witte, Daniel; Kuivaniemi, Helena; Fox, Caroline S; Franceschini, Nora; Goessling, Wolfram; Köttgen, Anna; Chu, Audrey Y
2017-03-01
Genome-wide association studies have identified >50 common variants associated with kidney function, but these variants do not fully explain the variation in eGFR. We performed a two-stage meta-analysis of associations between genotypes from the Illumina exome array and eGFR on the basis of serum creatinine (eGFRcrea) among participants of European ancestry from the CKDGen Consortium ( n Stage1 : 111,666; n Stage2 : 48,343). In single-variant analyses, we identified single nucleotide polymorphisms at seven new loci associated with eGFRcrea ( PPM1J , EDEM3, ACP1, SPEG, EYA4, CYP1A1 , and ATXN2L ; P Stage1 <3.7×10 -7 ), of which most were common and annotated as nonsynonymous variants. Gene-based analysis identified associations of functional rare variants in three genes with eGFRcrea, including a novel association with the SOS Ras/Rho guanine nucleotide exchange factor 2 gene, SOS2 ( P =5.4×10 -8 by sequence kernel association test). Experimental follow-up in zebrafish embryos revealed changes in glomerular gene expression and renal tubule morphology in the embryonic kidney of acp1- and sos2 -knockdowns. These developmental abnormalities associated with altered blood clearance rate and heightened prevalence of edema. This study expands the number of loci associated with kidney function and identifies novel genes with potential roles in kidney formation. Copyright © 2017 by the American Society of Nephrology.
Incidence of numerical variants and transitional lumbosacral vertebrae on whole-spine MRI.
Tins, Bernhard J; Balain, Birender
2016-04-01
This study sets out to prospectively investigate the incidence of transitional vertebrae and numerical variants of the spine. Over a period of 28 months, MRIs of the whole spine were prospectively evaluated for the presence of transitional lumbosacral vertebrae and numerical variants of the spine. MRI of the whole spine was evaluated in 420 patients, comprising 211 female and 209 male subjects. Two patients had more complex anomalies. Lumbosacral transitional vertebrae were seen in 12 patients: eight sacralised L5 (3 male, 5 female) and four lumbarised S1 (3 male, 1 female). The incidence of transitional vertebrae was approximately 3.3. % (14/418). Thirty-two (7.7 %) of 418 patients had numerical variants of mobile vertebrae of the spine without transitional vertebrae. The number of mobile vertebrae was increased by one in 18 patients (12 male, 6 female), and the number was decreased by one in 14 patients (4 male, 10 female). Numerical variants of the spine are common, and were found to be almost 2.5 times as frequent as transitional lumbosacral vertebrae in the study population. Only whole-spine imaging can identify numerical variants and the anatomical nature of transitional vertebrae. The tendency is toward an increased number of mobile vertebrae in men and a decreased number in women. Main messages • Numerical variants of the spine are more common than transitional vertebrae. • Spinal numerical variants can be reliably identified only with whole-spine imaging. • Increased numbers of vertebrae are more common in men than women. • Transitional lumbosacral vertebrae occurred in about 3.3 % of the study population. • The incidence of numerical variants of the spine was about 7.7 %.
Metzger, Julia; Philipp, Ute; Lopes, Maria Susana; da Camara Machado, Artur; Felicetti, Michela; Silvestrelli, Maurizio; Distl, Ottmar
2013-07-18
Copy number variants (CNVs) have been shown to play an important role in genetic diversity of mammals and in the development of many complex phenotypic traits. The aim of this study was to perform a standard comparative evaluation of CNVs in horses using three different CNV detection programs and to identify genomic regions associated with body size in horses. Analysis was performed using the Illumina Equine SNP50 genotyping beadchip for 854 horses. CNVs were detected by three different algorithms, CNVPartition, PennCNV and QuantiSNP. Comparative analysis revealed 50 CNVs that affected 153 different genes mainly involved in sensory perception, signal transduction and cellular components. Genome-wide association analysis for body size showed highly significant deleted regions on ECA1, ECA8 and ECA9. Homologous regions to the detected CNVs on ECA1 and ECA9 have also been shown to be correlated with human height. Comparative analysis of CNV detection algorithms was useful to increase the specificity of CNV detection but had certain limitations dependent on the detection tool. GWAS revealed genome-wide associated CNVs for body size in horses.
Gutiérrez, Orlando M; Judd, Suzanne E; Irvin, Marguerite R; Zhi, Degui; Limdi, Nita; Palmer, Nicholette D; Rich, Stephen S; Sale, Michèle M; Freedman, Barry I
2016-04-01
Two independent coding variants in the apolipoprotein L1 gene (APOL1), G1 and G2, strongly associate with nephropathy in African Americans; associations with cardiovascular disease are more controversial. Although APOL1 binds plasma high-density lipoproteins (HDLs), data on APOL1 risk variant associations with HDL subfractions are sparse. Two APOL1 G1 single nucleotide polymorphisms and the G2 insertion/deletion polymorphism were genotyped in 2010 Reasons for Geographic and Racial Differences in Stroke (REGARDS) Study participants with nuclear magnetic resonance spectroscopy-based lipoprotein subfraction measurements. Linear regression was used to model associations between numbers of APOL1 G1/G2 risk variants and HDL subfractions, adjusting for demographic, clinical and ancestral covariates. Female sex and higher percentage of African ancestry were positively associated with the number of APOL1 G1/G2 risk alleles. In the unadjusted analysis, mean (standard error) small HDL concentrations (μmol/L) for participants with zero, one and two G1/G2 risk alleles were 19.0 (0.2), 19.7 (0.2) and 19.9 (0.4), respectively (P = 0.02). Adjustment for age, sex, diabetes and African ancestry did not change the results but strengthened the statistical significance (P = 0.004). No significant differences in large or medium HDL, very low-density lipoprotein or low-density lipoprotein particle concentrations were observed by APOL1 genotype. Greater numbers of APOL1 G1/G2 risk alleles were associated with higher small HDL particle concentrations in African Americans. These results may suggest novel areas of investigation to uncover reasons for the association between APOL1 risk variants with adverse outcomes in African Americans. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Gutiérrez, Orlando M.; Judd, Suzanne E.; Irvin, Marguerite R.; Zhi, Degui; Limdi, Nita; Palmer, Nicholette D.; Rich, Stephen S.; Sale, Michèle M.; Freedman, Barry I.
2016-01-01
Background Two independent coding variants in the apolipoprotein L1 gene (APOL1), G1 and G2, strongly associate with nephropathy in African Americans; associations with cardiovascular disease are more controversial. Although APOL1 binds plasma high-density lipoproteins (HDLs), data on APOL1 risk variant associations with HDL subfractions are sparse. Methods Two APOL1 G1 single nucleotide polymorphisms and the G2 insertion/deletion polymorphism were genotyped in 2010 Reasons for Geographic and Racial Differences in Stroke (REGARDS) Study participants with nuclear magnetic resonance spectroscopy-based lipoprotein subfraction measurements. Linear regression was used to model associations between numbers of APOL1 G1/G2 risk variants and HDL subfractions, adjusting for demographic, clinical and ancestral covariates. Results Female sex and higher percentage of African ancestry were positively associated with the number of APOL1 G1/G2 risk alleles. In the unadjusted analysis, mean (standard error) small HDL concentrations (μmol/L) for participants with zero, one and two G1/G2 risk alleles were 19.0 (0.2), 19.7 (0.2) and 19.9 (0.4), respectively (P = 0.02). Adjustment for age, sex, diabetes and African ancestry did not change the results but strengthened the statistical significance (P = 0.004). No significant differences in large or medium HDL, very low-density lipoprotein or low-density lipoprotein particle concentrations were observed by APOL1 genotype. Conclusions Greater numbers of APOL1 G1/G2 risk alleles were associated with higher small HDL particle concentrations in African Americans. These results may suggest novel areas of investigation to uncover reasons for the association between APOL1 risk variants with adverse outcomes in African Americans. PMID:26152403
ERIC Educational Resources Information Center
Napoli, Eleonora; Russo, Serena; Casula, Laura; Alesi, Viola; Amendola, Filomena Alessandra; Angioni, Adriano; Novelli, Antonio; Valeri, Giovanni; Menghini, Deny; Vicari, Stefano
2018-01-01
Copy-number variants (CNVs) are associated with susceptibility to autism spectrum disorder (ASD). To detect the presence of CNVs, we conducted an array-comparative genomic hybridization (array-CGH) analysis in 133 children with "essential" ASD phenotype. Genetic analyses documented that 12 children had causative CNVs (C-CNVs), 29…
Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression
Wiedenhoeft, John; Brugel, Eric; Schliep, Alexander
2016-01-01
By integrating Haar wavelets with Hidden Markov Models, we achieve drastically reduced running times for Bayesian inference using Forward-Backward Gibbs sampling. We show that this improves detection of genomic copy number variants (CNV) in array CGH experiments compared to the state-of-the-art, including standard Gibbs sampling. The method concentrates computational effort on chromosomal segments which are difficult to call, by dynamically and adaptively recomputing consecutive blocks of observations likely to share a copy number. This makes routine diagnostic use and re-analysis of legacy data collections feasible; to this end, we also propose an effective automatic prior. An open source software implementation of our method is available at http://schlieplab.org/Software/HaMMLET/ (DOI: 10.5281/zenodo.46262). This paper was selected for oral presentation at RECOMB 2016, and an abstract is published in the conference proceedings. PMID:27177143
Jeuken, Judith; Sijben, Angelique; Alenda, Cristina; Rijntjes, Jos; Dekkers, Marieke; Boots-Sprenger, Sandra; McLendon, Roger; Wesseling, Pieter
2009-10-01
Epidermal growth factor receptor (EGFR) is commonly affected in cancer, generally in the form of an increase in DNA copy number and/or as mutation variants [e.g., EGFR variant III (EGFRvIII), an in-frame deletion of exons 2-7]. While detection of EGFR aberrations can be expected to be relevant for glioma patients, such analysis has not yet been implemented in a routine setting, also because feasible and robust assays were lacking. We evaluated multiplex ligation-dependent probe amplification (MLPA) for detection of EGFR amplification and EGFRvIII in DNA of a spectrum of 216 diffuse gliomas. EGFRvIII detection was verified at the protein level by immunohistochemistry and at the RNA level using the conventionally used endpoint RT-PCR as well as a newly developed quantitative RT-PCR. Compared to these techniques, the DNA-based MLPA assay for EGFR/EGFRvIII analysis tested showed 100% sensitivity and specificity. We conclude that MLPA is a robust assay for detection of EGFR/EGFRvIII aberrations. While the exact diagnostic, prognostic and predictive value of such EGFR testing remains to be seen, MLPA has great potential as it can reliably and relatively easily be performed on routinely processed (formalin-fixed, paraffin-embedded) tumor tissue in combination with testing for other relevant glioma markers.
Carrier screening in the era of expanding genetic technology.
Arjunan, Aishwarya; Litwack, Karen; Collins, Nick; Charrow, Joel
2016-12-01
The Center for Jewish Genetics provides genetic education and carrier screening to individuals of Jewish descent. Carrier screening has traditionally been performed by targeted mutation analysis for founder mutations with an enzyme assay for Tay-Sachs carrier detection. The development of next-generation sequencing (NGS) allows for higher detection rates regardless of ethnicity. Here, we explore differences in carrier detection rates between genotyping and NGS in a primarily Jewish population. Peripheral blood samples or saliva samples were obtained from 506 individuals. All samples were analyzed by sequencing, targeted genotyping, triplet-repeat detection, and copy-number analysis; the analyses were carried out at Counsyl. Of 506 individuals screened, 288 were identified as carriers of at least 1 condition and 8 couples were carriers for the same disorder. A total of 434 pathogenic variants were identified. Three hundred twelve variants would have been detected via genotyping alone. Although no additional mutations were detected by NGS in diseases routinely screened for in the Ashkenazi Jewish population, 26.5% of carrier results and 2 carrier couples would have been missed without NGS in the larger panel. In a primarily Jewish population, NGS reveals a larger number of pathogenic variants and provides individuals with valuable information for family planning.Genet Med 18 12, 1214-1217.
Large-scale exploratory genetic analysis of cognitive impairment in Parkinson's disease.
Mata, Ignacio F; Johnson, Catherine O; Leverenz, James B; Weintraub, Daniel; Trojanowski, John Q; Van Deerlin, Vivianna M; Ritz, Beate; Rausch, Rebecca; Factor, Stewart A; Wood-Siverio, Cathy; Quinn, Joseph F; Chung, Kathryn A; Peterson-Hiller, Amie L; Espay, Alberto J; Revilla, Fredy J; Devoto, Johnna; Yearout, Dora; Hu, Shu-Ching; Cholerton, Brenna A; Montine, Thomas J; Edwards, Karen L; Zabetian, Cyrus P
2017-08-01
Cognitive impairment is a common and disabling problem in Parkinson's disease (PD). Identification of genetic variants that influence the presence or severity of cognitive deficits in PD might provide a clearer understanding of the pathophysiology underlying this important nonmotor feature. We genotyped 1105 PD patients from the PD Cognitive Genetics Consortium for 249,336 variants using the NeuroX array. Participants underwent assessments of learning and memory (Hopkins Verbal Learning Test-Revised [HVLT-R]), working memory/executive function (Letter-Number Sequencing and Trail Making Test [TMT] A and B), language processing (semantic and phonemic verbal fluency), visuospatial abilities (Benton Judgment of Line Orientation [JoLO]), and global cognitive function (Montreal Cognitive Assessment). For common variants, we used linear regression to test for association between genotype and cognitive performance with adjustment for important covariates. Rare variants were analyzed using the optimal unified sequence kernel association test. The significance threshold was defined as a false discovery rate-corrected p-value (P FDR ) of 0.05. Eighteen common variants in 13 genomic regions exceeded the significance threshold for one of the cognitive tests. These included GBA rs2230288 (E326K; P FDR = 2.7 × 10 -4 ) for JoLO, PARP4 rs9318600 (P FDR = 0.006), and rs9581094 (P FDR = 0.006) for HVLT-R total recall, and MTCL1 rs34877994 (P FDR = 0.01) for TMT B-A. Analysis of rare variants did not yield any significant gene regions. We have conducted the first large-scale PD cognitive genetics analysis and nominated several new putative susceptibility genes for cognitive impairment in PD. These results will require replication in independent PD cohorts. Published by Elsevier Inc.
Hintzsche, Jennifer; Kim, Jihye; Yadav, Vinod; Amato, Carol; Robinson, Steven E; Seelenfreund, Eric; Shellman, Yiqun; Wisell, Joshua; Applegate, Allison; McCarter, Martin; Box, Neil; Tentler, John; De, Subhajyoti
2016-01-01
Objective Currently, there is a disconnect between finding a patient’s relevant molecular profile and predicting actionable therapeutics. Here we develop and implement the Integrating Molecular Profiles with Actionable Therapeutics (IMPACT) analysis pipeline, linking variants detected from whole-exome sequencing (WES) to actionable therapeutics. Methods and materials The IMPACT pipeline contains 4 analytical modules: detecting somatic variants, calling copy number alterations, predicting drugs against deleterious variants, and analyzing tumor heterogeneity. We tested the IMPACT pipeline on whole-exome sequencing data in The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples with known EGFR mutations. We also used IMPACT to analyze melanoma patient tumor samples before treatment, after BRAF-inhibitor treatment, and after BRAF- and MEK-inhibitor treatment. Results IMPACT Food and Drug Administration (FDA) correctly identified known EGFR mutations in the TCGA lung adenocarcinoma samples. IMPACT linked these EGFR mutations to the appropriate FDA-approved EGFR inhibitors. For the melanoma patient samples, we identified NRAS p.Q61K as an acquired resistance mutation to BRAF-inhibitor treatment. We also identified CDKN2A deletion as a novel acquired resistance mutation to BRAFi/MEKi inhibition. The IMPACT analysis pipeline predicts these somatic variants to actionable therapeutics. We observed the clonal dynamic in the tumor samples after various treatments. We showed that IMPACT not only helped in successful prioritization of clinically relevant variants but also linked these variations to possible targeted therapies. Conclusion IMPACT provides a new bioinformatics strategy to delineate candidate somatic variants and actionable therapies. This approach can be applied to other patient tumor samples to discover effective drug targets for personalized medicine. IMPACT is publicly available at http://tanlab.ucdenver.edu/IMPACT. PMID:27026619
Hintzsche, Jennifer; Kim, Jihye; Yadav, Vinod; Amato, Carol; Robinson, Steven E; Seelenfreund, Eric; Shellman, Yiqun; Wisell, Joshua; Applegate, Allison; McCarter, Martin; Box, Neil; Tentler, John; De, Subhajyoti; Robinson, William A; Tan, Aik Choon
2016-07-01
Currently, there is a disconnect between finding a patient's relevant molecular profile and predicting actionable therapeutics. Here we develop and implement the Integrating Molecular Profiles with Actionable Therapeutics (IMPACT) analysis pipeline, linking variants detected from whole-exome sequencing (WES) to actionable therapeutics. The IMPACT pipeline contains 4 analytical modules: detecting somatic variants, calling copy number alterations, predicting drugs against deleterious variants, and analyzing tumor heterogeneity. We tested the IMPACT pipeline on whole-exome sequencing data in The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples with known EGFR mutations. We also used IMPACT to analyze melanoma patient tumor samples before treatment, after BRAF-inhibitor treatment, and after BRAF- and MEK-inhibitor treatment. IMPACT Food and Drug Administration (FDA) correctly identified known EGFR mutations in the TCGA lung adenocarcinoma samples. IMPACT linked these EGFR mutations to the appropriate FDA-approved EGFR inhibitors. For the melanoma patient samples, we identified NRAS p.Q61K as an acquired resistance mutation to BRAF-inhibitor treatment. We also identified CDKN2A deletion as a novel acquired resistance mutation to BRAFi/MEKi inhibition. The IMPACT analysis pipeline predicts these somatic variants to actionable therapeutics. We observed the clonal dynamic in the tumor samples after various treatments. We showed that IMPACT not only helped in successful prioritization of clinically relevant variants but also linked these variations to possible targeted therapies. IMPACT provides a new bioinformatics strategy to delineate candidate somatic variants and actionable therapies. This approach can be applied to other patient tumor samples to discover effective drug targets for personalized medicine.IMPACT is publicly available at http://tanlab.ucdenver.edu/IMPACT. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Protein variants in Hiroshima and Nagasaki: tales of two cities.
Neel, J V; Satoh, C; Smouse, P; Asakawa, J; Takahashi, N; Goriki, K; Fujita, M; Kageoka, T; Hazama, R
1988-01-01
The results of 1,465,423 allele product determinations based on blood samples from Hiroshima and Nagasaki, involving 30 different proteins representing 32 different gene products, are analyzed in a variety of ways, with the following conclusions: (1) Sibships and their parents are included in the sample. Our analysis reveals that statistical procedures designed to reduce the sample to equivalent independent genomes do not in population comparisons compensate for the familial cluster effect of rare variants. Accordingly, the data set was reduced to one representative of each sibship (937,427 allele products). (2) Both chi 2-type contrasts and a genetic distance measure (delta) reveal that rare variants (P less than .01) are collectively as effective as polymorphisms in establishing genetic differences between the two cities. (3) We suggest that rare variants that individually exhibit significant intercity differences are probably the legacy of tribal private polymorphisms that occurred during prehistoric times. (4) Despite the great differences in the known histories of the two cities, both the overall frequency of rare variants and the number of different rare variants are essentially identical in the two cities. (5) The well-known differences in locus variability are confirmed, now after adjustment for sample size differences for the various locus products; in this large series we failed to detect variants at only three of 29 loci for which sample size exceeded 23,000. (6) The number of alleles identified per locus correlates positively with subunit molecular weight. (7) Loci supporting genetic polymorphisms are characterized by more rare variants than are loci at which polymorphisms were not encountered. (8) Loci whose products do not appear to be essential for health support more variants than do loci the absence of whose product is detrimental to health. (9) There is a striking excess of rare variants over the expectation under the neutral mutation/drift/equilibrium theory. We suggest that this finding is primarily due to the relatively recent (in genetic time) agglomeration of previously separated tribal populations; efforts to test for agreement with the expectations of this theory by using data from modern cosmopolitan populations are exercises in futility. (10) All of these findings should characterize DNA variants in exons as more data become available, since the finding are the protein expression of such variants. PMID:3195587
Protein variants in Hiroshima and Nagasaki: tales of two cities.
Neel, J V; Satoh, C; Smouse, P; Asakawa, J; Takahashi, N; Goriki, K; Fujita, M; Kageoka, T; Hazama, R
1988-12-01
The results of 1,465,423 allele product determinations based on blood samples from Hiroshima and Nagasaki, involving 30 different proteins representing 32 different gene products, are analyzed in a variety of ways, with the following conclusions: (1) Sibships and their parents are included in the sample. Our analysis reveals that statistical procedures designed to reduce the sample to equivalent independent genomes do not in population comparisons compensate for the familial cluster effect of rare variants. Accordingly, the data set was reduced to one representative of each sibship (937,427 allele products). (2) Both chi 2-type contrasts and a genetic distance measure (delta) reveal that rare variants (P less than .01) are collectively as effective as polymorphisms in establishing genetic differences between the two cities. (3) We suggest that rare variants that individually exhibit significant intercity differences are probably the legacy of tribal private polymorphisms that occurred during prehistoric times. (4) Despite the great differences in the known histories of the two cities, both the overall frequency of rare variants and the number of different rare variants are essentially identical in the two cities. (5) The well-known differences in locus variability are confirmed, now after adjustment for sample size differences for the various locus products; in this large series we failed to detect variants at only three of 29 loci for which sample size exceeded 23,000. (6) The number of alleles identified per locus correlates positively with subunit molecular weight. (7) Loci supporting genetic polymorphisms are characterized by more rare variants than are loci at which polymorphisms were not encountered. (8) Loci whose products do not appear to be essential for health support more variants than do loci the absence of whose product is detrimental to health. (9) There is a striking excess of rare variants over the expectation under the neutral mutation/drift/equilibrium theory. We suggest that this finding is primarily due to the relatively recent (in genetic time) agglomeration of previously separated tribal populations; efforts to test for agreement with the expectations of this theory by using data from modern cosmopolitan populations are exercises in futility. (10) All of these findings should characterize DNA variants in exons as more data become available, since the finding are the protein expression of such variants.
Brancaleoni, Valentina; Granata, Francesca; Missineo, Pasquale; Fustinoni, Silvia; Graziadei, Giovanna; Di Pierro, Elena
2018-06-13
Alterations in the ferrochelatase gene (FECH) are the basis of the phenotypic expressions in erythropoietic protoporphyria. The phenotype is due to the presence of a mutation in the FECH gene associated in trans to the c.315-48 T > C variant in the intron 3. The latter is able to increase the physiological quota of alternative splicing events in the intron 3. Other two variants in the FECH gene (c.1-252A > G and c.68-23C > T) have been found to be associated to the intron 3 variant in some populations and together, they constitute a haplotype (ACT/GTC), but eventually, their role in the alternative splicing event has never been elucidated. The absolute number of the aberrantly spliced FECH mRNA molecules and the absolute expression of the FECH gene were evaluated by digital PCR technique in a comprehensive cohort. The number of splicing events that rose in the presence of the c.315-48 T > C variant, both in the heterozygous and homozygous condition was reported for the first time. Also, the percentage of the inserted FECH mRNA increased, even doubled in the T/C cases, compared to T/T cases. The constant presence of variants in the promoter and intron 2 did not influence or modulate the aberrant splicing. The results of FECH gene expression suggested that the homozygosity for the c.315-48 T > C variant could be considered pathological. Thus, this study identified the homozygotes for the c.315-48 T > C variant as pathological. By extension, when the samples were categorised according to the haplotypes, the GTC haplotype in homozygosis was pathological. Copyright © 2018 Elsevier Inc. All rights reserved.
Pellegrino, Renata; Kavakli, Ibrahim Halil; Goel, Namni; Cardinale, Christopher J; Dinges, David F; Kuna, Samuel T; Maislin, Greg; Van Dongen, Hans P A; Tufik, Sergio; Hogenesch, John B; Hakonarson, Hakon; Pack, Allan I
2014-08-01
Earlier work described a mutation in DEC2 also known as BHLHE41 (basic helix-loophelix family member e41) as causal in a family of short sleepers, who needed just 6 h sleep per night. We evaluated whether there were other variants of this gene in two well-phenotyped cohorts. Sequencing of the BHLHE41 gene, electroencephalographic data, and delta power analysis and functional studies using cell-based luciferase. We identified new variants of the BHLHE41 gene in two cohorts who had either acute sleep deprivation (n = 200) or chronic partial sleep deprivation (n = 217). One variant, Y362H, at another location in the same exon occurred in one twin in a dizygotic twin pair and was associated with reduced sleep duration, less recovery sleep following sleep deprivation, and fewer performance lapses during sleep deprivation than the homozygous twin. Both twins had almost identical amounts of non rapid eye movement (NREM) sleep. This variant reduced the ability of BHLHE41 to suppress CLOCK/BMAL1 and NPAS2/BMAL1 transactivation in vitro. Another variant in the same exome had no effect on sleep or response to sleep deprivation and no effect on CLOCK/BMAL1 transactivation. Random mutagenesis identified a number of other variants of BHLHE41 that affect its function. There are a number of mutations of BHLHE41. Mutations reduce total sleep while maintaining NREM sleep and provide resistance to the effects of sleep loss. Mutations that affect sleep also modify the normal inhibition of BHLHE41 of CLOCK/BMAL1 transactivation. Thus, clock mechanisms are likely involved in setting sleep length and the magnitude of sleep homeostasis. Pellegrino R, Kavakli IH, Goel N, Cardinale CJ, Dinges DF, Kuna ST, Maislin G, Van Dongen HP, Tufik S, Hogenesch JB, Hakonarson H, Pack AI. A novel BHLHE41 variant is associated with short sleep and resistance to sleep deprivation in humans. SLEEP 2014;37(8):1327-1336.
Zhao, Linlu; Bracken, Michael B.; DeWan, Andrew T.
2013-01-01
Summary A genome-wide association study was undertaken to identify maternal single nucleotide polymorphisms (SNPs) and copy-number variants (CNVs) associated with preeclampsia. Case-control analysis was performed on 1070 Afro-Caribbean (n=21 cases and 1049 controls) and 723 Hispanic (n=62 cases and 661 controls) mothers and 1257 mothers of European ancestry (n=50 cases and 1207 controls) from the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study. European ancestry subjects were genotyped on Illumina Human610-Quad and Afro-Caribbean and Hispanic subjects were genotyped on Illumina Human1M-Duo BeadChip microarrays. Genome-wide SNP data were analyzed using PLINK. CNVs were called using three detection algorithms (GNOSIS, PennCNV, and QuantiSNP), merged using CNVision, and then screened using stringent criteria. SNP and CNV findings were compared to those of the Study of Pregnancy Hypertension in Iowa (SOPHIA), an independent preeclampsia case-control dataset of Caucasian mothers (n=177 cases and 116 controls). A list of top SNPs were identified for each of the HAPO ethnic groups, but none reached Bonferroni-corrected significance. Novel candidate CNVs showing enrichment among preeclampsia cases were also identified in each of the three ethnic groups. Several variants were suggestively replicated in SOPHIA. The discovered SNPs and copy-number variable regions present interesting candidate genetic variants for preeclampsia that warrant further replication and investigation. PMID:23551011
Pfundt, Rolph; del Rosario, Marisol; Vissers, Lisenka E.L.M.; Kwint, Michael P.; Janssen, Irene M.; de Leeuw, Nicole; Yntema, Helger G.; Nelen, Marcel R.; Lugtenberg, Dorien; Kamsteeg, Erik-Jan; Wieskamp, Nienke; Stegmann, Alexander P.A.; Stevens, Servi J.C.; Rodenburg, Richard J.T.; Simons, Annet; Mensenkamp, Arjen R.; Rinne, Tuula; Gilissen, Christian; Scheffer, Hans; Veltman, Joris A.; Hehir-Kwa, Jayne Y.
2017-01-01
Purpose: Copy-number variation is a common source of genomic variation and an important genetic cause of disease. Microarray-based analysis of copy-number variants (CNVs) has become a first-tier diagnostic test for patients with neurodevelopmental disorders, with a diagnostic yield of 10–20%. However, for most other genetic disorders, the role of CNVs is less clear and most diagnostic genetic studies are generally limited to the study of single-nucleotide variants (SNVs) and other small variants. With the introduction of exome and genome sequencing, it is now possible to detect both SNVs and CNVs using an exome- or genome-wide approach with a single test. Methods: We performed exome-based read-depth CNV screening on data from 2,603 patients affected by a range of genetic disorders for which exome sequencing was performed in a diagnostic setting. Results: In total, 123 clinically relevant CNVs ranging in size from 727 bp to 15.3 Mb were detected, which resulted in 51 conclusive diagnoses and an overall increase in diagnostic yield of ~2% (ranging from 0 to –5.8% per disorder). Conclusions: This study shows that CNVs play an important role in a broad range of genetic disorders and that detection via exome-based CNV profiling results in an increase in the diagnostic yield without additional testing, bringing us closer to single-test genomics. Genet Med advance online publication 27 October 2016 PMID:28574513
Screening for common copy-number variants in cancer genes.
Tyson, Jess; Majerus, Tamsin M O; Walker, Susan; Armour, John A L
2010-12-01
For most cases of colorectal cancer that arise without a family history of the disease, it is proposed that an appreciable heritable component of predisposition is the result of contributions from many loci. Although progress has been made in identifying single nucleotide variants associated with colorectal cancer risk, the involvement of low-penetrance copy number variants is relatively unexplored. We have used multiplex amplifiable probe hybridization (MAPH) in a fourfold multiplex (QuadMAPH), positioned at an average resolution of one probe per 2 kb, to screen a total of 1.56 Mb of genomic DNA for copy number variants around the genes APC, AXIN1, BRCA1, BRCA2, CTNNB1, HRAS, MLH1, MSH2, and TP53. Two deletion events were detected, one upstream of MLH1 in a control individual and the other in APC in a colorectal cancer patient, but these do not seem to correspond to copy number polymorphisms with measurably high population frequencies. In summary, by means of our QuadMAPH assay, copy number measurement data were of sufficient resolution and accuracy to detect any copy number variants with high probability. However, this study has demonstrated a very low incidence of deletion and duplication variants within intronic and flanking regions of these nine genes, in both control individuals and colorectal cancer patients. Copyright © 2010 Elsevier Inc. All rights reserved.
HGVS Recommendations for the Description of Sequence Variants: 2016 Update.
den Dunnen, Johan T; Dalgleish, Raymond; Maglott, Donna R; Hart, Reece K; Greenblatt, Marc S; McGowan-Jordan, Jean; Roux, Anne-Francoise; Smith, Timothy; Antonarakis, Stylianos E; Taschner, Peter E M
2016-06-01
The consistent and unambiguous description of sequence variants is essential to report and exchange information on the analysis of a genome. In particular, DNA diagnostics critically depends on accurate and standardized description and sharing of the variants detected. The sequence variant nomenclature system proposed in 2000 by the Human Genome Variation Society has been widely adopted and has developed into an internationally accepted standard. The recommendations are currently commissioned through a Sequence Variant Description Working Group (SVD-WG) operating under the auspices of three international organizations: the Human Genome Variation Society (HGVS), the Human Variome Project (HVP), and the Human Genome Organization (HUGO). Requests for modifications and extensions go through the SVD-WG following a standard procedure including a community consultation step. Version numbers are assigned to the nomenclature system to allow users to specify the version used in their variant descriptions. Here, we present the current recommendations, HGVS version 15.11, and briefly summarize the changes that were made since the 2000 publication. Most focus has been on removing inconsistencies and tightening definitions allowing automatic data processing. An extensive version of the recommendations is available online, at http://www.HGVS.org/varnomen. © 2016 WILEY PERIODICALS, INC.
Genome-wide association study yields variants at 20p12.2 that associate with urinary bladder cancer.
Rafnar, Thorunn; Sulem, Patrick; Thorleifsson, Gudmar; Vermeulen, Sita H; Helgason, Hannes; Saemundsdottir, Jona; Gudjonsson, Sigurjon A; Sigurdsson, Asgeir; Stacey, Simon N; Gudmundsson, Julius; Johannsdottir, Hrefna; Alexiusdottir, Kristin; Petursdottir, Vigdis; Nikulasson, Sigfus; Geirsson, Gudmundur; Jonsson, Thorvaldur; Aben, Katja K H; Grotenhuis, Anne J; Verhaegh, Gerald W; Dudek, Aleksandra M; Witjes, J Alfred; van der Heijden, Antoine G; Vrieling, Alina; Galesloot, Tessel E; De Juan, Ana; Panadero, Angeles; Rivera, Fernando; Hurst, Carolyn; Bishop, D Timothy; Sak, Sei C; Choudhury, Ananya; Teo, Mark T W; Arici, Cecilia; Carta, Angela; Toninelli, Elena; de Verdier, Petra; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; van der Keur, Kirstin A; Lurkin, Irene; Goossens, Mieke; Kellen, Eliane; Guarrera, Simonetta; Russo, Alessia; Critelli, Rossana; Sacerdote, Carlotta; Vineis, Paolo; Krucker, Clémentine; Zeegers, Maurice P; Gerullis, Holger; Ovsiannikov, Daniel; Volkert, Frank; Hengstler, Jan G; Selinski, Silvia; Magnusson, Olafur T; Masson, Gisli; Kong, Augustine; Gudbjartsson, Daniel; Lindblom, Annika; Zwarthoff, Ellen; Porru, Stefano; Golka, Klaus; Buntinx, Frank; Matullo, Giuseppe; Kumar, Rajiv; Mayordomo, José I; Steineck, D Gunnar; Kiltie, Anne E; Jonsson, Eirikur; Radvanyi, François; Knowles, Margaret A; Thorsteinsdottir, Unnur; Kiemeney, Lambertus A; Stefansson, Kari
2014-10-15
Genome-wide association studies (GWAS) of urinary bladder cancer (UBC) have yielded common variants at 12 loci that associate with risk of the disease. We report here the results of a GWAS of UBC including 1670 UBC cases and 90 180 controls, followed by replication analysis in additional 5266 UBC cases and 10 456 controls. We tested a dataset containing 34.2 million variants, generated by imputation based on whole-genome sequencing of 2230 Icelanders. Several correlated variants at 20p12, represented by rs62185668, show genome-wide significant association with UBC after combining discovery and replication results (OR = 1.19, P = 1.5 × 10(-11) for rs62185668-A, minor allele frequency = 23.6%). The variants are located in a non-coding region approximately 300 kb upstream from the JAG1 gene, an important component of the Notch signaling pathways that may be oncogenic or tumor suppressive in several forms of cancer. Our results add to the growing number of UBC risk variants discovered through GWAS. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Barr, Norman; Ruiz-Arce, Raul; Obregón, Oscar; De Leon, Rosita; Foster, Nelson; Reuter, Chris; Boratynski, Theodore; Vacek, Don
2013-02-01
The utility of the cytochrome oxidase I (COI) DNA sequence used for DNA barcoding and a Sequence Characterized Amplified Region for diagnosing boll weevil, Anthonomus grandis Boheman, variants was evaluated. Maximum likelihood analysis of COI DNA sequences from 154 weevils collected from the United States and Mexico supports previous evidence for limited gene flow between weevil populations on wild cotton and commercial cotton in northern Mexico and southern United States. The wild cotton populations represent a variant of the species called the thurberia weevil, which is not regarded as a significant pest. The 31 boll weevil COI haplotypes observed in the study form two distinct haplogroups (A and B) that are supported by five fixed nucleotide differences and a phylogenetic analysis. Although wild and commercial cotton populations are closely associated with specific haplogroups, there is not a fixed difference between the thurberia weevil variant and other populations. The Sequence Characterized Amplified Region marker generated a larger number of inconclusive results than the COI gene but also supported evidence of shared genotypes between wild and commercial cotton weevil populations. These methods provide additional markers that can assist in the identification of pest weevil populations but not definitively diagnose samples.
Hibar, Derrek P; Stein, Jason L; Ryles, April B; Kohannim, Omid; Jahanshad, Neda; Medland, Sarah E; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Montgomery, Grant W; Martin, Nicholas G; Wright, Margaret J; Saykin, Andrew J; Jack, Clifford R; Weiner, Michael W; Toga, Arthur W; Thompson, Paul M
2013-06-01
Deficits in lentiform nucleus volume and morphometry are implicated in a number of genetically influenced disorders, including Parkinson's disease, schizophrenia, and ADHD. Here we performed genome-wide searches to discover common genetic variants associated with differences in lentiform nucleus volume in human populations. We assessed structural MRI scans of the brain in two large genotyped samples: the Alzheimer's Disease Neuroimaging Initiative (ADNI; N = 706) and the Queensland Twin Imaging Study (QTIM; N = 639). Statistics of association from each cohort were combined meta-analytically using a fixed-effects model to boost power and to reduce the prevalence of false positive findings. We identified a number of associations in and around the flavin-containing monooxygenase (FMO) gene cluster. The most highly associated SNP, rs1795240, was located in the FMO3 gene; after meta-analysis, it showed genome-wide significant evidence of association with lentiform nucleus volume (P MA = 4.79 × 10(-8)). This commonly-carried genetic variant accounted for 2.68 % and 0.84 % of the trait variability in the ADNI and QTIM samples, respectively, even though the QTIM sample was on average 50 years younger. Pathway enrichment analysis revealed significant contributions of this gene to the cytochrome P450 pathway, which is involved in metabolizing numerous therapeutic drugs for pain, seizures, mania, depression, anxiety, and psychosis. The genetic variants we identified provide replicated, genome-wide significant evidence for the FMO gene cluster's involvement in lentiform nucleus volume differences in human populations.
Chen, Pei; Jou, Yuh-Shan; Fann, Cathy S J; Chen, Jaw-Wen; Chung, Chia-Min; Lin, Chin-Yu; Wu, Sheng-Yeu; Kang, Mei-Jyh; Chen, Ying-Chuang; Jong, Yuh-Shiun; Lo, Huey-Ming; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Huang, Nai-Kuei; Wu, Yi-Lin; Pan, Wen-Harn
2009-01-01
Previously, we observed that young-onset hypertension was independently associated with elevated plasma triglyceride(s) (TG) levels to a greater extent than other metabolic risk factors. Thus, focusing on the endophenotype--hypertension combined with elevated TG--we designed a family-based haplotype association study to explore its genetic connection with novel genetic variants of lipoprotein lipase gene (LPL), which encodes a major lipid metabolizing enzyme. Young-onset hypertension probands and their families were recruited, numbering 1,002 individuals from 345 families. Single-nucleotide polymorphism discovery for LPL, linkage disequilibrium (LD) analysis, transmission disequilibrium tests (TDT), bin construction, haplotype TDT association and logistic regression analysis were performed. We found that the CC- haplotype (i) spanning from intron 2 to intron 4 and the ACATT haplotype (ii) spanning from intron 5 to intron 6 were significantly associated with hypertension-related phenotypes: hypertension (ii, P=0.05), elevated TG (i, P=0.01), and hypertension combined with elevated TG (i, P=0.001; ii, P<0.0001), according to TDT. The risk of this hypertension subtype increased with the number of risk haplotypes in the two loci, using logistic regression model after adjusting within-family correlation. The relationships between LPL variants and hypertension-related disorders were also confirmed by an independent association study. Finally, we showed a trend that individuals with homozygous risk haplotypes had decreased LPL expression after a fatty meal, as opposed to those with protective haplotypes. In conclusion, this study strongly suggests that two LPL intronic variants may be associated with development of the hypertension endophenotype with elevated TG. Copyright 2008 Wiley-Liss, Inc.
Prescott, Natalie J.; Lehne, Benjamin; Stone, Kristina; Lee, James C.; Taylor, Kirstin; Knight, Jo; Papouli, Efterpi; Mirza, Muddassar M.; Simpson, Michael A.; Spain, Sarah L.; Lu, Grace; Fraternali, Franca; Bumpstead, Suzannah J.; Gray, Emma; Amar, Ariella; Bye, Hannah; Green, Peter; Chung-Faye, Guy; Hayee, Bu’Hussain; Pollok, Richard; Satsangi, Jack; Parkes, Miles; Barrett, Jeffrey C.; Mansfield, John C.; Sanderson, Jeremy; Lewis, Cathryn M.; Weale, Michael E.; Schlitt, Thomas; Mathew, Christopher G.
2015-01-01
The contribution of rare coding sequence variants to genetic susceptibility in complex disorders is an important but unresolved question. Most studies thus far have investigated a limited number of genes from regions which contain common disease associated variants. Here we investigate this in inflammatory bowel disease by sequencing the exons and proximal promoters of 531 genes selected from both genome-wide association studies and pathway analysis in pooled DNA panels from 474 cases of Crohn’s disease and 480 controls. 80 variants with evidence of association in the sequencing experiment or with potential functional significance were selected for follow up genotyping in 6,507 IBD cases and 3,064 population controls. The top 5 disease associated variants were genotyped in an extension panel of 3,662 IBD cases and 3,639 controls, and tested for association in a combined analysis of 10,147 IBD cases and 7,008 controls. A rare coding variant p.G454C in the BTNL2 gene within the major histocompatibility complex was significantly associated with increased risk for IBD (p = 9.65x10−10, OR = 2.3[95% CI = 1.75–3.04]), but was independent of the known common associated CD and UC variants at this locus. Rare (<1%) and low frequency (1–5%) variants in 3 additional genes showed suggestive association (p<0.005) with either an increased risk (ARIH2 c.338-6C>T) or decreased risk (IL12B p.V298F, and NICN p.H191R) of IBD. These results provide additional insights into the involvement of the inhibition of T cell activation in the development of both sub-phenotypes of inflammatory bowel disease. We suggest that although rare coding variants may make a modest overall contribution to complex disease susceptibility, they can inform our understanding of the molecular pathways that contribute to pathogenesis. PMID:25671699
A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference
2015-01-01
High-throughput sequencing, especially of exomes, is a popular diagnostic tool, but it is difficult to determine which tools are the best at analyzing this data. In this study, we use the NIST Genome in a Bottle results as a novel resource for validation of our exome analysis pipeline. We use six different aligners and five different variant callers to determine which pipeline, of the 30 total, performs the best on a human exome that was used to help generate the list of variants detected by the Genome in a Bottle Consortium. Of these 30 pipelines, we found that Novoalign in conjunction with GATK UnifiedGenotyper exhibited the highest sensitivity while maintaining a low number of false positives for SNVs. However, it is apparent that indels are still difficult for any pipeline to handle with none of the tools achieving an average sensitivity higher than 33% or a Positive Predictive Value (PPV) higher than 53%. Lastly, as expected, it was found that aligners can play as vital a role in variant detection as variant callers themselves. PMID:26539496
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort
Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Masters, Bettie Sue Siler; Martásek, Pavel
2015-01-01
Background Gene polymorphisms encoding the enzyme NADPH–cytochrome P450 oxidoreductase (POR) contribute to inter-individual differences in drug response. Aim To estimate polymorphic allele frequencies of the POR gene in a Czech Slavic population. Materials & Methods The gene POR was analyzed in 322 Czech Slavic individuals from a control cohort by sequencing and HRM analysis. Results Twenty-five SNP genetic variations were identified. Of these variants, 7 were new, unreported SNPs, including two SNPs in the 5´flanking region (g.4965 C>T and g.4994 G>T), one intronic variant (c.1899 −20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared to wild type. Conclusion New POR variant identification indicates that the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYPs in the endoplasmic reticulum. PMID:25712184
Population sequencing reveals breed and sub-species specific CNVs in cattle
USDA-ARS?s Scientific Manuscript database
Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...
Mutation analysis of FANCD2, BRIP1/BACH1, LMO4 and SFN in familial breast cancer.
Lewis, Aaron G; Flanagan, James; Marsh, Anna; Pupo, Gulietta M; Mann, Graham; Spurdle, Amanda B; Lindeman, Geoffrey J; Visvader, Jane E; Brown, Melissa A; Chenevix-Trench, Georgia
2005-01-01
Mutations in known predisposition genes account for only about a third of all multiple-case breast cancer families. We hypothesized that germline mutations in FANCD2, BRIP1/BACH1, LMO4 and SFN may account for some of the unexplained multiple-case breast cancer families. The families used in this study were ascertained through the Kathleen Cuningham Foundation Consortium for Research into Familial Breast Cancer (kConFab). Denaturing high performance liquid chromatography (DHPLC) analysis of the coding regions of these four genes was conducted in the youngest affected cases of 30 to 267 non-BRCA1/2 breast cancer families. In addition, a further 399 index cases were also screened for mutations in two functionally significant regions of the FANCD2 gene and 253 index cases were screened for two previously reported mutations in BACH1 (p. P47A and p. M299I). DHPLC analysis of FANCD2 identified six silent exonic variants, and a large number of intronic variants, which tagged two common haplotypes. One protein truncating variant was found in BRIP1/BACH1, as well as four missense variants, a silent change and a variant in the 3' untranslated region. No missense or splice site mutations were found in LMO4 or SFN. Analysis of the missense, silent and frameshift variants of FANCD2 and BACH1 in relatives of the index cases, and in a panel of controls, found no evidence suggestive of pathogenicity. There is no evidence that highly penetrant exonic or splice site mutations in FANCD2, BRIP1/BACH1, LMO4 or SFN contribute to familial breast cancer. Large scale association studies will be necessary to determine whether any of the polymorphisms or haplotypes identified in these genes contributes to breast cancer risk.
Siggs, Owen M; Javadiyan, Shari; Sharma, Shiwani; Souzeau, Emmanuelle; Lower, Karen M; Taranath, Deepa A; Black, Jo; Pater, John; Willoughby, John G; Burdon, Kathryn P; Craig, Jamie E
2017-01-01
Congenital cataract is a rare but severe paediatric visual impediment, often caused by variants in one of several crystallin genes that produce the bulk of structural proteins in the lens. Here we describe a pedigree with autosomal dominant isolated congenital cataract and linkage to the crystallin gene cluster on chromosome 22. No rare single nucleotide variants or short indels were identified by exome sequencing, yet copy number variant analysis revealed a duplication spanning both CRYBB1 and CRYBA4. While the CRYBA4 duplication was complete, the CRYBB1 duplication was not, with the duplicated CRYBB1 product predicted to create a gain of function allele. This association suggests a new genetic mechanism for the development of isolated congenital cataract. PMID:28272538
Ataxia telangiectasia presenting as dopa-responsive cervical dystonia
Mohire, Mahavir D.; Schneider, Susanne A.; Stamelou, Maria; Wood, Nicholas W.; Bhatia, Kailash P.
2013-01-01
Objective: To identify the cause of cervical dopa-responsive dystonia (DRD) in a Muslim Indian family inherited in an apparently autosomal recessive fashion, as previously described in this journal. Methods: Previous testing for mutations in the genes known to cause DRD (GCH1, TH, and SPR) had been negative. Whole exome sequencing was performed on all 3 affected individuals for whom DNA was available to identify potentially pathogenic shared variants. Genotyping data obtained for all 3 affected individuals using the OmniExpress single nucleotide polymorphism chip (Illumina, San Diego, CA) were used to perform linkage analysis, autozygosity mapping, and copy number variation analysis. Sanger sequencing was used to confirm all variants. Results: After filtering of the variants, exome sequencing revealed 2 genes harboring potentially pathogenic compound heterozygous variants (ATM and LRRC16A). Of these, the variants in ATM segregated perfectly with the cervical DRD. Both mutations detected in ATM have been shown to be pathogenic, and α-fetoprotein, a marker of ataxia telangiectasia, was increased in all affected individuals. Conclusion: Biallelic mutations in ATM can cause DRD, and mutations in this gene should be considered in the differential diagnosis of unexplained DRD, particularly if the dystonia is cervical and if there is a recessive family history. ATM has previously been reported to cause isolated cervical dystonia, but never, to our knowledge, DRD. Individuals with dystonia related to ataxia telangiectasia may benefit from a trial of levodopa. PMID:23946315
Kondo, Naoshi; Bessho, Hiroaki; Honda, Shigeru; Negi, Akira
2011-02-01
To investigate whether the Y402H variant in the complement factor H gene is associated with age-related macular degeneration (AMD) in Asian populations. Meta-analysis of previous publications. Case-control groups of subjects with AMD and controls from 13 association studies. We performed a meta-analysis of the association between Y402H and AMD in Asian populations using data available from 13 case-control studies involving 3973 subjects. Summary odds ratios (ORs) and 95% confidence intervals (CIs) were estimated using fixed- and random-effects models. The Q-statistic test was used to assess heterogeneity, and Egger's test was used to evaluate publication bias. Sensitivity analysis, cumulative meta-analysis, and meta-regression analysis were also performed. Allele and genotype frequencies of the Y402H variant. The Y402H variant showed a significant summary OR of 1.97 (95% CI, 1.54-2.52; P<0.001; allelic contrast model) per allele. Possession of at least 1 copy of the C allele increased the disease risk by 1.97-fold (95% CI, 1.63-2.39; P<0.001; dominant model) and accounted for 8.8% of the attributable risk of AMD in Asian populations. Sensitivity analysis indicated the robustness of our findings, and evidence of publication bias was not observed in our meta-analysis. Meta-regression analysis indicated no significant effect of baseline study characteristics on the summary effect size. Cumulative meta-analysis revealed that the summary ORs were stable and the 95% CIs narrowed with the accumulation of data over time. Our analysis provides substantial evidence that the Y402H variant is significantly associated with AMD in Asian populations. Our results expand the number of confirmed AMD susceptibility loci for Asians populations, which provide a better understanding of the genetic architecture underlying disease susceptibility and may advance the potential for preclinical prediction in future genetic tests by a combined evaluation of inherited susceptibility with previously established loci. Copyright © 2011 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
Genetic Structures of Copy Number Variants Revealed by Genotyping Single Sperm
Luo, Minjie; Cui, Xiangfeng; Fredman, David; Brookes, Anthony J.; Azaro, Marco A.; Greenawalt, Danielle M.; Hu, Guohong; Wang, Hui-Yun; Tereshchenko, Irina V.; Lin, Yong; Shentu, Yue; Gao, Richeng; Shen, Li; Li, Honghua
2009-01-01
Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis. PMID:19384415
Cheng, Hanyin; Dharmadhikari, Avinash V; Varland, Sylvia; Ma, Ning; Domingo, Deepti; Kleyner, Robert; Rope, Alan F; Yoon, Margaret; Stray-Pedersen, Asbjørg; Posey, Jennifer E; Crews, Sarah R; Eldomery, Mohammad K; Akdemir, Zeynep Coban; Lewis, Andrea M; Sutton, Vernon R; Rosenfeld, Jill A; Conboy, Erin; Agre, Katherine; Xia, Fan; Walkiewicz, Magdalena; Longoni, Mauro; High, Frances A; van Slegtenhorst, Marjon A; Mancini, Grazia M S; Finnila, Candice R; van Haeringen, Arie; den Hollander, Nicolette; Ruivenkamp, Claudia; Naidu, Sakkubai; Mahida, Sonal; Palmer, Elizabeth E; Murray, Lucinda; Lim, Derek; Jayakar, Parul; Parker, Michael J; Giusto, Stefania; Stracuzzi, Emanuela; Romano, Corrado; Beighley, Jennifer S; Bernier, Raphael A; Küry, Sébastien; Nizon, Mathilde; Corbett, Mark A; Shaw, Marie; Gardner, Alison; Barnett, Christopher; Armstrong, Ruth; Kassahn, Karin S; Van Dijck, Anke; Vandeweyer, Geert; Kleefstra, Tjitske; Schieving, Jolanda; Jongmans, Marjolijn J; de Vries, Bert B A; Pfundt, Rolph; Kerr, Bronwyn; Rojas, Samantha K; Boycott, Kym M; Person, Richard; Willaert, Rebecca; Eichler, Evan E; Kooy, R Frank; Yang, Yaping; Wu, Joseph C; Lupski, James R; Arnesen, Thomas; Cooper, Gregory M; Chung, Wendy K; Gecz, Jozef; Stessman, Holly A F; Meng, Linyan; Lyon, Gholson J
2018-05-03
N-alpha-acetylation is a common co-translational protein modification that is essential for normal cell function in humans. We previously identified the genetic basis of an X-linked infantile lethal Mendelian disorder involving a c.109T>C (p.Ser37Pro) missense variant in NAA10, which encodes the catalytic subunit of the N-terminal acetyltransferase A (NatA) complex. The auxiliary subunit of the NatA complex, NAA15, is the dimeric binding partner for NAA10. Through a genotype-first approach with whole-exome or genome sequencing (WES/WGS) and targeted sequencing analysis, we identified and phenotypically characterized 38 individuals from 33 unrelated families with 25 different de novo or inherited, dominantly acting likely gene disrupting (LGD) variants in NAA15. Clinical features of affected individuals with LGD variants in NAA15 include variable levels of intellectual disability, delayed speech and motor milestones, and autism spectrum disorder. Additionally, mild craniofacial dysmorphology, congenital cardiac anomalies, and seizures are present in some subjects. RNA analysis in cell lines from two individuals showed degradation of the transcripts with LGD variants, probably as a result of nonsense-mediated decay. Functional assays in yeast confirmed a deleterious effect for two of the LGD variants in NAA15. Further supporting a mechanism of haploinsufficiency, individuals with copy-number variant (CNV) deletions involving NAA15 and surrounding genes can present with mild intellectual disability, mild dysmorphic features, motor delays, and decreased growth. We propose that defects in NatA-mediated N-terminal acetylation (NTA) lead to variable levels of neurodevelopmental disorders in humans, supporting the importance of the NatA complex in normal human development. Copyright © 2018 American Society of Human Genetics. All rights reserved.
Fernandez-San Jose, Patricia; Liu, Yichuan; March, Michael; Pellegrino, Renata; Golhar, Ryan; Corton, Marta; Blanco-Kelly, Fiona; López-Molina, Maria Isabel; García-Sandoval, Blanca; Guo, Yiran; Tian, Lifeng; Liu, Xuanzhu; Guan, Liping; Zhang, Jianguo; Keating, Brendan; Xu, Xun
2015-01-01
This study aimed to identify the genetics underlying dominant forms of inherited retinal dystrophies using whole exome sequencing (WES) in six families extensively screened for known mutations or genes. Thirty-eight individuals were subjected to WES. Causative variants were searched among single nucleotide variants (SNVs) and insertion/deletion variants (indels) and whenever no potential candidate emerged, copy number variant (CNV) analysis was performed. Variants or regions harboring a candidate variant were prioritized and segregation of the variant with the disease was further assessed using Sanger sequencing in case of SNVs and indels, and quantitative PCR (qPCR) for CNVs. SNV and indel analysis led to the identification of a previously reported mutation in PRPH2. Two additional mutations linked to different forms of retinal dystrophies were identified in two families: a known frameshift deletion in RPGR, a gene responsible for X-linked retinitis pigmentosa and p.Ser163Arg in C1QTNF5 associated with Late-Onset Retinal Degeneration. A novel heterozygous deletion spanning the entire region of PRPF31 was also identified in the affected members of a fourth family, which was confirmed with qPCR. This study allowed the identification of the genetic cause of the retinal dystrophy and the establishment of a correct diagnosis in four families, including a large heterozygous deletion in PRPF31, typically considered one of the pitfalls of this method. Since all findings in this study are restricted to known genes, we propose that targeted sequencing using gene-panel is an optimal first approach for the genetic screening and that once known genetic causes are ruled out, WES might be used to uncover new genes involved in inherited retinal dystrophies. PMID:26197217
A selective splicing variant of hepcidin mRNA in hepatocellular carcinoma cell lines
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toki, Yasumichi; Sasaki, Katsunori, E-mail: k-sasaki@asahikawa-med.ac.jp; Tanaka, Hiroki
2016-08-05
Hepcidin is a main regulator of iron metabolism, of which abnormal expression affects intestinal absorption and reticuloendothelial sequestration of iron by interacting with ferroportin. It is also noted that abnormal iron accumulation is one of the key factors to facilitate promotion and progression of cancer including hepatoma. By RT-PCR/agarose gel electrophoresis of hepcidin mRNA in a hepatocellular carcinoma cell line HLF, a smaller mRNA band was shown in addition to the wild-type hepcidin mRNA. From sequencing analysis, this additional band was a selective splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene, producing the transcript that encodes truncatedmore » peptide lacking 20 amino acids at the middle of preprohepcidin. In the present study, we used the digital PCR, because such a small amount of variant mRNA was difficult to quantitate by the conventional RT-PCR amplification. Among seven hepatoma-derived cell lines, six cell lines have significant copy numbers of this variant mRNA, but not in one cell line. In the transient transfection analysis of variant-type hepcidin cDNA, truncated preprohepcidin has a different character comparing with native preprohepcidin: its product is insensitive to digestion, and secreted into the medium as a whole preprohepcidin form without maturation. Loss or reduction of function of HAMP gene by aberrantly splicing may be a suitable phenomenon to obtain the proliferating advantage of hepatoma cells. - Highlights: • An aberrant splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene. • Absolute quantification of hepcidin mRNA by digital PCR amplification. • Hepatoma-derived cell lines have significant copies of variant-type hepcidin mRNA. • Truncated preprohepcidin is secreted from cells without posttranslational cleavage.« less
Al-Bustan, Suzanne A; Al-Serri, Ahmad; Annice, Babitha G; Alnaqeeb, Majed A; Al-Kandari, Wafa Y; Dashti, Mohammed
2018-01-01
The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel "rare" variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004-0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001-0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia.
Al-Serri, Ahmad; Annice, Babitha G.; Alnaqeeb, Majed A.; Al-Kandari, Wafa Y.; Dashti, Mohammed
2018-01-01
The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel “rare” variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004–0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001–0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia. PMID:29438437
The Effects of a BDNF Val66Met Polymorphism on Posttraumatic Stress Disorder: A Meta-Analysis.
Bountress, Kaitlin E; Bacanu, Silviu-Alin; Tomko, Rachel L; Korte, Kristina J; Hicks, Terrell; Sheerin, Christina; Lind, Mackenzie J; Marraccini, Marisa; Nugent, Nicole; Amstadter, Ananda B
2018-06-06
Given evidence that posttraumatic stress disorder (PTSD) is moderately heritable, a number of studies utilizing candidate gene approaches have attempted to examine the potential contributions of theoretically relevant genetic variation. Some of these studies have found sup port for a brain-derived neurotrophic factor (BDNF) variant, Val66Met, in the risk of developing PTSD, while others have failed to find this link. This study sought to reconcile these conflicting findings using a meta-analysis framework. Analyses were also used to determine whether there is significant heterogeneity in the link between this variant and PTSD. We conducted a systematic review of the literature on BDNF and PTSD from the PsycINFO and PubMed databases. A total of 11 studies were included in the analysis. Findings indicate a marginally significant effect of the BDNF Val66Met variant on PTSD (p < 0.1). However, of the 11 studies included, only 2 suggested an effect with a non-zero confidence interval, one of which showed a z score of 3.31. We did not find any evidence for heterogeneity. Findings from this meta-analytic investigation of the published literature provide little support for the Val66Met variant of BDNF as a predictor of PTSD. Future well-powered agnostic genome-wide association studies with more refined phenotyping are needed to clarify genetic influences on PTSD. © 2018 S. Karger AG, Basel.
Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R
2017-10-01
The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10 -8 ), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect.
Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Goldstein, Jacqueline I; Jarskog, L Fredrik; Hilliard, Chris; Alfirevic, Ana; Duncan, Laramie; Fourches, Denis; Huang, Hailiang; Lek, Monkol; Neale, Benjamin M; Ripke, Stephan; Shianna, Kevin; Szatkiewicz, Jin P; Tropsha, Alexander; van den Oord, Edwin JCG; Cascorbi, Ingolf; Dettling, Michael; Gazit, Ephraim; Goff, Donald C; Holden, Arthur L; Kelly, Deanna L; Malhotra, Anil K; Nielsen, Jimmi; Pirmohamed, Munir; Rujescu, Dan; Werge, Thomas; Levy, Deborah L; Josiassen, Richard C; Kennedy, James L; Lieberman, Jeffrey A; Daly, Mark J; Sullivan, Patrick F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R
2017-01-01
The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10−8), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect. PMID:27400856
Population sequencing reveals breed and sub-species specific CNVs in cattle
USDA-ARS?s Scientific Manuscript database
Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect the rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an incre...
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.
Curtis, Christina; Shah, Sohrab P; Chin, Suet-Feung; Turashvili, Gulisa; Rueda, Oscar M; Dunning, Mark J; Speed, Doug; Lynch, Andy G; Samarajiwa, Shamith; Yuan, Yinyin; Gräf, Stefan; Ha, Gavin; Haffari, Gholamreza; Bashashati, Ali; Russell, Roslin; McKinney, Steven; Langerød, Anita; Green, Andrew; Provenzano, Elena; Wishart, Gordon; Pinder, Sarah; Watson, Peter; Markowetz, Florian; Murphy, Leigh; Ellis, Ian; Purushotham, Arnie; Børresen-Dale, Anne-Lise; Brenton, James D; Tavaré, Simon; Caldas, Carlos; Aparicio, Samuel
2012-04-18
The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort.
Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Siler Masters, Bettie Sue; Martásek, Pavel
2015-01-01
Estimating polymorphic allele frequencies of the NADPH-CYP450 oxidoreductase (POR) gene in a Czech Slavic population. The POR gene was analyzed in 322 individuals from a control cohort by sequencing and high resolution melting analysis. We identified seven unreported SNP genetic variations, including two SNPs in the 5' flanking region (g.4965C>T and g.4994G>T), one intronic variant (c.1899-20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared with wild-type. New POR variant identification indicates the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYP450s in the endoplasmic reticulum. Original submitted 15 September 2014; Revision submitted 17 November 2014.
Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C
2017-11-15
A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Lessons learned from additional research analyses of unsolved clinical exome cases.
Eldomery, Mohammad K; Coban-Akdemir, Zeynep; Harel, Tamar; Rosenfeld, Jill A; Gambin, Tomasz; Stray-Pedersen, Asbjørg; Küry, Sébastien; Mercier, Sandra; Lessel, Davor; Denecke, Jonas; Wiszniewski, Wojciech; Penney, Samantha; Liu, Pengfei; Bi, Weimin; Lalani, Seema R; Schaaf, Christian P; Wangler, Michael F; Bacino, Carlos A; Lewis, Richard Alan; Potocki, Lorraine; Graham, Brett H; Belmont, John W; Scaglia, Fernando; Orange, Jordan S; Jhangiani, Shalini N; Chiang, Theodore; Doddapaneni, Harsha; Hu, Jianhong; Muzny, Donna M; Xia, Fan; Beaudet, Arthur L; Boerwinkle, Eric; Eng, Christine M; Plon, Sharon E; Sutton, V Reid; Gibbs, Richard A; Posey, Jennifer E; Yang, Yaping; Lupski, James R
2017-03-21
Given the rarity of most single-gene Mendelian disorders, concerted efforts of data exchange between clinical and scientific communities are critical to optimize molecular diagnosis and novel disease gene discovery. We designed and implemented protocols for the study of cases for which a plausible molecular diagnosis was not achieved in a clinical genomics diagnostic laboratory (i.e. unsolved clinical exomes). Such cases were recruited to a research laboratory for further analyses, in order to potentially: (1) accelerate novel disease gene discovery; (2) increase the molecular diagnostic yield of whole exome sequencing (WES); and (3) gain insight into the genetic mechanisms of disease. Pilot project data included 74 families, consisting mostly of parent-offspring trios. Analyses performed on a research basis employed both WES from additional family members and complementary bioinformatics approaches and protocols. Analysis of all possible modes of Mendelian inheritance, focusing on both single nucleotide variants (SNV) and copy number variant (CNV) alleles, yielded a likely contributory variant in 36% (27/74) of cases. If one includes candidate genes with variants identified within a single family, a potential contributory variant was identified in a total of ~51% (38/74) of cases enrolled in this pilot study. The molecular diagnosis was achieved in 30/63 trios (47.6%). Besides this, the analysis workflow yielded evidence for pathogenic variants in disease-associated genes in 4/6 singleton cases (66.6%), 1/1 multiplex family involving three affected siblings, and 3/4 (75%) quartet families. Both the analytical pipeline and the collaborative efforts between the diagnostic and research laboratories provided insights that allowed recent disease gene discoveries (PURA, TANGO2, EMC1, GNB5, ATAD3A, and MIPEP) and increased the number of novel genes, defined in this study as genes identified in more than one family (DHX30 and EBF3). An efficient genomics pipeline in which clinical sequencing in a diagnostic laboratory is followed by the detailed reanalysis of unsolved cases in a research environment, supplemented with WES data from additional family members, and subject to adjuvant bioinformatics analyses including relaxed variant filtering parameters in informatics pipelines, can enhance the molecular diagnostic yield and provide mechanistic insights into Mendelian disorders. Implementing these approaches requires collaborative clinical molecular diagnostic and research efforts.
Abrahams, M.-R.; Anderson, J. A.; Giorgi, E. E.; Seoighe, C.; Mlisana, K.; Ping, L.-H.; Athreya, G. S.; Treurnicht, F. K.; Keele, B. F.; Wood, N.; Salazar-Gonzalez, J. F.; Bhattacharya, T.; Chu, H.; Hoffman, I.; Galvin, S.; Mapanje, C.; Kazembe, P.; Thebus, R.; Fiscus, S.; Hide, W.; Cohen, M. S.; Karim, S. Abdool; Haynes, B. F.; Shaw, G. M.; Hahn, B. H.; Korber, B. T.; Swanstrom, R.; Williamson, C.
2009-01-01
Identifying the specific genetic characteristics of successfully transmitted variants may prove central to the development of effective vaccine and microbicide interventions. Although human immunodeficiency virus transmission is associated with a population bottleneck, the extent to which different factors influence the diversity of transmitted viruses is unclear. We estimate here the number of transmitted variants in 69 heterosexual men and women with primary subtype C infections. From 1,505 env sequences obtained using a single genome amplification approach we show that 78% of infections involved single variant transmission and 22% involved multiple variant transmissions (median of 3). We found evidence for mutations selected for cytotoxic-T-lymphocyte or antibody escape and a high prevalence of recombination in individuals infected with multiple variants representing another potential escape pathway in these individuals. In a combined analysis of 171 subtype B and C transmission events, we found that infection with more than one variant does not follow a Poisson distribution, indicating that transmission of individual virions cannot be seen as independent events, each occurring with low probability. While most transmissions resulted from a single infectious unit, multiple variant transmissions represent a significant fraction of transmission events, suggesting that there may be important mechanistic differences between these groups that are not yet understood. PMID:19193811
Green, Luke R; Lucidarme, Jay; Dave, Neelam; Chan, Hannah; Clark, Stephen; Borrow, Ray; Bayliss, Christopher D
2018-06-27
A recombinant NadA protein is one of the four major protective antigens of 4C-MenB (Bexsero®), a vaccine developed for serogroup B Neisseria meningitidis (MenB). The Meningococcal Antigen Typing System (MATS) is utilised as a high throughput assay for assessing the invasive MenB strain coverage of 4C-MenB. Where present, the nadA gene is subject to phase variable changes in transcription due to a 5'TAAA repeat tract located in a regulatory region. The promoter-containing intergenic region sequences (IGR) and 5'TAAA repeat numbers were determined for 906 invasive meningococcal disease isolates possessing the nadA gene. Exclusion of the 5'TAAA repeats reduced the number of IGR alleles from 82 to 23. Repeat numbers were associated with low and high levels of NadA expression by Western blotting and ELISA. Low expression repeat numbers were present in 83% of 179 MenB isolates with NadA-2/3 or Nad-1 peptide variants and 68% of 480 MenW ST-11 complex isolates with Nad-2/3 peptide variants. For isolates with vaccine-compatible NadA variants, 93% of MATS negative isolates were associated with low expression repeat numbers whereas 63% of isolates with MATS RP scores above the 95% confidence interval for the positive bactericidal threshold had high expression repeat numbers. Analysis of the 5'TAAA repeat number has potential as a rapid, high throughput method for assessing strain coverage for the NadA-component of 4C-MenB. A key application will be assessing coverage in meningococcal disease cases where confirmation is by PCR only and MATS cannot be applied. Copyright © 2018 Green et al.
Diroma, Maria Angela; Lubisco, Paolo; Attimonelli, Marcella
2016-11-08
The abundance of biological data characterizing the genomics era is contributing to a comprehensive understanding of human mitochondrial genetics. Nevertheless, many aspects are still unclear, specifically about the variability of the 22 human mitochondrial transfer RNA (tRNA) genes and their involvement in diseases. The complex enrichment and isolation of tRNAs in vitro leads to an incomplete knowledge of their post-transcriptional modifications and three-dimensional folding, essential for correct tRNA functioning. An accurate annotation of mitochondrial tRNA variants would be definitely useful and appreciated by mitochondrial researchers and clinicians since the most of bioinformatics tools for variant annotation and prioritization available so far cannot shed light on the functional role of tRNA variations. To this aim, we updated our MToolBox pipeline for mitochondrial DNA analysis of high throughput and Sanger sequencing data by integrating tRNA variant annotations in order to identify and characterize relevant variants not only in protein coding regions, but also in tRNA genes. The annotation step in the pipeline now provides detailed information for variants mapping onto the 22 mitochondrial tRNAs. For each mt-tRNA position along the entire genome, the relative tRNA numbering, tRNA type, cloverleaf secondary domains (loops and stems), mature nucleotide and interactions in the three-dimensional folding were reported. Moreover, pathogenicity predictions for tRNA and rRNA variants were retrieved from the literature and integrated within the annotations provided by MToolBox, both in the stand-alone version and web-based tool at the Mitochondrial Disease Sequence Data Resource (MSeqDR) website. All the information available in the annotation step of MToolBox were exploited to generate custom tracks which can be displayed in the GBrowse instance at MSeqDR website. To the best of our knowledge, specific data regarding mitochondrial variants in tRNA genes were introduced for the first time in a tool for mitochondrial genome analysis, supporting the interpretation of genetic variants in specific genomic contexts.
USDA-ARS?s Scientific Manuscript database
Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...
Lorenzetti, Mario Alejandro; Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-03-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America.
Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-01-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America. PMID:22205789
Exome analysis of a family with Wolff-Parkinson-White syndrome identifies a novel disease locus.
Bowles, Neil E; Jou, Chuanchau J; Arrington, Cammon B; Kennedy, Brett J; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L; Etheridge, Susan P; Saarel, Elizabeth V; Bleyl, Steven B; Yost, H Joseph; Yandell, Mark; Leppert, Mark F; Tristani-Firouzi, Martin; Gruber, Peter J
2015-12-01
Wolff-Parkinson-White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5'-AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. © 2015 Wiley Periodicals, Inc.
Exome Analysis of a Family with Wolff–Parkinson–White Syndrome Identifies a Novel Disease Locus
Bowles, Neil E.; Jou, Chuanchau J.; Arrington, Cammon B.; Kennedy, Brett J.; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L.; Etheridge, Susan P.; Saarel, Elizabeth V.; Bleyl, Steven B.; Yost, H. Joseph; Yandell, Mark; Leppert, Mark F.; Tristani-Firouzi, Martin; Gruber, Peter J.
2016-01-01
Wolff–Parkinson–White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5’ -AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. PMID:26284702
CYP21A2 mutation update: Comprehensive analysis of databases and published genetic variants.
Simonetti, Leandro; Bruque, Carlos D; Fernández, Cecilia S; Benavides-Mori, Belén; Delea, Marisol; Kolomenski, Jorge E; Espeche, Lucía D; Buzzalino, Noemí D; Nadra, Alejandro D; Dain, Liliana
2018-01-01
Congenital adrenal hyperplasia (CAH) is a group of autosomal recessive disorders of adrenal steroidogenesis. Disorders in steroid 21-hydroxylation account for over 95% of patients with CAH. Clinically, the 21-hydroxylase deficiency has been classified in a broad spectrum of clinical forms, ranging from severe or classical, to mild late onset or non-classical. Known allelic variants in the disease causing CYP21A2 gene are spread among different sources. Until recently, most variants reported have been identified in the clinical setting, which presumably bias described variants to pathogenic ones, as those found in the CYPAlleles database. Nevertheless, a large number of variants are being described in massive genome projects, many of which are found in dbSNP, but lack functional implications and/or their phenotypic effect. In this work, we gathered a total of 1,340 GVs in the CYP21A2 gene, from which 899 variants were unique and 230 have an effect on human health, and compiled all this information in an integrated database. We also connected CYP21A2 sequence information to phenotypic effects for all available mutations, including double mutants in cis. Data compiled in the present work could help physicians in the genetic counseling of families affected with 21-hydroxylase deficiency. © 2017 Wiley Periodicals, Inc.
2012-01-01
Background Through the wealth of information contained within them, genome-wide association studies (GWAS) have the potential to provide researchers with a systematic means of associating genetic variants with a wide variety of disease phenotypes. Due to the limitations of approaches that have analyzed single variants one at a time, it has been proposed that the genetic basis of these disorders could be determined through detailed analysis of the genetic variants themselves and in conjunction with one another. The construction of models that account for these subsets of variants requires methodologies that generate predictions based on the total risk of a particular group of polymorphisms. However, due to the excessive number of variants, constructing these types of models has so far been computationally infeasible. Results We have implemented an algorithm, known as greedy RLS, that we use to perform the first known wrapper-based feature selection on the genome-wide level. The running time of greedy RLS grows linearly in the number of training examples, the number of features in the original data set, and the number of selected features. This speed is achieved through computational short-cuts based on matrix calculus. Since the memory consumption in present-day computers can form an even tighter bottleneck than running time, we also developed a space efficient variation of greedy RLS which trades running time for memory. These approaches are then compared to traditional wrapper-based feature selection implementations based on support vector machines (SVM) to reveal the relative speed-up and to assess the feasibility of the new algorithm. As a proof of concept, we apply greedy RLS to the Hypertension – UK National Blood Service WTCCC dataset and select the most predictive variants using 3-fold external cross-validation in less than 26 minutes on a high-end desktop. On this dataset, we also show that greedy RLS has a better classification performance on independent test data than a classifier trained using features selected by a statistical p-value-based filter, which is currently the most popular approach for constructing predictive models in GWAS. Conclusions Greedy RLS is the first known implementation of a machine learning based method with the capability to conduct a wrapper-based feature selection on an entire GWAS containing several thousand examples and over 400,000 variants. In our experiments, greedy RLS selected a highly predictive subset of genetic variants in a fraction of the time spent by wrapper-based selection methods used together with SVM classifiers. The proposed algorithms are freely available as part of the RLScore software library at http://users.utu.fi/aatapa/RLScore/. PMID:22551170
NASA Astrophysics Data System (ADS)
Kim, Edward; Baloch, Zubair; Kim, Caroline
2015-03-01
The number of new cases of thyroid cancer are dramatically increasing as incidences of this cancer have more than doubled since the early 1970s. Tall cell variant (TCV-PTC) papillary thyroid carcinoma is one type of thyroid cancer that is more aggressive and usually associated with higher local recurrence and distant metastasis. This variant can be identified through visual characteristics of cells in histological images. Thus, we created a fully automatic algorithm that is able to segment cells using a multi-stage approach. Our method learns the statistical characteristics of nuclei and cells during the segmentation process and utilizes this information for a more accurate result. Furthermore, we are able to analyze the detected regions and extract characteristic cell data that can be used to assist in clinical diagnosis.
Burden of Common Complex Disease Variants in the Exomes of Two Healthy Centenarian Brothers.
Tindale, Lauren C; Zeng, Andy; Bretherick, Karla L; Leach, Stephen; Thiessen, Nina; Brooks-Wilson, Angela R
2015-01-01
It is not understood whether long-term good health is promoted by the absence of disease risk variants, the presence of protective variants, or both. We characterized the exomes of two exceptionally healthy centenarian brothers aged 106 and 109 years who had never been diagnosed with cancer, cardiovascular disease, diabetes, Alzheimer's disease, or major pulmonary disease. The aim of this study was to gain insight into whether exceptional health and longevity are a result of carrying fewer disease-associated variants than typical individuals. We compared the number of disease-associated alleles, and the proportion of alleles predicted to be functionally damaging, between the centenarian brothers and published population data. Mitochondrial sequence reads were extracted from the exome data in order to analyze mitochondrial variants. The brothers carry a similar number of common disease-associated variants and predicted damaging variants compared to reference groups. They did not carry any high-penetrance clinically actionable variants. They carry mitochondrial haplogroup T, and one brother has a single heteroplasmic variant. Although our small sample size does not allow for definitive conclusions, a healthy aging and longevity phenotype is not necessarily due to a decreased burden of common disease-associated variants. Instead, it may be rare 'positive' variants that play a role in this desirable phenotype. © 2015 S. Karger AG, Basel.
Kroncke, Brett M; Glazer, Andrew M; Smith, Derek K; Blume, Jeffrey D; Roden, Dan M
2018-05-01
Accurately predicting the impact of rare nonsynonymous variants on disease risk is an important goal in precision medicine. Variants in the cardiac sodium channel SCN5A (protein Na V 1.5; voltage-dependent cardiac Na+ channel) are associated with multiple arrhythmia disorders, including Brugada syndrome and long QT syndrome. Rare SCN5A variants also occur in ≈1% of unaffected individuals. We hypothesized that in vitro electrophysiological functional parameters explain a statistically significant portion of the variability in disease penetrance. From a comprehensive literature review, we quantified the number of carriers presenting with and without disease for 1712 reported SCN5A variants. For 356 variants, data were also available for 5 Na V 1.5 electrophysiological parameters: peak current, late/persistent current, steady-state V1/2 of activation and inactivation, and recovery from inactivation. We found that peak and late current significantly associate with Brugada syndrome ( P <0.001; ρ=-0.44; Spearman rank test) and long QT syndrome disease penetrance ( P <0.001; ρ=0.37). Steady-state V1/2 activation and recovery from inactivation associate significantly with Brugada syndrome and long QT syndrome penetrance, respectively. Continuous estimates of disease penetrance align with the current American College of Medical Genetics classification paradigm. Na V 1.5 in vitro electrophysiological parameters are correlated with Brugada syndrome and long QT syndrome disease risk. Our data emphasize the value of in vitro electrophysiological characterization and incorporating counts of affected and unaffected carriers to aid variant classification. This quantitative analysis of the electrophysiological literature should aid the interpretation of Na V 1.5 variant electrophysiological abnormalities and help improve Na V 1.5 variant classification. © 2018 American Heart Association, Inc.
Histone H3 Variants in Trichomonas vaginalis
Zubáčová, Zuzana; Hostomská, Jitka
2012-01-01
The parabasalid protist Trichomonas vaginalis is a widespread parasite that affects humans, frequently causing vaginitis in infected women. Trichomonad mitosis is marked by the persistence of the nuclear membrane and the presence of an asymmetric extranuclear spindle with no obvious direct connection to the chromosomes. No centromeric markers have been described in T. vaginalis, which has prevented a detailed analysis of mitotic events in this organism. In other eukaryotes, nucleosomes of centromeric chromatin contain the histone H3 variant CenH3. The principal aim of this work was to identify a CenH3 homolog in T. vaginalis. We performed a screen of the T. vaginalis genome to retrieve sequences of canonical and variant H3 histones. Three variant histone H3 proteins were identified, and the subcellular localization of their epitope-tagged variants was determined. The localization of the variant TVAG_185390 could not be distinguished from that of the canonical H3 histone. The sequence of the variant TVAG_087830 closely resembled that of histone H3. The tagged protein colocalized with sites of active transcription, indicating that the variant TVAG_087830 represented H3.3 in T. vaginalis. The third H3 variant (TVAG_224460) was localized to 6 or 12 distinct spots at the periphery of the nucleus, corresponding to the number of chromosomes in G1 phase and G2 phase, respectively. We propose that this variant represents the centromeric marker CenH3 and thus can be employed as a tool to study mitosis in T. vaginalis. Furthermore, we suggest that the peripheral distribution of CenH3 within the nucleus results from the association of centromeres with the nuclear envelope throughout the cell cycle. PMID:22408228
NASA Astrophysics Data System (ADS)
Yusufaly, Tahir I.; Boedicker, James Q.
2017-08-01
Microbial communities frequently communicate via quorum sensing (QS), where cells produce, secrete, and respond to a threshold level of an autoinducer (AI) molecule, thereby modulating gene expression. However, the biology of QS remains incompletely understood in heterogeneous communities, where variant bacterial strains possess distinct QS systems that produce chemically unique AIs. AI molecules bind to ‘cognate’ receptors, but also to ‘non-cognate’ receptors found in other strains, resulting in inter-strain crosstalk. Understanding these interactions is a prerequisite for deciphering the consequences of crosstalk in real ecosystems, where multiple AIs are regularly present in the same environment. As a step towards this goal, we map crosstalk in a heterogeneous community of variant QS strains onto an artificial neural network model. This formulation allows us to systematically analyze how crosstalk regulates the community’s capacity for flexible decision making, as quantified by the Boltzmann entropy of all QS gene expression states of the system. In a mean-field limit of complete cross-inhibition between variant strains, the model is exactly solvable, allowing for an analytical formula for the number of variants that maximize capacity as a function of signal kinetics and activation parameters. An analysis of previous experimental results on the Staphylococcus aureus two-component Agr system indicates that the observed combination of variant numbers, gene expression rates and threshold concentrations lies near this critical regime of parameter space where capacity peaks. The results are suggestive of a potential evolutionary driving force for diversification in certain QS systems.
Hinney, Anke; Hebebrand, Johannes
2008-01-01
The molecular genetic analysis of obesity has led to the identification of a limited number of confirmed major genes. While such major genes have a clear influence on the development of the phenotype, the underlying mutations are however (extremely) infrequent and thus of minor clinical importance only. The genetic predisposition to obesity must thus be polygenic; a number of such variants should be found in most obese subjects; however, these variants predisposing to obesity are also found in normal weight and even lean individuals. Therefore, a polygene can only be identified and validated by statistical analyses: the appropriate gene variant (allele) occurs more frequently in obese than in non-obese subjects. Each single polygene makes only a small contribution to the development of obesity. The 103Ile allele of the Val103Ile single nucleotide polymorphism (SNP) of the melanocortin-4 receptor gene (MC4R) was the first confirmed polygenetic variant with an influence on the body mass index (BMI); the more common Val103 allele is more frequent in obese individuals. As determined in a recent, large-scaled meta-analysis the effect size of this allele on mean BMI was approximately -0.5 kg/m(2). The first genome-wide association study (GWA) for obesity, based on approximately 100,000 SNPs analyzed in families of the Framingham study, revealed that a SNP in the proximity of the insulin-induced gene 2 (INSIG2) was associated with obesity. The positive result was replicated in independent samples; however, some other study groups detected no association. Currently, a meta-analysis is ongoing; its result will contribute to the evaluation of the importance of the INSIG2 polymorphism in body weight regulation. SNP alleles in intron 1 of the fat mass and obesity associated gene (FTO) confer the most relevant polygenic effect on obesity. In the first GWA for extreme early onset obesity we substantiated that variation in FTO strongly contributes to early onset obesity. Copyright 2008 S. Karger AG, Basel.
Women's experiences receiving abnormal prenatal chromosomal microarray testing results.
Bernhardt, Barbara A; Soucier, Danielle; Hanson, Karen; Savage, Melissa S; Jackson, Laird; Wapner, Ronald J
2013-02-01
Genomic microarrays can detect copy-number variants not detectable by conventional cytogenetics. This technology is diffusing rapidly into prenatal settings even though the clinical implications of many copy-number variants are currently unknown. We conducted a qualitative pilot study to explore the experiences of women receiving abnormal results from prenatal microarray testing performed in a research setting. Participants were a subset of women participating in a multicenter prospective study "Prenatal Cytogenetic Diagnosis by Array-based Copy Number Analysis." Telephone interviews were conducted with 23 women receiving abnormal prenatal microarray results. We found that five key elements dominated the experiences of women who had received abnormal prenatal microarray results: an offer too good to pass up, blindsided by the results, uncertainty and unquantifiable risks, need for support, and toxic knowledge. As prenatal microarray testing is increasingly used, uncertain findings will be common, resulting in greater need for careful pre- and posttest counseling, and more education of and resources for providers so they can adequately support the women who are undergoing testing.
Lehmann, Kjong-Van; Kahles, André; Kandoth, Cyriac; Lee, William; Schultz, Nikolaus; Stegle, Oliver; Rätsch, Gunnar
2015-01-01
We present a genome-wide analysis of splicing patterns of 282 kidney renal clear cell carcinoma patients in which we integrate data from whole-exome sequencing of tumor and normal samples, RNA-seq and copy number variation. We proposed a scoring mechanism to compare splicing patterns in tumor samples to normal samples in order to rank and detect tumor-specific isoforms that have a potential for new biomarkers. We identified a subset of genes that show introns only observable in tumor but not in normal samples, ENCODE and GEUVADIS samples. In order to improve our understanding of the underlying genetic mechanisms of splicing variation we performed a large-scale association analysis to find links between somatic or germline variants with alternative splicing events. We identified 915 cis- and trans-splicing quantitative trait loci (sQTL) associated with changes in splicing patterns. Some of these sQTL have previously been associated with being susceptibility loci for cancer and other diseases. Our analysis also allowed us to identify the function of several COSMIC variants showing significant association with changes in alternative splicing. This demonstrates the potential significance of variants affecting alternative splicing events and yields insights into the mechanisms related to an array of disease phenotypes.
Chen, Zhijian; Craiu, Radu V; Bull, Shelley B
2014-11-01
In focused studies designed to follow up associations detected in a genome-wide association study (GWAS), investigators can proceed to fine-map a genomic region by targeted sequencing or dense genotyping of all variants in the region, aiming to identify a functional sequence variant. For the analysis of a quantitative trait, we consider a Bayesian approach to fine-mapping study design that incorporates stratification according to a promising GWAS tag SNP in the same region. Improved cost-efficiency can be achieved when the fine-mapping phase incorporates a two-stage design, with identification of a smaller set of more promising variants in a subsample taken in stage 1, followed by their evaluation in an independent stage 2 subsample. To avoid the potential negative impact of genetic model misspecification on inference we incorporate genetic model selection based on posterior probabilities for each competing model. Our simulation study shows that, compared to simple random sampling that ignores genetic information from GWAS, tag-SNP-based stratified sample allocation methods reduce the number of variants continuing to stage 2 and are more likely to promote the functional sequence variant into confirmation studies. © 2014 WILEY PERIODICALS, INC.
Savige, Judy; Dagher, Hayat; Povey, Sue
2014-07-01
This study examined whether gene-specific DNA variant databases for inherited diseases of the kidney fulfilled the Human Variome Project recommendations of being complete, accurate, clinically relevant and freely available. A recent review identified 60 inherited renal diseases caused by mutations in 132 genes. The disease name, MIM number, gene name, together with "mutation" or "database," were used to identify web-based databases. Fifty-nine diseases (98%) due to mutations in 128 genes had a variant database. Altogether there were 349 databases (a median of 3 per gene, range 0-6), but no gene had two databases with the same number of variants, and 165 (50%) databases included fewer than 10 variants. About half the databases (180, 54%) had been updated in the previous year. Few (77, 23%) were curated by "experts" but these included nine of the 11 with the most variants. Even fewer databases (41, 12%) included clinical features apart from the name of the associated disease. Most (223, 67%) could be accessed without charge, including those for 50 genes (40%) with the maximum number of variants. Future efforts should focus on encouraging experts to collaborate on a single database for each gene affected in inherited renal disease, including both unpublished variants, and clinical phenotypes. © 2014 WILEY PERIODICALS, INC.
A Novel BHLHE41 Variant is Associated with Short Sleep and Resistance to Sleep Deprivation in Humans
Pellegrino, Renata; Kavakli, Ibrahim Halil; Goel, Namni; Cardinale, Christopher J.; Dinges, David F.; Kuna, Samuel T.; Maislin, Greg; Van Dongen, Hans P.A.; Tufik, Sergio; Hogenesch, John B.; Hakonarson, Hakon; Pack, Allan I.
2014-01-01
Study Objectives: Earlier work described a mutation in DEC2 also known as BHLHE41 (basic helix-loophelix family member e41) as causal in a family of short sleepers, who needed just 6 h sleep per night. We evaluated whether there were other variants of this gene in two well-phenotyped cohorts. Design: Sequencing of the BHLHE41 gene, electroencephalographic data, and delta power analysis and functional studies using cell-based luciferase. Results: We identified new variants of the BHLHE41 gene in two cohorts who had either acute sleep deprivation (n = 200) or chronic partial sleep deprivation (n = 217). One variant, Y362H, at another location in the same exon occurred in one twin in a dizygotic twin pair and was associated with reduced sleep duration, less recovery sleep following sleep deprivation, and fewer performance lapses during sleep deprivation than the homozygous twin. Both twins had almost identical amounts of non rapid eye movement (NREM) sleep. This variant reduced the ability of BHLHE41 to suppress CLOCK/BMAL1 and NPAS2/BMAL1 transactivation in vitro. Another variant in the same exome had no effect on sleep or response to sleep deprivation and no effect on CLOCK/BMAL1 transactivation. Random mutagenesis identified a number of other variants of BHLHE41 that affect its function. Conclusions: There are a number of mutations of BHLHE41. Mutations reduce total sleep while maintaining NREM sleep and provide resistance to the effects of sleep loss. Mutations that affect sleep also modify the normal inhibition of BHLHE41 of CLOCK/BMAL1 transactivation. Thus, clock mechanisms are likely involved in setting sleep length and the magnitude of sleep homeostasis. Citation: Pellegrino R, Kavakli IH, Goel N, Cardinale CJ, Dinges DF, Kuna ST, Maislin G, Van Dongen HP, Tufik S, Hogenesch JB, Hakonarson H, Pack AI. A novel BHLHE41 variant is associated with short sleep and resistance to sleep deprivation in humans. SLEEP 2014;37(8):1327-1336. PMID:25083013
Gorin, Michael B.
2012-01-01
Age-related macular degeneration (AMD) is a common condition among the elderly population that leads to the progressive central vision loss and serious compromise of quality of life for its sufferers. It is also one of the few disorders for whom the investigation of its genetics has yielded rich insights into its diversity and causality and holds the promise of enabling clinicians to provide better risk assessments for individuals as well as to develop and selectively deploy new therapeutics to either prevent or slow the development of disease and lessen the threat of vision loss. The genetics of AMD began initially with the appreciation of familial aggregation and increase risk and expanded with the initial association of APOE variants with the disease. The first major breakthroughs came with family-based linkage studies of affected (and discordant) sibs, which identified a number of genetic loci and led to the targeted search of the 1q31 and 10q26 loci for associated variants. Three of the initial four reports for the CFH variant, Y402H, were based on regional candidate searches, as were the two initial reports of the ARMS2/HTRA1 locus variants. Case-control association studies initially also played a role in discovering the major genetic variants for AMD, and the success of those early studies have been used to fuel enthusiasm for the methodology for a number of diseases. Until 2010, all of the subsequent genetic variants associated with AMD came from candidate gene testing based on the complement factor pathway. In 2010, several large-scale genome-wide association studies (GWAS) identified genes that had not been previously identified. Much of this historical information is available in a number of recent reviews.(Chen et al., 2010b; Deangelis et al., 2011; Fafowora and Gorin, 2012b; Francis and Klein, 2011; Kokotas et al., 2011) Large meta analysis of AMD GWAS has added new loci and variants to this collection.(Chen et al., 2010a; Kopplin et al., 2010; Yu et al., 2011) This paper will focus on the ongoing controversies that are confronting AMD genetics at this time, rather than attempting to summarize this field, which has exploded in the past 5 years. PMID:22561651
Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements
Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.
2008-01-01
Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679
Describing the genetic architecture of epilepsy through heritability analysis.
Speed, Doug; O'Brien, Terence J; Palotie, Aarno; Shkura, Kirill; Marson, Anthony G; Balding, David J; Johnson, Michael R
2014-10-01
Epilepsy is a disease with substantial missing heritability; despite its high genetic component, genetic association studies have had limited success detecting common variants which influence susceptibility. In this paper, we reassess the role of common variants on epilepsy using extensions of heritability analysis. Our data set consists of 1258 UK patients with epilepsy, of which 958 have focal epilepsy, and 5129 population control subjects, with genotypes recorded for over 4 million common single nucleotide polymorphisms. Firstly, we show that on the liability scale, common variants collectively explain at least 26% (standard deviation 5%) of phenotypic variation for all epilepsy and 27% (standard deviation 5%) for focal epilepsy. Secondly we provide a new method for estimating the number of causal variants for complex traits; when applied to epilepsy, our most optimistic estimate suggests that at least 400 variants influence disease susceptibility, with potentially many thousands. Thirdly, we use bivariate analysis to assess how similar the genetic architecture of focal epilepsy is to that of non-focal epilepsy; we demonstrate both significant differences (P = 0.004) and significant similarities (P = 0.01) between the two subtypes, indicating that although the clinical definition of focal epilepsy does identify a genetically distinct epilepsy subtype, there is also scope to improve the classification of epilepsy by incorporating genotypic information. Lastly, we investigate the potential value in using genetic data to diagnose epilepsy following a single epileptic seizure; we find that a prediction model explaining 10% of phenotypic variation could have clinical utility for deciding which single-seizure individuals are likely to benefit from immediate anti-epileptic drug therapy. © The Author (2014). Published by Oxford University Press on behalf of the Guarantors of Brain.
Clinical evaluation incorporating a personal genome
Ashley, Euan A.; Butte, Atul J.; Wheeler, Matthew T.; Chen, Rong; Klein, Teri E.; Dewey, Frederick E.; Dudley, Joel T.; Ormond, Kelly E.; Pavlovic, Aleksandra; Hudgins, Louanne; Gong, Li; Hodges, Laura M.; Berlin, Dorit S.; Thorn, Caroline F.; Sangkuhl, Katrin; Hebert, Joan M.; Woon, Mark; Sagreiya, Hersh; Whaley, Ryan; Morgan, Alexander A.; Pushkarev, Dmitry; Neff, Norma F; Knowles, Joshua W.; Chou, Mike; Thakuria, Joseph; Rosenbaum, Abraham; Zaranek, Alexander Wait; Church, George; Greely, Henry T.; Quake, Stephen R.; Altman, Russ B.
2010-01-01
Background The cost of genomic information has fallen steeply but the path to clinical translation of risk estimates for common variants found in genome wide association studies remains unclear. Since the speed and cost of sequencing complete genomes is rapidly declining, more comprehensive means of analyzing these data in concert with rare variants for genetic risk assessment and individualisation of therapy are required. Here, we present the first integrated analysis of a complete human genome in a clinical context. Methods An individual with a family history of vascular disease and early sudden death was evaluated. Clinical assessment included risk prediction for coronary artery disease, screening for causes of sudden cardiac death, and genetic counselling. Genetic analysis included the development of novel methods for the integration of whole genome sequence data including 2.6 million single nucleotide polymorphisms and 752 copy number variations. The algorithm focused on predicting genetic risk of genes associated with known Mendelian disease, recognised drug responses, and pathogenicity for novel variants. In addition, since integration of risk ratios derived from case control studies is challenging, we estimated posterior probabilities from age and sex appropriate prior probability and likelihood ratios derived for each genotype. In addition, we developed a visualisation approach to account for gene-environment interactions and conditionally dependent risks. Findings We found increased genetic risk for myocardial infarction, type II diabetes and certain cancers. Rare variants in LPA are consistent with the family history of coronary artery disease. Pharmacogenomic analysis suggested a positive response to lipid lowering therapy, likely clopidogrel resistance, and a low initial dosing requirement for warfarin. Many variants of uncertain significance were reported. Interpretation Although challenges remain, our results suggest that whole genome sequencing can yield useful and clinically relevant information for individual patients, especially for those with a strong family history of significant disease. PMID:20435227
Li, Lin; Zhou, Xueya; Wang, Xi; Wang, Jing; Zhang, Wei; Wang, Binbin; Cao, Yunxia; Kee, Kehkooi
2016-09-01
Does a heterozygous mutation in AMHR2, identified in whole-exome sequencings (WES) of patients with primary ovarian insufficiency (POI), cause a defect in anti-Müllerian hormone (AMH) signaling? The I209N mutation at the adenosine triphosphate binding domain of AMHR2 exerts dominant negative defects in the AMH signaling pathway. Previous studies have demonstrated the associations of several sequence variants in AMH or AMHR2 with POI, but no functional assay has been performed to verify whether there was any defect on AMH signaling. Ninety-six unrelated female Chinese Han patients were diagnosed with idiopathic POI and subjected to WES. In silico analysis was done for the sequence variants followed by molecular assays to examine the functional effects of the sequence variants in human granulosa cells. In silico analysis, immunostaining, Western analysis, genome-wide expression analysis, quantitatively polymerase chain reaction were applied to the characterization of the sequence variants. We identified one novel heterozygous missense variant, p.Ala17Glu (A17E), in AMHR2. Subsequently, A17E and two independently reported missense variants, p.Ile209Asn (I209N) and p.Leu354Phe (L354F), were evaluated for effects on the AMH signaling pathway. In silico analysis predicted that all three variants may be deleterious. However, only one variant, I209N, showed severe defects in transducing the AMH signal as well as impaired SMAD1/5/8 phosphorylation. Furthermore, using genome-wide gene expression analysis, we identified genes whose expression was affected by the mutation, these included genes previously reported to participate in AMH signaling as well as newly identified genes. They are EMILIN2, FAM155A, GATA2, HES5, ID1, ID2, RLTPR, SMAD7, CBL, MALAT1 and SMARCA2. None. Although the in vitro assays demonstrated the causative effect of I209N on AMH signaling, further studies need to validate its long-term effects on folliculogenesis and POI. These results will aid both researchers and clinicians in understanding the molecular pathology of AMH signaling and POI to develop diagnostic assays or therapeutics approaches. Research funding is provided by the Ministry of Science and Technology of China [2012CB944704; 2012CB966702], and the National Natural Science Foundation of China [Grant number: 31171429]. The authors declare no conflict of interest. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Iacocca, Michael A.; Wang, Jian; Dron, Jacqueline S.; Robinson, John F.; McIntyre, Adam D.; Cao, Henian
2017-01-01
Familial hypercholesterolemia (FH) is a heritable condition of severely elevated LDL cholesterol, caused predominantly by autosomal codominant mutations in the LDL receptor gene (LDLR). In providing a molecular diagnosis for FH, the current procedure often includes targeted next-generation sequencing (NGS) panels for the detection of small-scale DNA variants, followed by multiplex ligation-dependent probe amplification (MLPA) in LDLR for the detection of whole-exon copy number variants (CNVs). The latter is essential because ∼10% of FH cases are attributed to CNVs in LDLR; accounting for them decreases false negative findings. Here, we determined the potential of replacing MLPA with bioinformatic analysis applied to NGS data, which uses depth-of-coverage analysis as its principal method to identify whole-exon CNV events. In analysis of 388 FH patient samples, there was 100% concordance in LDLR CNV detection between these two methods: 38 reported CNVs identified by MLPA were also successfully detected by our NGS method, while 350 samples negative for CNVs by MLPA were also negative by NGS. This result suggests that MLPA can be removed from the routine diagnostic screening for FH, significantly reducing associated costs, resources, and analysis time, while promoting more widespread assessment of this important class of mutations across diagnostic laboratories. PMID:28874442
Chromosomal abnormalities and copy number variations in fetal left-sided congenital heart defects.
Jansen, Fenna A R; Hoffer, Mariette J V; van Velzen, Christine L; Plati, Stephani Klingeman; Rijlaarsdam, Marry E B; Clur, Sally-Ann B; Blom, Nico A; Pajkrt, Eva; Bhola, Shama L; Knegt, Alida C; de Boer, Marion A; Haak, Monique C
2016-02-01
To demonstrate the spectrum of copy number variants (CNVs) in fetuses with isolated left-sided congenital heart defects (CHDs), and analyse genetic content. Between 2003 and 2012, 200 fetuses were identified with left-sided CHD. Exclusion criteria were chromosomal rearrangements, 22q11.2 microdeletion and/or extra-cardiac malformations (n = 64). We included cases with additional minor anomalies (n = 39), such as single umbilical artery. In 54 of 136 eligible cases, stored material was available for array analysis. CNVs were categorized as either (likely) benign, (likely) pathogenic or of unknown significance. In 18 of the 54 isolated left-sided CHDs we found 28 rare CNVs (prevalence 33%, average 1.6 CNV per person, size 10.6 kb-2.2 Mb). Our interpretation yielded clinically significant CNVs in two of 54 cases (4%) and variants of unknown significance in three other cases (6%). In left-sided CHDs that appear isolated, with normal chromosome analysis and 22q11.2 FISH analysis, array analysis detects clinically significant CNVs. When counselling parents of a fetus with a left-sided CHD it must be taken into consideration that aside from the cardiac characteristics, the presence of extra-cardiac malformations and chromosomal abnormalities influence the treatment plan and prognosis. © 2015 John Wiley & Sons, Ltd.
Keppens, Cleo; Palma, John F; Das, Partha M; Scudder, Sidney; Wen, Wei; Normanno, Nicola; Van Krieken, J Han; Sacco, Alessandra; Fenizia, Francesca; de Castro, David Gonzalez; Hönigschnabl, Selma; Kern, Izidor; Lopez-Rios, Fernando; Lozano, Maria D; Marchetti, Antonio; Halfon, Philippe; Schuuring, Ed; Setinek, Ulrike; Sorensen, Boe; Taniere, Phillipe; Tiemann, Markus; Vosmikova, Hana; Dequeker, Elisabeth M C
2018-04-25
Molecular testing of EGFR is required to predict the response likelihood to targeted therapy in non-small-cell lung cancer. Analysis of circulating tumor DNA in plasma may complement limitations of tumor tissue. This study evaluated the interlaboratory performance and reproducibility of the cobas EGFR Mutation Test v2 to detect EGFR variants in plasma. Fourteen laboratories received two identical panels of 27 single-blinded plasma samples. Samples were wild-type or spiked with plasmid DNA to contain seven common EGFR variants at six predefined concentrations from 50 to 5000 copies per mL. The circulating tumor DNA was extracted by the cobas cfDNA Sample Preparation kit, followed by duplicate analysis with the EGFRv2 kit (Roche Molecular Systems, Pleasanton, CA). Lowest sensitivities were obtained for the c.2156G>C p.(Gly719Ala) and c.2573T>G p.(Leu858Arg) variants for the lowest target copies. For all other variants, sensitivities varied between 96.3% and 100.0%. Specificities were all 98.8% to 100.0%. Coefficients of variation indicated good intra and interlaboratory repeatability and reproducibility, but increased for decreasing concentrations. Prediction models revealed a significant correlation for all variants between the pre-defined copy number and the observed semiquantitative index values which reflects the samples' plasma mutation load. This study demonstrates an overall robust performance of the EGFRv2 kit in plasma. Prediction models may be applied to estimate the plasma mutation load for diagnostic or research purposes. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Sequential Bottlenecks Drive Viral Evolution in Early Acute Hepatitis C Virus Infection
McElroy, Kerensa; Gaudieri, Silvana; Pham, Son T.; Chopra, Abha; Cameron, Barbara; Maher, Lisa; Dore, Gregory J.; White, Peter A.; Lloyd, Andrew R.
2011-01-01
Hepatitis C is a pandemic human RNA virus, which commonly causes chronic infection and liver disease. The characterization of viral populations that successfully initiate infection, and also those that drive progression to chronicity is instrumental for understanding pathogenesis and vaccine design. A comprehensive and longitudinal analysis of the viral population was conducted in four subjects followed from very early acute infection to resolution of disease outcome. By means of next generation sequencing (NGS) and standard cloning/Sanger sequencing, genetic diversity and viral variants were quantified over the course of the infection at frequencies as low as 0.1%. Phylogenetic analysis of reassembled viral variants revealed acute infection was dominated by two sequential bottleneck events, irrespective of subsequent chronicity or clearance. The first bottleneck was associated with transmission, with one to two viral variants successfully establishing infection. The second occurred approximately 100 days post-infection, and was characterized by a decline in viral diversity. In the two subjects who developed chronic infection, this second bottleneck was followed by the emergence of a new viral population, which evolved from the founder variants via a selective sweep with fixation in a small number of mutated sites. The diversity at sites with non-synonymous mutation was higher in predicted cytotoxic T cell epitopes, suggesting immune-driven evolution. These results provide the first detailed analysis of early within-host evolution of HCV, indicating strong selective forces limit viral evolution in the acute phase of infection. PMID:21912520
Yu, W. F.; Tung, C. S.; Wang, H.; Tasayco, M. L.
2000-01-01
Inspection of high resolution three-dimensional (3D) structures from the protein database reveals an increasing number of cis-Xaa-Pro and cis-Xaa-Yaa peptide bonds. However, we are still far from being able to predict whether these bonds will remain cis upon single-site substitution of Pro or Yaa and/or cleavage of a peptide bond close to it in the sequence. We have chosen oxidized Escherichia coli thioredoxin (Trx), a member of the Trx superfamily with a single alpha/beta domain and cis P76 to determine the effect of single-site substitution and/or cleavage on this isomer. Standard two-dimensional (2D) NMR analysis were performed on cleaved Trx (1-73/74-108) and its P76A variant. Analysis of the NOE connectivities indicates remarkable similarity between the secondary and supersecondary structure of the noncovalent complexes and Trx. Analysis of the 2D version of the HCCH-TOCSY and HMQC-NOESY-HMQC and 13C-filtered HMQC-NOESY spectra of cleaved Trx with uniformly 13C-labeled 175 and P76 shows surprising conservation of both cis P76 and packing of 175 against W31. A similar NMR analysis of its P76A variant provides no evidence for cis A76 and shows only subtle local changes in both the packing of 175 and the interstrand connectivities between its most protected hydrophobic strands (beta2 and beta4). Indeed, a molecular simulation model for the trans P76A variant of Trx shows only subtle local changes around the substitution site. In conclusion, cleavage of R73 is insufficient to provoke cis/trans isomerization of P76, but cleavage and single-site substitution (P76A) favors the trans isomer. PMID:10739243
The PHF21B gene is associated with major depression and modulates the stress response.
Wong, M-L; Arcos-Burgos, M; Liu, S; Vélez, J I; Yu, C; Baune, B T; Jawahar, M C; Arolt, V; Dannlowski, U; Chuah, A; Huttley, G A; Fogarty, R; Lewis, M D; Bornstein, S R; Licinio, J
2017-07-01
Major depressive disorder (MDD) affects around 350 million people worldwide; however, the underlying genetic basis remains largely unknown. In this study, we took into account that MDD is a gene-environment disorder, in which stress is a critical component, and used whole-genome screening of functional variants to investigate the 'missing heritability' in MDD. Genome-wide association studies (GWAS) using single- and multi-locus linear mixed-effect models were performed in a Los Angeles Mexican-American cohort (196 controls, 203 MDD) and in a replication European-ancestry cohort (499 controls, 473 MDD). Our analyses took into consideration the stress levels in the control populations. The Mexican-American controls, comprised primarily of recent immigrants, had high levels of stress due to acculturation issues and the European-ancestry controls with high stress levels were given higher weights in our analysis. We identified 44 common and rare functional variants associated with mild to moderate MDD in the Mexican-American cohort (genome-wide false discovery rate, FDR, <0.05), and their pathway analysis revealed that the three top overrepresented Gene Ontology (GO) processes were innate immune response, glutamate receptor signaling and detection of chemical stimulus in smell sensory perception. Rare variant analysis replicated the association of the PHF21B gene in the ethnically unrelated European-ancestry cohort. The TRPM2 gene, previously implicated in mood disorders, may also be considered replicated by our analyses. Whole-genome sequencing analyses of a subset of the cohorts revealed that European-ancestry individuals have a significantly reduced (50%) number of single nucleotide variants compared with Mexican-American individuals, and for this reason the role of rare variants may vary across populations. PHF21b variants contribute significantly to differences in the levels of expression of this gene in several brain areas, including the hippocampus. Furthermore, using an animal model of stress, we found that Phf21b hippocampal gene expression is significantly decreased in animals resilient to chronic restraint stress when compared with non-chronically stressed animals. Together, our results reveal that including stress level data enables the identification of novel rare functional variants associated with MDD.
USDA-ARS?s Scientific Manuscript database
High-throughput sequencing of reduced representation genomic libraries has ushered in an era of genotyping-by-sequencing (GBS), where genome-wide genotype data can be obtained for nearly any species. However, there remains a need for imputation-free GBS methods for genotyping large samples taken fr...
Xu, Yuejuan; Li, Tingting; Pu, Tian; Cao, Ruixue; Long, Fei; Chen, Sun; Sun, Kun; Xu, Rang
2017-12-01
Congenital heart disease (CHD) is one of the most common birth defects. More than 200 susceptibility loci have been identified for CHDs, yet a large part of the genetic risk factors remain unexplained. Monozygotic (MZ) twins are thought to be completely genetically identical; however, discordant phenotypes have been found in MZ twins. Recent studies have demonstrated genetic differences between MZ twins. We aimed to test whether copy number variants (CNVs) and/or genetic mutation differences play a role in the etiology of CHDs by using single nucleotide polymorphism (SNP) genotyping arrays and whole exome sequencing of twin pairs discordant for CHDs. Our goal was to identify mutations present only in the affected twins, which could identify novel candidates for CHD susceptibility loci. We present a comprehensive analysis for the CNVs and genetic mutation results of the selected individuals but detected no consistent differences within the twin pairs. Our study confirms that chromosomal structure or genetic mutation differences do not seem to play a role in the MZ twins discordant for CHD.
Rare variants and cardiovascular disease.
Wain, Louise V
2014-09-01
Cardiovascular disease (CVD) is a leading cause of mortality and morbidity in the Western world. Large genome-wide association studies (GWASs) of coronary artery disease, myocardial infarction, stroke and dilated cardiomyopathy have identified a number of common genetic variants with modest effects on disease risk. Similarly, studies of important modifiable risk factors of CVD have identified a large number of predominantly common variant associations, for example, with blood pressure and blood lipid levels. In each case, despite the often large numbers of loci identified, only a small proportion of the phenotypic variance is explained. It has been hypothesised that rare variants with large effects may account for some of the missing variance but large-scale studies of rare variation are in their infancy for cardiovascular traits and have yet to produce fruitful results. Studies of monogenic CVDs, inherited disorders believed to be entirely driven by individual rare mutations, have highlighted genes that play a key role in disease aetiology. In this review, we discuss how findings from studies of rare variants in monogenic disease and GWAS of predominantly common variants are converging to provide further insight into biological disease mechanisms. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Ribeiro, Antonio; Golicz, Agnieszka; Hackett, Christine Anne; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J; Bayer, Micha
2015-11-11
Single Nucleotide Polymorphisms (SNPs) are widely used molecular markers, and their use has increased massively since the inception of Next Generation Sequencing (NGS) technologies, which allow detection of large numbers of SNPs at low cost. However, both NGS data and their analysis are error-prone, which can lead to the generation of false positive (FP) SNPs. We explored the relationship between FP SNPs and seven factors involved in mapping-based variant calling - quality of the reference sequence, read length, choice of mapper and variant caller, mapping stringency and filtering of SNPs by read mapping quality and read depth. This resulted in 576 possible factor level combinations. We used error- and variant-free simulated reads to ensure that every SNP found was indeed a false positive. The variation in the number of FP SNPs generated ranged from 0 to 36,621 for the 120 million base pairs (Mbp) genome. All of the experimental factors tested had statistically significant effects on the number of FP SNPs generated and there was a considerable amount of interaction between the different factors. Using a fragmented reference sequence led to a dramatic increase in the number of FP SNPs generated, as did relaxed read mapping and a lack of SNP filtering. The choice of reference assembler, mapper and variant caller also significantly affected the outcome. The effect of read length was more complex and suggests a possible interaction between mapping specificity and the potential for contributing more false positives as read length increases. The choice of tools and parameters involved in variant calling can have a dramatic effect on the number of FP SNPs produced, with particularly poor combinations of software and/or parameter settings yielding tens of thousands in this experiment. Between-factor interactions make simple recommendations difficult for a SNP discovery pipeline but the quality of the reference sequence is clearly of paramount importance. Our findings are also a stark reminder that it can be unwise to use the relaxed mismatch settings provided as defaults by some read mappers when reads are being mapped to a relatively unfinished reference sequence from e.g. a non-model organism in its early stages of genomic exploration.
Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B
2017-10-01
Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Jang, Su; Lee, Yunjoo; Lee, Gileung; Seo, Jeonghwan; Lee, Dongryung; Yu, Yoye; Chin, Joong Hyoun; Koh, Hee-Jong
2018-01-15
Balancing panicle-related traits such as panicle length and the numbers of primary and secondary branches per panicle, is key to improving the number of spikelets per panicle in rice. Identifying genetic information contributes to a broader understanding of the roles of gene and provides candidate alleles for use as DNA markers. Discovering relations between panicle-related traits and sequence variants allows opportunity for molecular application in rice breeding to improve the number of spikelets per panicle. In total, 142 polymorphic sites, which constructed 58 haplotypes, were detected in coding regions of ten panicle development gene and 35 sequence variants in six genes were significantly associated with panicle-related traits. Rice cultivars were clustered according to their sequence variant profiles. One of the four resultant clusters, which contained only indica and tong-il varieties, exhibited the largest average number of favorable alleles and highest average number of spikelets per panicle, suggesting that the favorable allele combination found in this cluster was beneficial in increasing the number of spikelets per panicle. Favorable alleles identified in this study can be used to develop functional markers for rice breeding programs. Furthermore, stacking several favorable alleles has the potential to substantially improve the number of spikelets per panicle in rice.
Saba, Luca; Sanfilippo, Roberto; Porcu, Michele; Lucatelli, Pierleone; Montisci, Roberto; Zaccagna, Fulvio; Suri, Jasjit S; Anzidei, Michele; Wintermark, Max
2017-04-01
We aimed to assess if there is a difference of distribution and volume of white matter hyperintensities (WMH) in the brain according to the Circle of Willis (CoW) configuration in patients with carotid artery pathology. One-hundred consecutive patients (79 males, 21 females; mean age 70 years; age range 46-84 years) that underwent brain MRI before carotid endarterectomy (CEA) were included. FLAIR-WMH lesion volume was performed using a semi-automated segmentation technique and the status of the circle of Willis was assessed by two neuroradiologists in consensus. We found a prevalence of 55% of variants in the CoW configuration; 22 cases had one variants (40%); 25 cases had two variants (45.45%) and 8 cases showed 3 variants (14.55%). The configuration that was associated with the biggest WMH volume and number of lesions was the A1+PcoA+PcoA. The PcoA variants were the most prevalent and there was no statistically significant difference in number of lesions and WMH for each vascular territory assessed and the same results were found for AcoA and A1 variants. Results of our study suggest that the more common CoW variants are not associated with the presence of an increased WMH or number of lesions whereas uncommon configurations, in particular when 2 or more segment are missing increase the WMH volume and number of lesions. The WHM volume of the MCA territory seems to be more affected by the CoW configuration. Copyright © 2017 Elsevier B.V. All rights reserved.
Tziastoudi, Maria; Hadjigeorgiou, Georgios M.; Stravodimos, Konstantinos; Zintzaras, Elias
2017-01-01
Abstract Background: Despite the certain contribution of metabolic and haemodynamic factors in diabetic nephropathy (DN), many lines of evidence highlight the role of immunologic and inflammatory mechanisms. To elucidate the contribution of the immune system in the development of DN, we explored the contribution of gene variants (polymorphisms) in relevant pathophysiologic pathways. Methods: We selected six major pathways related to immune response from the Kyoto Encyclopaedia of Genes and Genomes database and thereafter we traced all available genetic association studies (GASs) involving gene variants in these pathways from PubMed and HuGE Navigator. Finally, we used meta-analytic methods for synthesizing the results of the GASs. Results: One hundred three GASs were retrieved that included 443 variants from 75 genes. Of those variants, 138 were meta-analysed and 61 produced significant results; seven variants were investigated in single GASs and showed significant association. Variants in CCL2, CCR5, IL6, IL8, EPO, IL1A, IL1B, IL100, IL1RN, GHRL, MMP9, TGFB1, VEGFA, MMP3, MMP12, IL12RB1, PRKCE, TNF and TNFRSF19 genes were associated with an increased risk of DN. Conclusions: There is evidence that variants related with immunologic response affect the course of DN. However, the present results should be interpreted with caution since the current number of available GASs is limited. PMID:28616206
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups
Curtis, Christina; Shah, Sohrab P.; Chin, Suet-Feung; Turashvili, Gulisa; Rueda, Oscar M.; Dunning, Mark J.; Speed, Doug; Lynch, Andy G.; Samarajiwa, Shamith; Yuan, Yinyin; Gräf, Stefan; Ha, Gavin; Haffari, Gholamreza; Bashashati, Ali; Russell, Roslin; McKinney, Steven; Langerød, Anita; Green, Andrew; Provenzano, Elena; Wishart, Gordon; Pinder, Sarah; Watson, Peter; Markowetz, Florian; Murphy, Leigh; Ellis, Ian; Purushotham, Arnie; Børresen-Dale, Anne-Lise; Brenton, James D.; Tavaré, Simon; Caldas, Carlos; Aparicio, Samuel
2012-01-01
The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome. PMID:22522925
Patwary, Nurmohammed; Preza, Chrysanthe
2015-01-01
A depth-variant (DV) image restoration algorithm for wide field fluorescence microscopy, using an orthonormal basis decomposition of DV point-spread functions (PSFs), is investigated in this study. The efficient PSF representation is based on a previously developed principal component analysis (PCA), which is computationally intensive. We present an approach developed to reduce the number of DV PSFs required for the PCA computation, thereby making the PCA-based approach computationally tractable for thick samples. Restoration results from both synthetic and experimental images show consistency and that the proposed algorithm addresses efficiently depth-induced aberration using a small number of principal components. Comparison of the PCA-based algorithm with a previously-developed strata-based DV restoration algorithm demonstrates that the proposed method improves performance by 50% in terms of accuracy and simultaneously reduces the processing time by 64% using comparable computational resources. PMID:26504634
Low α-defensin gene copy number increases the risk for IgA nephropathy and renal dysfunction.
Ai, Zhen; Li, Ming; Liu, Wenting; Foo, Jia-Nee; Mansouri, Omniah; Yin, Peiran; Zhou, Qian; Tang, Xueqing; Dong, Xiuqing; Feng, Shaozhen; Xu, Ricong; Zhong, Zhong; Chen, Jian; Wan, Jianxin; Lou, Tanqi; Yu, Jianwen; Zhou, Qin; Fan, Jinjin; Mao, Haiping; Gale, Daniel; Barratt, Jonathan; Armour, John A L; Liu, Jianjun; Yu, Xueqing
2016-06-29
Although a major source of genetic variation, copy number variations (CNVs) and their involvement in disease development have not been well studied. Immunoglobulin A nephropathy (IgAN) is the most common primary glomerulonephritis worldwide. We performed association analysis of the DEFA1A3 CNV locus in two independent IgAN cohorts of southern Chinese Han (total of 1189 cases and 1187 controls). We discovered three independent copy number associations within the locus: DEFA1A3 [P = 3.99 × 10(-9); odds ratio (OR), 0.88], DEFA3 (P = 6.55 × 10(-5); OR, 0.82), and a noncoding deletion variant (211bp) (P = 3.50 × 10(-16); OR, 0.75) (OR per copy, fixed-effects meta-analysis). While showing strong association with an increased risk for IgAN (P = 9.56 × 10(-20)), low total copy numbers of the three variants also showed significant association with renal dysfunction in patients with IgAN (P = 0.03; hazards ratio, 3.69; after controlling for the effects of known prognostic factors) and also with increased serum IgA1 (P = 0.02) and galactose-deficient IgA1 (P = 0.03). For replication, we confirmed the associations of DEFA1A3 (P = 4.42 × 10(-4); OR, 0.82) and DEFA3 copy numbers (P = 4.30 × 10(-3); OR, 0.74) with IgAN in a Caucasian cohort (531 cases and 198 controls) and found the 211bp variant to be much rarer in Caucasians. We also observed an association of the 211bp copy number with membranous nephropathy (P = 1.11 × 10(-7); OR, 0.74; in 493 Chinese cases and 500 matched controls), but not with diabetic kidney disease (in 806 Chinese cases and 786 matched controls). By explaining 4.96% of disease risk and influencing renal dysfunction in patients with IgAN, the DEFA1A3 CNV locus may be a potential therapeutic target for developing treatments for this disease. Copyright © 2016, American Association for the Advancement of Science.
Nance, D; Campbell, R A; Rowley, J W; Downie, J M; Jorde, L B; Kahr, W H; Mereby, S A; Tolley, N D; Zimmerman, G A; Weyrich, A S; Rondina, M T
2016-11-01
Essentials Co-existent damaging variants are likely to cause more severe bleeding and may go undiagnosed. We determined pathogenic variants in a three-generational pedigree with excessive bleeding. Bleeding occurred with concurrent variants in prostaglandin synthase-1 (PTGS-1) and factor VIII. The PTGS-1 variant was associated with functional defects in the arachidonic acid pathway. Background Inherited human variants that concurrently cause disorders of primary hemostasis and coagulation are uncommon. Nevertheless, rare cases of co-existent damaging variants are likely to cause more severe bleeding and may go undiagnosed. Objective We prospectively sought to determine pathogenic variants in a three-generational pedigree with excessive bleeding. Patients/methods Platelet number, size and light transmission aggregometry to multiple agonists were evaluated in pedigree members. Transmission electron microscopy determined platelet morphology and granule content. Thromboxane release studies and light transmission aggregometry in the presence or absence of prostaglandin G 2 assessed specific functional defects in the arachidonic acid pathway. Whole exome sequencing (WES) and targeted nucleotide sequence analysis identified potentially deleterious variants. Results Pedigree members with excessive bleeding had impaired platelet aggregation with arachidonic acid, epinephrine and low-dose ADP, as well as reduced platelet thromboxane B 2 release. Impaired platelet aggregation in response to 2MesADP was rescued with prostaglandin G 2 , a prostaglandin intermediate downstream of prostaglandin synthase-1 (PTGS-1) that aids in the production of thromboxane. WES identified a non-synonymous variant in the signal peptide of PTGS-1 (rs3842787; c.50C>T; p.Pro17Leu) that completely co-segregated with disease phenotype. A variant in the F8 gene causing hemophilia A (rs28935203; c.5096A>T; p.Y1699F) was also identified. Individuals with both variants had more severe bleeding manifestations than characteristic of mild hemophilia A alone. Conclusion We provide the first report of co-existing variants in both F8 and PTGS-1 genes in a three-generation pedigree. The PTGS-1 variant was associated with specific functional defects in the arachidonic acid pathway and more severe hemorrhage. © 2016 International Society on Thrombosis and Haemostasis.
Wright, Caroline F; Fitzgerald, Tomas W; Jones, Wendy D; Clayton, Stephen; McRae, Jeremy F; van Kogelenberg, Margriet; King, Daniel A; Ambridge, Kirsty; Barrett, Daniel M; Bayzetinova, Tanya; Bevan, A Paul; Bragin, Eugene; Chatzimichali, Eleni A; Gribble, Susan; Jones, Philip; Krishnappa, Netravathi; Mason, Laura E; Miller, Ray; Morley, Katherine I; Parthiban, Vijaya; Prigmore, Elena; Rajan, Diana; Sifrim, Alejandro; Swaminathan, G Jawahar; Tivey, Adrian R; Middleton, Anna; Parker, Michael; Carter, Nigel P; Barrett, Jeffrey C; Hurles, Matthew E; FitzPatrick, David R; Firth, Helen V
2015-04-04
Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently a topic of international debate, and development of robust analytical workflows to identify and communicate clinically relevant variants is paramount. The Deciphering Developmental Disorders (DDD) study has developed a UK-wide patient recruitment network involving over 180 clinicians across all 24 regional genetics services, and has performed genome-wide microarray and whole exome sequencing on children with undiagnosed developmental disorders and their parents. After data analysis, pertinent genomic variants were returned to individual research participants via their local clinical genetics team. Around 80,000 genomic variants were identified from exome sequencing and microarray analysis in each individual, of which on average 400 were rare and predicted to be protein altering. By focusing only on de novo and segregating variants in known developmental disorder genes, we achieved a diagnostic yield of 27% among 1133 previously investigated yet undiagnosed children with developmental disorders, whilst minimising incidental findings. In families with developmentally normal parents, whole exome sequencing of the child and both parents resulted in a 10-fold reduction in the number of potential causal variants that needed clinical evaluation compared to sequencing only the child. Most diagnostic variants identified in known genes were novel and not present in current databases of known disease variation. Implementation of a robust translational genomics workflow is achievable within a large-scale rare disease research study to allow feedback of potentially diagnostic findings to clinicians and research participants. Systematic recording of relevant clinical data, curation of a gene-phenotype knowledge base, and development of clinical decision support software are needed in addition to automated exclusion of almost all variants, which is crucial for scalable prioritisation and review of possible diagnostic variants. However, the resource requirements of development and maintenance of a clinical reporting system within a research setting are substantial. Health Innovation Challenge Fund, a parallel funding partnership between the Wellcome Trust and the UK Department of Health. Copyright © 2015 Wright et al. Open Access article distributed under the terms of CC BY. Published by Elsevier Ltd. All rights reserved.
Wright, Caroline F; Fitzgerald, Tomas W; Jones, Wendy D; Clayton, Stephen; McRae, Jeremy F; van Kogelenberg, Margriet; King, Daniel A; Ambridge, Kirsty; Barrett, Daniel M; Bayzetinova, Tanya; Bevan, A Paul; Bragin, Eugene; Chatzimichali, Eleni A; Gribble, Susan; Jones, Philip; Krishnappa, Netravathi; Mason, Laura E; Miller, Ray; Morley, Katherine I; Parthiban, Vijaya; Prigmore, Elena; Rajan, Diana; Sifrim, Alejandro; Swaminathan, G Jawahar; Tivey, Adrian R; Middleton, Anna; Parker, Michael; Carter, Nigel P; Barrett, Jeffrey C; Hurles, Matthew E; FitzPatrick, David R; Firth, Helen V
2015-01-01
Summary Background Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently a topic of international debate, and development of robust analytical workflows to identify and communicate clinically relevant variants is paramount. Methods The Deciphering Developmental Disorders (DDD) study has developed a UK-wide patient recruitment network involving over 180 clinicians across all 24 regional genetics services, and has performed genome-wide microarray and whole exome sequencing on children with undiagnosed developmental disorders and their parents. After data analysis, pertinent genomic variants were returned to individual research participants via their local clinical genetics team. Findings Around 80 000 genomic variants were identified from exome sequencing and microarray analysis in each individual, of which on average 400 were rare and predicted to be protein altering. By focusing only on de novo and segregating variants in known developmental disorder genes, we achieved a diagnostic yield of 27% among 1133 previously investigated yet undiagnosed children with developmental disorders, whilst minimising incidental findings. In families with developmentally normal parents, whole exome sequencing of the child and both parents resulted in a 10-fold reduction in the number of potential causal variants that needed clinical evaluation compared to sequencing only the child. Most diagnostic variants identified in known genes were novel and not present in current databases of known disease variation. Interpretation Implementation of a robust translational genomics workflow is achievable within a large-scale rare disease research study to allow feedback of potentially diagnostic findings to clinicians and research participants. Systematic recording of relevant clinical data, curation of a gene–phenotype knowledge base, and development of clinical decision support software are needed in addition to automated exclusion of almost all variants, which is crucial for scalable prioritisation and review of possible diagnostic variants. However, the resource requirements of development and maintenance of a clinical reporting system within a research setting are substantial. Funding Health Innovation Challenge Fund, a parallel funding partnership between the Wellcome Trust and the UK Department of Health. PMID:25529582
Genetic Relationships Between Schizophrenia, Bipolar Disorder, and Schizoaffective Disorder
Cardno, Alastair G.
2014-01-01
There is substantial evidence for partial overlap of genetic influences on schizophrenia and bipolar disorder, with family, twin, and adoption studies showing a genetic correlation between the disorders of around 0.6. Results of genome-wide association studies are consistent with commonly occurring genetic risk variants, contributing to both the shared and nonshared aspects, while studies of large, rare chromosomal structural variants, particularly copy number variants, show a stronger influence on schizophrenia than bipolar disorder to date. Schizoaffective disorder has been less investigated but shows substantial familial overlap with both schizophrenia and bipolar disorder. A twin analysis is consistent with genetic influences on schizoaffective episodes being entirely shared with genetic influences on schizophrenic and manic episodes, while association studies suggest the possibility of some relatively specific genetic influences on broadly defined schizoaffective disorder, bipolar subtype. Further insights into genetic relationships between these disorders are expected as studies continue to increase in sample size and in technical and analytical sophistication, information on phenotypes beyond clinical diagnoses are increasingly incorporated, and approaches such as next-generation sequencing identify additional types of genetic risk variant. PMID:24567502
Tindale, Lauren C; Leach, Stephen; Spinelli, John J; Brooks-Wilson, Angela R
2017-03-28
Several studies have found that long-lived individuals do not appear to carry lower numbers of common disease-associated variants than ordinary people; it has been hypothesized that they may instead carry protective variants. An intriguing type of protective variant is buffering variants that protect against variants that have deleterious effects. We genotyped 18 variants in 15 genes related to longevity or healthy aging that had been previously reported as having a gene-gene interaction or buffering effect. We compared a group of 446 healthy oldest-old 'Super-Seniors' (individuals 85 or older who have never been diagnosed with cancer, cardiovascular disease, dementia, diabetes or major pulmonary disease) to 421 random population-based midlife controls. Cases and controls were of European ancestry. Association tests of individual SNPs showed that Super-Seniors were less likely than controls to carry an APOEε4 allele or a haptoglobin HP2 allele. Interactions between APOE/FOXO3, APOE/CRYL1, and LPA/CRYL1 did not remain significant after multiple testing correction. In a network analysis of the candidate genes, lipid and cholesterol metabolism was a common theme. APOE, HP, and CRYL1 have all been associated with Alzheimer's Disease, the pathology of which involves lipid and cholesterol pathways. Age-related changes in lipid and cholesterol maintenance, particularly in the brain, may be central to healthy aging and longevity.
Jang, Hyo Geun; Choi, Youngsok; Kim, Jung Oh; Jeon, Young Joo; Rah, HyungChul; Cho, Sung Hwan; Kim, Ji Hyang; Lee, Woo Sik; Kim, Nam Keun
2016-06-01
Polymorphisms in TNF-a have been reported as genetic risk factors for recurrent spontaneous abortion and TNF-α may be immunologically important. We therefore examined the contribution of several TNF-a mutations to this phenomenon. The study participants consisted of 388 patients with idiopathic recurrent pregnancy loss (RPL), which was diagnosed on the basis of at least two consecutive spontaneous abortions; control subjects were 224 healthy women with a history of successful pregnancies. Polymerase chain reaction-restriction fragment length polymorphism analysis was performed to determine the TNF-α -863C>A, -857C>T, and +488G>A genotypes. The TNF-α -863C>A variants correlated with increased risk of RPL (CA+AA; adjusted odds ratio [AOR], 2.142; 95% confidence interval [CI], 1.493-3.074). These data did not differ in a stratified analysis according to number of consecutive spontaneous abortions. In haplotype analysis, there were similar trends of data for combination analysis, but in patients with 3+ pregnancy losses, a stratified analysis revealed that this correlation did not increase directly with the number of pregnancy losses. The TNF-α -863C>A variant is a possible genetic risk factor for idiopathic RPL in Korean women. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Impact of constitutional copy number variants on biological pathway evolution.
Poptsova, Maria; Banerjee, Samprit; Gokcumen, Omer; Rubin, Mark A; Demichelis, Francesca
2013-01-23
Inherited Copy Number Variants (CNVs) can modulate the expression levels of individual genes. However, little is known about how CNVs alter biological pathways and how this varies across different populations. To trace potential evolutionary changes of well-described biological pathways, we jointly queried the genomes and the transcriptomes of a collection of individuals with Caucasian, Asian or Yoruban descent combining high-resolution array and sequencing data. We implemented an enrichment analysis of pathways accounting for CNVs and genes sizes and detected significant enrichment not only in signal transduction and extracellular biological processes, but also in metabolism pathways. Upon the estimation of CNV population differentiation (CNVs with different polymorphism frequencies across populations), we evaluated that 22% of the pathways contain at least one gene that is proximal to a CNV (CNV-gene pair) that shows significant population differentiation. The majority of these CNV-gene pairs belong to signal transduction pathways and 6% of the CNV-gene pairs show statistical association between the copy number states and the transcript levels. The analysis suggested possible examples of positive selection within individual populations including NF-kB, MAPK signaling pathways, and Alu/L1 retrotransposition factors. Altogether, our results suggest that constitutional CNVs may modulate subtle pathway changes through specific pathway enzymes, which may become fixed in some populations.
Impact of constitutional copy number variants on biological pathway evolution
2013-01-01
Background Inherited Copy Number Variants (CNVs) can modulate the expression levels of individual genes. However, little is known about how CNVs alter biological pathways and how this varies across different populations. To trace potential evolutionary changes of well-described biological pathways, we jointly queried the genomes and the transcriptomes of a collection of individuals with Caucasian, Asian or Yoruban descent combining high-resolution array and sequencing data. Results We implemented an enrichment analysis of pathways accounting for CNVs and genes sizes and detected significant enrichment not only in signal transduction and extracellular biological processes, but also in metabolism pathways. Upon the estimation of CNV population differentiation (CNVs with different polymorphism frequencies across populations), we evaluated that 22% of the pathways contain at least one gene that is proximal to a CNV (CNV-gene pair) that shows significant population differentiation. The majority of these CNV-gene pairs belong to signal transduction pathways and 6% of the CNV-gene pairs show statistical association between the copy number states and the transcript levels. Conclusions The analysis suggested possible examples of positive selection within individual populations including NF-kB, MAPK signaling pathways, and Alu/L1 retrotransposition factors. Altogether, our results suggest that constitutional CNVs may modulate subtle pathway changes through specific pathway enzymes, which may become fixed in some populations. PMID:23342974
Shwiff, Stephanie A; Kirkpatrick, Katy N; Sterner, Ray T
2008-12-01
To conduct a benefit-cost analysis of the results of the domestic dog and coyote (DDC) oral rabies vaccine (ORV) program in Texas from 1995 through 2006 by use of fiscal records and relevant public health data. Retrospective benefit-cost analysis. Procedures-Pertinent economic data were collected in 20 counties of south Texas affected by a DDC-variant rabies epizootic. The costs and benefits afforded by a DDC ORV program were then calculated. Costs were the total expenditures of the ORV program. Benefits were the savings associated with the number of potentially prevented human postexposure prophylaxis (PEP) treatments and animal rabies tests for the DDC-variant rabies virus in the epizootic area and an area of potential disease expansion. Total estimated benefits of the program approximately ranged from $89 million to $346 million, with total program costs of $26,358,221 for the study period. The estimated savings (ie, damages avoided) from extrapolated numbers of PEP treatments and animal rabies tests yielded benefit-cost ratios that ranged from 3.38 to 13.12 for various frequen-cies of PEP and animal testing. In Texas, the use of ORV stopped the northward spread and led to the progressive elimination of the DDC variant of rabies in coyotes (Canis latrans). The decision to implement an ORV program was cost-efficient, although many unknowns were involved in the original decision, and key economic variables were identified for consideration in future planning of ORV programs.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.
Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M
2016-01-01
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
Evaluation of three read-depth based CNV detection tools using whole-exome sequencing data.
Yao, Ruen; Zhang, Cheng; Yu, Tingting; Li, Niu; Hu, Xuyun; Wang, Xiumin; Wang, Jian; Shen, Yiping
2017-01-01
Whole exome sequencing (WES) has been widely accepted as a robust and cost-effective approach for clinical genetic testing of small sequence variants. Detection of copy number variants (CNV) within WES data have become possible through the development of various algorithms and software programs that utilize read-depth as the main information. The aim of this study was to evaluate three commonly used, WES read-depth based CNV detection programs using high-resolution chromosomal microarray analysis (CMA) as a standard. Paired CMA and WES data were acquired for 45 samples. A total of 219 CNVs (size ranged from 2.3 kb - 35 mb) identified on three CMA platforms (Affymetrix, Agilent and Illumina) were used as standards. CNVs were called from WES data using XHMM, CoNIFER, and CNVnator with modified settings. All three software packages detected an elevated proportion of small variants (< 20 kb) compared to CMA. XHMM and CoNIFER had poor detection sensitivity (22.2 and 14.6%), which correlated with the number of capturing probes involved. CNVnator detected most variants and had better sensitivity (87.7%); however, suffered from an overwhelming detection of small CNVs below 20 kb, which required further confirmation. Size estimation of variants was exaggerated by CNVnator and understated by XHMM and CoNIFER. Low concordances of CNV, detected by three different read-depth based programs, indicate the immature status of WES-based CNV detection. Low sensitivity and uncertain specificity of WES-based CNV detection in comparison with CMA based CNV detection suggests that CMA will continue to play an important role in detecting clinical grade CNV in the NGS era, which is largely based on WES.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes
Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M
2016-01-01
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases. PMID:25644381
A map of human microRNA variation uncovers unexpectedly high levels of variability
2012-01-01
Background MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in a healthy population seems an obvious requirement for future research on miRNA-related pathologies. Methods Data on 14 populations from the 1000 Genomes Project were analyzed, along with new data extracted from 60 exomes of healthy individuals from a population from southern Spain, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. Results Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. Conclusions Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. PMID:22906193
Meta-analysis of gene-level associations for rare variants based on single-variant statistics.
Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu
2013-08-08
Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Wu, Yunfei; Dong, Xiaofeng; Kadowaki, Tatsuhiko
2017-01-01
Recent honey bee colony losses, particularly during the winter, have been shown to be associated with the presence of both ectoparasitic mites and Deformed Wing Virus (DWV). Whilst the role of Varroa destructor mites as a viral vector is well established, the role of Tropilaelaps mercedesae mites in viral transmission has not been fully investigated. In this study, we tested the effects that V. destructor and T. mercedesae infestation have on fluctuation of the DWV copy number and alteration of the virus variants in honey bees by characterizing individual pupae and their infesting mites. We observed that both mite species were associated with increased viral copy number in honey bee pupae. We found a positive correlation between DWV copy number in pupae and copy number in infesting mites, and the same DWV type A variant was present in either low or high copy number in both honey bee pupae and infesting V. destructor . These data also suggest that variant diversity is similar between honey bee pupae and the mites that infest them. These results support a previously proposed hypothesis that DWV suppresses the honey bee immune system when virus copy number reaches a specific threshold, promoting greater replication.
Wu, Yunfei; Dong, Xiaofeng; Kadowaki, Tatsuhiko
2017-01-01
Recent honey bee colony losses, particularly during the winter, have been shown to be associated with the presence of both ectoparasitic mites and Deformed Wing Virus (DWV). Whilst the role of Varroa destructor mites as a viral vector is well established, the role of Tropilaelaps mercedesae mites in viral transmission has not been fully investigated. In this study, we tested the effects that V. destructor and T. mercedesae infestation have on fluctuation of the DWV copy number and alteration of the virus variants in honey bees by characterizing individual pupae and their infesting mites. We observed that both mite species were associated with increased viral copy number in honey bee pupae. We found a positive correlation between DWV copy number in pupae and copy number in infesting mites, and the same DWV type A variant was present in either low or high copy number in both honey bee pupae and infesting V. destructor. These data also suggest that variant diversity is similar between honey bee pupae and the mites that infest them. These results support a previously proposed hypothesis that DWV suppresses the honey bee immune system when virus copy number reaches a specific threshold, promoting greater replication. PMID:28878743
Mabuchi, Fumihiko; Sakurada, Yoichi; Kashiwagi, Kenji; Yamagata, Zentaro; Iijima, Hiroyuki; Tsukahara, Shigeo
2015-03-01
To investigate the associations between the non-intraocular pressure (IOP)-related genetic variants (genetic variants associated with vulnerability of the optic nerve independent of IOP) and primary open-angle glaucoma (POAG), including normal-tension glaucoma (NTG) and high-tension glaucoma (HTG), and between the non-IOP-related genetic variants and a family history of glaucoma. Case-control study. Japanese patients with NTG (n = 213) and HTG (n = 212) and 191 control subjects were genotyped for 5 non-IOP-related genetic variants predisposing to POAG near the SRBD1, ELOVL5, CDKN2B/CDKN2B-AS1, SIX1/SIX6, and ATOH7 genes. The load of these genetic variants was compared between the control subjects and patients with NTG or HTG and between the POAG patients with and without a family history of glaucoma. The total number of POAG risk alleles and the product of the odds ratios (POAG risk) of these genetic variants were significantly larger (P < .0025) in patients with both NTG and HTG than in the control subjects, and were significantly larger (P = .0042 and P = .023, respectively) in POAG patients with a family history of glaucoma than in those without. As the number of relatives with glaucoma increased, the total number of risk alleles and the product of the odds ratios increased (P = .012 and P = .047, respectively). Non-IOP-related genetic variants contribute to the pathogenesis of HTG as well as NTG. A positive family history of glaucoma in cases of POAG is thought to reflect the influence of genetic variants predisposing to POAG. Copyright © 2015 Elsevier Inc. All rights reserved.
van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M
2015-09-01
Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.
van den Broek, M.; Bolat, I.; Nijkamp, J. F.; Ramos, E.; Luttik, M. A. H.; Koopman, F.; Geertman, J. M.; de Ridder, D.; Pronk, J. T.
2015-01-01
Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. PMID:26150454
Benchmarking distributed data warehouse solutions for storing genomic variant information
Wiewiórka, Marek S.; Wysakowicz, Dawid P.; Okoniewski, Michał J.
2017-01-01
Abstract Genomic-based personalized medicine encompasses storing, analysing and interpreting genomic variants as its central issues. At a time when thousands of patientss sequenced exomes and genomes are becoming available, there is a growing need for efficient database storage and querying. The answer could be the application of modern distributed storage systems and query engines. However, the application of large genomic variant databases to this problem has not been sufficiently far explored so far in the literature. To investigate the effectiveness of modern columnar storage [column-oriented Database Management System (DBMS)] and query engines, we have developed a prototypic genomic variant data warehouse, populated with large generated content of genomic variants and phenotypic data. Next, we have benchmarked performance of a number of combinations of distributed storages and query engines on a set of SQL queries that address biological questions essential for both research and medical applications. In addition, a non-distributed, analytical database (MonetDB) has been used as a baseline. Comparison of query execution times confirms that distributed data warehousing solutions outperform classic relational DBMSs. Moreover, pre-aggregation and further denormalization of data, which reduce the number of distributed join operations, significantly improve query performance by several orders of magnitude. Most of distributed back-ends offer a good performance for complex analytical queries, while the Optimized Row Columnar (ORC) format paired with Presto and Parquet with Spark 2 query engines provide, on average, the lowest execution times. Apache Kudu on the other hand, is the only solution that guarantees a sub-second performance for simple genome range queries returning a small subset of data, where low-latency response is expected, while still offering decent performance for running analytical queries. In summary, research and clinical applications that require the storage and analysis of variants from thousands of samples can benefit from the scalability and performance of distributed data warehouse solutions. Database URL: https://github.com/ZSI-Bio/variantsdwh PMID:29220442
Moore, Michael T; Brown, Timothy A
2012-09-01
A number of researchers have proposed adding an increasing number of subthreshold variants of major depressive disorder (MDD) as new mood disorder. However, this research has suffered from a number of theoretical and methodological flaws that the current investigation has attempted to address. Individuals with MDD (n = 470) were compared with individuals with subthreshold MDD (n = 57). Individuals with MDD reported consistently more severe symptoms, albeit of small magnitude, as well as differences in comorbidity with only two disorders. Results also indicated that diagnosis did not significantly predict rate of symptom change when MDD was compared with its subthreshold variant. Taken together, the aforementioned evidence suggests that small differences exist between MDD and its subthreshold variant. In addition, the extent to which the latter serves as useful analogs for the former may depend upon the variables under study.
Zhao, Wei; Niu, Guannan; Shen, Botao; Zheng, Yang; Gong, Fangchao; Wang, Xianfu; Lee, Jiyun; Mulvihill, John J; Chen, Xiaohui; Li, Shibo
2013-12-01
As patients with congenital heart disease (CHD) increasingly survive to childbearing age, it becomes important to understand the genetic origins of CHD. In children, CHD is frequently caused by chromosomal imbalances. We searched for submicroscopic imbalances in adults with CHD focusing on simple-to-moderate phenotypes, without associated dysmorphic features, a group not previously examined. A total of 100 Han Chinese adults with a diverse range of isolated CHD and 65 ethnically matched controls were screened using whole-genome array comparative genomic hybridization. Forty-five large (>100 kb) rare copy number variants (CNVs) were identified in 36/100 patients. These variants were not listed in the Database of Genomic Variants nor found in controls. In three of these genomic imbalances (22q11.2, 18q23, 3q21.3), genes that play an important role in cardiac development were implicated, including CRKL, NFATC1, PLXNA1, the latter has not been associated with human CHD before. This study detected a 0.7 Mb 22q11.2 deletion, which marginally overlapped the common 3 Mb 22q11.2 deletion, in one patient with a perimembranous ventricular septal defect without any extracardiac manifestation. Furthermore, we detected a novel inherited aberration dup (16q23.1). Although a causal relationship with CHD remains to be established, this CNVs profile provides a spectrum of genomic imbalances in this condition, and improves the CNV-phenotype correlations. © 2013 Wiley Periodicals, Inc.
Chen, Wenan; McDonnell, Shannon K; Thibodeau, Stephen N; Tillmans, Lori S; Schaid, Daniel J
2016-11-01
Functional annotations have been shown to improve both the discovery power and fine-mapping accuracy in genome-wide association studies. However, the optimal strategy to incorporate the large number of existing annotations is still not clear. In this study, we propose a Bayesian framework to incorporate functional annotations in a systematic manner. We compute the maximum a posteriori solution and use cross validation to find the optimal penalty parameters. By extending our previous fine-mapping method CAVIARBF into this framework, we require only summary statistics as input. We also derived an exact calculation of Bayes factors using summary statistics for quantitative traits, which is necessary when a large proportion of trait variance is explained by the variants of interest, such as in fine mapping expression quantitative trait loci (eQTL). We compared the proposed method with PAINTOR using different strategies to combine annotations. Simulation results show that the proposed method achieves the best accuracy in identifying causal variants among the different strategies and methods compared. We also find that for annotations with moderate effects from a large annotation pool, screening annotations individually and then combining the top annotations can produce overly optimistic results. We applied these methods on two real data sets: a meta-analysis result of lipid traits and a cis-eQTL study of normal prostate tissues. For the eQTL data, incorporating annotations significantly increased the number of potential causal variants with high probabilities. Copyright © 2016 by the Genetics Society of America.
Tang, Jinsong; Fan, Yu; Li, Hong; Xiang, Qun; Zhang, Deng-Feng; Li, Zongchang; He, Ying; Liao, Yanhui; Wang, Ya; He, Fan; Zhang, Fengyu; Shugart, Yin Yao; Liu, Chunyu; Tang, Yanqing; Chan, Raymond C K; Wang, Chuan-Yue; Yao, Yong-Gang; Chen, Xiaogang
2017-06-20
Schizophrenia is a common disorder with a high heritability, but its genetic architecture is still elusive. We implemented whole-genome sequencing (WGS) analysis of 8 families with monozygotic (MZ) twin pairs discordant for schizophrenia to assess potential association of de novo mutations (DNMs) or inherited variants with susceptibility to schizophrenia. Eight non-synonymous DNMs (including one splicing site) were identified and shared by twins, which were either located in previously reported schizophrenia risk genes (p.V24689I mutation in TTN, p.S2506T mutation in GCN1L1, IVS3+1G > T in DOCK1) or had a benign to damaging effect according to in silico prediction analysis. By searching the inherited rare damaging or loss-of-function (LOF) variants and common susceptible alleles from three classes of schizophrenia candidate genes, we were able to distill genetic alterations in several schizophrenia risk genes, including GAD1, PLXNA2, RELN and FEZ1. Four inherited copy number variations (CNVs; including a large deletion at 16p13.11) implicated for schizophrenia were identified in four families, respectively. Most of families carried both missense DNMs and inherited risk variants, which might suggest that DNMs, inherited rare damaging variants and common risk alleles together conferred to schizophrenia susceptibility. Our results support that schizophrenia is caused by a combination of multiple genetic factors, with each DNM/variant showing a relatively small effect size. Copyright © 2017 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. All rights reserved.
Stossi, Fabio; Dandekar, Radhika D; Bolt, Michael J; Newberg, Justin Y; Mancini, Maureen G; Kaushik, Akash K; Putluri, Vasanta; Sreekumar, Arun; Mancini, Michael A
2016-03-29
Prostate cancer remains a deadly disease especially when patients become resistant to drugs that target the Androgen Receptor (AR) ligand binding domain. At this stage, patients develop recurring castrate-resistant prostate cancers (CRPCs). Interestingly, CRPC tumors maintain dependency on AR for growth; moreover, in CRPCs, constitutively active AR splice variants (e.g., AR-V7) begin to be expressed at higher levels. These splice variants lack the ligand binding domain and are rendered insensitive to current endocrine therapies. Thus, it is of paramount importance to understand what regulates the expression of AR and its splice variants to identify new therapeutic strategies in CRPCs. Here, we used high throughput microscopy and quantitative image analysis to evaluate effects of selected endocrine disruptors on AR levels in multiple breast and prostate cancer cell lines. Bisphenol AP (BPAP), which is used in chemical and medical industries, was identified as a down-regulator of both full length AR and the AR-V7 splice variant. We validated its activity by performing time-course, dose-response, Western blot and qPCR analyses. BPAP also reduced the percent of cells in S phase, which was accompanied by a ~60% loss in cell numbers and colony formation in anchorage-independent growth assays. Moreover, it affected mitochondria size and cell metabolism. In conclusion, our high content analysis-based screening platform was used to classify the effect of compounds on endogenous ARs, and identified BPAP as being capable of causing AR (both full-length and variants) down-regulation, cell cycle arrest and metabolic alterations in CRPC cell lines.
General Framework for Meta-analysis of Rare Variants in Sequencing Association Studies
Lee, Seunggeun; Teslovich, Tanya M.; Boehnke, Michael; Lin, Xihong
2013-01-01
We propose a general statistical framework for meta-analysis of gene- or region-based multimarker rare variant association tests in sequencing association studies. In genome-wide association studies, single-marker meta-analysis has been widely used to increase statistical power by combining results via regression coefficients and standard errors from different studies. In analysis of rare variants in sequencing studies, region-based multimarker tests are often used to increase power. We propose meta-analysis methods for commonly used gene- or region-based rare variants tests, such as burden tests and variance component tests. Because estimation of regression coefficients of individual rare variants is often unstable or not feasible, the proposed method avoids this difficulty by calculating score statistics instead that only require fitting the null model for each study and then aggregating these score statistics across studies. Our proposed meta-analysis rare variant association tests are conducted based on study-specific summary statistics, specifically score statistics for each variant and between-variant covariance-type (linkage disequilibrium) relationship statistics for each gene or region. The proposed methods are able to incorporate different levels of heterogeneity of genetic effects across studies and are applicable to meta-analysis of multiple ancestry groups. We show that the proposed methods are essentially as powerful as joint analysis by directly pooling individual level genotype data. We conduct extensive simulations to evaluate the performance of our methods by varying levels of heterogeneity across studies, and we apply the proposed methods to meta-analysis of rare variant effects in a multicohort study of the genetics of blood lipid levels. PMID:23768515
Meta-analysis of gene-level tests for rare variant association.
Liu, Dajiang J; Peloso, Gina M; Zhan, Xiaowei; Holmen, Oddgeir L; Zawistowski, Matthew; Feng, Shuang; Nikpay, Majid; Auer, Paul L; Goel, Anuj; Zhang, He; Peters, Ulrike; Farrall, Martin; Orho-Melander, Marju; Kooperberg, Charles; McPherson, Ruth; Watkins, Hugh; Willer, Cristen J; Hveem, Kristian; Melander, Olle; Kathiresan, Sekar; Abecasis, Gonçalo R
2014-02-01
The majority of reported complex disease associations for common genetic variants have been identified through meta-analysis, a powerful approach that enables the use of large sample sizes while protecting against common artifacts due to population structure and repeated small-sample analyses sharing individual-level data. As the focus of genetic association studies shifts to rare variants, genes and other functional units are becoming the focus of analysis. Here we propose and evaluate new approaches for performing meta-analysis of rare variant association tests, including burden tests, weighted burden tests, variable-threshold tests and tests that allow variants with opposite effects to be grouped together. We show that our approach retains useful features from single-variant meta-analysis approaches and demonstrate its use in a study of blood lipid levels in ∼18,500 individuals genotyped with exome arrays.
Feliubadaló, Lídia; Lopez-Doriga, Adriana; Castellsagué, Ester; del Valle, Jesús; Menéndez, Mireia; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Gómez, Carolina; Campos, Olga; Pineda, Marta; González, Sara; Moreno, Victor; Brunet, Joan; Blanco, Ignacio; Serra, Eduard; Capellá, Gabriel; Lázaro, Conxi
2013-01-01
Next-generation sequencing (NGS) is changing genetic diagnosis due to its huge sequencing capacity and cost-effectiveness. The aim of this study was to develop an NGS-based workflow for routine diagnostics for hereditary breast and ovarian cancer syndrome (HBOCS), to improve genetic testing for BRCA1 and BRCA2. A NGS-based workflow was designed using BRCA MASTR kit amplicon libraries followed by GS Junior pyrosequencing. Data analysis combined Variant Identification Pipeline freely available software and ad hoc R scripts, including a cascade of filters to generate coverage and variant calling reports. A BRCA homopolymer assay was performed in parallel. A research scheme was designed in two parts. A Training Set of 28 DNA samples containing 23 unique pathogenic mutations and 213 other variants (33 unique) was used. The workflow was validated in a set of 14 samples from HBOCS families in parallel with the current diagnostic workflow (Validation Set). The NGS-based workflow developed permitted the identification of all pathogenic mutations and genetic variants, including those located in or close to homopolymers. The use of NGS for detecting copy-number alterations was also investigated. The workflow meets the sensitivity and specificity requirements for the genetic diagnosis of HBOCS and improves on the cost-effectiveness of current approaches. PMID:23249957
Single Color Multiplexed ddPCR Copy Number Measurements and Single Nucleotide Variant Genotyping.
Wood-Bouwens, Christina M; Ji, Hanlee P
2018-01-01
Droplet digital PCR (ddPCR) allows for accurate quantification of genetic events such as copy number variation and single nucleotide variants. Probe-based assays represent the current "gold-standard" for detection and quantification of these genetic events. Here, we introduce a cost-effective single color ddPCR assay that allows for single genome resolution quantification of copy number and single nucleotide variation.
Linkage disequilibrium among commonly genotyped SNP and variants detected from bull sequence
USDA-ARS?s Scientific Manuscript database
Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNP genotyped by commercial assays. A number of variants detected from sequencing influential sires are likely to be causal, but noticable improvements in prediction accuracy using imputed sequen...
NGS testing for cardiomyopathy: Utility of adding RASopathy-associated genes.
Ceyhan-Birsoy, Ozge; Miatkowski, Maya M; Hynes, Elizabeth; Funke, Birgit H; Mason-Suares, Heather
2018-04-25
RASopathies include a group of syndromes caused by pathogenic germline variants in RAS-MAPK pathway genes and typically present with facial dysmorphology, cardiovascular disease, and musculoskeletal anomalies. Recently, variants in RASopathy-associated genes have been reported in individuals with apparently nonsyndromic cardiomyopathy, suggesting that subtle features may be overlooked. To determine the utility and burden of adding RASopathy-associated genes to cardiomyopathy panels, we tested 11 RASopathy-associated genes by next-generation sequencing (NGS), including NGS-based copy number variant assessment, in 1,111 individuals referred for genetic testing for hypertrophic cardiomyopathy (HCM) or dilated cardiomyopathy (DCM). Disease-causing variants were identified in 0.6% (four of 692) of individuals with HCM, including three missense variants in the PTPN11, SOS1, and BRAF genes. Overall, 36 variants of uncertain significance (VUSs) were identified, averaging ∼3VUSs/100 cases. This study demonstrates that adding a subset of the RASopathy-associated genes to cardiomyopathy panels will increase clinical diagnoses without significantly increasing the number of VUSs/case. © 2018 Wiley Periodicals, Inc.
López-Romero, Ricardo; Iglesias-Chiesa, Candela; Alatorre, Brenda; Vázquez, Karla; Piña-Sánchez, Patricia; Alvarado, Isabel; Lazos, Minerva; Peralta, Raúl; González-Yebra, Beatriz; Romero, Anae; Salcedo, Mauricio
2013-01-01
The role of human papillomavirus (HPV) infection in penile carcinoma (PeC) is currently reported and about half of the PeC is associated with HPV16 and 18. We used a PCR-based strategy by using HPV general primers to analyze 86 penile carcinomas paraffin-embedded tissues. Some clinical data, the histological subtype, growth pattern, and differentiation degree were also collected. The amplified fragments were then sequenced to confirm the HPV type and for HPV16/18 variants. DNA samples were also subjected to relative real time PCR for hTERC gene copy number. Some clinical data were also collected. Global HPV frequency was 77.9%. Relative contributions was for HPV16 (85%), 31 (4.4%), 11 (4.4%), 58, 33, 18, and 59 (1.4% each one). Sequence analysis of HPV16 identified European variants and Asian-American (AAb-c) variants in 92% and in 8% of the samples, respectively. Furthermore hTERC gene amplification was observed in only 17% of the cases. Our results suggest that some members of HPV A9 group (represented by HPV16, 58, and 31) are the most frequent among PeC patients studied with an important contribution from HPV16 European variant. The hTERC gene amplification could be poorly related to penile epithelial tissue.
López-Romero, Ricardo; Iglesias-Chiesa, Candela; Alatorre, Brenda; Vázquez, Karla; Piña-Sánchez, Patricia; Alvarado, Isabel; Lazos, Minerva; Peralta, Raúl; González-Yebra, Beatriz; Romero, AnaE; Salcedo, Mauricio
2013-01-01
The role of human papillomavirus (HPV) infection in penile carcinoma (PeC) is currently reported and about half of the PeC is associated with HPV16 and 18. We used a PCR-based strategy by using HPV general primers to analyze 86 penile carcinomas paraffin-embedded tissues. Some clinical data, the histological subtype, growth pattern, and differentiation degree were also collected. The amplified fragments were then sequenced to confirm the HPV type and for HPV16/18 variants. DNA samples were also subjected to relative real time PCR for hTERC gene copy number. Some clinical data were also collected. Global HPV frequency was 77.9%. Relative contributions was for HPV16 (85%), 31 (4.4%), 11 (4.4%), 58, 33, 18, and 59 (1.4% each one). Sequence analysis of HPV16 identified European variants and Asian-American (AAb-c) variants in 92% and in 8% of the samples, respectively. Furthermore hTERC gene amplification was observed in only 17% of the cases. Our results suggest that some members of HPV A9 group (represented by HPV16, 58, and 31) are the most frequent among PeC patients studied with an important contribution from HPV16 European variant. The hTERC gene amplification could be poorly related to penile epithelial tissue. PMID:23826423
Ma, Chengying; Cao, Junxi; Li, Jianke; Zhou, Bo; Tang, Jinchi; Miao, Aiqing
2016-01-01
Leaf colour variation is observed in several plants. We obtained two types of branches with yellow and variegated leaves from Camellia sinensis. To reveal the mechanisms that underlie the leaf colour variations, combined morphological, histological, ionomic and proteomic analyses were performed using leaves from abnormal branches (variants) and normal branches (CKs). The measurement of the CIE-Lab coordinates showed that the brightness and yellowness of the variants were more intense than the CKs. When chloroplast profiles were analysed, HY1 (branch with yellow leaves) and HY2 (branch with variegated leaves) displayed abnormal chloroplast structures and a reduced number and size compared with the CKs, indicating that the abnormal chloroplast development might be tightly linked to the leaf colour variations. Moreover, the concentration of elemental minerals was different between the variants and the CKs. Furthermore, DEPs (differentially expressed proteins) were identified in the variants and the CKs by a quantitative proteomics analysis using the label-free approach. The DEPs were significantly involved in photosynthesis and included PSI, PSII, cytochrome b6/f complex, photosynthetic electron transport, LHC and F-type ATPase. Our results suggested that a decrease in the abundance of photosynthetic proteins might be associated with the changes of leaf colours in tea plants. PMID:27633059
Carr, Ian M; Morgan, Joanne; Watson, Christopher; Melnik, Svitlana; Diggle, Christine P; Logan, Clare V; Harrison, Sally M; Taylor, Graham R; Pena, Sergio D J; Markham, Alexander F; Alkuraya, Fowzan S; Black, Graeme C M; Ali, Manir; Bonthron, David T
2013-07-01
Massively parallel ("next generation") DNA sequencing (NGS) has quickly become the method of choice for seeking pathogenic mutations in rare uncharacterized monogenic diseases. Typically, before DNA sequencing, protein-coding regions are enriched from patient genomic DNA, representing either the entire genome ("exome sequencing") or selected mapped candidate loci. Sequence variants, identified as differences between the patient's and the human genome reference sequences, are then filtered according to various quality parameters. Changes are screened against datasets of known polymorphisms, such as dbSNP and the 1000 Genomes Project, in the effort to narrow the list of candidate causative variants. An increasing number of commercial services now offer to both generate and align NGS data to a reference genome. This potentially allows small groups with limited computing infrastructure and informatics skills to utilize this technology. However, the capability to effectively filter and assess sequence variants is still an important bottleneck in the identification of deleterious sequence variants in both research and diagnostic settings. We have developed an approach to this problem comprising a user-friendly suite of programs that can interactively analyze, filter and screen data from enrichment-capture NGS data. These programs ("Agile Suite") are particularly suitable for small-scale gene discovery or for diagnostic analysis. © 2013 WILEY PERIODICALS, INC.
Murray, Anita; Dunlop, Rebecca A; Noad, Michael J; Goldizen, Anne W
2018-02-01
Male humpback whales produce a mating display called "song." Behavioral studies indicate song has inter- and/or intra-sexual functionality, suggesting song may be a multi-message display. Multi-message displays often include stereotypic components that convey group membership for mate attraction and/or male-male interactions, and complex components that convey individual quality for courtship. Humpback whale song contains sounds ("units") arranged into sequences ("phrases"). Repetitions of a specific phrase create a "theme." Within a theme, imperfect phrase repetitions ("phrase variants") create variability among phrases of the same type ("phrase type"). The hypothesis that song contains stereotypic and complex phrase types, structural characteristics consistent with a multi-message display, is investigated using recordings of 17 east Australian males (8:2004, 9:2011). Phrase types are categorized as stereotypic or complex using number of unit types, number of phrase variants, and the proportion of phrases that is unique to an individual versus shared amongst males. Unit types are determined using self-organizing maps. Phrase variants are determined by Levenshtein distances between phrases. Stereotypic phrase types have smaller numbers of unit types and shared phrase variants. Complex phrase types have larger numbers of unit types and unique phrase variants. This study supports the hypothesis that song could be a multi-message display.
Common NOTCH3 Variants and Cerebral Small-Vessel Disease.
Rutten-Jacobs, Loes C A; Traylor, Matthew; Adib-Samii, Poneh; Thijs, Vincent; Sudlow, Cathie; Rothwell, Peter M; Boncoraglio, Giorgio; Dichgans, Martin; Bevan, Steve; Meschia, James; Levi, Christopher; Rost, Natalia S; Rosand, Jonathan; Hassan, Ahamad; Markus, Hugh S
2015-06-01
The most common monogenic cause of cerebral small-vessel disease is cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy, caused by NOTCH3 gene mutations. It has been hypothesized that more common variants in NOTCH3 may also contribute to the risk of sporadic small-vessel disease. Previously, 4 common variants (rs10404382, rs1043994, rs10423702, and rs1043997) were found to be associated with the presence of white matter hyperintensity in hypertensive community-dwelling elderly. We investigated the association of common single nucleotide polymorphisms (SNPs) in NOTCH3 in 1350 patients with MRI-confirmed lacunar stroke and 7397 controls, by meta-analysis of genome-wide association study data sets. In addition, we investigated the association of common SNPs in NOTCH3 with MRI white matter hyperintensity volumes in 3670 white patients with ischemic stroke. In each analysis, we considered all SNPs within the NOTCH3 gene, and within 50-kb upstream and downstream of the coding region. A total of 381 SNPs from the 1000 genome population with a mean allele frequency>0.01 were included in the analysis. A significance level of P<0.0015 was used, adjusted for the effective number of independent SNPs in the region using the Galwey method. We found no association of any common variants in NOTCH3 (including rs10404382, rs1043994, rs10423702, and rs1043997) with lacunar stroke or white matter hyperintensity volume. We repeated our analysis stratified for hypertension but again found no association. Our study does not support a role for common NOTCH3 variation in the risk of sporadic small-vessel disease. © 2015 The Authors.
Rabies surveillance in the United States during 2006.
Blanton, Jesse D; Hanlon, Cathleen A; Rupprecht, Charles E
2007-08-15
During 2006, 49 states and Puerto Rico reported 6,940 cases of rabies in animals and 3 cases in humans to the CDC, representing an 8.2% increase from the 6,417 cases in animals and 1 case in a human reported in 2005. Approximately 92% of the cases were in wildlife, and 8% were in domestic animals. Relative contributions by the major animal groups were as follows: 2,615 raccoons (37.7%), 1,692 bats (24.4%), 1,494 skunks (21.5%), 427 foxes (6.2%), 318 cats (4.6%), 82 cattle (1.2%), and 79 dogs (1.1%). Compared with numbers of reported cases in 2005, cases in 2006 increased among all groups except cattle. Increases in numbers of rabid raccoons during 2006 were reported by 11 of the 20 eastern states where raccoon rabies was enzootic, and reported cases increased by 3.2% overall, compared with 2005. On a national level, the number of rabies cases in skunks during 2006 increased by 6.1% from the number reported in 2005. Once again, Texas reported the greatest number (n = 351) of rabid skunks and the greatest overall state total of animal rabies cases (889). No cases of rabies associated with the dog/coyote rabies virus variant were reported. The last identified case of this canine rabies virus variant was identified in March 2004, along the US/Mexico border. With 2006 marking the second year of no apparent transmission of the dog/coyote variant, these findings from surveillance data support the contention that the canine rabies virus variant is no longer in circulation in the United States. Total number of cases of rabies reported nationally in foxes increased 13.6%, compared with 2005. Increases in the number of reported rabid foxes were attributable to greater numbers of foxes reported with the Arctic fox rabies virus variant in Alaska, the Texas gray fox rabies virus variant in Texas, and the raccoon rabies virus variant in Virginia. The 1,692 cases of rabies reported in bats represented a 14.5% increase, compared with numbers reported in 2005, making bats the second most reported rabid animal behind raccoons. Cases of rabies in cats, dogs, horses and mules, and sheep and goats increased 18.2%, 3.9%, 12.8%, and 22.2%, respectively, whereas cases reported in cattle decreased 11.8%. In Puerto Rico, reported cases of rabies in mongooses increased 9.2%, and rabies in domestic animals, presumably attributable to spillover infection from mongooses, increased 20%. Three cases of human rabies were reported from Texas, Indiana, and California during 2006. The cases in Indiana and Texas were attributed to bat rabies virus variants, whereas the case in California was attributed to an exposure to a dog in the Philippines.
Rozman, Vita; Kunej, Tanja
2018-05-10
Harnessing the genomics big data requires innovation in how we extract and interpret biologically relevant variants. Currently, there is no established catalog of prioritized missense variants associated with deleterious protein function phenotypes. We report in this study, to the best of our knowledge, the first genome-wide prioritization of sequence variants with the most deleterious effect on protein function (potentially deleterious variants [pDelVars]) in nine vertebrate species: human, cattle, horse, sheep, pig, dog, rat, mouse, and zebrafish. The analysis was conducted using the Ensembl/BioMart tool. Genes comprising pDelVars in the highest number of examined species were identified using a Python script. Multiple genomic alignments of the selected genes were built to identify interspecies orthologous potentially deleterious variants, which we defined as the "ortho-pDelVars." Genome-wide prioritization revealed that in humans, 0.12% of the known variants are predicted to be deleterious. In seven out of nine examined vertebrate species, the genes encoding the multiple PDZ domain crumbs cell polarity complex component (MPDZ) and the transforming acidic coiled-coil containing protein 2 (TACC2) comprise pDelVars. Five interspecies ortho-pDelVars were identified in three genes. These findings offer new ways to harness genomics big data by facilitating the identification of functional polymorphisms in humans and animal models and thus provide a future basis for optimization of protocols for whole genome prioritization of pDelVars and screening of orthologous sequence variants. The approach presented here can inform various postgenomic applications such as personalized medicine and multiomics study of health interventions (iatromics).
Genetic study of intracranial aneurysms.
Yan, Junxia; Hitomi, Toshiaki; Takenaka, Katsunobu; Kato, Masayasu; Kobayashi, Hatasu; Okuda, Hiroko; Harada, Kouji H; Koizumi, Akio
2015-03-01
Rupture of intracranial aneurysms (IAs) causes subarachnoid hemorrhage, leading to immediate death or severe disability. Identification of the genetic factors involved is critical for disease prevention and treatment. We aimed to identify the susceptibility genes for IAs. Exome sequencing was performed in 12 families with histories of multiple cases of IA (number of cases per family ≥3), with a total of 42 cases. Various filtering strategies were used to select the candidate variants. Replicate association studies of several candidate variants were performed in probands of 24 additional IA families and 426 sporadic IA cases. Functional analysis for the mutations was conducted. After sequencing and filtering, 78 variants were selected for the following reasons: allele frequencies of variants in 42 patients was significantly (P<0.05) larger than expected; variants were completely shared by all patients with IA within ≥1 family; variants predicted damage to the structure or function of the protein by PolyPhen-2 (Polymorphism Phenotyping V2) and SIFT (Sorting Intolerance From Tolerant). We selected 10 variants from 9 genes (GPR63, ADAMST15, MLL2, IL10RA, PAFAH2, THBD, IL11RA, FILIP1L, and ZNF222) to form 78 candidate variants by considering commonness in families, known disease genes, or ontology association with angiogenesis. Replicate association studies revealed that only p.E133Q in ADAMTS15 was aggregated in the familial IA cases (odds ratio, 5.96; 95% confidence interval, 2.40-14.82; P=0.0001; significant after the Bonferroni correction [P=0.05/78=0.0006]). Silencing ADAMTS15 and overexpression of ADAMTS15 p.E133Q accelerated endothelial cell migration, suggesting that ADAMTS15 may have antiangiogenic activity. ADAMTS15 is a candidate gene for IAs. © 2015 American Heart Association, Inc.
Positional bias in variant calls against draft reference assemblies.
Briskine, Roman V; Shimizu, Kentaro K
2017-03-28
Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to improve the quality of their variant data sets and reduce artefactual findings in downstream analyses.
Wallace, Ryan M.; Gilbert, Amy; Slate, Dennis; Chipman, Richard; Singh, Amber; Cassie Wedd; Blanton, Jesse D.
2014-01-01
Introduction In the continental US, four terrestrial mammalian species are reservoirs for seven antigenic rabies virus variants. Cross species transmission (CST) occurs when a rabies virus variant causes disease in non-reservoir species. Methods This study analyzed national surveillance data for rabies in terrestrial mammals. The CST rate was defined as: number of rabid non-reservoir animals/number of rabid reservoir animals. CST rates were analyzed for trend. Clusters of high CST rate counties were evaluated using space-time scanning statistics. Results The number of counties reporting a raccoon variant CST rate >1.0 increased from 75 in 1992 to 187 in 2011; counties with skunk variant CST rates >1.0 remained unchanged during the same period. As of 2011, for every rabid raccoon reported within the raccoon variant region, there were 0.73 cases of this variant reported in non-reservoir animals. Skunks were the most common non-reservoir animal reported with the raccoon rabies variant. Domestic animals were the most common non-reservoir animal diagnosed with a skunk rabies virus variant (n = 1,601). Cross species transmission rates increased fastest among domestic animals. Conclusions Cross species transmission of rabies virus variants into non-reservoir animals increases the risk of human exposures and threatens current advances toward rabies control. Cross species transmission in raccoon rabies enzootic regions increased dramatically during the study period. Pet owners should vaccinate their dogs and cats to ensure against CST, particularly in regions with active foci of rabies circulation. Clusters of high CST activity represent areas for further study to better understand interspecies disease transmission dynamics. Each CST event has the potential to result in a rabies virus adapted for sustained transmission in a new species; therefore further understanding of the dynamics of CST may help in early detection or prevention of the emergence of new terrestrial rabies virus variants. PMID:25295750
Wallace, Ryan M; Gilbert, Amy; Slate, Dennis; Chipman, Richard; Singh, Amber; Cassie Wedd; Blanton, Jesse D
2014-01-01
In the continental US, four terrestrial mammalian species are reservoirs for seven antigenic rabies virus variants. Cross species transmission (CST) occurs when a rabies virus variant causes disease in non-reservoir species. This study analyzed national surveillance data for rabies in terrestrial mammals. The CST rate was defined as: number of rabid non-reservoir animals/number of rabid reservoir animals. CST rates were analyzed for trend. Clusters of high CST rate counties were evaluated using space-time scanning statistics. The number of counties reporting a raccoon variant CST rate >1.0 increased from 75 in 1992 to 187 in 2011; counties with skunk variant CST rates >1.0 remained unchanged during the same period. As of 2011, for every rabid raccoon reported within the raccoon variant region, there were 0.73 cases of this variant reported in non-reservoir animals. Skunks were the most common non-reservoir animal reported with the raccoon rabies variant. Domestic animals were the most common non-reservoir animal diagnosed with a skunk rabies virus variant (n = 1,601). Cross species transmission rates increased fastest among domestic animals. Cross species transmission of rabies virus variants into non-reservoir animals increases the risk of human exposures and threatens current advances toward rabies control. Cross species transmission in raccoon rabies enzootic regions increased dramatically during the study period. Pet owners should vaccinate their dogs and cats to ensure against CST, particularly in regions with active foci of rabies circulation. Clusters of high CST activity represent areas for further study to better understand interspecies disease transmission dynamics. Each CST event has the potential to result in a rabies virus adapted for sustained transmission in a new species; therefore further understanding of the dynamics of CST may help in early detection or prevention of the emergence of new terrestrial rabies virus variants.
Goossens, Dirk; Moens, Lotte N; Nelis, Eva; Lenaerts, An-Sofie; Glassee, Wim; Kalbe, Andreas; Frey, Bruno; Kopal, Guido; De Jonghe, Peter; De Rijk, Peter; Del-Favero, Jurgen
2009-03-01
We evaluated multiplex PCR amplification as a front-end for high-throughput sequencing, to widen the applicability of massive parallel sequencers for the detailed analysis of complex genomes. Using multiplex PCR reactions, we sequenced the complete coding regions of seven genes implicated in peripheral neuropathies in 40 individuals on a GS-FLX genome sequencer (Roche). The resulting dataset showed highly specific and uniform amplification. Comparison of the GS-FLX sequencing data with the dataset generated by Sanger sequencing confirmed the detection of all variants present and proved the sensitivity of the method for mutation detection. In addition, we showed that we could exploit the multiplexed PCR amplicons to determine individual copy number variation (CNV), increasing the spectrum of detected variations to both genetic and genomic variants. We conclude that our straightforward procedure substantially expands the applicability of the massive parallel sequencers for sequencing projects of a moderate number of amplicons (50-500) with typical applications in resequencing exons in positional or functional candidate regions and molecular genetic diagnostics. 2008 Wiley-Liss, Inc.
de Vries, Tamar I; Monroe, Glen R; van Belzen, Martine J; van der Lans, Christian A; Savelberg, Sanne Mc; Newman, William G; van Haaften, Gijs; Nievelstein, Rutger A; van Haelst, Mieke M
2016-08-01
Rubinstein-Taybi syndrome (RTS, OMIM 180849) and Filippi syndrome (FLPIS, OMIM 272440) are both rare syndromes, with multiple congenital anomalies and intellectual deficit (MCA/ID). We present a patient with intellectual deficit, short stature, bilateral syndactyly of hands and feet, broad thumbs, ocular abnormalities, and dysmorphic facial features. These clinical features suggest both RTS and FLPIS. Initial DNA analysis of DNA isolated from blood did not identify variants to confirm either of these syndrome diagnoses. Whole-exome sequencing identified a homozygous variant in C9orf173, which was novel at the time of analysis. Further Sanger sequencing analysis of FLPIS cases tested negative for CKAP2L variants did not, however, reveal any further variants. Subsequent analysis using DNA isolated from buccal mucosa revealed a mosaic variant in CREBBP. This report highlights the importance of excluding mosaic variants in patients with a strong but atypical clinical presentation of a MCA/ID syndrome if no disease-causing variants can be detected in DNA isolated from blood samples. As the striking syndactyly observed in the present case is typical for FLPIS, we suggest CREBBP analysis in saliva samples for FLPIS syndrome cases in which no causal CKAP2L variant is detected.
Leapfrog variants of iterative methods for linear algebra equations
NASA Technical Reports Server (NTRS)
Saylor, Paul E.
1988-01-01
Two iterative methods are considered, Richardson's method and a general second order method. For both methods, a variant of the method is derived for which only even numbered iterates are computed. The variant is called a leapfrog method. Comparisons between the conventional form of the methods and the leapfrog form are made under the assumption that the number of unknowns is large. In the case of Richardson's method, it is possible to express the final iterate in terms of only the initial approximation, a variant of the iteration called the grand-leap method. In the case of the grand-leap variant, a set of parameters is required. An algorithm is presented to compute these parameters that is related to algorithms to compute the weights and abscissas for Gaussian quadrature. General algorithms to implement the leapfrog and grand-leap methods are presented. Algorithms for the important special case of the Chebyshev method are also given.
Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836
Li, Wenhua; Yang, Bin; Zhou, Dongmei; Xu, Jun; Ke, Zhi; Suen, Wen-Chen
2016-07-01
Liquid chromatography mass spectrometry (LC-MS) is the most commonly used technique for the characterization of antibody variants. MAb-X and mAb-Y are two approved IgG1 subtype monoclonal antibody drugs recombinantly produced in Chinese hamster ovary (CHO) cells. We report here that two unexpected and rare antibody variants have been discovered during cell culture process development of biosimilars for these two approved drugs through intact mass analysis. We then used comprehensive mass spectrometry-based comparative analysis including reduced light, heavy chains, and domain-specific mass as well as peptide mapping analysis to fully characterize the observed antibody variants. The "middle-up" mass comparative analysis demonstrated that the antibody variant from mAb-X biosimilar candidate was caused by mass variation of antibody crystalline fragment (Fc), whereas a different variant with mass variation in antibody antigen-binding fragment (Fab) from mAb-Y biosimilar candidate was identified. Endoproteinase Lys-C digested peptide mapping and tandem mass spectrometry analysis further revealed that a leucine to glutamine change in N-terminal 402 site of heavy chain was responsible for the generation of mAb-X antibody variant. Lys-C and trypsin coupled non-reduced and reduced peptide mapping comparative analysis showed that the formation of the light-heavy interchain trisulfide bond resulted in the mAb-Y antibody variant. These two cases confirmed that mass spectrometry-based comparative analysis plays a critical role for the characterization of monoclonal antibody variants, and biosimilar developers should start with a comprehensive structural assessment and comparative analysis to decrease the risk of the process development for biosimilars. Copyright © 2016 Elsevier B.V. All rights reserved.
Association analysis of multiple traits by an approach of combining P values.
Chen, Lili; Wang, Yong; Zhou, Yajing
2018-03-01
Increasing evidence shows that one variant can affect multiple traits, which is a widespread phenomenon in complex diseases. Joint analysis of multiple traits can increase statistical power of association analysis and uncover the underlying genetic mechanism. Although there are many statistical methods to analyse multiple traits, most of these methods are usually suitable for detecting common variants associated with multiple traits. However, because of low minor allele frequency of rare variant, these methods are not optimal for rare variant association analysis. In this paper, we extend an adaptive combination of P values method (termed ADA) for single trait to test association between multiple traits and rare variants in the given region. For a given region, we use reverse regression model to test each rare variant associated with multiple traits and obtain the P value of single-variant test. Further, we take the weighted combination of these P values as the test statistic. Extensive simulation studies show that our approach is more powerful than several other comparison methods in most cases and is robust to the inclusion of a high proportion of neutral variants and the different directions of effects of causal variants.
NASA Technical Reports Server (NTRS)
Kole, James A.; Schneider, Vivian I.; Healy, Alice F.; Barshi, Immanuel
2017-01-01
Subjects trained in a standard data entry task, which involved typing numbers (e.g., 5421) using their right hands. At test (6 months post-training), subjects completed the standard task, followed by a left-hand variant (typing with their left hands) that involved the same perceptual, but different motoric, processes as the standard task. At a second test (8 months post-training), subjects completed the standard task, followed by a code variant (translating letters into digits, then typing the digits with their right hands) that involved different perceptual, but the same motoric, processes as the standard task. For each of the three tasks, half the trials were trained numbers (old) and half were new. Repetition priming (faster response times to old than new numbers) was found for each task. Repetition priming for the standard task reflects retention of trained numbers; for the left-hand variant reflects transfer of perceptual processes; and for the code variant reflects transfer of motoric processes. There was thus evidence for both specificity and generalizability of training data entry perceptual and motoric processes over very long retention intervals.
Plant growth promotion rhizobacteria in onion production.
Colo, Josip; Hajnal-Jafari, Timea I; Durić, Simonida; Stamenov, Dragana; Hamidović, Saud
2014-01-01
The aim of the research was to examine the effect of rhizospheric bacteria Azotobacter chroococcum, Pseudomonas fluorescens (strains 1 and 2) and Bacillus subtilis on the growth and yield of onion and on the microorganisms in the rhizosphere of onion. The ability of microorganisms to produce indole-acetic acid (IAA), siderophores and to solubilize tricalcium phosphate (TCP) was also assessed. The experiment was conducted in field conditions, in chernozem type of soil. Bacillus subtilis was the best producer of IAA, whereas Pseudomonas fluorescens strains were better at producing siderophores and solubilizing phosphates. The longest seedling was observed with the application of Azotobacter chroococcum. The height of the plants sixty days after sowing was greater in all the inoculated variants than in the control. The highest onion yield was observed in Bacillus subtilis and Azotobacter chroococcum variants. The total number of bacteria and the number of Azotobacter chroococcum were larger in all the inoculated variants then in the control. The number of fungi decreased in most of the inoculated variants, whereas the number of actinomycetes decreased or remained the same.
Iacocca, Michael A; Wang, Jian; Dron, Jacqueline S; Robinson, John F; McIntyre, Adam D; Cao, Henian; Hegele, Robert A
2017-11-01
Familial hypercholesterolemia (FH) is a heritable condition of severely elevated LDL cholesterol, caused predominantly by autosomal codominant mutations in the LDL receptor gene ( LDLR ). In providing a molecular diagnosis for FH, the current procedure often includes targeted next-generation sequencing (NGS) panels for the detection of small-scale DNA variants, followed by multiplex ligation-dependent probe amplification (MLPA) in LDLR for the detection of whole-exon copy number variants (CNVs). The latter is essential because ∼10% of FH cases are attributed to CNVs in LDLR ; accounting for them decreases false negative findings. Here, we determined the potential of replacing MLPA with bioinformatic analysis applied to NGS data, which uses depth-of-coverage analysis as its principal method to identify whole-exon CNV events. In analysis of 388 FH patient samples, there was 100% concordance in LDLR CNV detection between these two methods: 38 reported CNVs identified by MLPA were also successfully detected by our NGS method, while 350 samples negative for CNVs by MLPA were also negative by NGS. This result suggests that MLPA can be removed from the routine diagnostic screening for FH, significantly reducing associated costs, resources, and analysis time, while promoting more widespread assessment of this important class of mutations across diagnostic laboratories. Copyright © 2017 by the American Society for Biochemistry and Molecular Biology, Inc.
Kale, S P; Cary, J W; Bhatnagar, D; Bennett, J W
1996-01-01
Six previously isolated, nonaflatoxigenic variants of Aspergillus parasiticus, designated sec mutants, were characterized morphologically by electron microscopy, biochemically by biotransformation studies with an aflatoxin precursor, and genetically by Northern (RNA) hybridization analysis of aflatoxin biosynthetic gene transcripts. Scanning electron micrographs clearly demonstrated that compared with the parental sec+ forms, the variant sec forms had an abundance of vegetative mycelia, orders of magnitude reduced number of conidiophores and conidia, and abnormal metulae. Conidiospores were detected in sec cultures only at higher magnifications (x 500), in contrast to the sec+ (wild-type) strain, in which abundant conidiospores (masking the vegetative mycelia) were observed at even lower magnifications (x 300). All sec+ forms, but none of the sec forms, showed bioconversion of sterigmatocystin to aflatoxins. Northern blots probed with pathway genes demonstrated lack of expression of both the aflatoxin biosynthetic pathway structural (nor-1 and omtA) and regulatory (aflR) genes in the sec forms; PCR and Southern hybridization analysis confirmed the presence of the genes in the sec genomes. Thus, the loss of aflatoxigenic capabilities in the sec form is correlated with alterations in the conidial morphology of the fungus, suggesting that the regulation of aflatoxin synthesis and conidiogenesis may be interlinked. PMID:8795232
Identification of missing variants by combining multiple analytic pipelines.
Ren, Yingxue; Reddy, Joseph S; Pottier, Cyril; Sarangi, Vivekananda; Tian, Shulan; Sinnwell, Jason P; McDonnell, Shannon K; Biernacka, Joanna M; Carrasquillo, Minerva M; Ross, Owen A; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hudson, Matthew; Mainzer, Liudmila Sergeevna; Asmann, Yan W
2018-04-16
After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. Identification of the complete variant set from sequencing data is the prerequisite of genetic association analyses. The current analytic practice of calling genetic variants from sequencing data using a single bioinformatics pipeline is no longer adequate with the increasingly large projects. The number and percentage of quality variants that passed quality filters but are missed by the one-pipeline approach rapidly increased with sample size.
Monico, Carla G; Weinstein, Adam; Jiang, Zhirong; Rohlinger, Audrey L; Cogal, Andrea G; Bjornson, Beth B; Olson, Julie B; Bergstralh, Eric J; Milliner, Dawn S; Aronson, Peter S
2008-12-01
Urinary oxalate is a major risk factor for calcium oxalate stones. Marked hyperoxaluria arises from mutations in 2 separate loci, AGXT and GRHPR, the causes of primary hyperoxaluria (PH) types 1 (PH1) and 2 (PH2), respectively. Studies of null Slc26a6(-/-) mice have shown a phenotype of hyperoxaluria, hyperoxalemia, and calcium oxalate urolithiasis, leading to the hypothesis that SLC26A6 mutations may cause or modify hyperoxaluria in humans. Cross-sectional case-control. Cases were recruited from the International Primary Hyperoxaluria Registry. Control DNA samples were from a pool of adult subjects who identified themselves as being in good health. PH1, PH2, and non-PH1/PH2 genotypes in cases. Homozygosity or compound heterozygosity for SLC26A6 variants. Functional expression of oxalate transport in Xenopus laevis oocytes. 80 PH1, 6 PH2, 8 non-PH1/PH2, and 96 control samples were available for SLC26A6 screening. A rare variant, c.487C-->T (p.Pro163Ser), was detected solely in 1 non-PH1/PH2 pedigree, but this variant failed to segregate with hyperoxaluria, and functional studies of oxalate transport in Xenopus oocytes showed no transport defect. No other rare variant was identified specifically in non-PH1/PH2. Six additional missense variants were detected in controls and cases. Of these, c.616G-->A (p.Val206Met) was most common (11%) and showed a 30% reduction in oxalate transport. To test p.Val206Met as a potential modifier of hyperoxaluria, we extended screening to PH1 and PH2. Heterozygosity for this variant did not affect plasma or urine oxalate levels in this population. We did not have a sufficient number of cases to determine whether homozygosity for p.Val206Met might significantly affect urine oxalate. SLC26A6 was effectively ruled out as the disease gene in this non-PH1/PH2 cohort. Taken together, our studies are the first to identify and characterize SLC26A6 variants in patients with hyperoxaluria. Phenotypic and functional analysis excluded a significant effect of identified variants on oxalate excretion.
Monico, Carla G.; Weinstein, Adam; Jiang, Zhirong; Rohlinger, Audrey L.; Cogal, Andrea G.; Bjornson, Beth B.; Olson, Julie B.; Bergstralh, Eric J.; Milliner, Dawn S.; Aronson, Peter S.
2008-01-01
Background Urinary oxalate is a major risk factor for calcium oxalate stones. Marked hyperoxaluria arises from mutations in two separate loci, AGXT and GRHPR, the causes of primary hyperoxaluria (PH) types 1 and 2, respectively. Studies of null Slc26a6 (−/−) mice have revealed a phenotype of hyperoxaluria, hyperoxalemia and calcium oxalate urolithiasis, leading to the hypothesis that SLC26A6 mutations may cause or modify hyperoxaluria in humans. Study Design Cross-sectional, case-control. Setting & Participants Cases were recruited from the International Primary Hyperoxaluria Registry. Control DNA samples were from a pool of adult subjects who identified themselves as being in good health. Predictor PH1, PH2, non-PH1/PH2 genotypes in cases. Outcomes & Measures Homozygosity or compound heterozygosity for SLC26A6 variants. Functional expression of oxalate transport in Xenopus oocytes. Results A total of 80 PH1, 6 PH2, 8 non-PH1/PH2 and 96 control samples were available for SLC26A6 screening. A rare variant, c.487C>T (p.Pro163Ser) was detected solely in one non-PH1/PH2 pedigree but this variant failed to segregate with hyperoxaluria, and functional studies of oxalate transport in Xenopus oocytes revealed no transport defect. No other rare variant was identified specifically in non-PH1/PH2. Six additional missense variants were detected in controls and in cases. Of these, c.616G>A (p.Val206Met) was most common (11%), and showed a 30% reduction in oxalate transport. To test p.Val206Met as a potential modifier of hyperoxaluria, we extended screening to PH1 and PH2. Heterozygosity for this variant did not affect plasma or urine oxalate in this population. Limitations We did not have a sufficient number of cases to determine whether homozygosity for p.Val206Met might significantly affect urine oxalate. Conclusions SLC26A6 was effectively ruled out as the disease gene in this non-PH1/PH2 cohort. Taken together, our studies are the first to identify and characterize SLC26A6 variants in hyperoxaluria. Phenotypic and functional analysis excluded a significant effect of identified variants on oxalate excretion. PMID:18951670
Chromosomal microarray findings in pregnancies with an isolated pelvic kidney.
Sagi-Dain, Lena; Singer, Amihood; Frumkin, Ayala; Shalata, Adel; Koifman, Arie; Segel, Reeval; Benyamini, Lilach; Rienstein, Shlomit; Kahyat, Morad; Sharony, Reuven; Maya, Idit; Ben Shachar, Shay
2018-05-29
To examine the risk for abnormal chromosomal microarray analysis (CMA) results among fetuses with an apparently isolated pelvic kidney. Data from all CMA analyses performed due to an isolated pelvic kidney reported to the Israeli Ministry of Health between January 2013 and September 2016 were retrospectively obtained. Risk estimation was performed comparing the rate of abnormal observed CMA findings to the general population risk, based on a systematic review encompassing 9272 cases and on local data of 5541 cases. Of 120 pregnancies with an isolated pelvic kidney, two gain-of-copy number variants suggesting microduplication syndromes were demonstrated (1.67%). In addition, three variants of unknown significance were detected (2.5%). The risk for clinically significant CMA findings among pregnancies with an isolated single pelvic kidney was not significantly different compared to both control populations. The results of our study question the practice of routine CMA analysis in fetuses with an isolated pelvic kidney.
NASA Astrophysics Data System (ADS)
Szafranko, Elżbieta
2017-10-01
Assessment of variant solutions developed for a building investment project needs to be made at the stage of planning. While considering alternative solutions, the investor defines various criteria, but a direct evaluation of the degree of their fulfilment by developed variant solutions can be very difficult. In practice, there are different methods which enable the user to include a large number of parameters into an analysis, but their implementation can be challenging. Some methods require advanced mathematical computations, preceded by complicating input data processing, and the generated results may not lend themselves easily to interpretation. Hence, during her research, the author has developed a systemic approach, which involves several methods and whose goal is to compare their outcome. The final stage of the proposed method consists of graphic interpretation of results. The method has been tested on a variety of building and development projects.
Molecular and geographic analyses of vampire bat-transmitted cattle rabies in central Brazil
Kobayashi, Yuki; Sato, Go; Mochizuki, Nobuyuki; Hirano, Shinji; Itou, Takuya; Carvalho, Adolorata AB; Albas, Avelino; Santos, Hamilton P; Ito, Fumio H; Sakai, Takeo
2008-01-01
Background Vampire bats are important rabies virus vectors, causing critical problems in both the livestock industry and public health sector in Latin America. In order to assess the epidemiological characteristics of vampire bat-transmitted rabies, the authors conducted phylogenetic and geographical analyses using sequence data of a large number of cattle rabies isolates collected from a wide geographical area in Brazil. Methods Partial nucleoprotein genes of rabies viruses isolated from 666 cattle and 18 vampire bats between 1987 and 2006 were sequenced and used for phylogenetic analysis. The genetic variants were plotted on topographical maps of Brazil. Results In this study, 593 samples consisting of 24 genetic variants were analyzed. Regional localization of variants was observed, with the distribution of several variants found to be delimited by mountain ranges which served as geographic boundaries. The geographical distributions of vampire-bat and cattle isolates that were classified as the identical phylogenetic group were found to overlap with high certainty. Most of the samples analyzed in this study were isolated from adjacent areas linked by rivers. Conclusion This study revealed the existence of several dozen regional variants associated with vampire bats in Brazil, with the distribution patterns of these variants found to be affected by mountain ranges and rivers. These results suggest that epidemiological characteristics of vampire bat-related rabies appear to be associated with the topographical and geographical characteristics of areas where cattle are maintained, and the factors affecting vampire bat ecology. PMID:18983685
Ridge, Perry G; Maxwell, Taylor J; Foutz, Spencer J; Bailey, Matthew H; Corcoran, Christopher D; Tschanz, JoAnn T; Norton, Maria C; Munger, Ronald G; O'Brien, Elizabeth; Kerber, Richard A; Cawthon, Richard M; Kauwe, John S K
2014-01-01
The mitochondria are essential organelles and are the location of cellular respiration, which is responsible for the majority of ATP production. Each cell contains multiple mitochondria, and each mitochondrion contains multiple copies of its own circular genome. The ratio of mitochondrial genomes to nuclear genomes is referred to as mitochondrial copy number. Decreases in mitochondrial copy number are known to occur in many tissues as people age, and in certain diseases. The regulation of mitochondrial copy number by nuclear genes has been studied extensively. While mitochondrial variation has been associated with longevity and some of the diseases known to have reduced mitochondrial copy number, the role that the mitochondrial genome itself has in regulating mitochondrial copy number remains poorly understood. We analyzed the complete mitochondrial genomes from 1007 individuals randomly selected from the Cache County Study on Memory Health and Aging utilizing the inferred evolutionary history of the mitochondrial haplotypes present in our dataset to identify sequence variation and mitochondrial haplotypes associated with changes in mitochondrial copy number. Three variants belonging to mitochondrial haplogroups U5A1 and T2 were significantly associated with higher mitochondrial copy number in our dataset. We identified three variants associated with higher mitochondrial copy number and suggest several hypotheses for how these variants influence mitochondrial copy number by interacting with known regulators of mitochondrial copy number. Our results are the first to report sequence variation in the mitochondrial genome that causes changes in mitochondrial copy number. The identification of these variants that increase mtDNA copy number has important implications in understanding the pathological processes that underlie these phenotypes.
Identifying Mendelian disease genes with the Variant Effect Scoring Tool
2013-01-01
Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is available as a stand-alone software package at http://wiki.chasmsoftware.org and is hosted by the CRAVAT web server at http://www.cravat.us PMID:23819870
Lyu, S; Arends, D; Nassar, M K; Brockmann, G A
2017-06-01
In our previous research, QTL analysis in an F 2 cross between the inbred New Hampshire (NHI) and White Leghorn (WL77) lines revealed a growth QTL in the distal part of chromosome 4. To physically reduce the chromosomal interval and the number of potential candidate genes, we performed fine mapping using individuals of generations F 10 , F 11 and F 12 in an advanced intercross line that had been established from the initial F 2 mapping population. Using nine single nucleotide polymorphism (SNP) markers within the QTL region for an association analysis with several growth traits from hatch to 20 weeks and body composition traits at 20 weeks, we could reduce the confidence interval from 26.9 to 3.4 Mb. Within the fine mapped region, markers rs14490774, rs314961352 and rs318175270 were in full linkage disequilibrium (D' = 1.0) and showed the strongest effect on growth and muscle mass (LOD ≥ 4.00). This reduced region contains 30 genes, compared to 292 genes in the original region. Chicken 60 K and 600 K SNP chips combined with DNA sequencing of the parental lines were used to call mutations in the reduced region. In the narrowed-down region, 489 sequence variants were detected between NHI and WL77. The most deleterious variants are a missense variant in ADGRA3 (SIFT = 0.02) and a frameshift deletion in the functional unknown gene ENSGALG00000014401 in NHI chicken. In addition, five synonymous variants were discovered in genes PPARGC1A, ADGRA3, PACRGL, SLIT2 and FAM184B. In our study, the confidence interval and the number of potential genes could be reduced 8- and 10- fold respectively. Further research will focus on functional effects of mutant genes. © 2017 Stichting International Foundation for Animal Genetics.
DMET-Miner: Efficient discovery of association rules from pharmacogenomic data.
Agapito, Giuseppe; Guzzi, Pietro H; Cannataro, Mario
2015-08-01
Microarray platforms enable the investigation of allelic variants that may be correlated to phenotypes. Among those, the Affymetrix DMET (Drug Metabolism Enzymes and Transporters) platform enables the simultaneous investigation of all the genes that are related to drug absorption, distribution, metabolism and excretion (ADME). Although recent studies demonstrated the effectiveness of the use of DMET data for studying drug response or toxicity in clinical studies, there is a lack of tools for the automatic analysis of DMET data. In a previous work we developed DMET-Analyzer, a methodology and a supporting platform able to automatize the statistical study of allelic variants, that has been validated in several clinical studies. Although DMET-Analyzer is able to correlate a single variant for each probe (related to a portion of a gene) through the use of the Fisher test, it is unable to discover multiple associations among allelic variants, due to its underlying statistic analysis strategy that focuses on a single variant for each time. To overcome those limitations, here we propose a new analysis methodology for DMET data based on Association Rules mining, and an efficient implementation of this methodology, named DMET-Miner. DMET-Miner extends the DMET-Analyzer tool with data mining capabilities and correlates the presence of a set of allelic variants with the conditions of patient's samples by exploiting association rules. To face the high number of frequent itemsets generated when considering large clinical studies based on DMET data, DMET-Miner uses an efficient data structure and implements an optimized search strategy that reduces the search space and the execution time. Preliminary experiments on synthetic DMET datasets, show how DMET-Miner outperforms off-the-shelf data mining suites such as the FP-Growth algorithms available in Weka and RapidMiner. To demonstrate the biological relevance of the extracted association rules and the effectiveness of the proposed approach from a medical point of view, some preliminary studies on a real clinical dataset are currently under medical investigation. Copyright © 2015 Elsevier Inc. All rights reserved.
Corominas, Jordi; Colijn, Johanna M; Geerlings, Maartje J; Pauper, Marc; Bakker, Bjorn; Amin, Najaf; Lores Motta, Laura; Kersten, Eveline; Garanto, Alejandro; Verlouw, Joost A M; van Rooij, Jeroen G J; Kraaij, Robert; de Jong, Paulus T V M; Hofman, Albert; Vingerling, Johannes R; Schick, Tina; Fauser, Sascha; de Jong, Eiko K; van Duijn, Cornelia M; Hoyng, Carel B; Klaver, Caroline C W; den Hollander, Anneke I
2018-04-26
Genome-wide association studies and targeted sequencing studies of candidate genes have identified common and rare variants that are associated with age-related macular degeneration (AMD). Whole-exome sequencing (WES) studies allow a more comprehensive analysis of rare coding variants across all genes of the genome and will contribute to a better understanding of the underlying disease mechanisms. To date, the number of WES studies in AMD case-control cohorts remains scarce and sample sizes are limited. To scrutinize the role of rare protein-altering variants in AMD cause, we performed the largest WES study in AMD to date in a large European cohort consisting of 1125 AMD patients and 1361 control participants. Genome-wide case-control association study of WES data. One thousand one hundred twenty-five AMD patients and 1361 control participants. A single variant association test of WES data was performed to detect variants that are associated individually with AMD. The cumulative effect of multiple rare variants with 1 gene was analyzed using a gene-based CMC burden test. Immunohistochemistry was performed to determine the localization of the Col8a1 protein in mouse eyes. Genetic variants associated with AMD. We detected significantly more rare protein-altering variants in the COL8A1 gene in patients (22/2250 alleles [1.0%]) than in control participants (11/2722 alleles [0.4%]; P = 7.07×10 -5 ). The association of rare variants in the COL8A1 gene is independent of the common intergenic variant (rs140647181) near the COL8A1 gene previously associated with AMD. We demonstrated that the Col8a1 protein localizes at Bruch's membrane. This study supported a role for protein-altering variants in the COL8A1 gene in AMD pathogenesis. We demonstrated the presence of Col8a1 in Bruch's membrane, further supporting the role of COL8A1 variants in AMD pathogenesis. Protein-altering variants in COL8A1 may alter the integrity of Bruch's membrane, contributing to the accumulation of drusen and the development of AMD. Copyright © 2018 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
Drögemüller, Cord; Jagannathan, Vidhya; Keller, Irene; Wüthrich, Daniel; Bruggmann, Rémy; Schütz, Ekkehard; Demmel, Steffi; Moser, Simon; Signer-Hasler, Heidi; Pieńkowska-Schelling, Aldona; Schelling, Claude; Sande, Marcos; Rongen, Ronald
2017-01-01
Belted cattle have a circular belt of unpigmented hair and skin around their midsection. The belt is inherited as a monogenic autosomal dominant trait. We mapped the causative variant to a 37 kb segment on bovine chromosome 3. Whole genome sequence data of 2 belted and 130 control cattle yielded only one private genetic variant in the critical interval in the two belted animals. The belt-associated variant was a copy number variant (CNV) involving the quadruplication of a 6 kb non-coding sequence located approximately 16 kb upstream of the TWIST2 gene. Increased copy numbers at this CNV were strongly associated with the belt phenotype in a cohort of 333 cases and 1322 controls. We hypothesized that the CNV causes aberrant expression of TWIST2 during neural crest development, which might negatively affect melanoblasts. Functional studies showed that ectopic expression of bovine TWIST2 in neural crest in transgenic zebrafish led to a decrease in melanocyte numbers. Our results thus implicate an unsuspected involvement of TWIST2 in regulating pigmentation and reveal a non-coding CNV underlying a captivating Mendelian character. PMID:28658273
Yoneyama, Sachiko; Yao, Jie; Guo, Xiuqing; Fernandez-Rhodes, Lindsay; Lim, Unhee; Boston, Jonathan; Buzková, Petra; Carlson, Christopher S.; Cheng, Iona; Cochran, Barbara; Cooper, Richard; Ehret, Georg; Fornage, Myriam; Gong, Jian; Gross, Myron; Gu, C. Charles; Haessler, Jeff; Haiman, Christopher A.; Henderson, Brian; Hindorff, Lucia A.; Houston, Denise; Irvin, Marguerite R.; Jackson, Rebecca; Kuller, Lew; Leppert, Mark; Lewis, Cora E.; Li, Rongling; Le Marchand, Loic; Matise, Tara C.; Nguyen, Khanh-Dung H.; Chakravarti, Aravinda; Pankow, James S.; Pankratz, Nathan; Pooler, Loreall; Ritchie, Marylyn D.; Bien, Stephanie A.; Wassel, Christina L.; Chen, Yii-Der I.; Taylor, Kent D.; Allison, Matthew; Rotter, Jerome I.; Schreiner, Pamela J.; Schumacher, Fredrick; Wilkens, Lynne; Boerwinkle, Eric; Kooperberg, Charles; Peters, Ulrike; Buyske, Steven; Graff, Mariaelisa; North, Kari E.
2016-01-01
Background/Objectives Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of BMI and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. Subjects/Methods To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine mapping cardiovascular associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. Results Of the 17 WHR loci, eight SNPs located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. Conclusions Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci. PMID:27867202
ERIC Educational Resources Information Center
Pescosolido, Matthew F.; Gamsiz, Ece D.; Nagpal, Shailender; Morrow, Eric M.
2013-01-01
Objective: The purpose of the present study was to discover the extent to which distinct "DSM" disorders share large, highly recurrent copy number variants (CNVs) as susceptibility factors. We also sought to identify gene mechanisms common to groups of diagnoses and/or specific to a given diagnosis based on associations with CNVs. Method:…
ERIC Educational Resources Information Center
Eissen, Marco; Strudthoff, Merle; Backhaus, Solveig; Eismann, Carolin; Oetken, Gesa; Kaling, Soren; Lenoir, Dieter
2011-01-01
Oxidation-state and donor-acceptor concepts are important areas in the chemical education. Student worksheets containing problems that emphasize oxidation numbers, redox reactions of organic compounds, and stoichiometric reaction equations are presented. All of the examples are incorporated under one unifying topic: the production of vicinal…
USDA-ARS?s Scientific Manuscript database
Copy number variants (CNV) are large scale duplications or deletions of genomic sequence that are caused by a diverse set of molecular phenomena that are distinct from single nucleotide polymorphism (SNP) formation. Due to their different mechanisms of formation, CNVs are often difficult to track us...
ERIC Educational Resources Information Center
Bornmann, Lutz
2012-01-01
Ruscio, Seaman, D'Oriano, Stremlo, and Mahalchik (this issue) evaluate 22 bibliometric indicators, including conventional measures, like the number of publications, the "h" index, and many "h" index variants. To assess the quality of the indicators, their well-justified criteria encompass conceptual, empirical, and practical…
de Vries, Tamar I; R Monroe, Glen; van Belzen, Martine J; van der Lans, Christian A; Savelberg, Sanne MC; Newman, William G; van Haaften, Gijs; Nievelstein, Rutger A; van Haelst, Mieke M
2016-01-01
Rubinstein–Taybi syndrome (RTS, OMIM 180849) and Filippi syndrome (FLPIS, OMIM 272440) are both rare syndromes, with multiple congenital anomalies and intellectual deficit (MCA/ID). We present a patient with intellectual deficit, short stature, bilateral syndactyly of hands and feet, broad thumbs, ocular abnormalities, and dysmorphic facial features. These clinical features suggest both RTS and FLPIS. Initial DNA analysis of DNA isolated from blood did not identify variants to confirm either of these syndrome diagnoses. Whole-exome sequencing identified a homozygous variant in C9orf173, which was novel at the time of analysis. Further Sanger sequencing analysis of FLPIS cases tested negative for CKAP2L variants did not, however, reveal any further variants. Subsequent analysis using DNA isolated from buccal mucosa revealed a mosaic variant in CREBBP. This report highlights the importance of excluding mosaic variants in patients with a strong but atypical clinical presentation of a MCA/ID syndrome if no disease-causing variants can be detected in DNA isolated from blood samples. As the striking syndactyly observed in the present case is typical for FLPIS, we suggest CREBBP analysis in saliva samples for FLPIS syndrome cases in which no causal CKAP2L variant is detected. PMID:26956253
Phonetic Spelling Filter for Keyword Selection in Drug Mention Mining from Social Media
Pimpalkhute, Pranoti; Patki, Apurv; Nikfarjam, Azadeh; Gonzalez, Graciela
2014-01-01
Social media postings are rich in information that often remain hidden and inaccessible for automatic extraction due to inherent limitations of the site’s APIs, which mostly limit access via specific keyword-based searches (and limit both the number of keywords and the number of postings that are returned). When mining social media for drug mentions, one of the first problems to solve is how to derive a list of variants of the drug name (common misspellings) that can capture a sufficient number of postings. We present here an approach that filters the potential variants based on the intuition that, faced with the task of writing an unfamiliar, complex word (the drug name), users will tend to revert to phonetic spelling, and we thus give preference to variants that reflect the phonemes of the correct spelling. The algorithm allowed us to capture 50.4 – 56.0 % of the user comments using only about 18% of the variants. PMID:25717407
Phonetic spelling filter for keyword selection in drug mention mining from social media.
Pimpalkhute, Pranoti; Patki, Apurv; Nikfarjam, Azadeh; Gonzalez, Graciela
2014-01-01
Social media postings are rich in information that often remain hidden and inaccessible for automatic extraction due to inherent limitations of the site's APIs, which mostly limit access via specific keyword-based searches (and limit both the number of keywords and the number of postings that are returned). When mining social media for drug mentions, one of the first problems to solve is how to derive a list of variants of the drug name (common misspellings) that can capture a sufficient number of postings. We present here an approach that filters the potential variants based on the intuition that, faced with the task of writing an unfamiliar, complex word (the drug name), users will tend to revert to phonetic spelling, and we thus give preference to variants that reflect the phonemes of the correct spelling. The algorithm allowed us to capture 50.4 - 56.0 % of the user comments using only about 18% of the variants.
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O.; Decker, Christian; Preising, Markus N.; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Issa, Peter Charbel; Holz, Frank G.; Baig, Shahid M.; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y.; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S.; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J.
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading. PMID:24265693
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O; Decker, Christian; Preising, Markus N; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Charbel Issa, Peter; Holz, Frank G; Baig, Shahid M; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover "hidden mutations" such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5' exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5'-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading.
Genome-wide Polygenic Burden of Rare Deleterious Variants in Sudden Unexpected Death in Epilepsy.
Leu, Costin; Balestrini, Simona; Maher, Bridget; Hernández-Hernández, Laura; Gormley, Padhraig; Hämäläinen, Eija; Heggeli, Kristin; Schoeler, Natasha; Novy, Jan; Willis, Joseph; Plagnol, Vincent; Ellis, Rachael; Reavey, Eleanor; O'Regan, Mary; Pickrell, William O; Thomas, Rhys H; Chung, Seo-Kyung; Delanty, Norman; McMahon, Jacinta M; Malone, Stephen; Sadleir, Lynette G; Berkovic, Samuel F; Nashef, Lina; Zuberi, Sameer M; Rees, Mark I; Cavalleri, Gianpiero L; Sander, Josemir W; Hughes, Elaine; Helen Cross, J; Scheffer, Ingrid E; Palotie, Aarno; Sisodiya, Sanjay M
2015-09-01
Sudden unexpected death in epilepsy (SUDEP) represents the most severe degree of the spectrum of epilepsy severity and is the commonest cause of epilepsy-related premature mortality. The precise pathophysiology and the genetic architecture of SUDEP remain elusive. Aiming to elucidate the genetic basis of SUDEP, we analysed rare, protein-changing variants from whole-exome sequences of 18 people who died of SUDEP, 87 living people with epilepsy and 1479 non-epilepsy disease controls. Association analysis revealed a significantly increased genome-wide polygenic burden per individual in the SUDEP cohort when compared to epilepsy (P = 5.7 × 10(- 3)) and non-epilepsy disease controls (P = 1.2 × 10(- 3)). The polygenic burden was driven both by the number of variants per individual, and over-representation of variants likely to be deleterious in the SUDEP cohort. As determined by this study, more than a thousand genes contribute to the observed polygenic burden within the framework of this study. Subsequent gene-based association analysis revealed five possible candidate genes significantly associated with SUDEP or epilepsy, but no one single gene emerges as common to the SUDEP cases. Our findings provide further evidence for a genetic susceptibility to SUDEP, and suggest an extensive polygenic contribution to SUDEP causation. Thus, an overall increased burden of deleterious variants in a highly polygenic background might be important in rendering a given individual more susceptible to SUDEP. Our findings suggest that exome sequencing in people with epilepsy might eventually contribute to generating SUDEP risk estimates, promoting stratified medicine in epilepsy, with the eventual aim of reducing an individual patient's risk of SUDEP.
Genome-wide Polygenic Burden of Rare Deleterious Variants in Sudden Unexpected Death in Epilepsy
Leu, Costin; Balestrini, Simona; Maher, Bridget; Hernández-Hernández, Laura; Gormley, Padhraig; Hämäläinen, Eija; Heggeli, Kristin; Schoeler, Natasha; Novy, Jan; Willis, Joseph; Plagnol, Vincent; Ellis, Rachael; Reavey, Eleanor; O'Regan, Mary; Pickrell, William O.; Thomas, Rhys H.; Chung, Seo-Kyung; Delanty, Norman; McMahon, Jacinta M.; Malone, Stephen; Sadleir, Lynette G.; Berkovic, Samuel F.; Nashef, Lina; Zuberi, Sameer M.; Rees, Mark I.; Cavalleri, Gianpiero L.; Sander, Josemir W.; Hughes, Elaine; Helen Cross, J.; Scheffer, Ingrid E.; Palotie, Aarno; Sisodiya, Sanjay M.
2015-01-01
Sudden unexpected death in epilepsy (SUDEP) represents the most severe degree of the spectrum of epilepsy severity and is the commonest cause of epilepsy-related premature mortality. The precise pathophysiology and the genetic architecture of SUDEP remain elusive. Aiming to elucidate the genetic basis of SUDEP, we analysed rare, protein-changing variants from whole-exome sequences of 18 people who died of SUDEP, 87 living people with epilepsy and 1479 non-epilepsy disease controls. Association analysis revealed a significantly increased genome-wide polygenic burden per individual in the SUDEP cohort when compared to epilepsy (P = 5.7 × 10− 3) and non-epilepsy disease controls (P = 1.2 × 10− 3). The polygenic burden was driven both by the number of variants per individual, and over-representation of variants likely to be deleterious in the SUDEP cohort. As determined by this study, more than a thousand genes contribute to the observed polygenic burden within the framework of this study. Subsequent gene-based association analysis revealed five possible candidate genes significantly associated with SUDEP or epilepsy, but no one single gene emerges as common to the SUDEP cases. Our findings provide further evidence for a genetic susceptibility to SUDEP, and suggest an extensive polygenic contribution to SUDEP causation. Thus, an overall increased burden of deleterious variants in a highly polygenic background might be important in rendering a given individual more susceptible to SUDEP. Our findings suggest that exome sequencing in people with epilepsy might eventually contribute to generating SUDEP risk estimates, promoting stratified medicine in epilepsy, with the eventual aim of reducing an individual patient's risk of SUDEP. PMID:26501104
Association of genetic variants of GRIN2B with autism.
Pan, Yongcheng; Chen, Jingjing; Guo, Hui; Ou, Jianjun; Peng, Yu; Liu, Qiong; Shen, Yidong; Shi, Lijuan; Liu, Yalan; Xiong, Zhimin; Zhu, Tengfei; Luo, Sanchuan; Hu, Zhengmao; Zhao, Jingping; Xia, Kun
2015-02-06
Autism (MIM 209850) is a complex neurodevelopmental disorder characterized by social communication impairments and restricted repetitive behaviors. It has a high heritability, although much remains unclear. To evaluate genetic variants of GRIN2B in autism etiology, we performed a system association study of common and rare variants of GRIN2B and autism in cohorts from a Chinese population, involving a total sample of 1,945 subjects. Meta-analysis of a triad family cohort and a case-control cohort identified significant associations of multiple common variants and autism risk (Pmin = 1.73 × 10(-4)). Significantly, the haplotype involved with the top common variants also showed significant association (P = 1.78 × 10(-6)). Sanger sequencing of 275 probands from a triad cohort identified several variants in coding regions, including four common variants and seven rare variants. Two of the common coding variants were located in the autism-related linkage disequilibrium (LD) block, and both were significantly associated with autism (P < 9 × 10(-3)) using an independent control cohort. Burden analysis and case-only analysis of rare coding variants identified by Sanger sequencing did not find this association. Our study for the first time reveals that common variants and related haplotypes of GRIN2B are associated with autism risk.
Held, Elizabeth; Cape, Joshua; Tintle, Nathan
2016-01-01
Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data.
Hwang, Sang Mee; Lee, Ki Chan; Lee, Min Seob; Park, Kyoung Un
2018-01-01
Transition to next generation sequencing (NGS) for BRCA1 / BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1 /2. The results of Ion PGM with OTG-snpcaller, a pipeline based on Torrent mapping alignment program and Genome Analysis Toolkit, from 75 clinical samples and 14 reference DNA samples were compared with Sanger sequencing for BRCA1 / BRCA2 . Ten clinical samples and 14 reference DNA samples were additionally sequenced by Ion PGM Dx with Torrent Suite. Fifty types of variants including 18 pathogenic or variants of unknown significance were identified from 75 clinical samples and known variants of the reference samples were confirmed by Sanger sequencing and/or NGS. One false-negative results were present for Ion PGM/OTG-snpcaller for an indel variant misidentified as a single nucleotide variant. However, eight discordant results were present for Ion PGM Dx/Torrent Suite with both false-positive and -negative results. A 40-bp deletion, a 4-bp deletion and a 1-bp deletion variant was not called and a false-positive deletion was identified. Four other variants were misidentified as another variant. Ion PGM/OTG-snpcaller showed acceptable performance with good concordance with Sanger sequencing. However, Ion PGM Dx/Torrent Suite showed many discrepant results not suitable for use in a clinical laboratory, requiring further optimization of the data analysis for calling variants.
O'Dwyer, James P; Kandler, Anne
2017-12-05
Neutral evolution assumes that there are no selective forces distinguishing different variants in a population. Despite this striking assumption, many recent studies have sought to assess whether neutrality can provide a good description of different episodes of cultural change. One approach has been to test whether neutral predictions are consistent with observed progeny distributions, recording the number of variants that have produced a given number of new instances within a specified time interval: a classic example is the distribution of baby names. Using an overlapping generations model, we show that these distributions consist of two phases: a power-law phase with a constant exponent of [Formula: see text], followed by an exponential cut-off for variants with very large numbers of progeny. Maximum-likelihood estimations of the model parameters provide a direct way to establish whether observed empirical patterns are consistent with neutral evolution. We apply our approach to a complete dataset of baby names from Australia. Crucially, we show that analyses based on only the most popular variants, as is often the case in studies of cultural evolution, can provide misleading evidence for underlying transmission hypotheses. While neutrality provides a plausible description of progeny distributions of abundant variants, rare variants deviate from neutrality. Further, we develop a simulation framework that allows the detection of alternative cultural transmission processes. We show that anti-novelty bias is able to replicate the complete progeny distribution of the Australian dataset.This article is part of the themed issue 'Process and pattern in innovations from cells to societies'. © 2017 The Author(s).
Out, Astrid A; van Minderhout, Ivonne J H M; van der Stoep, Nienke; van Bommel, Lysette S R; Kluijt, Irma; Aalfs, Cora; Voorendt, Marsha; Vossen, Rolf H A M; Nielsen, Maartje; Vasen, Hans F A; Morreau, Hans; Devilee, Peter; Tops, Carli M J; Hes, Frederik J
2015-06-01
Familial adenomatous polyposis is most frequently caused by pathogenic variants in either the APC gene or the MUTYH gene. The detection rate of pathogenic variants depends on the severity of the phenotype and sensitivity of the screening method, including sensitivity for mosaic variants. For 171 patients with multiple colorectal polyps without previously detectable pathogenic variant, APC was reanalyzed in leukocyte DNA by one uniform technique: high-resolution melting (HRM) analysis. Serial dilution of heterozygous DNA resulted in a lowest detectable allelic fraction of 6% for the majority of variants. HRM analysis and subsequent sequencing detected pathogenic fully heterozygous APC variants in 10 (6%) of the patients and pathogenic mosaic variants in 2 (1%). All these variants were previously missed by various conventional scanning methods. In parallel, HRM APC scanning was applied to DNA isolated from polyp tissue of two additional patients with apparently sporadic polyposis and without detectable pathogenic APC variant in leukocyte DNA. In both patients a pathogenic mosaic APC variant was present in multiple polyps. The detection of pathogenic APC variants in 7% of the patients, including mosaics, illustrates the usefulness of a complete APC gene reanalysis of previously tested patients, by a supplementary scanning method. HRM is a sensitive and fast pre-screening method for reliable detection of heterozygous and mosaic variants, which can be applied to leukocyte and polyp derived DNA.
1976-07-16
Influence of Range 10 5 Range Performance Penalty Function II 6 Influence of Closing Velocity 12 7 Energy Influence Function 14 8 Comparison of the...flELtSHAlL, ..E^) RANGE RANGE Figure 7 Energy Influence Function 14 TM 76-1 SA ! PERFORMANCE INDEX COMPARATIVE ANALYSIS Maneuver Conversion Model...hnergy Integral ■’> E s K Energy Influence Function K* Proportionality Constant MT Target Mach Number N Normal Acceleration (load factor) z
Hildebrandt, Michelle A T; Roth, Jack A; Vaporciyan, Ara A; Pu, Xia; Ye, Yuanqing; Correa, Arlene M; Kim, Jae Y; Swisher, Stephen G; Wu, Xifeng
2015-07-13
Post-operative pulmonary complications are the most common morbidity associated with lung resection in non-small cell lung cancer (NSCLC) patients. The TNF/TRAF2/ASK1/p38 kinase pathway is activated by stress stimuli and inflammatory signals. We hypothesized that genetic polymorphisms within this pathway may contribute to risk of complications. In this case-only study, we genotyped 173 germline genetic variants in a discovery population of 264 NSCLC patients who underwent a lobectomy followed by genotyping of the top variants in a replication population of 264 patients. Complications data was obtained from a prospective database at MD Anderson. MAP2K4:rs12452497 was significantly associated with a decreased risk in both phases, resulting in a 40% reduction in the pooled population (95% CI:0.43-0.83, P = 0.0018). In total, seven variants were significant for risk in the pooled analysis. Gene-based analysis supported the involvement of TRAF2, MAP2K4, and MAP3K5 as mediating complications risk and a highly significant trend was identified between the number of risk genotypes and complications risk (P = 1.63 × 10(-8)). An inverse relationship was observed between association with clinical outcomes and complications for two variants. These results implicate the TNF/TRAF2/ASK1/p38 kinase pathway in modulating risk of pulmonary complications following lobectomy and may be useful biomarkers to identify patients at high risk.
Stormer, R S; Falkinham, J O
1989-01-01
Unpigmented colonial variants were isolated from pigmented Mycobacterium avium isolates recovered from patients with acquired immunodeficiency syndrome and the environment. The variants were interconvertible: the rate of transition from unpigmented to pigmented type was 4.0 x 10(-5) variants per cell per generation. The unpigmented variants were more tolerant to antibiotics, especially beta-lactams, and Cd2+ and Cu2+ salts than were their pigmented parents. Both pigmented and unpigmented variants of the strains produced beta-lactamase, although beta-lactamase did not appear to be a determinant of beta-lactam susceptibility. Pigmented variants grew more rapidly in a number of commonly used mycobacterial media, were more hydrophobic, and had higher carotenoid contents than their unpigmented segregants. PMID:2808669
Bedard, Tanya; Lowry, R Brian; Sibbald, Barbara; Thomas, Mary Ann; Innes, A Micheil
2016-01-01
The use of array-based comparative genomic hybridization to assess DNA copy number is increasing in many jurisdictions. Such technology identifies more genetic causes of congenital anomalies; however, the clinical significance of some results may be challenging to interpret. A coding strategy to address cases with copy number variants has recently been implemented by the Alberta Congenital Anomalies Surveillance System and is described.
Rietschel, Marcella; Mattheisen, Manuel; Breuer, René; Schulze, Thomas G.; Nöthen, Markus M.; Levinson, Douglas; Shi, Jianxin; Gejman, Pablo V.; Cichon, Sven; Ophoff, Roel A.
2012-01-01
Recent studies suggest that variation in complex disorders (e.g., schizophrenia) is explained by a large number of genetic variants with small effect size (Odds Ratio∼1.05–1.1). The statistical power to detect these genetic variants in Genome Wide Association (GWA) studies with large numbers of cases and controls (∼15,000) is still low. As it will be difficult to further increase sample size, we decided to explore an alternative method for analyzing GWA data in a study of schizophrenia, dramatically reducing the number of statistical tests. The underlying hypothesis was that at least some of the genetic variants related to a common outcome are collocated in segments of chromosomes at a wider scale than single genes. Our approach was therefore to study the association between relatively large segments of DNA and disease status. An association test was performed for each SNP and the number of nominally significant tests in a segment was counted. We then performed a permutation-based binomial test to determine whether this region contained significantly more nominally significant SNPs than expected under the null hypothesis of no association, taking linkage into account. Genome Wide Association data of three independent schizophrenia case/control cohorts with European ancestry (Dutch, German, and US) using segments of DNA with variable length (2 to 32 Mbp) was analyzed. Using this approach we identified a region at chromosome 5q23.3-q31.3 (128–160 Mbp) that was significantly enriched with nominally associated SNPs in three independent case-control samples. We conclude that considering relatively wide segments of chromosomes may reveal reliable relationships between the genome and schizophrenia, suggesting novel methodological possibilities as well as raising theoretical questions. PMID:22723893
Hormone escape is associated with genomic instability in a human prostate cancer model.
Legrier, Marie-Emmanuelle; Guyader, Charlotte; Céraline, Jocelyn; Dutrillaux, Bernard; Oudard, Stéphane; Poupon, Marie-France; Auger, Nathalie
2009-03-01
Lack of hormone dependency in prostate cancers is an irreversible event that occurs through generation of genomic instability induced by androgen deprivation. Indeed, the cytogenetic profile of hormone-dependent (HD) prostate cancer remains stable as long as it received a hormone supply, whereas the profile of hormone-independent (HID) variants acquired new and various alterations. This is demonstrated here using a HD xenografted model of a human prostate cancer, PAC120, transplanted for 11 years into male nude mice and 4 HID variants obtained by surgical castration. Cytogenetic analysis, done by karyotype, FISH, CGH and array-CGH, shows that PAC120 at early passage presents numerous chromosomal alterations. Very few additional alterations were found between the 5th and 47th passages, indicating the stability of the parental tumor. HID variants largely maintained the core of chromosomal alterations of PAC120 - losses at 6q, 7p, 12q, 15q and 17q sites. However, each HID variant displayed a number of new alterations, almost all being specific to each variant and very few shared by all. None of the HID had androgen receptor mutations. Our study indicates that hormone castration is responsible for genomic instability generating new cytogenetic abnormalities susceptible to alter the properties of cancer cell associated with tumor progression, such as increased cell survival and ability to metastasize.
A genetic study of Wilson’s disease in the United Kingdom
Coffey, Alison J.; Durkie, Miranda; Hague, Stephen; McLay, Kirsten; Emmerson, Jennifer; Lo, Christine; Klaffke, Stefanie; Joyce, Christopher J.; Dhawan, Anil; Hadzic, Nedim; Mieli-Vergani, Giorgina; Kirk, Richard; Elizabeth Allen, K.; Nicholl, David; Wong, Siew; Griffiths, William; Smithson, Sarah; Giffin, Nicola; Taha, Ali; Connolly, Sally; Gillett, Godfrey T.; Tanner, Stuart; Bonham, Jim; Sharrack, Basil; Palotie, Aarno; Rattray, Magnus; Dalton, Ann
2013-01-01
Previous studies have failed to identify mutations in the Wilson’s disease gene ATP7B in a significant number of clinically diagnosed cases. This has led to concerns about genetic heterogeneity for this condition but also suggested the presence of unusual mutational mechanisms. We now present our findings in 181 patients from the United Kingdom with clinically and biochemically confirmed Wilson’s disease. A total of 116 different ATP7B mutations were detected, 32 of which are novel. The overall mutation detection frequency was 98%. The likelihood of mutations in genes other than ATP7B causing a Wilson’s disease phenotype is therefore very low. We report the first cases with Wilson’s disease due to segmental uniparental isodisomy as well as three patients with three ATP7B mutations and three families with Wilson’s disease in two consecutive generations. We determined the genetic prevalence of Wilson’s disease in the United Kingdom by sequencing the entire coding region and adjacent splice sites of ATP7B in 1000 control subjects. The frequency of all single nucleotide variants with in silico evidence of pathogenicity (Class 1 variant) was 0.056 or 0.040 if only those single nucleotide variants that had previously been reported as mutations in patients with Wilson’s disease were included in the analysis (Class 2 variant). The frequency of heterozygote, putative or definite disease-associated ATP7B mutations was therefore considerably higher than the previously reported occurrence of 1:90 (or 0.011) for heterozygote ATP7B mutation carriers in the general population (P < 2.2 × 10-16 for Class 1 variants or P < 5 × 10-11 for Class 2 variants only). Subsequent exclusion of four Class 2 variants without additional in silico evidence of pathogenicity led to a further reduction of the mutation frequency to 0.024. Using this most conservative approach, the calculated frequency of individuals predicted to carry two mutant pathogenic ATP7B alleles is 1:7026 and thus still considerably higher than the typically reported prevalence of Wilson’s disease of 1:30 000 (P = 0.00093). Our study provides strong evidence for monogenic inheritance of Wilson’s disease. It also has major implications for ATP7B analysis in clinical practice, namely the need to consider unusual genetic mechanisms such as uniparental disomy or the possible presence of three ATP7B mutations. The marked discrepancy between the genetic prevalence and the number of clinically diagnosed cases of Wilson’s disease may be due to both reduced penetrance of ATP7B mutations and failure to diagnose patients with this eminently treatable disorder. PMID:23518715
U1 small nuclear RNA variants differentially form ribonucleoprotein particles in vitro.
Somarelli, Jason A; Mesa, Annia; Rodriguez, Carol E; Sharma, Shalini; Herrera, Rene J
2014-04-25
The U1 small nuclear (sn)RNA participates in splicing of pre-mRNAs by recognizing and binding to 5' splice sites at exon/intron boundaries. U1 snRNAs associate with 5' splice sites in the form of ribonucleoprotein particles (snRNPs) that are comprised of the U1 snRNA and 10 core components, including U1A, U1-70K, U1C and the 'Smith antigen', or Sm, heptamer. The U1 snRNA is highly conserved across a wide range of taxa; however, a number of reports have identified the presence of expressed U1-like snRNAs in multiple species, including humans. While numerous U1-like molecules have been shown to be expressed, it is unclear whether these variant snRNAs have the capacity to form snRNPs and participate in splicing. The purpose of the present study was to further characterize biochemically the ability of previously identified human U1-like variants to form snRNPs and bind to U1 snRNP proteins. A bioinformatics analysis provided support for the existence of multiple expressed variants. In vitro gel shift assays, competition assays, and immunoprecipitations (IPs) revealed that the variants formed high molecular weight assemblies to varying degrees and associated with core U1 snRNP proteins to a lesser extent than the canonical U1 snRNA. Together, these data suggest that the human U1 snRNA variants analyzed here are unable to efficiently bind U1 snRNP proteins. The current work provides additional biochemical insights into the ability of the variants to assemble into snRNPs. Copyright © 2014 Elsevier B.V. All rights reserved.
High depth, whole-genome sequencing of cholera isolates from Haiti and the Dominican Republic.
Sealfon, Rachel; Gire, Stephen; Ellis, Crystal; Calderwood, Stephen; Qadri, Firdausi; Hensley, Lisa; Kellis, Manolis; Ryan, Edward T; LaRocque, Regina C; Harris, Jason B; Sabeti, Pardis C
2012-09-11
Whole-genome sequencing is an important tool for understanding microbial evolution and identifying the emergence of functionally important variants over the course of epidemics. In October 2010, a severe cholera epidemic began in Haiti, with additional cases identified in the neighboring Dominican Republic. We used whole-genome approaches to sequence four Vibrio cholerae isolates from Haiti and the Dominican Republic and three additional V. cholerae isolates to a high depth of coverage (>2000x); four of the seven isolates were previously sequenced. Using these sequence data, we examined the effect of depth of coverage and sequencing platform on genome assembly and identification of sequence variants. We found that 50x coverage is sufficient to construct a whole-genome assembly and to accurately call most variants from 100 base pair paired-end sequencing reads. Phylogenetic analysis between the newly sequenced and thirty-three previously sequenced V. cholerae isolates indicates that the Haitian and Dominican Republic isolates are closest to strains from South Asia. The Haitian and Dominican Republic isolates form a tight cluster, with only four variants unique to individual isolates. These variants are located in the CTX region, the SXT region, and the core genome. Of the 126 mutations identified that separate the Haiti-Dominican Republic cluster from the V. cholerae reference strain (N16961), 73 are non-synonymous changes, and a number of these changes cluster in specific genes and pathways. Sequence variant analyses of V. cholerae isolates, including multiple isolates from the Haitian outbreak, identify coverage-specific and technology-specific effects on variant detection, and provide insight into genomic change and functional evolution during an epidemic.
VCFR: A package to manipulate and visualize variant call format data in R
USDA-ARS?s Scientific Manuscript database
Software to call single nucleotide polymorphisms or related genetic variants has converged on the variant call format (vcf) as their output format of choice. This has created a need for tools to work with vcf files. While an increasing number of software exists to read vcf data, many of them only ex...
Variants of cellobiohydrolases
Bott, Richard R.; Foukaraki, Maria; Hommes, Ronaldus Wilhelmus; Kaper, Thijs; Kelemen, Bradley R.; Kralj, Slavko; Nikolaev, Igor; Sandgren, Mats; Van Lieshout, Johannes Franciscus Thomas; Van Stigt Thans, Sander
2018-04-10
Disclosed are a number of homologs and variants of Hypocrea jecorina Ce17A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.
Würschum, Tobias; Boeven, Philipp H G; Langer, Simon M; Longin, C Friedrich H; Leiser, Willmar L
2015-07-29
Copy number variation was found to be a frequent type of DNA polymorphism in the human genome often associated with diseases but its importance in crops and the effects on agronomic traits are still largely unknown. Here, we employed a large worldwide panel of 1110 winter wheat varieties to assess the frequency and the geographic distribution of copy number variants at the Photoperiod-B1 (Ppd-B1) and the Vernalization-A1 (Vrn-A1) loci as well as their effects on flowering time under field conditions. We identified a novel four copy variant of Vrn-A1 and based on the phylogenetic relationships among the lines show that the higher copy variants at both loci are likely to have arisen independently multiple times. In addition, we found that the frequency of the different copy number variants at both loci reflects the environmental conditions in the varieties' region of origin and based on multi-location field trials show that Ppd-B1 copy number has a substantial effect on the fine-tuning of flowering time. In conclusion, our results show the importance of copy number variation at Ppd-B1 and Vrn-A1 for the global adaptation of wheat making it a key factor for wheat success in a broad range of environments and in a wider context substantiate the significant role of copy number variation in crops.
Molecular detection and characterization of noroviruses in river water in Thailand.
Inoue, K; Motomura, K; Boonchan, M; Takeda, N; Ruchusatsawa, K; Guntapong, R; Tacharoenmuang, R; Sangkitporn, S; Chantaroj, S
2016-03-01
Norovirus (NoV) generally exists as a mixture of multiple genotype variants in nature. However, there has been no published report monitoring NoV in natural settings in Thailand. To obtain information on mixed presence of the NoV RNA genome, we conducted viral genome analysis of 15 water specimens collected from five sites in a river near Bangkok between August 2013 and August 2014. The number of viral RNA copies per specimen declined progressively from the most upstream to the most downstream site. Following direct nucleotide sequencing of the PCR products, we obtained three partial genome sequences of the NoV GI strain and 13 partial genome sequences of the NoV GII strains. Phylogenetic analysis indicated the presence of four GII.4 variant groups pro-circulated after the Den Haag_2006b, New Orleans_2009 and Sydney_2012 outbreaks. On the other hand, only GI.4 was observed from the specimens collected on April, 2014. These results indicated that multiple genogroups and genotypes of noroviruses are present and are circulating in the natural environment in Thailand as in other countries. Our study provides comprehensive information on the occurrence of new variants. Our study is the first paper that multiple genogroups and genotypes of norovirus exist, and are circulating in the river water near Bangkok, Thailand. Phylogenetic analysis indicated the presence of four GII.4 variant groups pro-circulated after the Den Haag_2006b, New Orleans_2009 and Sydney_2012 that caused outbreaks in the world. Continued research will be essential for understanding the natural history of NoV and the control of future outbreaks. © 2015 The Society for Applied Microbiology.
SeqHBase: a big data toolset for family based sequencing data analysis.
He, Min; Person, Thomas N; Hebbring, Scott J; Heinzen, Ethan; Ye, Zhan; Schrodi, Steven J; McPherson, Elizabeth W; Lin, Simon M; Peissig, Peggy L; Brilliant, Murray H; O'Rawe, Jason; Robison, Reid J; Lyon, Gholson J; Wang, Kai
2015-04-01
Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant prioritisation). We applied SeqHBase to a 5-member nuclear family and a 10-member 3-generation family with WGS data, as well as a 4-member nuclear family with WES data. Analysis times were almost linearly scalable with number of data nodes. With 20 data nodes, SeqHBase took about 5 secs to analyse WES familial data and approximately 1 min to analyse WGS familial data. These results demonstrate SeqHBase's high efficiency and scalability, which is necessary as WGS and WES are rapidly becoming standard methods to study the genetics of familial disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Wood, Andrew R; Esko, Tonu; Yang, Jian; Vedantam, Sailaja; Pers, Tune H; Gustafsson, Stefan; Chu, Audrey Y; Estrada, Karol; Luan, Jian'an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna A E; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Mateo Leach, Irene; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Arnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex S F; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C P G M; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik K E; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor V A; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan J L; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John J P; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela A F; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, D C; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter E H; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul I W; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin N A; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L; Lettre, Guillaume; Loos, Ruth J F; Weedon, Michael N; Ingelsson, Erik; O'Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E; Visscher, Peter M; Hirschhorn, Joel N; Frayling, Timothy M
2014-11-01
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
Chu, Audrey Y; Estrada, Karol; Luan, Jian’an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna AE; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Leach, Irene Mateo; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Ärnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex SF; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C.P.G.M.; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik KE; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor VA; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan JL; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John JP; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela AF; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, DC; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter EH; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul IW; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J.; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin NA; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S.; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L.; Lettre, Guillaume; Loos, Ruth JF; Weedon, Michael N; Ingelsson, Erik; O’Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E
2014-01-01
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explain one-fifth of heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ~2,000, ~3,700 and ~9,500 SNPs explained ~21%, ~24% and ~29% of phenotypic variance. Furthermore, all common variants together captured the majority (60%) of heritability. The 697 variants clustered in 423 loci enriched for genes, pathways, and tissue-types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/beta-catenin, and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants. PMID:25282103
Value of genetic profiling for the prediction of coronary heart disease.
van der Net, Jeroen B; Janssens, A Cecile J W; Sijbrands, Eric J G; Steyerberg, Ewout W
2009-07-01
Advances in high-throughput genomics facilitate the identification of novel genetic susceptibility variants for coronary heart disease (CHD). This may improve CHD risk prediction. The aim of the present simulation study was to investigate to what degree CHD risk can be predicted by testing multiple genetic variants (genetic profiling). We simulated genetic profiles for a population of 100,000 individuals with a 10-year CHD incidence of 10%. For each combination of model parameters (number of variants, genotype frequency and odds ratio [OR]), we calculated the area under the receiver operating characteristic curve (AUC) to indicate the discrimination between individuals who will and will not develop CHD. The AUC of genetic profiles could rise to 0.90 when 100 hypothetical variants with ORs of 1.5 and genotype frequencies of 50% were simulated. The AUC of a genetic profile consisting of 10 established variants, with ORs ranging from 1.13 to 1.42, was 0.59. When 2, 5, and 10 times as many identical variants would be identified, the AUCs were 0.63, 0.69, and 0.76. To obtain AUCs similar to those of conventional CHD risk predictors, a considerable number of additional common genetic variants need to be identified with preferably strong effects.
Wang, Qingzhong; Shelton, Richard C; Dwivedi, Yogesh
2018-01-01
Gene-environment interaction contributes to the risks of psychiatric disorders. Interactions between FKBP5 gene variants and early-life stress may enhance the risk not only for mood disorder, but also for a number of other behavioral phenotypes. The aim of the present study was to review and conduct a meta-analysis on the results from published studies examining interaction between FKBP5 gene variants and early-life stress and their associations with stress-related disorders such as major depression and PTSD. A literature search was conducted using PsychINFO and PubMed databases until May 2017. A total of 14 studies with a pooled total of 15109 participants met the inclusion criteria, the results of which were combined and a meta-analysis was performed using the differences in correlations as the effect measure. Based on literature, rs1360780, rs3800373, and rs9470080 SNPs were selected within the FKBP5 gene and systematic review was conducted. Based on the Comprehensive Meta-Analysis software, no publication bias was detected. Sensitivity analysis and credibility of meta-analysis results also indicated that the analyses were stable. The meta-analysis showed that individuals who carry T allele of rs1360780, C-allele of rs3800373 or T-allele of rs9470080 exposed to early-life trauma had higher risks for depression or PTSD. The effects of ethnicity, age, sex, and different stress measures were not examined due to limited sample size. These results provide strong evidence of interactions between FKBP5 genotypes and early-life stress, which could pose a significant risk factor for stress-associated disorders such as major depression and PTSD. Copyright © 2017 Elsevier B.V. All rights reserved.
Efficient population-scale variant analysis and prioritization with VAPr.
Birmingham, Amanda; Mark, Adam M; Mazzaferro, Carlo; Xu, Guorong; Fisch, Kathleen M
2018-04-06
With the growing availability of population-scale whole-exome and whole-genome sequencing, demand for reproducible, scalable variant analysis has spread within genomic research communities. To address this need, we introduce the Python package VAPr (Variant Analysis and Prioritization). VAPr leverages existing annotation tools ANNOVAR and MyVariant.info with MongoDB-based flexible storage and filtering functionality. It offers biologists and bioinformatics generalists easy-to-use and scalable analysis and prioritization of genomic variants from large cohort studies. VAPr is developed in Python and is available for free use and extension under the MIT License. An install package is available on PyPi at https://pypi.python.org/pypi/VAPr, while source code and extensive documentation are on GitHub at https://github.com/ucsd-ccbb/VAPr. kfisch@ucsd.edu.
Evaluating Reported Candidate Gene Associations with Polycystic Ovary Syndrome
Pau, Cindy; Saxena, Richa; Welt, Corrine Kolka
2013-01-01
Objective To replicate variants in candidate genes associated with PCOS in a population of European PCOS and control subjects. Design Case-control association analysis and meta-analysis. Setting Major academic hospital Patients Women of European ancestry with PCOS (n=525) and controls (n=472), aged 18 to 45 years. Intervention Variants previously associated with PCOS in candidate gene studies were genotyped (n=39). Metabolic, reproductive and anthropomorphic parameters were examined as a function of the candidate variants. All genetic association analyses were adjusted for age, BMI and ancestry and were reported after correction for multiple testing. Main Outcome Measure Association of candidate gene variants with PCOS. Results Three variants, rs3797179 (SRD5A1), rs12473543 (POMC), and rs1501299 (ADIPOQ), were nominally associated with PCOS. However, they did not remain significant after correction for multiple testing and none of the variants replicated in a sufficiently powered meta-analysis. Variants in the FBN3 gene (rs17202517 and rs73503752) were associated with smaller waist circumferences and variant rs727428 in the SHBG gene was associated with lower SHBG levels. Conclusion Previously identified variants in candidate genes do not appear to be associated with PCOS risk. PMID:23375202
Nho, Kwangsik; Horgusluoglu, Emrin; Kim, Sungeun; Risacher, Shannon L; Kim, Dokyoon; Foroud, Tatiana; Aisen, Paul S; Petersen, Ronald C; Jack, Clifford R; Shaw, Leslie M; Trojanowski, John Q; Weiner, Michael W; Green, Robert C; Toga, Arthur W; Saykin, Andrew J
2016-08-12
Pathogenic mutations in PSEN1 are known to cause familial early-onset Alzheimer's disease (EOAD) but common variants in PSEN1 have not been found to strongly influence late-onset AD (LOAD). The association of rare variants in PSEN1 with LOAD-related endophenotypes has received little attention. In this study, we performed a rare variant association analysis of PSEN1 with quantitative biomarkers of LOAD using whole genome sequencing (WGS) by integrating bioinformatics and imaging informatics. A WGS data set (N = 815) from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort was used in this analysis. 757 non-Hispanic Caucasian participants underwent WGS from a blood sample and high resolution T1-weighted structural MRI at baseline. An automated MRI analysis technique (FreeSurfer) was used to measure cortical thickness and volume of neuroanatomical structures. We assessed imaging and cerebrospinal fluid (CSF) biomarkers as LOAD-related quantitative endophenotypes. Single variant analyses were performed using PLINK and gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). A total of 839 rare variants (MAF < 1/√(2 N) = 0.0257) were found within a region of ±10 kb from PSEN1. Among them, six exonic (three non-synonymous) variants were observed. A single variant association analysis showed that the PSEN1 p. E318G variant increases the risk of LOAD only in participants carrying APOE ε4 allele where individuals carrying the minor allele of this PSEN1 risk variant have lower CSF Aβ1-42 and higher CSF tau. A gene-based analysis resulted in a significant association of rare but not common (MAF ≥ 0.0257) PSEN1 variants with bilateral entorhinal cortical thickness. This is the first study to show that PSEN1 rare variants collectively show a significant association with the brain atrophy in regions preferentially affected by LOAD, providing further support for a role of PSEN1 in LOAD. The PSEN1 p. E318G variant increases the risk of LOAD only in APOE ε4 carriers. Integrating bioinformatics with imaging informatics for identification of rare variants could help explain the missing heritability in LOAD.
DistributedFBA.jl: High-level, high-performance flux balance analysis in Julia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Heirendt, Laurent; Thiele, Ines; Fleming, Ronan M. T.
Flux balance analysis and its variants are widely used methods for predicting steady-state reaction rates in biochemical reaction networks. The exploration of high dimensional networks with such methods is currently hampered by software performance limitations. DistributedFBA.jl is a high-level, high-performance, open-source implementation of flux balance analysis in Julia. It is tailored to solve multiple flux balance analyses on a subset or all the reactions of large and huge-scale networks, on any number of threads or nodes. DistributedFBA.jl is a high-level, high-performance, open-source implementation of flux balance analysis in Julia. It is tailored to solve multiple flux balance analyses on amore » subset or all the reactions of large and huge-scale networks, on any number of threads or nodes.« less
DistributedFBA.jl: High-level, high-performance flux balance analysis in Julia
Heirendt, Laurent; Thiele, Ines; Fleming, Ronan M. T.
2017-01-16
Flux balance analysis and its variants are widely used methods for predicting steady-state reaction rates in biochemical reaction networks. The exploration of high dimensional networks with such methods is currently hampered by software performance limitations. DistributedFBA.jl is a high-level, high-performance, open-source implementation of flux balance analysis in Julia. It is tailored to solve multiple flux balance analyses on a subset or all the reactions of large and huge-scale networks, on any number of threads or nodes. DistributedFBA.jl is a high-level, high-performance, open-source implementation of flux balance analysis in Julia. It is tailored to solve multiple flux balance analyses on amore » subset or all the reactions of large and huge-scale networks, on any number of threads or nodes.« less
Cheung, Chloe Y Y; Tang, Clara S; Xu, Aimin; Lee, Chi-Ho; Au, Ka-Wing; Xu, Lin; Fong, Carol H Y; Kwok, Kelvin H M; Chow, Wing-Sun; Woo, Yu-Cho; Yuen, Michele M A; Hai, JoJo S H; Jin, Ya-Li; Cheung, Bernard M Y; Tan, Kathryn C B; Cherny, Stacey S; Zhu, Feng; Zhu, Tong; Thomas, G Neil; Cheng, Kar-Keung; Jiang, Chao-Qiang; Lam, Tai-Hing; Tse, Hung-Fat; Sham, Pak-Chung; Lam, Karen S L
2017-01-01
Genome-wide association studies (GWASs) have identified many common type 2 diabetes-associated variants, mostly at the intronic or intergenic regions. Recent advancements of exome-array genotyping platforms have opened up a novel means for detecting the associations of low-frequency or rare coding variants with type 2 diabetes. We conducted an exomechip association analysis to identify additional type 2 diabetes susceptibility variants in the Chinese population. An exome-chip association study was conducted by genotyping 5640 Chinese individuals from Hong Kong, using a custom designed exome array, the Asian Exomechip. Single variant association analysis was conducted on 77,468 single nucleotide polymorphisms (SNPs). Fifteen SNPs were subsequently genotyped for replication analysis in an independent Chinese cohort comprising 12,362 individuals from Guangzhou. A combined analysis involving 7189 cases and 10,813 controls was performed. In the discovery stage, an Asian-specific coding variant rs2233580 (p.Arg192His) in PAX4, and two variants at the known loci, CDKN2B-AS1 and KCNQ1, were significantly associated with type 2 diabetes with exome-wide significance (p discovery < 6.45 × 10 -7 ). The risk allele (T) of PAX4 rs2233580 was associated with a younger age at diabetes diagnosis. This variant was replicated in an independent cohort and demonstrated a stronger association that reached genome-wide significance (p meta-analysis [p meta ] = 3.74 × 10 -15 ) in the combined analysis. We identified the association of a PAX4 Asian-specific missense variant rs2233580 with type 2 diabetes in an exome-chip association analysis, supporting the involvement of PAX4 in the pathogenesis of type 2 diabetes. Our findings suggest PAX4 is a possible effector gene of the 7q32 locus, previously identified from GWAS in Asians.
Identification of copy number variants in horses.
Doan, Ryan; Cohen, Noah; Harrington, Jessica; Veazey, Kylee; Veazy, Kylee; Juras, Rytis; Cothran, Gus; McCue, Molly E; Skow, Loren; Dindot, Scott V
2012-05-01
Copy number variants (CNVs) represent a substantial source of genetic variation in mammals. However, the occurrence of CNVs in horses and their subsequent impact on phenotypic variation is unknown. We performed a study to identify CNVs in 16 horses representing 15 distinct breeds (Equus caballus) and an individual gray donkey (Equus asinus) using a whole-exome tiling array and the array comparative genomic hybridization methodology. We identified 2368 CNVs ranging in size from 197 bp to 3.5 Mb. Merging identical CNVs from each animal yielded 775 CNV regions (CNVRs), involving 1707 protein- and RNA-coding genes. The number of CNVs per animal ranged from 55 to 347, with median and mean sizes of CNVs of 5.3 kb and 99.4 kb, respectively. Approximately 6% of the genes investigated were affected by a CNV. Biological process enrichment analysis indicated CNVs primarily affected genes involved in sensory perception, signal transduction, and metabolism. CNVs also were identified in genes regulating blood group antigens, coat color, fecundity, lactation, keratin formation, neuronal homeostasis, and height in other species. Collectively, these data are the first report of copy number variation in horses and suggest that CNVs are common in the horse genome and may modulate biological processes underlying different traits observed among horses and horse breeds.
Koko, Mahmoud; Abdallah, Mohammed O E; Amin, Mutaz; Ibrahim, Muntaser
2018-01-15
The conventional variant calling of pathogenic alleles in exome and genome sequencing requires the presence of the non-pathogenic alleles as genome references. This hinders the correct identification of variants with minor and/or pathogenic reference alleles warranting additional approaches for variant calling. More than 26,000 Exome Aggregation Consortium (ExAC) variants have a minor reference allele including variants with known ClinVar disease alleles. For instance, in a number of variants related to clotting disorders, the phenotype-associated allele is a human genome reference allele (rs6025, rs6003, rs1799983, and rs2227564 using the assembly hg19). We highlighted how the current variant calling standards miss homozygous reference disease variants in these sites and provided a bioinformatic panel that can be used to screen these variants using commonly available variant callers. We present exome sequencing results from an individual with venous thrombosis to emphasize how pathogenic alleles in clinically relevant variants escape variant calling while non-pathogenic alleles are detected. This article highlights the importance of specialized variant calling strategies in clinical variants with minor reference alleles especially in the context of personal genomes and exomes. We provide here a simple strategy to screen potential disease-causing variants when present in homozygous reference state.
DangerTrack: A scoring system to detect difficult-to-assess regions.
Dolgalev, Igor; Sedlazeck, Fritz; Busby, Ben
2017-01-01
Over recent years, multiple groups have shown that a large number of structural variants, repeats, or problems with the underlying genome assembly have dramatic effects on the mapping, calling, and overall reliability of single nucleotide polymorphism calls. This project endeavored to develop an easy-to-use track for looking at structural variant and repeat regions. This track, DangerTrack, can be displayed alongside the existing Genome Reference Consortium assembly tracks to warn clinicians and biologists when variants of interest may be incorrectly called, of dubious quality, or on an insertion or copy number expansion. While mapping and variant calling can be automated, it is our opinion that when these regions are of interest to a particular clinical or research group, they warrant a careful examination, potentially involving localized reassembly. DangerTrack is available at https://github.com/DCGenomics/DangerTrack.
Crovella, S; Moura, R R; Cappellani, S; Celsi, F; Trevisan, E; Schneider, M; Brollo, A; Nicastro, E M; Vita, F; Finotto, L; Zabucchi, G; Borelli, V
2018-01-01
The presence of asbestos bodies (ABs) in lung parenchyma is considered a histopathologic hallmark of past exposure to asbestos fibers, of which there was a population of longer fibers. The mechanisms underlying AB formation are complex, involving inflammatory responses and iron (Fe) metabolism. Thus, the responsiveness to AB formation is variable, with some individuals appearing to be poor AB formers. The aim of this study was to disclose the possible role of genetic variants of genes encoding inflammasome and iron metabolism proteins in the ability to form ABs in a population of 81 individuals from North East Italy, who died after having developed malignant pleural mesothelioma (MPM). This study included 86 genetic variants distributed in 10 genes involved in Fe metabolism and 7 genetic variants in two genes encoding for inflammasome molecules. Genotypes/haplotypes were compared according to the number of lung ABs. Data showed that the NLRP1 rs12150220 missense variant (H155L) was significantly correlated with numbers of ABs in MPM patients. Specifically, a low number of ABs was detected in individuals carrying the NLRP1 rs12150220 A/T genotype. Our findings suggest that the NLRP1 inflammasome might contribute in the development of lung ABs. It is postulated that the NLRP1 missense variant may be considered as one of the possible host genetic factors contributing to individual variability in coating efficiency, which needs to be taken when assessing occupational exposure to asbestos.
Jimeno Yepes, Antonio; Verspoor, Karin
2014-01-01
As the cost of genomic sequencing continues to fall, the amount of data being collected and studied for the purpose of understanding the genetic basis of disease is increasing dramatically. Much of the source information relevant to such efforts is available only from unstructured sources such as the scientific literature, and significant resources are expended in manually curating and structuring the information in the literature. As such, there have been a number of systems developed to target automatic extraction of mutations and other genetic variation from the literature using text mining tools. We have performed a broad survey of the existing publicly available tools for extraction of genetic variants from the scientific literature. We consider not just one tool but a number of different tools, individually and in combination, and apply the tools in two scenarios. First, they are compared in an intrinsic evaluation context, where the tools are tested for their ability to identify specific mentions of genetic variants in a corpus of manually annotated papers, the Variome corpus. Second, they are compared in an extrinsic evaluation context based on our previous study of text mining support for curation of the COSMIC and InSiGHT databases. Our results demonstrate that no single tool covers the full range of genetic variants mentioned in the literature. Rather, several tools have complementary coverage and can be used together effectively. In the intrinsic evaluation on the Variome corpus, the combined performance is above 0.95 in F-measure, while in the extrinsic evaluation the combined recall performance is above 0.71 for COSMIC and above 0.62 for InSiGHT, a substantial improvement over the performance of any individual tool. Based on the analysis of these results, we suggest several directions for the improvement of text mining tools for genetic variant extraction from the literature. PMID:25285203
NASA Astrophysics Data System (ADS)
Edwards, Rebecca L.; Griffiths, Paul; Bunch, Josephine; Cooper, Helen J.
2012-11-01
We have previously shown that liquid microjunction surface sampling of dried blood spots coupled with high resolution top-down mass spectrometry may be used for screening of common hemoglobin variants HbS, HbC, and HbD. In order to test the robustness of the approach, we have applied the approach to unknown hemoglobin variants. Six neonatal dried blood spot samples that had been identified as variants, but which could not be diagnosed by current screening methods, were analyzed by direct surface sampling top-down mass spectrometry. Both collision-induced dissociation and electron transfer dissociation mass spectrometry were employed. Four of the samples were identified as β-chain variants: two were heterozygous Hb D-Iran, one was heterozygous Hb Headington, and one was heterozygous Hb J-Baltimore. The fifth sample was identified as the α-chain variant heterozygous Hb Phnom Penh. Analysis of the sixth sample suggested that it did not in fact contain a variant. Adoption of the approach in the clinic would require speed in both data collection and interpretation. To address that issue, we have compared manual data analysis with freely available data analysis software (ProsightPTM). The results demonstrate the power of top-down proteomics for hemoglobin variant analysis in newborn samples.
Parker, Margaret M.; Chen, Han; Lao, Taotao; Hardin, Megan; Qiao, Dandi; Hawrylkiewicz, Iwona; Sliwinski, Pawel; Yim, Jae-Joon; Kim, Woo Jin; Kim, Deog Kyeom; Castaldi, Peter J.; Hersh, Craig P.; Morrow, Jarrett; Celli, Bartolome R.; Pinto-Plata, Victor M.; Criner, Gerald J.; Marchetti, Nathaniel; Bueno, Raphael; Agustí, Alvar; Make, Barry J.; Crapo, James D.; Calverley, Peter M.; Donner, Claudio F.; Lomas, David A.; Wouters, Emiel F. M.; Vestbo, Jorgen; Paré, Peter D.; Levy, Robert D.; Rennard, Stephen I.; Zhou, Xiaobo; Laird, Nan M.; Lin, Xihong; Beaty, Terri H.; Silverman, Edwin K.
2016-01-01
Rationale: Chronic obstructive pulmonary disease (COPD) susceptibility is in part related to genetic variants. Most genetic studies have been focused on genome-wide common variants without a specific focus on coding variants, but common and rare coding variants may also affect COPD susceptibility. Objectives: To identify coding variants associated with COPD. Methods: We tested nonsynonymous, splice, and stop variants derived from the Illumina HumanExome array for association with COPD in five study populations enriched for COPD. We evaluated single variants with a minor allele frequency greater than 0.5% using logistic regression. Results were combined using a fixed effects meta-analysis. We replicated novel single-variant associations in three additional COPD cohorts. Measurements and Main Results: We included 6,004 control subjects and 6,161 COPD cases across five cohorts for analysis. Our top result was rs16969968 (P = 1.7 × 10−14) in CHRNA5, a locus previously associated with COPD susceptibility and nicotine dependence. Additional top results were found in AGER, MMP3, and SERPINA1. A nonsynonymous variant, rs181206, in IL27 (P = 4.7 × 10−6) was just below the level of exome-wide significance but attained exome-wide significance (P = 5.7 × 10−8) when combined with results from other cohorts. Gene expression datasets revealed an association of rs181206 and the surrounding locus with expression of multiple genes; several were differentially expressed in COPD lung tissue, including TUFM. Conclusions: In an exome array analysis of COPD, we identified nonsynonymous variants at previously described loci and a novel exome-wide significant variant in IL27. This variant is at a locus previously described in genome-wide associations with diabetes, inflammatory bowel disease, and obesity and appears to affect genes potentially related to COPD pathogenesis. PMID:26771213
Comprehensive splicing functional analysis of DNA variants of the BRCA2 gene by hybrid minigenes
2012-01-01
Introduction The underlying pathogenic mechanism of a large fraction of DNA variants of disease-causing genes is the disruption of the splicing process. We aimed to investigate the effect on splicing of the BRCA2 variants c.8488-1G > A (exon 20) and c.9026_9030del (exon 23), as well as 41 BRCA2 variants reported in the Breast Cancer Information Core (BIC) mutation database. Methods DNA variants were analyzed with the splicing prediction programs NNSPLICE and Human Splicing Finder. Functional analyses of candidate variants were performed by lymphocyte RT-PCR and/or hybrid minigene assays. Forty-one BIC variants of exons 19, 20, 23 and 24 were bioinformatically selected and generated by PCR-mutagenesis of the wild type minigenes. Results Lymphocyte RT-PCR of c.8488-1G > A showed intron 19 retention and a 12-nucleotide deletion in exon 20, whereas c.9026_9030del did not show any splicing anomaly. Minigene analysis of c.8488-1G > A displayed the aforementioned aberrant isoforms but also exon 20 skipping. We further evaluated the splicing outcomes of 41 variants of four BRCA2 exons by minigene analysis. Eighteen variants presented splicing aberrations. Most variants (78.9%) disrupted the natural splice sites, whereas four altered putative enhancers/silencers and had a weak effect. Fluorescent RT-PCR of minigenes accurately detected 14 RNA isoforms generated by cryptic site usage, exon skipping and intron retention events. Fourteen variants showed total splicing disruptions and were predicted to truncate or eliminate essential domains of BRCA2. Conclusions A relevant proportion of BRCA2 variants are correlated with splicing disruptions, indicating that RNA analysis is a valuable tool to assess the pathogenicity of a particular DNA change. The minigene system is a straightforward and robust approach to detect variants with an impact on splicing and contributes to a better knowledge of this gene expression step. PMID:22632462
CBH1 homologs and variant CBH1 cellulases
Goedegebuur, Frits [Rozenlaan, NL; Gualfetti, Peter [San Francisco, CA; Mitchinson, Colin [Half Moon Bay, CA; Neefe, Paulien [Zoetermeer, NL
2011-05-31
Disclosed are a number of homologs and variants of Hypocrea jecorina Cel7A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.
Novel GREM1 Variations in Sub-Saharan African Patients With Cleft Lip and/or Cleft Palate.
Gowans, Lord Jephthah Joojo; Oseni, Ganiyu; Mossey, Peter A; Adeyemo, Wasiu Lanre; Eshete, Mekonen A; Busch, Tamara D; Donkor, Peter; Obiri-Yeboah, Solomon; Plange-Rhule, Gyikua; Oti, Alexander A; Owais, Arwa; Olaitan, Peter B; Aregbesola, Babatunde S; Oginni, Fadekemi O; Bello, Seidu A; Audu, Rosemary; Onwuamah, Chika; Agbenorku, Pius; Ogunlewe, Mobolanle O; Abdur-Rahman, Lukman O; Marazita, Mary L; Adeyemo, A A; Murray, Jeffrey C; Butali, Azeez
2018-05-01
Cleft lip and/or cleft palate (CL/P) are congenital anomalies of the face and have multifactorial etiology, with both environmental and genetic risk factors playing crucial roles. Though at least 40 loci have attained genomewide significant association with nonsyndromic CL/P, these loci largely reside in noncoding regions of the human genome, and subsequent resequencing studies of neighboring candidate genes have revealed only a limited number of etiologic coding variants. The present study was conducted to identify etiologic coding variants in GREM1, a locus that has been shown to be largely associated with cleft of both lip and soft palate. We resequenced DNA from 397 sub-Saharan Africans with CL/P and 192 controls using Sanger sequencing. Following analyses of the sequence data, we observed 2 novel coding variants in GREM1. These variants were not found in the 192 African controls and have never been previously reported in any public genetic variant database that includes more than 5000 combined African and African American controls or from the CL/P literature. The novel variants include p.Pro164Ser in an individual with soft palate cleft only and p.Gly61Asp in an individual with bilateral cleft lip and palate. The proband with the p.Gly61Asp GREM1 variant is a van der Woude (VWS) case who also has an etiologic variant in IRF6 gene. Our study demonstrated that there is low number of etiologic coding variants in GREM1, confirming earlier suggestions that variants in regulatory elements may largely account for the association between this locus and CL/P.
The Australian experience following plain packaging: the impact on tobacco branding.
Greenland, Steven J
2016-12-01
Brands are critical to tobacco marketing. Industry stakeholders predicted that plain packaging, by removing key tangible branding dimensions, would restrict new products and brand differentiation. However, manufacturers respond innovatively to limit regulatory impact. This study investigates brand strategy following plain packaging's introduction to Australia. Brand portfolios were determined using 2006-15 tobacco ingredient reports. These detail the brand and variant names sold and are provided annually as part of a voluntary agreement between the Australian Government and leading manufacturers. Post-plain packaging brand ranges were verified using retail price lists and a supermarket retail audit using a method used previously to verify a period of pre-plain packaging data. The verification process identified some data inaccuracies from one manufacturer which resulted in the issuing of corrected data. After plain packaging the leading manufacturers continued with extensive brand ranges differentiated by price. All launched new products. While total brand numbers fell from 29 to 24, the mean number of variants for the leading 12 brands grew from 8.9 to 9.7. Substantial variant name modifications occurred with 50 new or modified names in 2012-13. Among leading brands, the incidence of variant colour names increased from 49.5 to 79.3%. New brands and variants were not inhibited by the introduction of plain packaging in Australia. After plain packaging, leading brand variant numbers expanded by 9 to 116 and colour variant names increased by 73.6% and became the norm-lighter colours (blue, gold and silver) dominated, perpetuating notions of less harmful cigarettes. [Correction added on 09 September 2016, after first online publication: The figures in the last sentence of the Abstract are now corrected from 'expanded by 116' to 'expanded by 9 to 116'.]. © 2016 Society for the Study of Addiction.
Proposed variations of the stepped-wedge design can be used to accommodate multiple interventions.
Lyons, Vivian H; Li, Lingyu; Hughes, James P; Rowhani-Rahbar, Ali
2017-06-01
Stepped-wedge design (SWD) cluster-randomized trials have traditionally been used for evaluating a single intervention. We aimed to explore design variants suitable for evaluating multiple interventions in an SWD trial. We identified four specific variants of the traditional SWD that would allow two interventions to be conducted within a single cluster-randomized trial: concurrent, replacement, supplementation, and factorial SWDs. These variants were chosen to flexibly accommodate study characteristics that limit a one-size-fits-all approach for multiple interventions. In the concurrent SWD, each cluster receives only one intervention, unlike the other variants. The replacement SWD supports two interventions that will not or cannot be used at the same time. The supplementation SWD is appropriate when the second intervention requires the presence of the first intervention, and the factorial SWD supports the evaluation of intervention interactions. The precision for estimating intervention effects varies across the four variants. Selection of the appropriate design variant should be driven by the research question while considering the trade-off between the number of steps, number of clusters, restrictions for concurrent implementation of the interventions, lingering effects of each intervention, and precision of the intervention effect estimates. Copyright © 2017 Elsevier Inc. All rights reserved.
Chang, Vivian Y.; Federman, Noah; Martinez-Agosto, Julian; Tatishchev, Sergei F.; Nelson, Stanley F.
2014-01-01
Background Gastric adenocarcinoma is a rare diagnosis in childhood. A 14-year old male patient presented with metastatic gastric adenocarcinoma, and a strong family history of colon cancer. Clinical sequencing of CDH1 and APC were negative. Whole exome sequencing was therefore applied to capture the majority of protein-coding regions for the identification of single-nucleotide variants, small insertion/deletions, and copy number abnormalities in the patient’s germline as well as primary tumor. Materials and Methods DNA was extracted from the patient’s blood, primary tumor, and the unaffected mother’s blood. DNA libraries were constructed and sequenced on Illumina HiSeq2000. Data were post-processed using Picard and Samtools, then analyzed with the Genome Analysis Toolkit. Variants were annotated using an in-house Ensembl-based program. Copy number was assessed using ExomeCNV. Results Each sample was sequenced to a mean depth of coverage of greater than 120×. A rare non-synonymous coding SNV in TP53 was identified in the germline. There were 10 somatic cancer protein-damaging variants that were not observed in the unaffected mother genome. ExomeCNV comparing tumor to the patient’s germline, identified abnormal copy number, spanning 6,946 genes. Conclusion We present an unusual case of Li-Fraumeni detected by whole exome sequencing. There were also likely driver somatic mutations in the gastric adenocarcinoma. These results highlight the need for more thorough and broad scale germline and cancer analyses to accurately inform patients of inherited risk to cancer and to identify somatic mutations. PMID:23015295
van Riet, Job; Krol, Niels M G; Atmodimedjo, Peggy N; Brosens, Erwin; van IJcken, Wilfred F J; Jansen, Maurice P H M; Martens, John W M; Looijenga, Leendert H; Jenster, Guido; Dubbink, Hendrikus J; Dinjens, Winand N M; van de Werken, Harmen J G
2018-03-01
Exploration and visualization of next-generation sequencing data are crucial for clinical diagnostics. Software allowing simultaneous visualization of multiple regions of interest coupled with dynamic heuristic filtering of genetic aberrations is, however, lacking. Therefore, the authors developed the web application SNPitty that allows interactive visualization and interrogation of variant call format files by using B-allele frequencies of single-nucleotide polymorphisms and single-nucleotide variants, coverage metrics, and copy numbers analysis results. SNPitty displays variant alleles and allelic imbalances with a focus on loss of heterozygosity and copy number variation using genome-wide heterozygous markers and somatic mutations. In addition, SNPitty is capable of generating predefined reports that summarize and highlight disease-specific targets of interest. SNPitty was validated for diagnostic interpretation of somatic events by showcasing a serial dilution series of glioma tissue. Additionally, SNPitty is demonstrated in four cancer-related scenarios encountered in daily clinical practice and on whole-exome sequencing data of peripheral blood from a Down syndrome patient. SNPitty allows detection of loss of heterozygosity, chromosomal and gene amplifications, homozygous or heterozygous deletions, somatic mutations, or any combination thereof in regions or genes of interest. Furthermore, SNPitty can be used to distinguish molecular relationships between multiple tumors from a single patient. On the basis of these data, the authors demonstrate that SNPitty is robust and user friendly in a wide range of diagnostic scenarios. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Lovelock, Paul K; Spurdle, Amanda B; Mok, Myth T S; Farrugia, Daniel J; Lakhani, Sunil R; Healey, Sue; Arnold, Stephen; Buchanan, Daniel; Couch, Fergus J; Henderson, Beric R; Goldgar, David E; Tavtigian, Sean V; Chenevix-Trench, Georgia; Brown, Melissa A
2007-01-01
Many of the DNA sequence variants identified in the breast cancer susceptibility gene BRCA1 remain unclassified in terms of their potential pathogenicity. Both multifactorial likelihood analysis and functional approaches have been proposed as a means to elucidate likely clinical significance of such variants, but analysis of the comparative value of these methods for classifying all sequence variants has been limited. We have compared the results from multifactorial likelihood analysis with those from several functional analyses for the four BRCA1 sequence variants A1708E, G1738R, R1699Q, and A1708V. Our results show that multifactorial likelihood analysis, which incorporates sequence conservation, co-inheritance, segregation, and tumour immunohistochemical analysis, may improve classification of variants. For A1708E, previously shown to be functionally compromised, analysis of oestrogen receptor, cytokeratin 5/6, and cytokeratin 14 tumour expression data significantly strengthened the prediction of pathogenicity, giving a posterior probability of pathogenicity of 99%. For G1738R, shown to be functionally defective in this study, immunohistochemistry analysis confirmed previous findings of inconsistent 'BRCA1-like' phenotypes for the two tumours studied, and the posterior probability for this variant was 96%. The posterior probabilities of R1699Q and A1708V were 54% and 69%, respectively, only moderately suggestive of increased risk. Interestingly, results from functional analyses suggest that both of these variants have only partial functional activity. R1699Q was defective in foci formation in response to DNA damage and displayed intermediate transcriptional transactivation activity but showed no evidence for centrosome amplification. In contrast, A1708V displayed an intermediate transcriptional transactivation activity and a normal foci formation response in response to DNA damage but induced centrosome amplification. These data highlight the need for a range of functional studies to be performed in order to identify variants with partially compromised function. The results also raise the possibility that A1708V and R1699Q may be associated with a low or moderate risk of cancer. While data pooling strategies may provide more information for multifactorial analysis to improve the interpretation of the clinical significance of these variants, it is likely that the development of current multifactorial likelihood approaches and the consideration of alternative statistical approaches will be needed to determine whether these individually rare variants do confer a low or moderate risk of breast cancer.
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
The humankind genome: from genetic diversity to the origin of human diseases.
Belizário, Jose E
2013-12-01
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
In silico study of breast cancer associated gene 3 using LION Target Engine and other tools.
León, Darryl A; Cànaves, Jaume M
2003-12-01
Sequence analysis of individual targets is an important step in annotation and validation. As a test case, we investigated human breast cancer associated gene 3 (BCA3) with LION Target Engine and with other bioinformatics tools. LION Target Engine confirmed that the BCA3 gene is located on 11p15.4 and that the two most likely splice variants (lacking exon 3 and exons 3 and 5, respectively) exist. Based on our manual curation of sequence data, it is proposed that an additional variant (missing only exon 5) published in a public sequence repository, is a prediction artifact. A significant number of new orthologs were also identified, and these were the basis for a high-quality protein secondary structure prediction. Moreover, our research confirmed several distinct functional domains as described in earlier reports. Sequence conservation from multiple sequence alignments, splice variant identification, secondary structure predictions, and predicted phosphorylation sites suggest that the removal of interaction sites through alternative splicing might play a modulatory role in BCA3. This in silico approach shows the depth and relevance of an analysis that can be accomplished by including a variety of publicly available tools with an integrated and customizable life science informatics platform.
Exploiting induced variation to dissect quantitative traits in barley.
Druka, Arnis; Franckowiak, Jerome; Lundqvist, Udda; Bonar, Nicola; Alexander, Jill; Guzy-Wrobelska, Justyna; Ramsay, Luke; Druka, Ilze; Grant, Iain; Macaulay, Malcolm; Vendramin, Vera; Shahinnia, Fahimeh; Radovic, Slobodanka; Houston, Kelly; Harrap, David; Cardle, Linda; Marshall, David; Morgante, Michele; Stein, Nils; Waugh, Robbie
2010-04-01
The identification of genes underlying complex quantitative traits such as grain yield by means of conventional genetic analysis (positional cloning) requires the development of several large mapping populations. However, it is possible that phenotypically related, but more extreme, allelic variants generated by mutational studies could provide a means for more efficient cloning of QTLs (quantitative trait loci). In barley (Hordeum vulgare), with the development of high-throughput genome analysis tools, efficient genome-wide identification of genetic loci harbouring mutant alleles has recently become possible. Genotypic data from NILs (near-isogenic lines) that carry induced or natural variants of genes that control aspects of plant development can be compared with the location of QTLs to potentially identify candidate genes for development--related traits such as grain yield. As yield itself can be divided into a number of allometric component traits such as tillers per plant, kernels per spike and kernel size, mutant alleles that both affect these traits and are located within the confidence intervals for major yield QTLs may represent extreme variants of the underlying genes. In addition, the development of detailed comparative genomic models based on the alignment of a high-density barley gene map with the rice and sorghum physical maps, has enabled an informed prioritization of 'known function' genes as candidates for both QTLs and induced mutant genes.
Gim, Jungsoo; Kim, Wonji; Kwak, Soo Heon; Choi, Hosik; Park, Changyi; Park, Kyong Soo; Kwon, Sunghoon; Park, Taesung; Won, Sungho
2017-11-01
Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance. Copyright © 2017 by the Genetics Society of America.
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Paganoni, C.A.; Chang, K.C.; Robblee, M.B.
2006-01-01
A significant data quality challenge for highly variant systems surrounds the limited ability to quantify operationally reasonable limits on the data elements being collected and provide reasonable threshold predictions. In many instances, the number of influences that drive a resulting value or operational range is too large to enable physical sampling for each influencer, or is too complicated to accurately model in an explicit simulation. An alternative method to determine reasonable observation thresholds is to employ an automation algorithm that would emulate a human analyst visually inspecting data for limits. Using the visualization technique of self-organizing maps (SOM) on data having poorly understood relationships, a methodology for determining threshold limits was developed. To illustrate this approach, analysis of environmental influences that drive the abundance of a target indicator species (the pink shrimp, Farfantepenaeus duorarum) provided a real example of applicability. The relationship between salinity and temperature and abundance of F. duorarum is well documented, but the effect of changes in water quality upstream on pink shrimp abundance is not well understood. The highly variant nature surrounding catch of a specific number of organisms in the wild, and the data available from up-stream hydrology measures for salinity and temperature, made this an ideal candidate for the approach to provide a determination about the influence of changes in hydrology on populations of organisms.
NASA Astrophysics Data System (ADS)
Paganoni, Christopher A.; Chang, K. C.; Robblee, Michael B.
2006-05-01
A significant data quality challenge for highly variant systems surrounds the limited ability to quantify operationally reasonable limits on the data elements being collected and provide reasonable threshold predictions. In many instances, the number of influences that drive a resulting value or operational range is too large to enable physical sampling for each influencer, or is too complicated to accurately model in an explicit simulation. An alternative method to determine reasonable observation thresholds is to employ an automation algorithm that would emulate a human analyst visually inspecting data for limits. Using the visualization technique of self-organizing maps (SOM) on data having poorly understood relationships, a methodology for determining threshold limits was developed. To illustrate this approach, analysis of environmental influences that drive the abundance of a target indicator species (the pink shrimp, Farfantepenaeus duorarum) provided a real example of applicability. The relationship between salinity and temperature and abundance of F. duorarum is well documented, but the effect of changes in water quality upstream on pink shrimp abundance is not well understood. The highly variant nature surrounding catch of a specific number of organisms in the wild, and the data available from up-stream hydrology measures for salinity and temperature, made this an ideal candidate for the approach to provide a determination about the influence of changes in hydrology on populations of organisms.
2013-01-01
Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222
Rabies surveillance in the United States during 2007.
Blanton, Jesse D; Palmer, Dustyn; Christian, Kira A; Rupprecht, Charles E
2008-09-15
During 2007, 49 states and Puerto Rico reported 7,258 cases of rabies in animals and 1 case in a human to the CDC, representing a 4.6% increase from the 6,940 cases in animals and 3 cases in humans reported in 2006. Approximately 93% of the cases were in wildlife, and 7% were in domestic animals. Relative contributions by the major animal groups were as follows: 2,659 raccoons (36.6%), 1,973 bats (27.2%), 1,478 skunks (20.4%), 489 foxes (6.7%), 274 cats (3.8%), 93 dogs (1.3%), and 57 cattle (0.8%). Compared with numbers of reported cases in 2006, cases in 2007 increased among dogs, bats, foxes, and skunks while decreases were reported among cattle, cats, and skunks. Increases in numbers of rabid raccoons during 2007 were reported by 11 of the 20 eastern states where raccoon rabies was enzootic, and reported cases increased by 1.7% overall, compared with 2006. On a national level, the number of rabies cases in skunks during 2007 decreased by 1.1% from the number reported in 2006. Texas reported the greatest number (n = 362) of rabid skunks and the greatest overall state total of animal rabies cases (969). No cases of rabies associated with the dog/coyote rabies virus variant were reported. The United States remains free of dog-to-dog transmission of canine rabies virus variants. The total number of cases of rabies reported nationally in foxes increased 14.5%, compared with 2006. Increases in the number of reported rabid foxes were attributable to greater numbers of foxes reported with the Arctic fox rabies virus variant in Alaska, the Texas gray fox rabies virus variant in Texas, and the raccoon rabies virus variant in Virginia. The 1,973 cases of rabies reported in bats represented a 16.6% increase over numbers reported in 2006. Cases of rabies in dogs and in sheep and goats increased 17.7% and 18.2%, respectively, whereas cases reported in cattle, cats, and horses and mules decreased 30.5%, 13.8%, and 20.8%, respectively. In Puerto Rico, reported cases of rabies in mongooses decreased 51.5%, and rabies in domestic animals, presumably attributable to spillover infection from mongooses, increased 25%. One human rabies case was reported from Minnesota during 2007. Although typing of the rabies virus variant in this case was not possible, an investigation of this case indicated a bat as the most likely source of exposure.
Genetic basis for childhood interstitial lung disease among Japanese infants and children.
Hayasaka, Itaru; Cho, Kazutoshi; Akimoto, Takuma; Ikeda, Masahiko; Uzuki, Yutaka; Yamada, Masafumi; Nakata, Koh; Furuta, Itsuko; Ariga, Tadashi; Minakami, Hisanori
2018-02-01
BackgroundGenetic variants responsible for childhood interstitial lung disease (chILD) have not been studied extensively in Japanese patients.MethodsThe study population consisted of 62 Japanese chILD patients. Twenty-one and four patients had pulmonary hypertension resistant to treatment (PH) and hypothyroidism, respectively. Analyses of genetic variants were performed in all 62 patients for SFTPC and ABCA3, in all 21 PH patients for FOXF1, and in a limited number of patients for NKX2.1.ResultsCausative genetic variants for chILD were identified in 11 (18%) patients: SFTPC variants in six, NKX2.1 variants in three, and FOXF1 variants in two patients. No patients had ABCA3 variants. All three and two patients with NKX2.1 variants had hypothyroidism and developmental delay, respectively. We found six novel variants in this study.ConclusionMutations in SFTPC, NKX2.1, and FOXF1 were identified among Japanese infants and children with chILD, whereas ABCA3 mutations were rare.
Gray, Phillip N.; Vuong, Huy; Tsai, Pei; Lu, Hsaio-Mei; Mu, Wenbo; Hsuan, Vickie; Hoo, Jayne; Shah, Swati; Uyeda, Lisa; Fox, Susanne; Patel, Harshil; Janicek, Mike; Brown, Sandra; Dobrea, Lavinia; Wagman, Lawrence; Plimack, Elizabeth; Mehra, Ranee; Golemis, Erica A.; Bilusic, Marijo; Zibelman, Matthew; Elliott, Aaron
2016-01-01
The development of targeted therapies for both germline and somatic DNA mutations has increased the need for molecular profiling assays to determine the mutational status of specific genes. Moreover, the potential of off-label prescription of targeted therapies favors classifying tumors based on DNA alterations rather than traditional tissue pathology. Here we describe the analytical validation of a custom probe-based NGS tumor panel, TumorNext, which can detect single nucleotide variants, small insertions and deletions in 142 genes that are frequently mutated in somatic and/or germline cancers. TumorNext also detects gene fusions and structural variants, such as tandem duplications and inversions, in 15 frequently disrupted oncogenes and tumor suppressors. The assay uses a matched control and custom bioinformatics pipeline to differentiate between somatic and germline mutations, allowing precise variant classification. We tested 170 previously characterized samples, of which > 95% were formalin-fixed paraffin embedded tissue from 8 different cancer types, and highlight examples where lack of germline status may have led to the inappropriate prescription of therapy. We also describe the validation of the Affymetrix OncoScan platform, an array technology for high resolution copy number variant detection for use in parallel with the NGS panel that can detect single copy amplifications and hemizygous deletions. We analyzed 80 previously characterized formalin-fixed paraffin-embedded specimens and provide examples of hemizygous deletion detection in samples with known pathogenic germline mutations. Thus, the TumorNext combined approach of NGS and OncoScan potentially allows for the identification of the “second hit” in hereditary cancer patients. PMID:27626691
Mefford, Heather C; Cooper, Gregory M; Zerr, Troy; Smith, Joshua D; Baker, Carl; Shafer, Neil; Thorland, Erik C; Skinner, Cindy; Schwartz, Charles E; Nickerson, Deborah A; Eichler, Evan E
2009-09-01
Copy-number variants (CNVs) are substantial contributors to human disease. A central challenge in CNV-disease association studies is to characterize the pathogenicity of rare and possibly incompletely penetrant events, which requires the accurate detection of rare CNVs in large numbers of individuals. Cost and throughput issues limit our ability to perform these studies. We have adapted the Illumina BeadXpress SNP genotyping assay and developed an algorithm, SNP-Conditional OUTlier detection (SCOUT), to rapidly and accurately detect both rare and common CNVs in large cohorts. This approach is customizable, cost effective, highly parallelized, and largely automated. We applied this method to screen 69 loci in 1105 children with unexplained intellectual disability, identifying pathogenic variants in 3.1% of these individuals and potentially pathogenic variants in an additional 2.3%. We identified seven individuals (0.7%) with a deletion of 16p11.2, which has been previously associated with autism. Our results widen the phenotypic spectrum of these deletions to include intellectual disability without autism. We also detected 1.65-3.4 Mbp duplications at 16p13.11 in 1.1% of affected individuals and 350 kbp deletions at 15q11.2, near the Prader-Willi/Angelman syndrome critical region, in 0.8% of affected individuals. Compared to published CNVs in controls they are significantly (P = 4.7 x 10(-5) and 0.003, respectively) enriched in these children, supporting previously published hypotheses that they are neurocognitive disease risk factors. More generally, this approach offers a previously unavailable balance between customization, cost, and throughput for analysis of CNVs and should prove valuable for targeted CNV detection in both research and diagnostic settings.
Reddy, Puli Chandramouli; Ubhe, Suyog; Sirwani, Neha; Lohokare, Rasika; Galande, Sanjeev
2017-08-01
Histones are fundamental components of chromatin in all eukaryotes. Hydra, an emerging model system belonging to the basal metazoan phylum Cnidaria, provides an ideal platform to understand the evolution of core histone components at the base of eumetazoan phyla. Hydra exhibits peculiar properties such as tremendous regenerative capacity, lack of organismal senescence and rarity of malignancy. In light of the role of histone modifications and histone variants in these processes it is important to understand the nature of histones themselves and their variants in hydra. Here, we report identification of the complete repertoire of histone-coding genes in the Hydra magnipapillata genome. Hydra histones were classified based on their copy numbers, gene structure and other characteristic features. Genomic organization of canonical histone genes revealed the presence of H2A-H2B and H3-H4 paired clusters in high frequency and also a cluster with all core histones along with H1. Phylogenetic analysis of identified members of H2A and H2B histones suggested rapid expansion of these groups in Hydrozoa resulting in the appearance of unique subtypes. Amino acid sequence level comparisons of H2A and H2B forms with bilaterian counterparts suggest the possibility of a highly mobile nature of nucleosomes in hydra. Absolute quantitation of transcripts confirmed the high copy number of histones and supported the canonical nature of H2A. Furthermore, functional characterization of H2A.X.1 and a unique variant H2A.X.2 in the gastric region suggest their role in the maintenance of genome integrity and differentiation processes. These findings provide insights into the evolution of histones and their variants in hydra. Copyright © 2017 Elsevier GmbH. All rights reserved.
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.
Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao
2016-11-30
Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Ruiz-Pérez, R.; López-Cózar, E. Delgado; Jiménez-Contreras, E.
2002-01-01
Objectives: The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. Methods: The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Índice Médico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. Results: The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as “first surname second surname, first name initial middle name initial” (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was “first surname, first name initial second name initial” (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Conclusions: Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors' inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases. PMID:12398248
Human MHC architecture and evolution: implications for disease association studies
Traherne, J A
2008-01-01
Major histocompatibility complex (MHC) variation is a key determinant of susceptibility and resistance to a large number of infectious, autoimmune and other diseases. Identification of the MHC variants conferring susceptibility to disease is problematic, due to high levels of variation and linkage disequilibrium. Recent cataloguing and analysis of variation over the complete MHC has facilitated localization of susceptibility loci for autoimmune diseases, and provided insight into the MHC's evolution. This review considers how the unusual genetic characteristics of the MHC impact on strategies to identify variants causing, or contributing to, disease phenotypes. It also considers the MHC in relation to novel mechanisms influencing gene function and regulation, such as epistasis, epigenetics and microRNAs. These developments, along with recent technological advances, shed light on genetic association in complex disease. PMID:18397301
Lee y trabaja: Libro de lectura 2, nivel 1 (Read and Work: Reader 2, Level 1).
ERIC Educational Resources Information Center
Martinez, Emiliano; and Others
This reading textbook, the second of a series, is an anthology of stories designed to relate to the natural interest of the elementary school child. On this level the number of words to memorize is increased (on the average, four per unit) while at the same time, the study of word variants is introduced to begin analysis exercises based on the…
A genome-wide assessment of rare copy number variants in colorectal cancer.
Li, Zhenli; Yu, Dan; Gan, Meifu; Shan, Qiaonan; Yin, Xiaoyang; Tang, Shunli; Zhang, Shuai; Shi, Yongyong; Zhu, Yimin; Lai, Maode; Zhang, Dandan
2015-09-22
Colorectal cancer (CRC) is a complex disease with an estimated heritability of approximately 35%. However, known CRC-related common single nucleotide polymorphisms (SNPs) can only explain ~0.65% of the heritability. This "missing heritability" may be explained partially by rare copy number variants (CNVs). In this study, we performed a genome-wide scan using Illumina Human-Omni Express BeadChip, 694 sporadic CRC cases and 1641 controls were eventually included in our analysis after quality control. The global burden analysis revealed a 1.53-fold excess of rare CNVs in CRC cases compared with controls (P < 1 × 10(-6)), and the difference being more pronounced for genic rare CNVs and CNVs overlapped with coding regions (1.65-fold and 1.84-fold, respectively, both P < 1 × 10(-6)). Interestingly, both the cases in the lowest and middle tertile of age carried a higher burden of rare CNVs comparing to the highest tertile. Furthermore, 639 CNV-disrupted genes exclusive to CRC cases were found to be significantly enriched in gene ontology (GO) terms concerning nucleosome assembly and olfactory receptor activity. Our study was the first to evaluate the burden of rare CNVs in sporadic CRC and suggested that rare CNVs contributed to the missing heritability of CRC.
Lacey, Cameron J; Doudney, Kit; Bridgman, Paul G; George, Peter M; Mulder, Roger T; Zarifeh, Julie J; Kimber, Bridget; Cadzow, Murray J; Black, Michael A; Merriman, Tony R; Lehnert, Klaus; Bickley, Vivienne M; Pearson, John F; Cameron, Vicky A; Kennedy, Martin A
2018-05-15
The pathophysiology of stress cardiomyopathy (SCM), also known as takotsubo syndrome, is poorly understood. SCM usually occurs sporadically, often in association with a stressful event, but clusters of cases are reported after major natural disasters. There is some evidence that this is a familial condition. We have examined three possible models for an underlying genetic predisposition to SCM. Our primary study cohort consists of 28 women who suffered SCM as a result of two devastating earthquakes that struck the city of Christchurch, New Zealand, in 2010 and 2011. To seek possible underlying genetic factors we carried out exome analysis, genotyping array analysis, and array comparative genomic hybridization on these subjects. The most striking finding was the observation of a markedly elevated rate of rare, heterogeneous copy number variants (CNV) of uncertain clinical significance (in 12/28 subjects). Several of these CNVs impacted on genes of cardiac relevance including RBFOX1, GPC5, KCNRG, CHODL, and GPBP1L1. There is no physical overlap between the CNVs, and the genes they impact do not appear to be functionally related. The recognition that SCM predisposition may be associated with a high rate of rare CNVs offers a novel perspective on this enigmatic condition.
Genetic Factors of the Disease Course After Sepsis: Rare Deleterious Variants Are Predictive.
Taudien, Stefan; Lausser, Ludwig; Giamarellos-Bourboulis, Evangelos J; Sponholz, Christoph; Schöneweck, Franziska; Felder, Marius; Schirra, Lyn-Rouven; Schmid, Florian; Gogos, Charalambos; Groth, Susann; Petersen, Britt-Sabina; Franke, Andre; Lieb, Wolfgang; Huse, Klaus; Zipfel, Peter F; Kurzai, Oliver; Moepps, Barbara; Gierschik, Peter; Bauer, Michael; Scherag, André; Kestler, Hans A; Platzer, Matthias
2016-10-01
Sepsis is a life-threatening organ dysfunction caused by dysregulated host response to infection. For its clinical course, host genetic factors are important and rare genomic variants are suspected to contribute. We sequenced the exomes of 59 Greek and 15 German patients with bacterial sepsis divided into two groups with extremely different disease courses. Variant analysis was focusing on rare deleterious single nucleotide variants (SNVs). We identified significant differences in the number of rare deleterious SNVs per patient between the ethnic groups. Classification experiments based on the data of the Greek patients allowed discrimination between the disease courses with estimated sensitivity and specificity>75%. By application of the trained model to the German patients we observed comparable discriminatory properties despite lower population-specific rare SNV load. Furthermore, rare SNVs in genes of cell signaling and innate immunity related pathways were identified as classifiers discriminating between the sepsis courses. Sepsis patients with favorable disease course after sepsis, even in the case of unfavorable preconditions, seem to be affected more often by rare deleterious SNVs in cell signaling and innate immunity related pathways, suggesting a protective role of impairments in these processes against a poor disease course. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Efficient inference for genetic association studies with multiple outcomes.
Ruffieux, Helene; Davison, Anthony C; Hager, Jorg; Irincheeva, Irina
2017-10-01
Combined inference for heterogeneous high-dimensional data is critical in modern biology, where clinical and various kinds of molecular data may be available from a single study. Classical genetic association studies regress a single clinical outcome on many genetic variants one by one, but there is an increasing demand for joint analysis of many molecular outcomes and genetic variants in order to unravel functional interactions. Unfortunately, most existing approaches to joint modeling are either too simplistic to be powerful or are impracticable for computational reasons. Inspired by Richardson and others (2010, Bayesian Statistics 9), we consider a sparse multivariate regression model that allows simultaneous selection of predictors and associated responses. As Markov chain Monte Carlo (MCMC) inference on such models can be prohibitively slow when the number of genetic variants exceeds a few thousand, we propose a variational inference approach which produces posterior information very close to that of MCMC inference, at a much reduced computational cost. Extensive numerical experiments show that our approach outperforms popular variable selection methods and tailored Bayesian procedures, dealing within hours with problems involving hundreds of thousands of genetic variants and tens to hundreds of clinical or molecular outcomes. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Exome Sequencing Fails to Identify the Genetic Cause of Aicardi Syndrome.
Lund, Caroline; Striano, Pasquale; Sorte, Hanne Sørmo; Parisi, Pasquale; Iacomino, Michele; Sheng, Ying; Vigeland, Magnus D; Øye, Anne-Marte; Møller, Rikke Steensbjerre; Selmer, Kaja K; Zara, Federico
2016-09-01
Aicardi syndrome (AS) is a well-characterized neurodevelopmental disorder with an unknown etiology. In this study, we performed whole-exome sequencing in 11 female patients with the diagnosis of AS, in order to identify the disease-causing gene. In particular, we focused on detecting variants in the X chromosome, including the analysis of variants with a low number of sequencing reads, in case of somatic mosaicism. For 2 of the patients, we also sequenced the exome of the parents to search for de novo mutations. We did not identify any genetic variants likely to be damaging. Only one single missense variant was identified by the de novo analyses of the 2 trios, and this was considered benign. The failure to identify a disease gene in this study may be due to technical limitations of our study design, including the possibility that the genetic aberration leading to AS is situated in a non-exonic region or that the mutation is somatic and not detectable by our approach. Alternatively, it is possible that AS is genetically heterogeneous and that 11 patients are not sufficient to reveal the causative genes. Future studies of AS should consider designs where also non-exonic regions are explored and apply a sequencing depth so that also low-grade somatic mosaicism can be detected.
Role of protein surface charge in monellin sweetness.
Xue, Wei-Feng; Szczepankiewicz, Olga; Thulin, Eva; Linse, Sara; Carey, Jannette
2009-03-01
A small number of proteins have the unusual property of tasting intensely sweet. Despite many studies aimed at identifying their sweet taste determinants, the molecular basis of protein sweetness is not fully understood. Recent mutational studies of monellin have implicated positively charged residues in sweetness. In the present work, the effect of overall net charge was investigated using the complementary approach of negative charge alterations. Multiple substitutions of Asp/Asn and Glu/Gln residues radically altered the surface charge of single-chain monellin by removing six negative charges or adding four negative charges. Biophysical characterization using circular dichroism, fluorescence, and two-dimensional NMR demonstrates that the native fold of monellin is preserved in the variant proteins under physiological solution conditions although their stability toward chemical denaturation is altered. A human taste test was employed to determine the sweetness detection threshold of the variants. Removal of negative charges preserves monellin sweetness, whereas added negative charge has a large negative impact on sweetness. Meta-analysis of published charge variants of monellin and other sweet proteins reveals a general trend toward increasing sweetness with increasing positive net charge. Structural mapping of monellin variants identifies a hydrophobic surface predicted to face the receptor where introduced positive or negative charge reduces sweetness, and a polar surface where charges modulate long-range electrostatic complementarity.
Williams, Nigel M; Franke, Barbara; Mick, Eric; Anney, Richard J L; Freitag, Christine M; Gill, Michael; Thapar, Anita; O'Donovan, Michael C; Owen, Michael J; Holmans, Peter; Kent, Lindsey; Middleton, Frank; Zhang-James, Yanli; Liu, Lu; Meyer, Jobst; Nguyen, Thuy Trang; Romanos, Jasmin; Romanos, Marcel; Seitz, Christiane; Renner, Tobias J; Walitza, Susanne; Warnke, Andreas; Palmason, Haukur; Buitelaar, Jan; Rommelse, Nanda; Vasquez, Alejandro Arias; Hawi, Ziarih; Langley, Kate; Sergeant, Joseph; Steinhausen, Hans-Christoph; Roeyers, Herbert; Biederman, Joseph; Zaharieva, Irina; Hakonarson, Hakon; Elia, Josephine; Lionel, Anath C; Crosbie, Jennifer; Marshall, Christian R; Schachar, Russell; Scherer, Stephen W; Todorov, Alexandre; Smalley, Susan L; Loo, Sandra; Nelson, Stanley; Shtir, Corina; Asherson, Philip; Reif, Andreas; Lesch, Klaus-Peter; Faraone, Stephen V
2012-02-01
Attention deficit hyperactivity disorder (ADHD) is a common, highly heritable psychiatric disorder. Because of its multifactorial etiology, however, identifying the genes involved has been difficult. The authors followed up on recent findings suggesting that rare copy number variants (CNVs) may be important for ADHD etiology. The authors performed a genome-wide analysis of large, rare CNVs (<1% population frequency) in children with ADHD (N=896) and comparison subjects (N=2,455) from the IMAGE II Consortium. The authors observed 1,562 individually rare CNVs >100 kb in size, which segregated into 912 independent loci. Overall, the rate of rare CNVs >100 kb was 1.15 times higher in ADHD case subjects relative to comparison subjects, with duplications spanning known genes showing a 1.2-fold enrichment. In accordance with a previous study, rare CNVs >500 kb showed the greatest enrichment (1.28-fold). CNVs identified in ADHD case subjects were significantly enriched for loci implicated in autism and in schizophrenia. Duplications spanning the CHRNA7 gene at chromosome 15q13.3 were associated with ADHD in single-locus analysis. This finding was consistently replicated in an additional 2,242 ADHD case subjects and 8,552 comparison subjects from four independent cohorts from the United Kingdom, the United States, and Canada. Presence of the duplication at 15q13.3 appeared to be associated with comorbid conduct disorder. These findings support the enrichment of large, rare CNVs in ADHD and implicate duplications at 15q13.3 as a novel risk factor for ADHD. With a frequency of 0.6% in the populations investigated and a relatively large effect size (odds ratio=2.22, 95% confidence interval=1.5–3.6), this locus could be an important contributor to ADHD etiology.
Franke, Barbara; Mick, Eric; Anney, Richard J.L.; Freitag, Christine M.; Gill, Michael; Thapar, Anita; O'Donovan, Michael C.; Owen, Michael J.; Holmans, Peter; Kent, Lindsey; Middleton, Frank; Zhang-James, Yanli; Liu, Lu; Meyer, Jobst; Nguyen, Thuy Trang; Romanos, Jasmin; Romanos, Marcel; Seitz, Christiane; Renner, Tobias J.; Walitza, Susanne; Warnke, Andreas; Palmason, Haukur; Buitelaar, Jan; Rommelse, Nanda; Vasquez, Alejandro Arias; Hawi, Ziarih; Langley, Kate; Sergeant, Joseph; Steinhausen, Hans-Christoph; Roeyers, Herbert; Biederman, Joseph; Zaharieva, Irina; Hakonarson, Hakon; Elia, Josephine; Lionel, Anath C.; Crosbie, Jennifer; Marshall, Christian R.; Schachar, Russell; Scherer, Stephen W.; Todorov, Alexandre; Smalley, Susan L.; Loo, Sandra; Nelson, Stanley; Shtir, Corina; Asherson, Philip; Reif, Andreas; Lesch, Klaus-Peter
2012-01-01
Objective: Attention deficit hyperactivity disorder (ADHD) is a common, highly heritable psychiatric disorder. Because of its multifactorial etiology, however, identifying the genes involved has been difficult. The authors followed up on recent findings suggesting that rare copy number variants (CNVs) may be important for ADHD etiology. Method: The authors performed a genome-wide analysis of large, rare CNVs (<1% population frequency) in children with ADHD (N=896) and comparison subjects (N=2,455) from the IMAGE II Consortium. Results: The authors observed 1,562 individually rare CNVs >100 kb in size, which segregated into 912 independent loci. Overall, the rate of rare CNVs >100 kb was 1.15 times higher in ADHD case subjects relative to comparison subjects, with duplications spanning known genes showing a 1.2-fold enrichment. In accordance with a previous study, rare CNVs >500 kb showed the greatest enrichment (1.28-fold). CNVs identified in ADHD case subjects were significantly enriched for loci implicated in autism and in schizophrenia. Duplications spanning the CHRNA7 gene at chromosome 15q13.3 were associated with ADHD in single-locus analysis. This finding was consistently replicated in an additional 2,242 ADHD case subjects and 8,552 comparison subjects from four independent cohorts from the United Kingdom, the United States, and Canada. Presence of the duplication at 15q13.3 appeared to be associated with comorbid conduct disorder. Conclusions: These findings support the enrichment of large, rare CNVs in ADHD and implicate duplications at 15q13.3 as a novel risk factor for ADHD. With a frequency of 0.6% in the populations investigated and a relatively large effect size (odds ratio=2.22, 95% confidence interval=1.5–3.6), this locus could be an important contributor to ADHD etiology. PMID:22420048
Alteration of gene expression by alcohol exposure at early neurulation.
Zhou, Feng C; Zhao, Qianqian; Liu, Yunlong; Goodlett, Charles R; Liang, Tiebing; McClintick, Jeanette N; Edenberg, Howard J; Li, Lang
2011-02-21
We have previously demonstrated that alcohol exposure at early neurulation induces growth retardation, neural tube abnormalities, and alteration of DNA methylation. To explore the global gene expression changes which may underline these developmental defects, microarray analyses were performed in a whole embryo mouse culture model that allows control over alcohol and embryonic variables. Alcohol caused teratogenesis in brain, heart, forelimb, and optic vesicle; a subset of the embryos also showed cranial neural tube defects. In microarray analysis (accession number GSM9545), adopting hypothesis-driven Gene Set Enrichment Analysis (GSEA) informatics and intersection analysis of two independent experiments, we found that there was a collective reduction in expression of neural specification genes (neurogenin, Sox5, Bhlhe22), neural growth factor genes [Igf1, Efemp1, Klf10 (Tieg), and Edil3], and alteration of genes involved in cell growth, apoptosis, histone variants, eye and heart development. There was also a reduction of retinol binding protein 1 (Rbp1), and de novo expression of aldehyde dehydrogenase 1B1 (Aldh1B1). Remarkably, four key hematopoiesis genes (glycophorin A, adducin 2, beta-2 microglobulin, and ceruloplasmin) were absent after alcohol treatment, and histone variant genes were reduced. The down-regulation of the neurospecification and the neurotrophic genes were further confirmed by quantitative RT-PCR. Furthermore, the gene expression profile demonstrated distinct subgroups which corresponded with two distinct alcohol-related neural tube phenotypes: an open (ALC-NTO) and a closed neural tube (ALC-NTC). Further, the epidermal growth factor signaling pathway and histone variants were specifically altered in ALC-NTO, and a greater number of neurotrophic/growth factor genes were down-regulated in the ALC-NTO than in the ALC-NTC embryos. This study revealed a set of genes vulnerable to alcohol exposure and genes that were associated with neural tube defects during early neurulation.
Reid, Alistair G; Huntly, Brian J P; Grace, Colin; Green, Anthony R; Nacheva, Elisabeth P
2003-05-01
The BCR-ABL fusion in chronic myeloid leukaemia (CML) is generated by the Philadelphia (Ph) translocation t(9;22) or, in 10% of patients, variants thereof (vPh). Deletion encompassing the reciprocal product (ABL-BCR) from the derivative chromosome 9 [der(9)] occurs in 15% of all patients, but with greater frequency in vPh patients. Reports of physical separation of ABL-BCR in non-deleted patients, as well as evolution from classical to variant Ph, introduce further heterogeneity to the vPh subgroup and raise the possibility that such translocations may herald disease progression. Survival analyses, however, have thus far yielded contradictory results. We assessed the frequency of der(9) deletions, ABL-BCR abrogation, cytogenetic evolution and cryptic rearrangement in a large cohort of 54 patients with vPh CML. Deletions encompassing ABL-BCR were detected in 37% of patients, consistent with a model in which a greater number of chromosome breaks increases the risk of genomic loss. The components of ABL-BCR were physically separated in a further 52% of patients while fused in the remaining 11%. Evolution from classical to vPh was demonstrated in three patients. The difference in survival, as indicated by Kaplan-Meier analysis, was marked between classical and vPh patients (105 vs 60 months respectively; P = 0.0002). Importantly, this difference disappeared when patients with deletions were removed from the analysis. Our study showed that, despite the existence of several levels of genomic heterogeneity in variant Ph-positive CML, der(9) deletion status is the key prognostic factor.
Sato, Keisaku; Pollock, Neil; Stowell, Kathryn M
2010-06-01
Malignant hyperthermia is associated with mutations within the gene encoding the skeletal muscle ryanodine receptor, the calcium channel that releases Ca from sarcoplasmic reticulum stores triggering muscle contraction, and other metabolic activities. More than 200 variants have been identified in the ryanodine receptor, but only some of these have been shown to functionally affect the calcium channel. To implement genetic testing for malignant hyperthermia, variants must be shown to alter the function of the channel. A number of different ex vivo methods can be used to demonstrate functionality, as long as cells from human patients can be obtained and cultured from at least two unrelated families. Because malignant hyperthermia is an uncommon disorder and many variants seem to be private, including the newly identified H4833Y mutation, these approaches are limited. The authors cloned the human skeletal muscle ryanodine receptor complementary DNA and expressed both normal and mutated forms in HEK-293 cells and carried out functional analysis using ryanodine binding assays in the presence of a specific agonist, 4-chloro-m-cresol, and the antagonist Mg. Transiently expressed human ryanodine receptor proteins colocalized with an endoplasmic reticulum marker in HEK-293 cells. Ryanodine binding assays confirmed that mutations causing malignant hyperthermia resulted in a hypersensitive channel, while those causing central core disease resulted in a hyposensitive channel. The functional assays validate recombinant human skeletal muscle ryanodine receptor for analysis of variants and add an additional mutation (H4833Y) to the repertoire of mutations that can be used for the genetic diagnosis of malignant hyperthermia.
Evidence for two transferrin loci in the Salmo trutta genome.
Rozman, T; Dovc, P; Marić, S; Kokalj-Vokac, N; Erjavec-Skerget, A; Rab, P; Snoj, A
2008-12-01
To determine the organization of transferrin (TF) locus in the Salmo trutta genome, partial DNA and cDNA sequencing, fluorescent in situ hybridization (FISH) and Salmo salar BAC analysis were performed. TF expression levels and copy number prediction were assessed using real-time PCR. In addition to two previously reported DNA TF variant sequences of S. trutta and Salmo marmoratus (TF1), two novel variant sequences (TF2) were revealed in both species. Variant-specific sequence tags, characterizing two variants for each TF type (TF1 and TF2), were identified in genomic clones from each of the F1 hybrids between S. trutta and S. marmoratus. These clearly documented double heterozygote status at the TF loci. The real-time PCR data showed that each of the two TF types (TF1 and TF2) existed in one copy only and that the transcription of TF2 was considerably lower compared with TF1. Using FISH, hybridization signals were observed on two medium-sized acrocentric chromosomes of S. trutta karyotype. A TF type-specific PCR followed by a restriction analysis revealed the presence of two TF loci in the majority of analysed BAC clones. It was concluded that the TF gene is duplicated in the genome of S. trutta, and that the two TF loci are located adjacent to one another on the same chromosome. The differing transcription levels of TF1 and TF2 appear to depend on the corresponding promoter activity, which at least for TF2 seems to vary between different Salmo congeners.
Whole-genome sequence-based analysis of thyroid function.
Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J; Traglia, Michela; Brown, Suzanne J; Mullin, Benjamin H; Shihab, Hashem A; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R; Beilby, John P; Charoen, Pimphen; Danecek, Petr; Dudbridge, Frank; Forgetta, Vincenzo; Greenwood, Celia; Grundberg, Elin; Johnson, Andrew D; Hui, Jennie; Lim, Ee M; McCarthy, Shane; Muddyman, Dawn; Panicker, Vijay; Perry, John R B; Bell, Jordana T; Yuan, Wei; Relton, Caroline; Gaunt, Tom; Schlessinger, David; Abecasis, Goncalo; Cucca, Francesco; Surdulescu, Gabriela L; Woltersdorf, Wolfram; Zeggini, Eleftheria; Zheng, Hou-Feng; Toniolo, Daniela; Dayan, Colin M; Naitza, Silvia; Walsh, John P; Spector, Tim; Davey Smith, George; Durbin, Richard; Richards, J Brent; Sanna, Serena; Soranzo, Nicole; Timpson, Nicholas J; Wilson, Scott G
2015-03-06
Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10(-9)) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10(-14)). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10(-9)) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10(-11)). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.
Pranavchand, Rayabarapu; Reddy, Battini Mohan
2017-06-13
Given the characteristic atherogenic dyslipidemia of south Indian population and crucial role of APOA1, APOC3, APOA4 and APOA5 genes clustered in 11q23.3 chromosomal region in regulating lipoprotein metabolism and cholesterol homeostasis, a large number of recently identified variants are to be explored for their role in regulating the serum lipid parameters among south Indians. Using fluidigm SNP genotyping platform, a prioritized set of 96 SNPs of the 11q23.3 chromosomal region were genotyped on 516 individuals from Hyderabad, India, and its vicinity and aged >45 years. The linear regression analysis of the individual lipid traits viz., TC, LDLC, HDLC, VLDL and TG with each of the 78 SNPs that confirm to HWE and with minor allele frequency > 1%, suggests 23 of those to be significantly associated (p ≤ 0.05) with at least one of these quantitative traits. Most importantly, the variant rs632153 is involved in elevating TC, LDLC, TG and VLDLs and probably playing a crucial role in the manifestation of dyslipidemia. Additionally, another three SNPs rs633389, rs2187126 and rs1263163 are found risk conferring to dyslipidemia by elevating LDLC and TC levels in the present population. Further, the ROC (receiver operating curve) analysis for the risk scores and dyslipidemia status yielded a significant area under curve (AUC) = 0.675, suggesting high discriminative power of the risk variants towards the condition. The interaction analysis suggests rs10488699-rs2187126 pair of the BUD13 gene to confer significant risk (Interaction odds ratio = 14.38, P = 7.17 × 10 5 ) towards dyslipidemia by elevating the TC levels (β = 37.13, p = 6.614 × 10 5 ). On the other hand, the interaction between variants of APOA1 gene and BUD13 and/or ZPR1 regulatory genes at this region are associated with elevated TG and VLDL. The variants at 11q23.3 chromosomal region seem to determine the quantitative lipid traits and in turn dyslipidemia in the population of Hyderabad. Particularly, the variants rs632153, rs633389, rs2187126 and rs1263163 might be risk conferring to dyslipidemia by elevating LDLC and TC levels, while the variants of APOC3 and APOA1 genes might be the genetic determinants of elevated triglycerides in the present population.
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.
van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J
2017-10-01
Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.
Fujinami, Kaoru; Strauss, Rupert W; Chiang, John Pei-Wen; Audo, Isabelle S; Bernstein, Paul S; Birch, David G; Bomotti, Samantha M; Cideciyan, Artur V; Ervin, Ann-Margret; Marino, Meghan J; Sahel, José-Alain; Mohand-Said, Saddek; Sunness, Janet S; Traboulsi, Elias I; West, Sheila; Wojciechowski, Robert; Zrenner, Eberhart; Michaelides, Michel; Scholl, Hendrik P N
2018-06-20
To describe the genetic characteristics of the cohort enrolled in the international multicentre progression of Stargardt disease 1 (STGD1) studies (ProgStar) and to determine geographic differences based on the allele frequency. 345 participants with a clinical diagnosis of STGD1 and harbouring at least one disease-causing ABCA4 variant were enrolled from 9 centres in the USA and Europe. All variants were reviewed and in silico analysis was performed including allele frequency in public databases and pathogenicity predictions. Participants with multiple likely pathogenic variants were classified into four national subgroups (USA, UK, France, Germany), with subsequent comparison analysis of the allele frequency for each prevalent allele. 211 likely pathogenic variants were identified in the total cohort, including missense (63%), splice site alteration (18%), stop (9%) and others. 50 variants were novel. Exclusively missense variants were detected in 139 (50%) of 279 patients with multiple pathogenic variants. The three most prevalent variants of these patients with multiple pathogenic variants were p.G1961E (15%), p.G863A (7%) and c.5461-10 T>C (5%). Subgroup analysis revealed a statistically significant difference between the four recruiting nations in the allele frequency of nine variants. There is a large spectrum of ABCA4 sequence variants, including 50 novel variants, in a well-characterised cohort thereby further adding to the unique allelic heterogeneity in STGD1. Approximately half of the cohort harbours missense variants only, indicating a relatively mild phenotype of the ProgStar cohort. There are significant differences in allele frequencies between nations, although the three most prevalent variants are shared as frequent variants. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Stokowy, Tomasz; Garbulowski, Mateusz; Fiskerstrand, Torunn; Holdhus, Rita; Labun, Kornel; Sztromwasser, Pawel; Gilissen, Christian; Hoischen, Alexander; Houge, Gunnar; Petersen, Kjell; Jonassen, Inge; Steen, Vidar M
2016-10-01
The search for causative genetic variants in rare diseases of presumed monogenic inheritance has been boosted by the implementation of whole exome (WES) and whole genome (WGS) sequencing. In many cases, WGS seems to be superior to WES, but the analysis and visualization of the vast amounts of data is demanding. To aid this challenge, we have developed a new tool-RareVariantVis-for analysis of genome sequence data (including non-coding regions) for both germ line and somatic variants. It visualizes variants along their respective chromosomes, providing information about exact chromosomal position, zygosity and frequency, with point-and-click information regarding dbSNP IDs, gene association and variant inheritance. Rare variants as well as de novo variants can be flagged in different colors. We show the performance of the RareVariantVis tool in the Genome in a Bottle WGS data set. https://www.bioconductor.org/packages/3.3/bioc/html/RareVariantVis.html tomasz.stokowy@k2.uib.no Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Kim, Do Gyun; Kim, Hyoung Jin; Kim, Hong-Jin
2016-10-01
Charge variants (acidic and basic) of recombinant monoclonal antibodies (Mabs) have received much attention due to their potential biological effects. C-terminal lysine variants are common in Mabs and their proportion is affected by the manufacturing process. In the present study, changes of trastuzumab charge variants brought about by carboxypeptidase B treatment and subsequent storage at 8 or 37 °C for up to 24 h were monitored by cation-exchange chromatography analysis to investigate the effects of C-terminal lysine cleavage and its subsequent reaction at 8 or 37 °C. C-terminal lysine cleavage at 8 °C reduced the fraction of basic species and had little effect on the fraction of acidic species. Analysis of individual peaks demonstrated that C-terminal lysine cleavage induced both increases and decreases in individual acidic variants, with the result that there was little overall change in the overall proportion of acidic species. It appeared that most of the basic variant Mab molecules but only a fraction of the acidic variant molecules had C-terminal lysines. Increasing the temperature to 37 °C appeared to increase the fraction of acidic species and decrease main species significantly, without a similar change in basic species. These results indicate that length of exposure to elevated temperature is a critical consideration in charge variant analysis.
Hundreds of variants clustered in genomic loci and biological pathways affect human height
Lango Allen, Hana; Estrada, Karol; Lettre, Guillaume; Berndt, Sonja I.; Weedon, Michael N.; Rivadeneira, Fernando; Willer, Cristen J.; Jackson, Anne U.; Vedantam, Sailaja; Raychaudhuri, Soumya; Ferreira, Teresa; Wood, Andrew R.; Weyant, Robert J.; Segrè, Ayellet V.; Speliotes, Elizabeth K.; Wheeler, Eleanor; Soranzo, Nicole; Park, Ju-Hyun; Yang, Jian; Gudbjartsson, Daniel; Heard-Costa, Nancy L.; Randall, Joshua C.; Qi, Lu; Smith, Albert Vernon; Mägi, Reedik; Pastinen, Tomi; Liang, Liming; Heid, Iris M.; Luan, Jian'an; Thorleifsson, Gudmar; Winkler, Thomas W.; Goddard, Michael E.; Lo, Ken Sin; Palmer, Cameron; Workalemahu, Tsegaselassie; Aulchenko, Yurii S.; Johansson, Åsa; Zillikens, M.Carola; Feitosa, Mary F.; Esko, Tõnu; Johnson, Toby; Ketkar, Shamika; Kraft, Peter; Mangino, Massimo; Prokopenko, Inga; Absher, Devin; Albrecht, Eva; Ernst, Florian; Glazer, Nicole L.; Hayward, Caroline; Hottenga, Jouke-Jan; Jacobs, Kevin B.; Knowles, Joshua W.; Kutalik, Zoltán; Monda, Keri L.; Polasek, Ozren; Preuss, Michael; Rayner, Nigel W.; Robertson, Neil R.; Steinthorsdottir, Valgerdur; Tyrer, Jonathan P.; Voight, Benjamin F.; Wiklund, Fredrik; Xu, Jianfeng; Zhao, Jing Hua; Nyholt, Dale R.; Pellikka, Niina; Perola, Markus; Perry, John R.B.; Surakka, Ida; Tammesoo, Mari-Liis; Altmaier, Elizabeth L.; Amin, Najaf; Aspelund, Thor; Bhangale, Tushar; Boucher, Gabrielle; Chasman, Daniel I.; Chen, Constance; Coin, Lachlan; Cooper, Matthew N.; Dixon, Anna L.; Gibson, Quince; Grundberg, Elin; Hao, Ke; Junttila, M. Juhani; Kaplan, Lee M.; Kettunen, Johannes; König, Inke R.; Kwan, Tony; Lawrence, Robert W.; Levinson, Douglas F.; Lorentzon, Mattias; McKnight, Barbara; Morris, Andrew P.; Müller, Martina; Ngwa, Julius Suh; Purcell, Shaun; Rafelt, Suzanne; Salem, Rany M.; Salvi, Erika; Sanna, Serena; Shi, Jianxin; Sovio, Ulla; Thompson, John R.; Turchin, Michael C.; Vandenput, Liesbeth; Verlaan, Dominique J.; Vitart, Veronique; White, Charles C.; Ziegler, Andreas; Almgren, Peter; Balmforth, Anthony J.; Campbell, Harry; Citterio, Lorena; De Grandi, Alessandro; Dominiczak, Anna; Duan, Jubao; Elliott, Paul; Elosua, Roberto; Eriksson, Johan G.; Freimer, Nelson B.; Geus, Eco J.C.; Glorioso, Nicola; Haiqing, Shen; Hartikainen, Anna-Liisa; Havulinna, Aki S.; Hicks, Andrew A.; Hui, Jennie; Igl, Wilmar; Illig, Thomas; Jula, Antti; Kajantie, Eero; Kilpeläinen, Tuomas O.; Koiranen, Markku; Kolcic, Ivana; Koskinen, Seppo; Kovacs, Peter; Laitinen, Jaana; Liu, Jianjun; Lokki, Marja-Liisa; Marusic, Ana; Maschio, Andrea; Meitinger, Thomas; Mulas, Antonella; Paré, Guillaume; Parker, Alex N.; Peden, John F.; Petersmann, Astrid; Pichler, Irene; Pietiläinen, Kirsi H.; Pouta, Anneli; Ridderstråle, Martin; Rotter, Jerome I.; Sambrook, Jennifer G.; Sanders, Alan R.; Schmidt, Carsten Oliver; Sinisalo, Juha; Smit, Jan H.; Stringham, Heather M.; Walters, G.Bragi; Widen, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Zagato, Laura; Zgaga, Lina; Zitting, Paavo; Alavere, Helene; Farrall, Martin; McArdle, Wendy L.; Nelis, Mari; Peters, Marjolein J.; Ripatti, Samuli; van Meurs, Joyce B.J.; Aben, Katja K.; Ardlie, Kristin G; Beckmann, Jacques S.; Beilby, John P.; Bergman, Richard N.; Bergmann, Sven; Collins, Francis S.; Cusi, Daniele; den Heijer, Martin; Eiriksdottir, Gudny; Gejman, Pablo V.; Hall, Alistair S.; Hamsten, Anders; Huikuri, Heikki V.; Iribarren, Carlos; Kähönen, Mika; Kaprio, Jaakko; Kathiresan, Sekar; Kiemeney, Lambertus; Kocher, Thomas; Launer, Lenore J.; Lehtimäki, Terho; Melander, Olle; Mosley, Tom H.; Musk, Arthur W.; Nieminen, Markku S.; O'Donnell, Christopher J.; Ohlsson, Claes; Oostra, Ben; Palmer, Lyle J.; Raitakari, Olli; Ridker, Paul M.; Rioux, John D.; Rissanen, Aila; Rivolta, Carlo; Schunkert, Heribert; Shuldiner, Alan R.; Siscovick, David S.; Stumvoll, Michael; Tönjes, Anke; Tuomilehto, Jaakko; van Ommen, Gert-Jan; Viikari, Jorma; Heath, Andrew C.; Martin, Nicholas G.; Montgomery, Grant W.; Province, Michael A.; Kayser, Manfred; Arnold, Alice M.; Atwood, Larry D.; Boerwinkle, Eric; Chanock, Stephen J.; Deloukas, Panos; Gieger, Christian; Grönberg, Henrik; Hall, Per; Hattersley, Andrew T.; Hengstenberg, Christian; Hoffman, Wolfgang; Lathrop, G.Mark; Salomaa, Veikko; Schreiber, Stefan; Uda, Manuela; Waterworth, Dawn; Wright, Alan F.; Assimes, Themistocles L.; Barroso, Inês; Hofman, Albert; Mohlke, Karen L.; Boomsma, Dorret I.; Caulfield, Mark J.; Cupples, L.Adrienne; Erdmann, Jeanette; Fox, Caroline S.; Gudnason, Vilmundur; Gyllensten, Ulf; Harris, Tamara B.; Hayes, Richard B.; Jarvelin, Marjo-Riitta; Mooser, Vincent; Munroe, Patricia B.; Ouwehand, Willem H.; Penninx, Brenda W.; Pramstaller, Peter P.; Quertermous, Thomas; Rudan, Igor; Samani, Nilesh J.; Spector, Timothy D.; Völzke, Henry; Watkins, Hugh; Wilson, James F.; Groop, Leif C.; Haritunians, Talin; Hu, Frank B.; Kaplan, Robert C.; Metspalu, Andres; North, Kari E.; Schlessinger, David; Wareham, Nicholas J.; Hunter, David J.; O'Connell, Jeffrey R.; Strachan, David P.; Wichmann, H.-Erich; Borecki, Ingrid B.; van Duijn, Cornelia M.; Schadt, Eric E.; Thorsteinsdottir, Unnur; Peltonen, Leena; Uitterlinden, André; Visscher, Peter M.; Chatterjee, Nilanjan; Loos, Ruth J.F.; Boehnke, Michael; McCarthy, Mark I.; Ingelsson, Erik; Lindgren, Cecilia M.; Abecasis, Gonçalo R.; Stefansson, Kari; Frayling, Timothy M.; Hirschhorn, Joel N
2010-01-01
Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence phenotype. Genome-wide association (GWA) studies have identified >600 variants associated with human traits1, but these typically explain small fractions of phenotypic variation, raising questions about the utility of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait2,3. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P=0.016), and that underlie skeletal growth defects (P<0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants, and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented amongst variants that alter amino acid structure of proteins and expression levels of nearby genes. Our data explain ∼10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to ∼16% of phenotypic variation (∼20% of heritable variation). Although additional approaches are needed to fully dissect the genetic architecture of polygenic human traits, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways. PMID:20881960
A patient with PMP22-related hereditary neuropathy and DBH-gene-related dysautonomia.
Bartoletti-Stella, Anna; Chiaro, Giacomo; Calandra-Buonaura, Giovanna; Contin, Manuela; Scaglione, Cesa; Barletta, Giorgio; Cecere, Annagrazia; Garagnani, Paolo; Tieri, Paolo; Ferrarini, Alberto; Piras, Silvia; Franceschi, Claudio; Delledonne, Massimo; Cortelli, Pietro; Capellari, Sabina
2015-10-01
Recurrent focal neuropathy with liability to pressure palsies is a relatively frequent autosomal-dominant demyelinating neuropathy linked to peripheral myelin protein 22 (PMP22) gene deletions. The combination of PMP22 gene mutations with other genetic variants is known to cause a more severe phenotype than expected. We present the case of a patient with severe orthostatic hypotension since 12 years of age, who inherited a PMP22 gene deletion from his father. Genetic double trouble was suspected because of selective sympathetic autonomic disturbances. Through exome-sequencing analysis, we identified two novel mutations in the dopamine beta hydroxylase gene. Moreover, with interactome analysis, we excluded a further influence on the origin of the disease by variants in other genes. This case increases the number of unique patients presenting with dopamine-β-hydroxylase deficiency and of cases with genetically proven double trouble. Finding the right, complete diagnosis is crucial to obtain adequate medical care and appropriate genetic counseling.
Atanur, Santosh S; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R; Kaisaki, Pamela J; Otto, Georg W; Ma, Man Chun John; Keane, Thomas M; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J
2013-08-01
Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.
2013-01-01
Summary Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. PaperClip PMID:23890820
Kappagantu, Madhu; Villamor, Dan Edward V; Bullock, Jeff M; Eastwell, Kenneth C
2017-07-01
Hop stunt disease caused by Hop stunt viroid (HSVd) is a growing threat to hop cultivation globally. HSVd spreads mainly by use of contaminated planting material and by mechanical means. Thorough testing of hop yards and removal of infected bines are critical components of efforts to control the spread of the disease. Reverse transcription-polymerase chain reaction (RT-PCR) has become the primary technique used for HSVd detection; however, sample handling and analysis are technically challenging. In this study, a robust reverse transcription-recombinase polymerase amplification (RT-RPA) assay was developed to facilitate analysis of multiple samples. The assay was optimized with all major variants of HSVd from other host species in addition to hop variants. Used in conjunction with sample collection cards, RT-RPA accommodates large sample numbers. Greenhouse and farm samples tested with RT-RPA were also tested with RT-PCR and a 100% correlation between the two techniques was found. Copyright © 2017. Published by Elsevier B.V.
The Role of Constitutional Copy Number Variants in Breast Cancer
Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.
2015-01-01
Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231
Antigenic variation of Anaplasma marginale msp2 occurs by combinatorial gene conversion.
Brayton, Kelly A; Palmer, Guy H; Lundgren, Anna; Yi, Jooyoung; Barbet, Anthony F
2002-03-01
The rickettsial pathogen Anaplasma marginale establishes lifelong persistent infection in the mammalian reservoir host, during which time immune escape variants continually arise in part because of variation in the expressed copy of the immunodominant outer membrane protein MSP2. A key question is how the small 1.2 Mb A. marginale genome generates sufficient variants to allow long-term persistence in an immunocompetent reservoir host. The recombination of whole pseudogenes into the single msp2 expression site has been previously identified as one method of generating variants, but is inadequate to generate the number of variants required for persistent infection. In the present study, we demonstrate that recombination of a whole pseudogene is followed by a second level of variation in which small segments of pseudogenes recombine into the expression site by gene conversion. Evidence for four short sequential changes in the hypervariable region of msp2 coupled with the identification of nine pseudogenes from a single strain of A. marginale provides for a combinatorial number of possible expressed MSP2 variants sufficient for lifelong persistence.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ho, Hoan, E-mail: hoan.ho@wdc.com; Department of Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; Zhu, Jingxi, E-mail: jingxiz@andrew.cmu.edu
2014-11-21
We present a study on atomic ordering within individual grains in granular L1{sub 0}-FePt thin films using transmission electron microscopy techniques. The film, used as a medium for heat assisted magnetic recording, consists of a single layer of FePt grains separated by non-magnetic grain boundaries and is grown on an MgO underlayer. Using convergent-beam techniques, diffraction patterns of individual grains are obtained for a large number of crystallites. The study found that although the majority of grains are ordered in the perpendicular direction, more than 15% of them are multi-variant, or of in-plane c-axis orientation, or disordered fcc. It wasmore » also found that these multi-variant and in-plane grains have always grown across MgO grain boundaries separating two or more MgO grains of the underlayer. The in-plane ordered portion within a multi-variant L1{sub 0}-FePt grain always lacks atomic coherence with the MgO directly underneath it, whereas, the perpendicularly ordered portion is always coherent with the underlying MgO grain. Since the existence of multi-variant and in-plane ordered grains are severely detrimental to high density data storage capability, the understanding of their formation mechanism obtained here should make a significant impact on the future development of hard disk drive technology.« less
2013-10-01
role of copy number variants in prostate cancer risk and progression using a novel genome-wide screening method. 5a. CONTRACT NUMBER 5b. GRANT ...Prostate; Cancer; Risk; Deletion; Prognosismatter Published by Elsevier Inc. .urolonc.2013.06.004 d in part by DOD grant PC081025, by grant arly...Detection Research Network of the National CTRC at UTHSCSA grant P30CA054174. Data omics Core Shared Resource, which is supported CI P30CA054174 (CTRC of
NASA Astrophysics Data System (ADS)
Martyushev, S. G.; Miroshnichenko, I. V.; Sheremet, M. A.
2015-11-01
We have performed a numerical analysis of the stationary regimes of thermogravitational convection and thermal surface radiation in a closed differentially heated parallelepiped. The mathematical model formulated in dimensionless natural velocity-pressure-temperature variables was realized numerically in the control volume approach. Analysis of the radiative heat exchange was carried out on the basis of the surface radiation approach with the use of the balance method in the Polyak variant. We have obtained three-dimensional temperature and velocity fields, as well as dependences for the mean Nusselt number reflecting the influence of the geometric parameter, the Rayleigh number, and the reduced emissive factor of the walls on the flow structure and the heat transfer.
A generalized least-squares framework for rare-variant analysis in family data.
Li, Dalin; Rotter, Jerome I; Guo, Xiuqing
2014-01-01
Rare variants may, in part, explain some of the hereditability missing in current genome-wide association studies. Many gene-based rare-variant analysis approaches proposed in recent years are aimed at population-based samples, although analysis strategies for family-based samples are clearly warranted since the family-based design has the potential to enhance our ability to enrich for rare causal variants. We have recently developed the generalized least squares, sequence kernel association test, or GLS-SKAT, approach for the rare-variant analyses in family samples, in which the kinship matrix that was computed from the high dimension genetic data was used to decorrelate the family structure. We then applied the SKAT-O approach for gene-/region-based inference in the decorrelated data. In this study, we applied this GLS-SKAT method to the systolic blood pressure data in the simulated family sample distributed by the Genetic Analysis Workshop 18. We compared the GLS-SKAT approach to the rare-variant analysis approach implemented in family-based association test-v1 and demonstrated that the GLS-SKAT approach provides superior power and good control of type I error rate.
Murray, Anna; Bennett, Claire E; Perry, John R B; Weedon, Michael N; Jacobs, Patricia A; Morris, Danielle H; Orr, Nicholas; Schoemaker, Minouk J; Jones, Michael; Ashworth, Alan; Swerdlow, Anthony J
2011-01-01
Women become infertile approximately 10 years before menopause, and as more women delay childbirth into their 30s, the number of women who experience infertility is likely to increase. Tests that predict the timing of menopause would allow women to make informed reproductive decisions. Current predictors are only effective just prior to menopause, and there are no long-range indicators. Age at menopause and early menopause (EM) are highly heritable, suggesting a genetic aetiology. Recent genome-wide scans have identified four loci associated with variation in the age of normal menopause (40-60 years). We aimed to determine whether theses loci are also risk factors for EM. We tested the four menopause-associated genetic variants in a cohort of approximately 2000 women with menopause≤45 years from the Breakthrough Generations Study (BGS). All four variants significantly increased the odds of having EM. Comparing the 4.5% of individuals with the lowest number of risk alleles (two or three) with the 3.0% with the highest number (eight risk alleles), the odds ratio was 4.1 (95% CI 2.4-7.1, P=4.0×10(-7)). In combination, the four variants discriminated EM cases with a receiver operator characteristic area under the curve of 0.6. Four common genetic variants identified by genome-wide association studies, had a significant impact on the odds of having EM in an independent cohort from the BGS. The discriminative power is still limited, but as more variants are discovered they may be useful for predicting reproductive lifespan.
Protein Interaction Networks Reveal Novel Autism Risk Genes within GWAS Statistical Noise
Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M.
2014-01-01
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical “noise” that warrant further analysis for causal variants. PMID:25409314
Protein interaction networks reveal novel autism risk genes within GWAS statistical noise.
Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M
2014-01-01
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical "noise" that warrant further analysis for causal variants.
Kugler, Jamie E.; Horsch, Marion; Huang, Di; Furusawa, Takashi; Rochman, Mark; Garrett, Lillian; Becker, Lore; Bohla, Alexander; Hölter, Sabine M.; Prehn, Cornelia; Rathkolb, Birgit; Racz, Ildikó; Aguilar-Pimentel, Juan Antonio; Adler, Thure; Adamski, Jerzy; Beckers, Johannes; Busch, Dirk H.; Eickelberg, Oliver; Klopstock, Thomas; Ollert, Markus; Stöger, Tobias; Wolf, Eckhard; Wurst, Wolfgang; Yildirim, Ali Önder; Zimmer, Andreas; Gailus-Durner, Valérie; Fuchs, Helmut; Hrabě de Angelis, Martin; Garfinkel, Benny; Orly, Joseph; Ovcharenko, Ivan; Bustin, Michael
2013-01-01
The nuclei of most vertebrate cells contain members of the high mobility group N (HMGN) protein family, which bind specifically to nucleosome core particles and affect chromatin structure and function, including transcription. Here, we study the biological role of this protein family by systematic analysis of phenotypes and tissue transcription profiles in mice lacking functional HMGN variants. Phenotypic analysis of Hmgn1tm1/tm1, Hmgn3tm1/tm1, and Hmgn5tm1/tm1 mice and their wild type littermates with a battery of standardized tests uncovered variant-specific abnormalities. Gene expression analysis of four different tissues in each of the Hmgntm1/tm1 lines reveals very little overlap between genes affected by specific variants in different tissues. Pathway analysis reveals that loss of an HMGN variant subtly affects expression of numerous genes in specific biological processes. We conclude that within the biological framework of an entire organism, HMGNs modulate the fidelity of the cellular transcriptional profile in a tissue- and HMGN variant-specific manner. PMID:23620591
Regularized rare variant enrichment analysis for case-control exome sequencing data.
Larson, Nicholas B; Schaid, Daniel J
2014-02-01
Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
Integrated analysis of germline and somatic variants in ovarian cancer.
Kanchi, Krishna L; Johnson, Kimberly J; Lu, Charles; McLellan, Michael D; Leiserson, Mark D M; Wendl, Michael C; Zhang, Qunyuan; Koboldt, Daniel C; Xie, Mingchao; Kandoth, Cyriac; McMichael, Joshua F; Wyczalkowski, Matthew A; Larson, David E; Schmidt, Heather K; Miller, Christopher A; Fulton, Robert S; Spellman, Paul T; Mardis, Elaine R; Druley, Todd E; Graubert, Timothy A; Goodfellow, Paul J; Raphael, Benjamin J; Wilson, Richard K; Ding, Li
2014-01-01
We report the first large-scale exome-wide analysis of the combined germline-somatic landscape in ovarian cancer. Here we analyse germline and somatic alterations in 429 ovarian carcinoma cases and 557 controls. We identify 3,635 high confidence, rare truncation and 22,953 missense variants with predicted functional impact. We find germline truncation variants and large deletions across Fanconi pathway genes in 20% of cases. Enrichment of rare truncations is shown in BRCA1, BRCA2 and PALB2. In addition, we observe germline truncation variants in genes not previously associated with ovarian cancer susceptibility (NF1, MAP3K4, CDKN2B and MLL3). Evidence for loss of heterozygosity was found in 100 and 76% of cases with germline BRCA1 and BRCA2 truncations, respectively. Germline-somatic interaction analysis combined with extensive bioinformatics annotation identifies 222 candidate functional germline truncation and missense variants, including two pathogenic BRCA1 and 1 TP53 deleterious variants. Finally, integrated analyses of germline and somatic variants identify significantly altered pathways, including the Fanconi, MAPK and MLL pathways.
Evolutionary history of African mongoose rabies.
Van Zyl, N; Markotter, W; Nel, L H
2010-06-01
Two biotypes or variants of rabies virus (RABV) occur in southern Africa. These variants are respectively adapted to hosts belonging to the Canidae family (the canid variant) and hosts belonging to the Herpestidae family (the mongoose variant). Due to the distinct host adaptation and differences in epidemiology and pathogenesis, it has been hypothesized that the two variants were introduced into Africa at different times. The objective of this study was to investigate the molecular phylogeny of representative RABV isolates of the mongoose variant towards a better understanding of the origins of this group. The study was based on an analysis of the full nucleoprotein and glycoprotein gene sequences of a panel of 27 viruses. Phylogenetic analysis of this dataset confirmed extended evolutionary adaptation of isolates in specific geographic areas. The evolutionary dynamics of this virus variant was investigated using Bayesian methodology, allowing for rate variation among viral lineages. Molecular clock analysis estimated the age of the African mongoose RABV to be approximately 200 years old, which is in concurrence with literature describing rabies in mongooses since the early 1800 s. (c) 2010 Elsevier B.V. All rights reserved.
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing.
Morse, Alison M; Calabro, Kaitlyn R; Fear, Justin M; Bloom, David C; McIntyre, Lauren M
2017-08-16
High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended.
Musunuru, Kiran; Bernstein, Daniel; Cole, F Sessions; Khokha, Mustafa K; Lee, Frank S; Lin, Shin; McDonald, Thomas V; Moskowitz, Ivan P; Quertermous, Thomas; Sankaran, Vijay G; Schwartz, David A; Silverman, Edwin K; Zhou, Xiaobo; Hasan, Ahmed A K; Luo, Xiao-Zhong James
2018-04-01
The National Institutes of Health have made substantial investments in genomic studies and technologies to identify DNA sequence variants associated with human disease phenotypes. The National Heart, Lung, and Blood Institute has been at the forefront of these commitments to ascertain genetic variation associated with heart, lung, blood, and sleep diseases and related clinical traits. Genome-wide association studies, exome- and genome-sequencing studies, and exome-genotyping studies of the National Heart, Lung, and Blood Institute-funded epidemiological and clinical case-control studies are identifying large numbers of genetic variants associated with heart, lung, blood, and sleep phenotypes. However, investigators face challenges in identification of genomic variants that are functionally disruptive among the myriad of computationally implicated variants. Studies to define mechanisms of genetic disruption encoded by computationally identified genomic variants require reproducible, adaptable, and inexpensive methods to screen candidate variant and gene function. High-throughput strategies will permit a tiered variant discovery and genetic mechanism approach that begins with rapid functional screening of a large number of computationally implicated variants and genes for discovery of those that merit mechanistic investigation. As such, improved variant-to-gene and gene-to-function screens-and adequate support for such studies-are critical to accelerating the translation of genomic findings. In this White Paper, we outline the variety of novel technologies, assays, and model systems that are making such screens faster, cheaper, and more accurate, referencing published work and ongoing work supported by the National Heart, Lung, and Blood Institute's R21/R33 Functional Assays to Screen Genomic Hits program. We discuss priorities that can accelerate the impressive but incomplete progress represented by big data genomic research. © 2018 American Heart Association, Inc.
Yeruva, Laxmi; Bowlin, Anne K; Spencer, Nicole; Maurelli, Anthony T; Rank, Roger G
2015-08-01
An important question in the study of chlamydial genital tract disease is why some women develop severe upper tract disease while others have mild or even "silent" infections with or without pathology. Animal studies suggest that the pathological outcome of an infection is dependent upon both the composition of the infecting chlamydial population and the genotype of the host, along with host physiological effects, such as the cyclical production of reproductive hormones and even the size of the infecting inoculum or the number of repeated infections. In this study, we compared two variants of Chlamydia caviae, contrasting in virulence, with respect to their abilities to ascend the guinea pig genital tract. We then determined the effect of combining the two variants on the course of infection and on the bacterial loads of the two variants in the genital tract. Although the variants individually had similar infection kinetics in the cervix, SP6, the virulent variant, could be isolated from the oviducts more often and in greater numbers than the attenuated variant, AZ2. SP6 also elicited higher levels of interleukin 8 (IL-8) in the lower genital tract and increased leukocyte infiltration in the cervix and uterus compared to AZ2. When the two variants were combined in a mixed infection, SP6 outcompeted AZ2 in the lower genital tract; however, AZ2 was able to ascend the genital tract as readily as SP6. These data suggest that the ability of SP6 to elicit an inflammatory response in the lower genital tract facilitates the spread of both variants to the oviducts. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Genetic Analyses in Small-for-Gestational-Age Newborns.
Stalman, Susanne E; Solanky, Nita; Ishida, Miho; Alemán-Charlet, Cristina; Abu-Amero, Sayeda; Alders, Marielle; Alvizi, Lucas; Baird, William; Demetriou, Charalambos; Henneman, Peter; James, Chela; Knegt, Lia C; Leon, Lydia J; Mannens, Marcel M A M; Mul, Adi N; Nibbering, Nicole A; Peskett, Emma; Rezwan, Faisal I; Ris-Stalpers, Carrie; van der Post, Joris A M; Kamp, Gerdine A; Plötz, Frans B; Wit, Jan M; Stanier, Philip; Moore, Gudrun E; Hennekam, Raoul C
2018-03-01
Small for gestational age (SGA) can be the result of fetal growth restriction, which is associated with perinatal morbidity and mortality. Mechanisms that control prenatal growth are poorly understood. The aim of the current study was to gain more insight into prenatal growth failure and determine an effective diagnostic approach in SGA newborns. We hypothesized that one or more copy number variations (CNVs) and disturbed methylation and sequence variants may be present in genes associated with fetal growth. A prospective cohort study of subjects with a low birth weight for gestational age. The study was conducted at an academic pediatric research institute. A total of 21 SGA newborns with a mean birth weight below the first centile and a control cohort of 24 appropriate-for-gestational-age newborns were studied. Array comparative genomic hybridization, genome-wide methylation studies, and exome sequencing were performed. The numbers of CNVs, methylation disturbances, and sequence variants. The genetic analyses demonstrated three CNVs, one systematically disturbed methylation pattern, and one sequence variant explaining SGA. Additional methylation disturbances and sequence variants were present in 20 patients. In 19 patients, multiple abnormalities were found. Our results confirm the influence of a large number of mechanisms explaining dysregulation of fetal growth. We concluded that CNVs, methylation disturbances, and sequence variants all contribute to prenatal growth failure. These genetic workups can be an effective diagnostic approach in SGA newborns.
Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Alejandro Q; Musolf, Anthony; Matise, Tara C; Finch, Stephen J; Gordon, Derek
2012-01-01
As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single-variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p value, no matter how many loci. Copyright © 2013 S. Karger AG, Basel.
Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Andrew; Musolf, Anthony; Matise, Tara C.; Finch, Stephen J.; Gordon, Derek
2013-01-01
As with any new technology, next generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model, based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to that data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p-value, no matter how many loci. PMID:23594495
Law of corresponding states for open collaborations
NASA Astrophysics Data System (ADS)
Gherardi, Marco; Bassetti, Federico; Cosentino Lagomarsino, Marco
2016-04-01
We study the relation between number of contributors and product size in Wikipedia and GitHub. In contrast to traditional production, this is strongly probabilistic, but is characterized by two quantitative nonlinear laws: a power-law bound to product size for increasing number of contributors, and the universal collapse of rescaled distributions. A variant of the random-energy model shows that both laws are due to the heterogeneity of contributors, and displays an intriguing finite-size scaling property with no equivalent in standard systems. The analysis uncovers the right intensive densities, enabling the comparison of projects with different numbers of contributors on equal grounds. We use this property to expose the detrimental effects of conflicting interactions in Wikipedia.
Systematic RH genotyping and variant identification in French donors of African origin
Kappler-Gratias, Sandrine; Auxerre, Carine; Dubeaux, Isabelle; Beolet, Marylise; Ripaux, Maryline; Le Pennec, Pierre-Yves; Pham, Bach-Nga
2014-01-01
Background RH molecular analysis has enabled the documentation of numerous variants of RHD and RHCE alleles, especially in individuals of African origin. The aim of the present study was to determine the type and frequency of D and/or RhCE variants among blood donors of African origin in France, by performing a systematic RH molecular analysis, in order to evaluate the implications for blood transfusion of patients of African origin. Materials and methods Samples from 316 African blood donors, whose origin was established by their Fy(a−b−) phenotype, were first analysed using the RHD and RHCE BeadChips Kit (BioArray Solutions, Immucor, Warren, NJ, USA). Sequencing was performed when necessary. Results RHD molecular analysis showed that 26.2% of donors had a variant RHD allele. It allowed the prediction of a partial D in 11% of cases. RHCE molecular analysis showed that 14.2% of donors had a variant RHCE allele or RH [RN or (C)ces] haplotype. A rare Rh phenotype associated with the loss of a high-prevalence antigen or partial RhCE antigens were predicted from RHCE molecular analysis in 1 (0.3%) and 17 (5%) cases, respectively. Discussion Systematic RHD and RHCE molecular analysis performed in blood donors of African origin provides transfusion-relevant information for individuals of African origin because of the frequency of variant RH alleles. RH molecular analysis may improve transfusion therapy of patients by allowing better donor and recipient matching, based not only on phenotypically matched red blood cell units, but also on units that are genetically matched with regards to RhCE variants. PMID:23867180
In-Depth Analysis of HA and NS1 Genes in A(H1N1)pdm09 Infected Patients.
Caglioti, Claudia; Selleri, Marina; Rozera, Gabriella; Giombini, Emanuela; Zaccaro, Paola; Valli, Maria Beatrice; Capobianchi, Maria Rosaria
2016-01-01
In March/April 2009, a new pandemic influenza A virus (A(H1N1)pdm09) emerged and spread rapidly via human-to-human transmission, giving rise to the first pandemic of the 21th century. Influenza virus may be present in the infected host as a mixture of variants, referred to as quasi-species, on which natural and immune-driven selection operates. Since hemagglutinin (HA) and non-structural 1 (NS1) proteins are relevant in respect of adaptive and innate immune responses, the present study was aimed at establishing the intra-host genetic heterogeneity of HA and NS1 genes, applying ultra-deep pyrosequencing (UDPS) to nasopharyngeal swabs (NPS) from patients with confirmed influenza A(H1N1)pdm09 infection. The intra-patient nucleotide diversity of HA was significantly higher than that of NS1 (median (IQR): 37.9 (32.8-42.3) X 10-4 vs 30.6 (27.4-33.6) X 10-4 substitutions/site, p = 0.024); no significant correlation for nucleotide diversity of NS1 and HA was observed (r = 0.319, p = 0.29). Furthermore, a strong inverse correlation between nucleotide diversity of NS1 and viral load was observed (r = - 0.74, p = 0.004). For both HA and NS1, the variants appeared scattered along the genes, thus indicating no privileged mutation site. Known polymorphisms, S203T (HA) and I123V (NS1), were observed as dominant variants (>98%) in almost all patients; three HA and two NS1 further variants were observed at frequency >40%; a number of additional variants were detected at frequency <6% (minority variants), of which three HA and four NS1 variants were novel. In few patients multiple variants were observed at HA residues 203 and 222. According to the FLUSURVER tool, some of these variants may affect immune recognition and host range; however, these inferences are based on H5N1, and their extension to A(H1N1)pdm09 requires caution. More studies are necessary to address the significance of the composite nature of influenza virus quasi-species within infected patients.
Lovelock, Paul K; Spurdle, Amanda B; Mok, Myth TS; Farrugia, Daniel J; Lakhani, Sunil R; Healey, Sue; Arnold, Stephen; Buchanan, Daniel; Investigators, kConFab; Couch, Fergus J; Henderson, Beric R; Goldgar, David E; Tavtigian, Sean V; Chenevix-Trench, Georgia; Brown, Melissa A
2007-01-01
Introduction Many of the DNA sequence variants identified in the breast cancer susceptibility gene BRCA1 remain unclassified in terms of their potential pathogenicity. Both multifactorial likelihood analysis and functional approaches have been proposed as a means to elucidate likely clinical significance of such variants, but analysis of the comparative value of these methods for classifying all sequence variants has been limited. Methods We have compared the results from multifactorial likelihood analysis with those from several functional analyses for the four BRCA1 sequence variants A1708E, G1738R, R1699Q, and A1708V. Results Our results show that multifactorial likelihood analysis, which incorporates sequence conservation, co-inheritance, segregation, and tumour immunohistochemical analysis, may improve classification of variants. For A1708E, previously shown to be functionally compromised, analysis of oestrogen receptor, cytokeratin 5/6, and cytokeratin 14 tumour expression data significantly strengthened the prediction of pathogenicity, giving a posterior probability of pathogenicity of 99%. For G1738R, shown to be functionally defective in this study, immunohistochemistry analysis confirmed previous findings of inconsistent 'BRCA1-like' phenotypes for the two tumours studied, and the posterior probability for this variant was 96%. The posterior probabilities of R1699Q and A1708V were 54% and 69%, respectively, only moderately suggestive of increased risk. Interestingly, results from functional analyses suggest that both of these variants have only partial functional activity. R1699Q was defective in foci formation in response to DNA damage and displayed intermediate transcriptional transactivation activity but showed no evidence for centrosome amplification. In contrast, A1708V displayed an intermediate transcriptional transactivation activity and a normal foci formation response in response to DNA damage but induced centrosome amplification. Conclusion These data highlight the need for a range of functional studies to be performed in order to identify variants with partially compromised function. The results also raise the possibility that A1708V and R1699Q may be associated with a low or moderate risk of cancer. While data pooling strategies may provide more information for multifactorial analysis to improve the interpretation of the clinical significance of these variants, it is likely that the development of current multifactorial likelihood approaches and the consideration of alternative statistical approaches will be needed to determine whether these individually rare variants do confer a low or moderate risk of breast cancer. PMID:18036263
Bukin, Yu S; Dzhioev, Yu P; Tkachev, S E; Kozlova, I V; Paramonov, A I; Ruzek, D; Qu, Z; Zlobin, V I
2017-06-15
This work is dedicated to the study of the variability of the main antigenic envelope protein E among different strains of tick-borne encephalitis virus at the level of physical and chemical properties of the amino acid residues. E protein variants were extracted from then NCBI database. Four amino acid residues properties in the polypeptide sequences were investigated: the average volume of the amino acid residue in the protein tertiary structure, the number of amino acid residue hydrogen bond donors, the charge of amino acid residue lateral radical and the dipole moment of the amino acid residue. These physico-chemical properties are involved in antigen-antibody interactions. As a result, 103 different variants of the antigenic determinants of the tick-borne encephalitis virus E protein were found, significantly different by physical and chemical properties of the amino acid residues in their structure. This means that some strains among the natural variants of tick-borne encephalitis virus can potentially escape the immune response induced by the standard vaccine. Copyright © 2017 Elsevier B.V. All rights reserved.
Whole-exome SNP array identifies 15 new susceptibility loci for psoriasis
Zuo, Xianbo; Sun, Liangdan; Yin, Xianyong; Gao, Jinping; Sheng, Yujun; Xu, Jinhua; Zhang, Jianzhong; He, Chundi; Qiu, Ying; Wen, Guangdong; Tian, Hongqing; Zheng, Xiaodong; Liu, Shengxiu; Wang, Wenjun; Li, Weiran; Cheng, Yuyan; Liu, Longdan; Chang, Yan; Wang, Zaixing; Li, Zenggang; Li, Longnian; Wu, Jianping; Fang, Ling; Shen, Changbing; Zhou, Fusheng; Liang, Bo; Chen, Gang; Li, Hui; Cui, Yong; Xu, Aie; Yang, Xueqin; Hao, Fei; Xu, Limin; Fan, Xing; Li, Yuzhen; Wu, Rina; Wang, Xiuli; Liu, Xiaoming; Zheng, Min; Song, Shunpeng; Ji, Bihua; Fang, Hong; Yu, Jianbin; Sun, Yongxin; Hui, Yan; Zhang, Furen; Yang, Rongya; Yang, Sen; Zhang, Xuejun
2015-01-01
Genome-wide association studies (GWASs) have reproducibly associated ∼40 susceptibility loci with psoriasis. However, the missing heritability is evident and the contributions of coding variants have not yet been systematically evaluated. Here, we present a large-scale whole-exome array analysis for psoriasis consisting of 42,760 individuals. We discover 16 SNPs within 15 new genes/loci associated with psoriasis, including C1orf141, ZNF683, TMC6, AIM2, IL1RL1, CASR, SON, ZFYVE16, MTHFR, CCDC129, ZNF143, AP5B1, SYNE2, IFNGR2 and 3q26.2-q27 (P<5.00 × 10−08). In addition, we also replicate four known susceptibility loci TNIP1, NFKBIA, IL12B and LCE3D–LCE3E. These susceptibility variants identified in the current study collectively account for 1.9% of the psoriasis heritability. The variant within AIM2 is predicted to impact protein structure. Our findings increase the number of genetic risk factors for psoriasis and highlight new and plausible biological pathways in psoriasis. PMID:25854761
Byrska-Bishop, Marta; Wallace, John; Frase, Alexander T; Ritchie, Marylyn D
2018-01-01
Abstract Motivation BioBin is an automated bioinformatics tool for the multi-level biological binning of sequence variants. Herein, we present a significant update to BioBin which expands the software to facilitate a comprehensive rare variant analysis and incorporates novel features and analysis enhancements. Results In BioBin 2.3, we extend our software tool by implementing statistical association testing, updating the binning algorithm, as well as incorporating novel analysis features providing for a robust, highly customizable, and unified rare variant analysis tool. Availability and implementation The BioBin software package is open source and freely available to users at http://www.ritchielab.com/software/biobin-download Contact mdritchie@geisinger.edu Supplementary information Supplementary data are available at Bioinformatics online. PMID:28968757
CHEK2*1100DELC Variant and Breast Cancer Risk
2006-10-01
AD_________________ Award Number: DAMD17-03-1-0774 TITLE: CHEK2 *1100DELC Variant and Breast...01-10-2006 2. REPORT TYPE Final 3. DATES COVERED (From - To) 15 Sep 03 – 14 Sep 06 4. TITLE AND SUBTITLE CHEK2 *1100DELC...SUPPLEMENTARY NOTES 14. ABSTRACT: We propose to examine the association between the CHEK2 *1100delC gene variant and breast cancer among BRCA1/2
Identification of pathogenic gene mutations in LMNA and MYBPC3 that alter RNA splicing.
Ito, Kaoru; Patel, Parth N; Gorham, Joshua M; McDonough, Barbara; DePalma, Steven R; Adler, Emily E; Lam, Lien; MacRae, Calum A; Mohiuddin, Syed M; Fatkin, Diane; Seidman, Christine E; Seidman, J G
2017-07-18
Genetic variants that cause haploinsufficiency account for many autosomal dominant (AD) disorders. Gene-based diagnosis classifies variants that alter canonical splice signals as pathogenic, but due to imperfect understanding of RNA splice signals other variants that may create or eliminate splice sites are often clinically classified as variants of unknown significance (VUS). To improve recognition of pathogenic splice-altering variants in AD disorders, we used computational tools to prioritize VUS and developed a cell-based minigene splicing assay to confirm aberrant splicing. Using this two-step procedure we evaluated all rare variants in two AD cardiomyopathy genes, lamin A/C ( LMNA ) and myosin binding protein C ( MYBPC3 ). We demonstrate that 13 LMNA and 35 MYBPC3 variants identified in cardiomyopathy patients alter RNA splicing, representing a 50% increase in the numbers of established damaging splice variants in these genes. Over half of these variants are annotated as VUS by clinical diagnostic laboratories. Familial analyses of one variant, a synonymous LMNA VUS, demonstrated segregation with cardiomyopathy affection status and altered cardiac LMNA splicing. Application of this strategy should improve diagnostic accuracy and variant classification in other haploinsufficient AD disorders.
Proposed variations of the stepped-wedge design can be used to accommodate multiple interventions
Lyons, Vivian H; Li, Lingyu; Hughes, James P; Rowhani-Rahbar, Ali
2018-01-01
Objective Stepped wedge design (SWD) cluster randomized trials have traditionally been used for evaluating a single intervention. We aimed to explore design variants suitable for evaluating multiple interventions in a SWD trial. Study Design and Setting We identified four specific variants of the traditional SWD that would allow two interventions to be conducted within a single cluster randomized trial: Concurrent, Replacement, Supplementation and Factorial SWDs. These variants were chosen to flexibly accommodate study characteristics that limit a one-size-fits-all approach for multiple interventions. Results In the Concurrent SWD, each cluster receives only one intervention, unlike the other variants. The Replacement SWD supports two interventions that will not or cannot be employed at the same time. The Supplementation SWD is appropriate when the second intervention requires the presence of the first intervention, and the Factorial SWD supports the evaluation of intervention interactions. The precision for estimating intervention effects varies across the four variants. Conclusion Selection of the appropriate design variant should be driven by the research question while considering the trade-off between the number of steps, number of clusters, restrictions for concurrent implementation of the interventions, lingering effects of each intervention, and precision of the intervention effect estimates. PMID:28412466
Investigation of the role of interleukin-1 receptor antagonist VNTR variant on the Behçet’s disease
Dursun, Gül; Demir, Helin Deniz; Karakuş, Nevin; Demir, Osman; Yiğit, Serbülent
2018-01-01
Objective Behçet’s disease (BD), a chronic multisystem inflammatory disorder, is mainly characterized by relapsing periods of a wide range of clinical symptoms. Several cytokine genes may play important roles in the pathogenesis of BD. Therefore, interleukin-1 receptor antagonist (IL-1Ra) gene 86bp variable number tandem repeat (VNTR) variant was investigated in patients with BD in a Turkish population. Methods One hundred nine patients (60 females, 49 males; the mean age±standard deviation [SD] was 36.56±9.571 years) with BD and one hundred healthy individuals (54 females, 46 males; the mean age±SD was 36.64±2.294 years) were examined in the study. For genotyping, polymerase chain reaction-restriction fragment length polymorphism analysis was employed. Data were analyzed using Statistical Package for Social Sciences (SPSS) 22.0 (IBM Corp.; Armonk, NY, USA) (p<0.05) Results The genotype distribution and allele frequencies of the IL-1Ra VNTR variant did not differ significantly between the patients and the controls (p>0.05). The frequency of the a1/a1, a1/a2 genotypes and a1, a2 alleles were the most common both in patients and healthy controls (p=0.37, p=0.26, and p=0.53, respectively). Also, no statistically significant difference was found between the IL-1Ra VNTR variant genotypes and clinical characteristics (p>0.05). Conclusion The results of this study do not support an association between the IL-1Ra VNTR variant and the risk of BD in a Turkish population. However, further studies of this variant with larger sample sizes and different ethnicities are required for confirmation. PMID:29657871
Wu, Ying; Waite, Lindsay L.; Jackson, Anne U.; Sheu, Wayne H-H.; Buyske, Steven; Absher, Devin; Arnett, Donna K.; Boerwinkle, Eric; Bonnycastle, Lori L.; Carty, Cara L.; Cheng, Iona; Cochran, Barbara; Croteau-Chonka, Damien C.; Dumitrescu, Logan; Eaton, Charles B.; Franceschini, Nora; Guo, Xiuqing; Henderson, Brian E.; Hindorff, Lucia A.; Kim, Eric; Kinnunen, Leena; Komulainen, Pirjo; Lee, Wen-Jane; Le Marchand, Loic; Lin, Yi; Lindström, Jaana; Lingaas-Holmen, Oddgeir; Mitchell, Sabrina L.; Narisu, Narisu; Robinson, Jennifer G.; Schumacher, Fred; Stančáková, Alena; Sundvall, Jouko; Sung, Yun-Ju; Swift, Amy J.; Wang, Wen-Chang; Wilkens, Lynne; Wilsgaard, Tom; Young, Alicia M.; Adair, Linda S.; Ballantyne, Christie M.; Bůžková, Petra; Chakravarti, Aravinda; Collins, Francis S.; Duggan, David; Feranil, Alan B.; Ho, Low-Tone; Hung, Yi-Jen; Hunt, Steven C.; Hveem, Kristian; Juang, Jyh-Ming J.; Kesäniemi, Antero Y.; Kuusisto, Johanna; Laakso, Markku; Lakka, Timo A.; Lee, I-Te; Leppert, Mark F.; Matise, Tara C.; Moilanen, Leena; Njølstad, Inger; Peters, Ulrike; Quertermous, Thomas; Rauramaa, Rainer; Rotter, Jerome I.; Saramies, Jouko; Tuomilehto, Jaakko; Uusitupa, Matti; Wang, Tzung-Dau; Mohlke, Karen L.
2013-01-01
Genome-wide association studies (GWAS) have identified ∼100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1×10−4 in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies. PMID:23555291
Karolak, Justyna A; Gambin, Tomasz; Pitarque, Jose A; Molinari, Andrea; Jhangiani, Shalini; Stankiewicz, Pawel; Lupski, James R; Gajecka, Marzena
2017-01-01
Keratoconus (KTCN) is a protrusion and thinning of the cornea, resulting in impairment of visual function. The extreme genetic heterogeneity makes it difficult to discover factors unambiguously influencing the KTCN phenotype. In this study, we used whole-exome sequencing (WES) and Sanger sequencing to reduce the number of candidate genes at the 5q31.1–q35.3 locus and to prioritize other potentially relevant variants in an Ecuadorian family with KTCN. We applied WES in two affected KTCN individuals from the Ecuadorian family that showed a suggestive linkage between the KTCN phenotype and the 5q31.1–q35.3 locus. Putative variants identified by WES were further evaluated in this family using Sanger sequencing. Exome capture discovered a total of 173 rare (minor allele frequency <0.001 in control population) nonsynonymous variants in both affected individuals. Among them, 16 SNVs were selected for further evaluation. Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1–q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied KTCN family. We demonstrate that a combination of various techniques significantly narrowed the studied genomic region and reduced the list of the putative exonic variants. Moreover, since this locus overlapped two other chromosomal regions previously recognized in distinct KTCN studies, our findings suggest that this 5q31.1–q35.3 locus might be linked with KTCN. PMID:27703147
Nemethova, Martina; Radvanszky, Jan; Kadasi, Ludevit; Ascher, David B; Pires, Douglas E V; Blundell, Tom L; Porfirio, Berardino; Mannoni, Alessandro; Santucci, Annalisa; Milucci, Lia; Sestini, Silvia; Biolcati, Gianfranco; Sorge, Fiammetta; Aurizi, Caterina; Aquaron, Robert; Alsbou, Mohammed; Lourenço, Charles Marques; Ramadevi, Kanakasabapathi; Ranganath, Lakshminarayan R; Gallagher, James A; van Kan, Christa; Hall, Anthony K; Olsson, Birgitta; Sireau, Nicolas; Ayoob, Hana; Timmis, Oliver G; Sang, Kim-Hanh Le Quan; Genovese, Federica; Imrich, Richard; Rovensky, Jozef; Srinivasaraghavan, Rangan; Bharadwaj, Shruthi K; Spiegel, Ronen; Zatkova, Andrea
2016-01-01
Alkaptonuria (AKU) is an autosomal recessive disorder caused by mutations in homogentisate-1,2-dioxygenase (HGD) gene leading to the deficiency of HGD enzyme activity. The DevelopAKUre project is underway to test nitisinone as a specific treatment to counteract this derangement of the phenylalanine-tyrosine catabolic pathway. We analysed DNA of 40 AKU patients enrolled for SONIA1, the first study in DevelopAKUre, and of 59 other AKU patients sent to our laboratory for molecular diagnostics. We identified 12 novel DNA variants: one was identified in patients from Brazil (c.557T>A), Slovakia (c.500C>T) and France (c.440T>C), three in patients from India (c.469+6T>C, c.650-85A>G, c.158G>A), and six in patients from Italy (c.742A>G, c.614G>A, c.1057A>C, c.752G>A, c.119A>C, c.926G>T). Thus, the total number of potential AKU-causing variants found in 380 patients reported in the HGD mutation database is now 129. Using mCSM and DUET, computational approaches based on the protein 3D structure, the novel missense variants are predicted to affect the activity of the enzyme by three mechanisms: decrease of stability of individual protomers, disruption of protomer-protomer interactions or modification of residues in the region of the active site. We also present an overview of AKU in Italy, where so far about 60 AKU cases are known and DNA analysis has been reported for 34 of them. In this rather small group, 26 different HGD variants affecting function were described, indicating rather high heterogeneity. Twelve of these variants seem to be specific for Italy.
Nemethova, Martina; Radvanszky, Jan; Kadasi, Ludevit; Ascher, David B; Pires, Douglas E V; Blundell, Tom L; Porfirio, Berardino; Mannoni, Alessandro; Santucci, Annalisa; Milucci, Lia; Sestini, Silvia; Biolcati, Gianfranco; Sorge, Fiammetta; Aurizi, Caterina; Aquaron, Robert; Alsbou, Mohammed; Marques Lourenço, Charles; Ramadevi, Kanakasabapathi; Ranganath, Lakshminarayan R; Gallagher, James A; van Kan, Christa; Hall, Anthony K; Olsson, Birgitta; Sireau, Nicolas; Ayoob, Hana; Timmis, Oliver G; Le Quan Sang, Kim-Hanh; Genovese, Federica; Imrich, Richard; Rovensky, Jozef; Srinivasaraghavan, Rangan; Bharadwaj, Shruthi K; Spiegel, Ronen; Zatkova, Andrea
2016-01-01
Alkaptonuria (AKU) is an autosomal recessive disorder caused by mutations in homogentisate-1,2-dioxygenase (HGD) gene leading to the deficiency of HGD enzyme activity. The DevelopAKUre project is underway to test nitisinone as a specific treatment to counteract this derangement of the phenylalanine-tyrosine catabolic pathway. We analysed DNA of 40 AKU patients enrolled for SONIA1, the first study in DevelopAKUre, and of 59 other AKU patients sent to our laboratory for molecular diagnostics. We identified 12 novel DNA variants: one was identified in patients from Brazil (c.557T>A), Slovakia (c.500C>T) and France (c.440T>C), three in patients from India (c.469+6T>C, c.650–85A>G, c.158G>A), and six in patients from Italy (c.742A>G, c.614G>A, c.1057A>C, c.752G>A, c.119A>C, c.926G>T). Thus, the total number of potential AKU-causing variants found in 380 patients reported in the HGD mutation database is now 129. Using mCSM and DUET, computational approaches based on the protein 3D structure, the novel missense variants are predicted to affect the activity of the enzyme by three mechanisms: decrease of stability of individual protomers, disruption of protomer-protomer interactions or modification of residues in the region of the active site. We also present an overview of AKU in Italy, where so far about 60 AKU cases are known and DNA analysis has been reported for 34 of them. In this rather small group, 26 different HGD variants affecting function were described, indicating rather high heterogeneity. Twelve of these variants seem to be specific for Italy. PMID:25804398
Genomic analysis identifies masqueraders of full-term cerebral palsy.
Takezawa, Yusuke; Kikuchi, Atsuo; Haginoya, Kazuhiro; Niihori, Tetsuya; Numata-Uematsu, Yurika; Inui, Takehiko; Yamamura-Suzuki, Saeko; Miyabayashi, Takuya; Anzai, Mai; Suzuki-Muromoto, Sato; Okubo, Yukimune; Endo, Wakaba; Togashi, Noriko; Kobayashi, Yasuko; Onuma, Akira; Funayama, Ryo; Shirota, Matsuyuki; Nakayama, Keiko; Aoki, Yoko; Kure, Shigeo
2018-05-01
Cerebral palsy is a common, heterogeneous neurodevelopmental disorder that causes movement and postural disabilities. Recent studies have suggested genetic diseases can be misdiagnosed as cerebral palsy. We hypothesized that two simple criteria, that is, full-term births and nonspecific brain MRI findings, are keys to extracting masqueraders among cerebral palsy cases due to the following: (1) preterm infants are susceptible to multiple environmental factors and therefore demonstrate an increased risk of cerebral palsy and (2) brain MRI assessment is essential for excluding environmental causes and other particular disorders. A total of 107 patients-all full-term births-without specific findings on brain MRI were identified among 897 patients diagnosed with cerebral palsy who were followed at our center. DNA samples were available for 17 of the 107 cases for trio whole-exome sequencing and array comparative genomic hybridization. We prioritized variants in genes known to be relevant in neurodevelopmental diseases and evaluated their pathogenicity according to the American College of Medical Genetics guidelines. Pathogenic/likely pathogenic candidate variants were identified in 9 of 17 cases (52.9%) within eight genes: CTNNB1 , CYP2U1 , SPAST , GNAO1 , CACNA1A , AMPD2 , STXBP1 , and SCN2A . Five identified variants had previously been reported. No pathogenic copy number variations were identified. The AMPD2 missense variant and the splice-site variants in CTNNB1 and AMPD2 were validated by in vitro functional experiments. The high rate of detecting causative genetic variants (52.9%) suggests that patients diagnosed with cerebral palsy in full-term births without specific MRI findings may include genetic diseases masquerading as cerebral palsy.
Holmes, E.C.; Stephenson, A.G.
2014-01-01
Determining the extent and structure of intra-host genetic diversity and the magnitude and impact of population bottlenecks is central to understanding the mechanisms of viral evolution. To determine the nature of viral evolution following systemic movement through a plant, we performed deep sequencing of 23 leaves that grew sequentially along a single Cucurbita pepo vine that was infected with zucchini yellow mosaic virus (ZYMV), and on a leaf that grew in on a side branch. Strikingly, of 112 genetic (i.e. sub-consensus) variants observed in the data set as a whole, only 22 were found in multiple leaves. Similarly, only three of the 13 variants present in the inoculating population were found in the subsequent leaves on the vine. Hence, it appears that systemic movement is characterized by sequential population bottlenecks, although not sufficient to reduce the population to a single virion as multiple variants were consistently transmitted between leaves. In addition, the number of variants within a leaf increases as a function of distance from the inoculated (source) leaf, suggesting that the circulating sap may serve as a continual source of virus. Notably, multiple mutational variants were observed in the cylindrical Inclusion (CI) protein (known to be involved in both cell-to-cell and systemic movement of the virus) that were present in multiple (19/24) leaf samples. These mutations resulted in a conformational change, suggesting that they might confer a selective advantage in systemic movement within the vine. Overall, these data reveal that bottlenecks occur during systemic movement, that variants circulate in the phloem sap throughout the infection process, and that important conformational changes in CI protein may arise during individual infections. PMID:25107623
Olsen, Rikke K J; Koňaříková, Eliška; Giancaspero, Teresa A; Mosegaard, Signe; Boczonadi, Veronika; Mataković, Lavinija; Veauville-Merllié, Alice; Terrile, Caterina; Schwarzmayr, Thomas; Haack, Tobias B; Auranen, Mari; Leone, Piero; Galluccio, Michele; Imbard, Apolline; Gutierrez-Rios, Purificacion; Palmfeldt, Johan; Graf, Elisabeth; Vianey-Saban, Christine; Oppenheim, Marcus; Schiff, Manuel; Pichard, Samia; Rigal, Odile; Pyle, Angela; Chinnery, Patrick F; Konstantopoulou, Vassiliki; Möslinger, Dorothea; Feichtinger, René G; Talim, Beril; Topaloglu, Haluk; Coskun, Turgay; Gucer, Safak; Botta, Annalisa; Pegoraro, Elena; Malena, Adriana; Vergani, Lodovica; Mazzà, Daniela; Zollino, Marcella; Ghezzi, Daniele; Acquaviva, Cecile; Tyni, Tiina; Boneh, Avihu; Meitinger, Thomas; Strom, Tim M; Gregersen, Niels; Mayr, Johannes A; Horvath, Rita; Barile, Maria; Prokisch, Holger
2016-06-02
Multiple acyl-CoA dehydrogenase deficiencies (MADDs) are a heterogeneous group of metabolic disorders with combined respiratory-chain deficiency and a neuromuscular phenotype. Despite recent advances in understanding the genetic basis of MADD, a number of cases remain unexplained. Here, we report clinically relevant variants in FLAD1, which encodes FAD synthase (FADS), as the cause of MADD and respiratory-chain dysfunction in nine individuals recruited from metabolic centers in six countries. In most individuals, we identified biallelic frameshift variants in the molybdopterin binding (MPTb) domain, located upstream of the FADS domain. Inasmuch as FADS is essential for cellular supply of FAD cofactors, the finding of biallelic frameshift variants was unexpected. Using RNA sequencing analysis combined with protein mass spectrometry, we discovered FLAD1 isoforms, which only encode the FADS domain. The existence of these isoforms might explain why affected individuals with biallelic FLAD1 frameshift variants still harbor substantial FADS activity. Another group of individuals with a milder phenotype responsive to riboflavin were shown to have single amino acid changes in the FADS domain. When produced in E. coli, these mutant FADS proteins resulted in impaired but detectable FADS activity; for one of the variant proteins, the addition of FAD significantly improved protein stability, arguing for a chaperone-like action similar to what has been reported in other riboflavin-responsive inborn errors of metabolism. In conclusion, our studies identify FLAD1 variants as a cause of potentially treatable inborn errors of metabolism manifesting with MADD and shed light on the mechanisms by which FADS ensures cellular FAD homeostasis. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Circadian gene variants and susceptibility to type 2 diabetes: a pilot study.
Kelly, M Ann; Rees, Simon D; Hydrie, M Zafar I; Shera, A Samad; Bellary, Srikanth; O'Hare, J Paul; Kumar, Sudhesh; Taheri, Shahrad; Basit, Abdul; Barnett, Anthony H
2012-01-01
Disruption of endogenous circadian rhythms has been shown to increase the risk of developing type 2 diabetes, suggesting that circadian genes might play a role in determining disease susceptibility. We present the results of a pilot study investigating the association between type 2 diabetes and selected single nucleotide polymorphisms (SNPs) in/near nine circadian genes. The variants were chosen based on their previously reported association with prostate cancer, a disease that has been suggested to have a genetic link with type 2 diabetes through a number of shared inherited risk determinants. The pilot study was performed using two genetically homogeneous Punjabi cohorts, one resident in the United Kingdom and one indigenous to Pakistan. Subjects with (N = 1732) and without (N = 1780) type 2 diabetes were genotyped for thirteen circadian variants using a competitive allele-specific polymerase chain reaction method. Associations between the SNPs and type 2 diabetes were investigated using logistic regression. The results were also combined with in silico data from other South Asian datasets (SAT2D consortium) and white European cohorts (DIAGRAM+) using meta-analysis. The rs7602358G allele near PER2 was negatively associated with type 2 diabetes in our Punjabi cohorts (combined odds ratio [OR] = 0.75 [0.66-0.86], p = 3.18 × 10(-5)), while the BMAL1 rs11022775T allele was associated with an increased risk of the disease (combined OR = 1.22 [1.07-1.39], p = 0.003). Neither of these associations was replicated in the SAT2D or DIAGRAM+ datasets, however. Meta-analysis of all the cohorts identified disease associations with two variants, rs2292912 in CRY2 and rs12315175 near CRY1, although statistical significance was nominal (combined OR = 1.05 [1.01-1.08], p = 0.008 and OR = 0.95 [0.91-0.99], p = 0.015 respectively). None of the selected circadian gene variants was associated with type 2 diabetes with study-wide significance after meta-analysis. The nominal association observed with the CRY2 SNP, however, complements previous findings and confirms a role for this locus in disease susceptibility.
Somatic Mosaicism: Implications for Disease and Transmission Genetics
Campbell, Ian M.; Shaw, Chad A.; Stankiewicz, Pawel; Lupski, James R.
2015-01-01
Nearly all of the genetic material among cells within an organism is identical. However, single nucleotide variants (SNVs), indels, copy number variants (CNVs), and other structural variants (SVs) continually accumulate as cells divide during development. This process results in an organism composed of countless cells, each with its own unique personal genome. Thus, every human is undoubtedly mosaic. Mosaic mutations can go unnoticed, underlie genetic disease or normal human variation, and may be transmitted to the next generation as constitutional variants. Here, we review the influence of the developmental timing of mutations, the mechanisms by which they arise, methods for detecting mosaic variants, and the risk of passing these mutations on to the next generation. PMID:25910407
Nho, Kwangsik; Kim, Sungeun; Horgusluoglu, Emrin; Risacher, Shannon L; Shen, Li; Kim, Dokyoon; Lee, Seunggeun; Foroud, Tatiana; Shaw, Leslie M; Trojanowski, John Q; Aisen, Paul S; Petersen, Ronald C; Jack, Clifford R; Weiner, Michael W; Green, Robert C; Toga, Arthur W; Saykin, Andrew J
2017-05-24
The APOE ε4 allele is the most significant common genetic risk factor for late-onset Alzheimer's disease (LOAD). The region surrounding APOE on chromosome 19 has also shown consistent association with LOAD. However, no common variants in the region remain significant after adjusting for APOE genotype. We report a rare variant association analysis of genes in the vicinity of APOE with cerebrospinal fluid (CSF) and neuroimaging biomarkers of LOAD. Whole genome sequencing (WGS) was performed on 817 blood DNA samples from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Sequence data from 757 non-Hispanic Caucasian participants was used in the present analysis. We extracted all rare variants (MAF (minor allele frequency) < 0.05) within a 312 kb window in APOE's vicinity encompassing 12 genes. We assessed CSF and neuroimaging (MRI and PET) biomarkers as LOAD-related quantitative endophenotypes. Gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). A total of 3,334 rare variants (MAF < 0.05) were found within the APOE region. Among them, 72 rare non-synonymous variants were observed. Eight genes spanning the APOE region were significantly associated with CSF Aβ 1-42 (p < 1.0 × 10 -3 ). After controlling for APOE genotype and adjusting for multiple comparisons, 4 genes (CBLC, BCAM, APOE, and RELB) remained significant. Whole-brain surface-based analysis identified highly significant clusters associated with rare variants of CBLC in the temporal lobe region including the entorhinal cortex, as well as frontal lobe regions. Whole-brain voxel-wise analysis of amyloid PET identified significant clusters in the bilateral frontal and parietal lobes showing associations of rare variants of RELB with cortical amyloid burden. Rare variants within genes spanning the APOE region are significantly associated with LOAD-related CSF Aβ 1-42 and neuroimaging biomarkers after adjusting for APOE genotype. These findings warrant further investigation and illustrate the role of next generation sequencing and quantitative endophenotypes in assessing rare variants which may help explain missing heritability in AD and other complex diseases.
Smith, Andrew J P; Deloukas, Panos; Munroe, Patricia B
2018-04-13
Over the last decade, genome-wide association studies (GWAS) have propelled the discovery of thousands of loci associated with complex diseases. The focus is now turning towards the function of these association signals, determining the causal variant(s) amongst those in strong linkage disequilibrium, and identifying their underlying mechanisms, such as long-range gene regulation. Genome-editing techniques utilising zinc-finger nucleases (ZFN), transcription activator-like effector nucleases (TALENs) and clustered regularly-interspaced short palindromic repeats with Cas9 nuclease (CRISPR-Cas9), are becoming the tools of choice to establish functionality for these variants, due to the ability to assess effects of single variants in vivo. This review will discuss examples of how these technologies have begun to aid functional analysis of GWAS loci for complex traits such as cardiovascular disease, type 2 diabetes, cancer, obesity and autoimmune disease. We focus on analysis of variants occurring within non-coding genomic regions, as these comprise the majority of GWAS variants, providing the greatest challenges to determining functionality, and compare editing strategies that provide different levels of evidence for variant functionality. The review describes molecular insights into some of these potentially causal variants, and how these may relate to the pathology of the trait, and look towards future directions for these technologies in post-GWAS analysis, such as base-editing.
NASA Astrophysics Data System (ADS)
Davydova, Tatyana; Zhutaeva, Evgeniya; Dubrovskaya, Tatyana
2017-10-01
Article considers the significance of the demographic forecast for the effective operation of the providing system of social and economic development of the urban transport infrastructure. Analysis of the factors which influence on the population of the city of Voronezh was performed and the population forecast for the year 2020 is presented on the basis of the classification by year of birth. Calculation was performed in three variants (with consideration of the use of classification by year of birth) in connection with an impact of modern social and economic situation on the negative tendencies formed in demographic processes. In the basis of variants were grounded different approaches to the dynamics of demographic processes. The main demographic indicators are the number of permanent residents, birth rates, death rates, migration rates. According to the results of the study, population of the urban district of the city of Voronezh is expected to increase in the specified period and migration inflow of the population has a dominant role in the formation in the formation of the number of the city population.
Morgan, Andrew P.; Didion, John P.; Doran, Anthony G.; Holt, James M.; McMillan, Leonard; Keane, Thomas M.; de Villena, Fernando Pardo-Manuel
2016-01-01
Wild-derived mouse inbred strains are becoming increasingly popular for complex traits analysis, evolutionary studies, and systems genetics. Here, we report the whole-genome sequencing of two wild-derived mouse inbred strains, LEWES/EiJ and ZALENDE/EiJ, of Mus musculus domesticus origin. These two inbred strains were selected based on their geographic origin, karyotype, and use in ongoing research. We generated 14× and 18× coverage sequence, respectively, and discovered over 1.1 million novel variants, most of which are private to one of these strains. This report expands the number of wild-derived inbred genomes in the Mus genus from six to eight. The sequence variation can be accessed via an online query tool; variant calls (VCF format) and alignments (BAM format) are available for download from a dedicated ftp site. Finally, the sequencing data have also been stored in a lossless, compressed, and indexed format using the multi-string Burrows-Wheeler transform. All data can be used without restriction. PMID:27765810
Alvarez-Lobos, Manuel; Arostegui, Juan I; Sans, Miquel; Tassies, Dolors; Plaza, Susana; Delgado, Salvadora; Lacy, Antonio M; Pique, Josep M; Yagüe, Jordi; Panés, Julián
2005-11-01
To study the predictive value of Nod2/CARD15 gene variants along with disease phenotypic characteristics for requirement of initial surgery and for surgical recurrence in Crohn's disease (CD). Nod2/CARD15 gene variants play an important role in the susceptibility to CD. Studies of genotype-phenotype relationship suggest that these variants are associated with development of intestinal strictures. Preliminary reports analyzing the association between these variants and need for surgery have produced inconsistent results. A total of 170 CD patients were included prospectively in the study and followed up regularly for a mean of 7.4 +/- 6.1 years. Clinical characteristics of CD, time and indication for surgery, and recurrence were registered. Nod2/CARD15 gene variants were determined by DNA sequencing analysis. Surgery for stricturing disease was significantly more frequent in patients with Nod2/CARD15 variants in the univariate analysis (odds ratio [OR], 3.63; 95% confidence interval [CI], 1.42-9.27), and it was required at an earlier time (P = 0.004). Only Nod2/CARD15 variants (OR, 3.58; 95% CI, 1.21-10.5) and stricturing phenotype at diagnosis of CD (OR, 9.34; 95% CI, 2.56-33.3) were independent predictive factors of initial surgery for stricturing lesions in the multivariate analysis. Among 70 patients that required surgery, postoperative recurrence was also more frequent in patients with Nod2/CARD15 variants in the univariate and multivariate analysis (OR, 3.29; 95% CI, 1.13-9.56), and reoperation was needed at an earlier time (P = 0.03). Nod2/CARD15 variants are associated with early initial surgery due to stenosis and with surgical recurrence in Crohn's disease. Patients with these variants could benefit from preventive and/or early therapeutic strategies.
Stepiński, Dariusz
2009-03-01
The nucleolar proteins, fibrillarin and nucleophosmin, have been identified immunofluorescently in the root meristematic cells of soybean seedlings under varying experimental conditions: at 25 degrees C (control), chilling at 10 degrees C for 3 h and 4 days and recovery from the chilling stress at 25 degrees C. In each experimental variant, the immunofluorescence signals were present solely at the nucleolar territories. Fluorescent staining for both proteins was mainly in the shape of circular domains that are assumed to correspond to the dense fibrillar component of the nucleoli. The fewest fluorescent domains were observed in the nucleoli of chilled plants, and the highest number was observed in the plants recovered after chilling. This difference in the number of circular domains in the nucleoli of each variant may indicate various levels of these proteins in each variant. Both the number of circular domains and the level of these nucleolar proteins changed with changes in the transcriptional activity of the nucleoli, with the more metabolically active cell having higher numbers of active areas in the nucleolus and higher levels of nucleolar proteins, and conversely. Electron microscopic studies revealed differences in the ultrastructure of the nucleoli in all experimental variants and confirmed that the number of fibrillar centres surrounded by dense fibrillar component was the lowest in the nucleoli of chilled plants, and the highest in the nucleoli of recovered seedlings.
Simino, Jeannette; Wang, Zhiying; Bressler, Jan; Chouraki, Vincent; Yang, Qiong; Younkin, Steven G; Seshadri, Sudha; Fornage, Myriam; Boerwinkle, Eric; Mosley, Thomas H
2017-01-01
We performed single-variant and gene-based association analyses of plasma amyloid-β (aβ) concentrations using whole exome sequence from 1,414 African and European Americans. Our goal was to identify genes that influence plasma aβ42 concentrations and aβ42:aβ40 ratios in late middle age (mean = 59 years), old age (mean = 77 years), or change over time (mean = 18 years). Plasma aβ measures were linearly regressed onto age, gender, APOE ε4 carrier status, and time elapsed between visits (fold-changes only) separately by race. Following inverse normal transformation of the residuals, seqMeta was used to conduct race-specific single-variant and gene-based association tests while adjusting for population structure. Linear regression models were fit on autosomal variants with minor allele frequencies (MAF)≥1%. T5 burden and Sequence Kernel Association (SKAT) gene-based tests assessed functional variants with MAF≤5%. Cross-race fixed effects meta-analyses were Bonferroni-corrected for the number of variants or genes tested. Seven genes were associated with aβ in late middle age or change over time; no associations were identified in old age. Single variants in KLKB1 (rs3733402; p = 4.33x10-10) and F12 (rs1801020; p = 3.89x10-8) were significantly associated with midlife aβ42 levels through cross-race meta-analysis; the KLKB1 variant replicated internally using 1,014 additional participants with exome chip. ITPRIP, PLIN2, and TSPAN18 were associated with the midlife aβ42:aβ40 ratio via the T5 test; TSPAN18 was significant via the cross-race meta-analysis, whereas ITPRIP and PLIN2 were European American-specific. NCOA1 and NT5C3B were associated with the midlife aβ42:aβ40 ratio and the fold-change in aβ42, respectively, via SKAT in African Americans. No associations replicated externally (N = 725). We discovered age-dependent genetic effects, established associations between vascular-related genes (KLKB1, F12, PLIN2) and midlife plasma aβ levels, and identified a plausible Alzheimer's Disease candidate gene (ITPRIP) influencing cell death. Plasma aβ concentrations may have dynamic biological determinants across the lifespan; plasma aβ study designs or analyses must consider age.
Identification of copy number variants in whole-genome data using Reference Coverage Profiles
Glusman, Gustavo; Severson, Alissa; Dhankani, Varsha; Robinson, Max; Farrah, Terry; Mauldin, Denise E.; Stittrich, Anna B.; Ament, Seth A.; Roach, Jared C.; Brunkow, Mary E.; Bodian, Dale L.; Vockley, Joseph G.; Shmulevich, Ilya; Niederhuber, John E.; Hood, Leroy
2015-01-01
The identification of DNA copy numbers from short-read sequencing data remains a challenge for both technical and algorithmic reasons. The raw data for these analyses are measured in tens to hundreds of gigabytes per genome; transmitting, storing, and analyzing such large files is cumbersome, particularly for methods that analyze several samples simultaneously. We developed a very efficient representation of depth of coverage (150–1000× compression) that enables such analyses. Current methods for analyzing variants in whole-genome sequencing (WGS) data frequently miss copy number variants (CNVs), particularly hemizygous deletions in the 1–100 kb range. To fill this gap, we developed a method to identify CNVs in individual genomes, based on comparison to joint profiles pre-computed from a large set of genomes. We analyzed depth of coverage in over 6000 high quality (>40×) genomes. The depth of coverage has strong sequence-specific fluctuations only partially explained by global parameters like %GC. To account for these fluctuations, we constructed multi-genome profiles representing the observed or inferred diploid depth of coverage at each position along the genome. These Reference Coverage Profiles (RCPs) take into account the diverse technologies and pipeline versions used. Normalization of the scaled coverage to the RCP followed by hidden Markov model (HMM) segmentation enables efficient detection of CNVs and large deletions in individual genomes. Use of pre-computed multi-genome coverage profiles improves our ability to analyze each individual genome. We make available RCPs and tools for performing these analyses on personal genomes. We expect the increased sensitivity and specificity for individual genome analysis to be critical for achieving clinical-grade genome interpretation. PMID:25741365
Variations on a theme of Lander and Waterman
DOE Office of Scientific and Technical Information (OSTI.GOV)
Speed, T.
1997-12-01
The original Lander and Waterman mathematical analysis was for fingerprinting random clones. Since that time, a number of variants of their theory have appeared, including ones which apply to mapping by anchoring random clones, and to non-random or directed clone mapping. The same theory is now widely used to devise random sequencing strategies. In this talk I will review these developments, and go on the discuss the theory required for directed sequencing strategies.
Couto, Ana Rita; Parreira, Bruna; Thomson, Russell; Soares, Marta; Power, Deborah M; Stankovich, Jim; Armas, Jácome Bruges; Brown, Matthew A
2017-01-01
Twelve families with exuberant and early-onset calcium pyrophosphate dehydrate chondrocalcinosis (CC) and diffuse idiopathic skeletal hyperostosis (DISH), hereafter designated DISH/CC, were identified in Terceira Island, the Azores, Portugal. Ninety-two (92) individuals from these families were selected for whole-genome-wide linkage analysis. An identity-by-descent (IBD) analysis was performed in 10 individuals from 5 of the investigated pedigrees. The chromosome area with the maximal logarithm of the odds score (1.32; P =0.007) was not identified using the IBD/identity-by-state (IBS) analysis; therefore, it was not investigated further. From the IBD/IBS analysis, two candidate genes, LEMD3 and RSPO4 , were identified and sequenced. Nine genetic variants were identified in the RSPO4 gene; one regulatory variant (rs146447064) was significantly more frequent in control individuals than in DISH/CC patients ( P =0.03). Four variants were identified in LEMD3 , and the rs201930700 variant was further investigated using segregation analysis. None of the genetic variants in RSPO4 or LEMD3 segregated within the studied families. Therefore, although a major genetic effect was shown to determine DISH/CC occurrence within these families, the specific genetic variants involved were not identified.
Couto, Ana Rita; Parreira, Bruna; Thomson, Russell; Soares, Marta; Power, Deborah M; Stankovich, Jim; Armas, Jácome Bruges; Brown, Matthew A
2017-01-01
Twelve families with exuberant and early-onset calcium pyrophosphate dehydrate chondrocalcinosis (CC) and diffuse idiopathic skeletal hyperostosis (DISH), hereafter designated DISH/CC, were identified in Terceira Island, the Azores, Portugal. Ninety-two (92) individuals from these families were selected for whole-genome-wide linkage analysis. An identity-by-descent (IBD) analysis was performed in 10 individuals from 5 of the investigated pedigrees. The chromosome area with the maximal logarithm of the odds score (1.32; P=0.007) was not identified using the IBD/identity-by-state (IBS) analysis; therefore, it was not investigated further. From the IBD/IBS analysis, two candidate genes, LEMD3 and RSPO4, were identified and sequenced. Nine genetic variants were identified in the RSPO4 gene; one regulatory variant (rs146447064) was significantly more frequent in control individuals than in DISH/CC patients (P=0.03). Four variants were identified in LEMD3, and the rs201930700 variant was further investigated using segregation analysis. None of the genetic variants in RSPO4 or LEMD3 segregated within the studied families. Therefore, although a major genetic effect was shown to determine DISH/CC occurrence within these families, the specific genetic variants involved were not identified. PMID:29104755
Valadas, Samantha Y O B; da Silva, Juliana I G; Lopes, Estela Gallucci; Keid, Lara B; Zwarg, Ticiana; de Oliveira, Alice S; Sanches, Thaís C; Joppert, Adriana M; Pena, Hilda F J; Oliveira, Tricia M F S; Ferreira, Helena L; Soares, Rodrigo M
2016-05-01
Although few species of Sarcocystis are known to use marsupials of the genus Didelphis as definitive host, an extensive diversity of alleles of surface antigen genes (sag2, sag3, and sag4) has been described in samples of didelphid opossums in Brazil. In this work, we studied 25 samples of Sarcocystis derived from gastrointestinal tract of opossums of the genus Didelphis by accessing the variability of sag2, sag3, sag4, gene encoding cytochrome b (cytB) and first internal transcribed spacer (ITS1). Reference samples of Sarcocystis neurona (SN138) and Sarcocystis falcatula (SF1) maintained in cell culture were also analyzed. We found four allele variants of cytB, seven allele variants of ITS1, 10 allele variants of sag2, 13 allele variants of sag3, and 6 allele variants of sag4. None of the sporocyst-derived sequences obtained from Brazilian opossums revealed 100% identity to SN138 at cytB gene, nor to SN138 or SF1 at ITS1 locus. In addition, none of the sag alleles were found identical to either SF1 or SN138 homologous sequences, and a high number of new sag allele types were found other than those previously described in Brazil. Out of ten sag2 alleles, four are novel, while eight out of 13 sag3 alleles are novel and one out of six sag4 alleles is novel. Further studies are needed to clarify if such a vast repertoire of allele variants of Sarcocystis is the consequence of re-assortments driven by sexual exchange, in order to form individuals with highly diverse characteristics, such as pathogenicity, host spectrum, among others or if it only represents allele variants of different species with different biological traits. Copyright © 2016 Elsevier Inc. All rights reserved.
López-Díez, Raquel; Rastrojo, Alberto; Villate, Olatz; Aguado, Begoña
2013-01-01
The receptor for advanced glycosylation end products (RAGE) is a multiligand receptor involved in diverse cell signaling pathways. Previous studies show that this gene expresses several splice variants in human, mouse, and dog. Alternative splicing (AS) plays an important role in expanding transcriptomic and proteomic diversity, and it has been related to disease. AS is also one of the main evolutionary mechanisms in mammalian genomes. However, limited information is available regarding the AS of RAGE in a wide context of mammalian tissues. In this study, we examined in detail the different RAGE mRNAs generated by AS from six mammals, including two primates (human and monkey), two artiodactyla (cow and pig), and two rodentia (mouse and rat) in 6–18 different tissues including fetal, adult, and tumor. By nested reverse transcription-polymerase chain reaction (RT-PCR) we identified a high number of splice variants including noncoding transcripts and predicted coding ones with different potential protein modifications affecting mainly the transmembrane and ligand-binding domains that could influence their biological function. However, analysis of RNA-seq data enabled detecting only the most abundant splice variants. More than 80% of the detected RT-PCR variants (87 of 101 transcripts) are novel (different exon/intron structure to the previously described ones), and interestingly, 20–60% of the total transcripts (depending on the species) are noncoding ones that present tissue specificity. Our results suggest that RAGE undergoes extensive AS in mammals, with different expression patterns among adult, fetal, and tumor tissues. Moreover, most splice variants seem to be species specific, especially the noncoding variants, with only two (canonical human Tv1-RAGE, and human N-truncated or Tv10-RAGE) conserved among the six different species. This could indicate a special evolution pattern of this gene at mRNA level. PMID:24273313
Canary: an atomic pipeline for clinical amplicon assays.
Doig, Kenneth D; Ellul, Jason; Fellowes, Andrew; Thompson, Ella R; Ryland, Georgina; Blombery, Piers; Papenfuss, Anthony T; Fox, Stephen B
2017-12-15
High throughput sequencing requires bioinformatics pipelines to process large volumes of data into meaningful variants that can be translated into a clinical report. These pipelines often suffer from a number of shortcomings: they lack robustness and have many components written in multiple languages, each with a variety of resource requirements. Pipeline components must be linked together with a workflow system to achieve the processing of FASTQ files through to a VCF file of variants. Crafting these pipelines requires considerable bioinformatics and IT skills beyond the reach of many clinical laboratories. Here we present Canary, a single program that can be run on a laptop, which takes FASTQ files from amplicon assays through to an annotated VCF file ready for clinical analysis. Canary can be installed and run with a single command using Docker containerization or run as a single JAR file on a wide range of platforms. Although it is a single utility, Canary performs all the functions present in more complex and unwieldy pipelines. All variants identified by Canary are 3' shifted and represented in their most parsimonious form to provide a consistent nomenclature, irrespective of sequencing variation. Further, proximate in-phase variants are represented as a single HGVS 'delins' variant. This allows for correct nomenclature and consequences to be ascribed to complex multi-nucleotide polymorphisms (MNPs), which are otherwise difficult to represent and interpret. Variants can also be annotated with hundreds of attributes sourced from MyVariant.info to give up to date details on pathogenicity, population statistics and in-silico predictors. Canary has been used at the Peter MacCallum Cancer Centre in Melbourne for the last 2 years for the processing of clinical sequencing data. By encapsulating clinical features in a single, easily installed executable, Canary makes sequencing more accessible to all pathology laboratories. Canary is available for download as source or a Docker image at https://github.com/PapenfussLab/Canary under a GPL-3.0 License.
Metzner, Karin J; Scherrer, Alexandra U; von Wyl, Viktor; Böni, Jürg; Yerly, Sabine; Klimkait, Thomas; Aubert, Vincent; Furrer, Hansjakob; Hirsch, Hans H; Vernazza, Pietro L; Cavassini, Matthias; Calmy, Alexandra; Bernasconi, Enos; Weber, Rainer; Günthard, Huldrych F
2014-09-24
The presence of minority nonnucleoside reverse transcriptase inhibitor (NNRTI)-resistant HIV-1 variants prior to antiretroviral therapy (ART) has been linked to virologic failure in treatment-naive patients. We performed a large retrospective study to determine the number of treatment failures that could have been prevented by implementing minority drug-resistant HIV-1 variant analyses in ART-naïve patients in whom no NNRTI resistance mutations were detected by routine resistance testing. Of 1608 patients in the Swiss HIV Cohort Study, who have initiated first-line ART with two nucleoside reverse transcriptase inhibitors (NRTIs) and one NNRTI before July 2008, 519 patients were eligible by means of HIV-1 subtype, viral load and sample availability. Key NNRTI drug resistance mutations K103N and Y181C were measured by allele-specific PCR in 208 of 519 randomly chosen patients. Minority K103N and Y181C drug resistance mutations were detected in five out of 190 (2.6%) and 10 out of 201 (5%) patients, respectively. Focusing on 183 patients for whom virologic success or failure could be examined, virologic failure occurred in seven out of 183 (3.8%) patients; minority K103N and/or Y181C variants were present prior to ART initiation in only two of those patients. The NNRTI-containing, first-line ART was effective in 10 patients with preexisting minority NNRTI-resistant HIV-1 variant. As revealed in settings of case-control studies, minority NNRTI-resistant HIV-1 variants can have an impact on ART. However, the implementation of minority NNRTI-resistant HIV-1 variant analysis in addition to genotypic resistance testing (GRT) cannot be recommended in routine clinical settings. Additional associated risk factors need to be discovered.
Kim, Taehyeung; Park, Ah Yeon; Baek, Younghwa; Cha, Seongwon
2017-01-01
Circulating lipid ratios are considered predictors of cardiovascular risks and metabolic syndrome, which cause coronary heart diseases. One constitutional type of Korean medicine prone to weight accumulation, the Tae-Eum type, predisposes the consumers to metabolic syndrome, hypertension, diabetes mellitus, etc. Here, we aimed to identify genetic variants for lipid ratios using a genome-wide association study (GWAS) and followed replication analysis in Koreans and constitutional subgroups. GWASs in 5,292 individuals of the Korean Genome and Epidemiology Study and replication analyses in 2,567 subjects of the Korea medicine Data Center were performed to identify genetic variants associated with triglyceride (TG) to HDL cholesterol (HDLC), LDL cholesterol (LDLC) to HDLC, and non-HDLC to HDLC ratios. For subgroup analysis, a computer-based constitution analysis tool was used to categorize the constitutional types of the subjects. In the discovery stage, seven variants in four loci, three variants in three loci, and two variants in one locus were associated with the ratios of log-transformed TG:HDLC (log[TG]:HDLC), LDLC:HDLC, and non-HDLC:HDLC, respectively. The associations of the GWAS variants with lipid ratios were replicated in the validation stage: for the log[TG]:HDLC ratio, rs6589566 near APOA5 and rs4244457 and rs6586891 near LPL; for the LDLC:HDLC ratio, rs4420638 near APOC1 and rs17445774 near C2orf47; and for the non-HDLC:HDLC ratio, rs6589566 near APOA5. Five of these six variants are known to be associated with TG, LDLC, and/or HDLC, but rs17445774 was newly identified to be involved in lipid level changes in this study. Constitutional subgroup analysis revealed effects of variants associated with log[TG]:HDLC and non-HDLC:HDLC ratios in both the Tae-Eum and non-Tae-Eum types, whereas the effect of the LDLC:HDLC ratio-associated variants remained only in the Tae-Eum type. In conclusion, we identified three log[TG]:HDLC ratio-associated variants, two LDLC:HDLC ratio-associated variants, and one non-HDLC:HDLC-associated variant in Koreans and the constitutional subgroups.
Kim, Taehyeung; Park, Ah Yeon; Baek, Younghwa
2017-01-01
Circulating lipid ratios are considered predictors of cardiovascular risks and metabolic syndrome, which cause coronary heart diseases. One constitutional type of Korean medicine prone to weight accumulation, the Tae-Eum type, predisposes the consumers to metabolic syndrome, hypertension, diabetes mellitus, etc. Here, we aimed to identify genetic variants for lipid ratios using a genome-wide association study (GWAS) and followed replication analysis in Koreans and constitutional subgroups. GWASs in 5,292 individuals of the Korean Genome and Epidemiology Study and replication analyses in 2,567 subjects of the Korea medicine Data Center were performed to identify genetic variants associated with triglyceride (TG) to HDL cholesterol (HDLC), LDL cholesterol (LDLC) to HDLC, and non-HDLC to HDLC ratios. For subgroup analysis, a computer-based constitution analysis tool was used to categorize the constitutional types of the subjects. In the discovery stage, seven variants in four loci, three variants in three loci, and two variants in one locus were associated with the ratios of log-transformed TG:HDLC (log[TG]:HDLC), LDLC:HDLC, and non-HDLC:HDLC, respectively. The associations of the GWAS variants with lipid ratios were replicated in the validation stage: for the log[TG]:HDLC ratio, rs6589566 near APOA5 and rs4244457 and rs6586891 near LPL; for the LDLC:HDLC ratio, rs4420638 near APOC1 and rs17445774 near C2orf47; and for the non-HDLC:HDLC ratio, rs6589566 near APOA5. Five of these six variants are known to be associated with TG, LDLC, and/or HDLC, but rs17445774 was newly identified to be involved in lipid level changes in this study. Constitutional subgroup analysis revealed effects of variants associated with log[TG]:HDLC and non-HDLC:HDLC ratios in both the Tae-Eum and non-Tae-Eum types, whereas the effect of the LDLC:HDLC ratio-associated variants remained only in the Tae-Eum type. In conclusion, we identified three log[TG]:HDLC ratio-associated variants, two LDLC:HDLC ratio-associated variants, and one non-HDLC:HDLC-associated variant in Koreans and the constitutional subgroups. PMID:28046027
A Protein Domain and Family Based Approach to Rare Variant Association Analysis.
Richardson, Tom G; Shihab, Hashem A; Rivas, Manuel A; McCarthy, Mark I; Campbell, Colin; Timpson, Nicholas J; Gaunt, Tom R
2016-01-01
It has become common practice to analyse large scale sequencing data with statistical approaches based around the aggregation of rare variants within the same gene. We applied a novel approach to rare variant analysis by collapsing variants together using protein domain and family coordinates, regarded to be a more discrete definition of a biologically functional unit. Using Pfam definitions, we collapsed rare variants (Minor Allele Frequency ≤ 1%) together in three different ways 1) variants within single genomic regions which map to individual protein domains 2) variants within two individual protein domain regions which are predicted to be responsible for a protein-protein interaction 3) all variants within combined regions from multiple genes responsible for coding the same protein domain (i.e. protein families). A conventional collapsing analysis using gene coordinates was also undertaken for comparison. We used UK10K sequence data and investigated associations between regions of variants and lipid traits using the sequence kernel association test (SKAT). We observed no strong evidence of association between regions of variants based on Pfam domain definitions and lipid traits. Quantile-Quantile plots illustrated that the overall distributions of p-values from the protein domain analyses were comparable to that of a conventional gene-based approach. Deviations from this distribution suggested that collapsing by either protein domain or gene definitions may be favourable depending on the trait analysed. We have collapsed rare variants together using protein domain and family coordinates to present an alternative approach over collapsing across conventionally used gene-based regions. Although no strong evidence of association was detected in these analyses, future studies may still find value in adopting these approaches to detect previously unidentified association signals.
van der Klift, Heleen M; Jansen, Anne M L; van der Steenstraten, Niki; Bik, Elsa C; Tops, Carli M J; Devilee, Peter; Wijnen, Juul T
2015-01-01
A subset of DNA variants causes genetic disease through aberrant splicing. Experimental splicing assays, either RT-PCR analyses of patient RNA or functional splicing reporter minigene assays, are required to evaluate the molecular nature of the splice defect. Here, we present minigene assays performed for 17 variants in the consensus splice site regions, 14 exonic variants outside these regions, and two deep intronic variants, all in the DNA mismatch-repair (MMR) genes MLH1, MSH2, MSH6, and PMS2, associated with Lynch syndrome. We also included two deep intronic variants in APC and PKD2. For one variant (MLH1 c.122A>G), our minigene assay and patient RNA analysis could not confirm the previously reported aberrant splicing. The aim of our study was to further investigate the concordance between minigene splicing assays and patient RNA analyses. For 30 variants results from patient RNA analyses were available, either performed by our laboratory or presented in literature. Some variants were deliberately included in this study because they resulted in multiple aberrant transcripts in patient RNA analysis, or caused a splice effect other than the prevalent exon skip. While both methods were completely concordant in the assessment of splice effects, four variants exhibited major differences in aberrant splice patterns. Based on the present and earlier studies, together showing an almost 100% concordance of minigene assays with patient RNA analyses, we discuss the weight given to minigene splicing assays in the current criteria proposed by InSiGHT for clinical classification of MMR variants. PMID:26247049
[Structural organization of 5S ribosomal DNA of Rosa rugosa].
Tynkevych, Iu O; Volkov, R A
2014-01-01
In order to clarify molecular organization of the genomic region encoding 5S rRNA in diploid species Rosa rugosa several 5S rDNA repeated units were cloned and sequenced. Analysis of the obtained sequences revealed that only one length variant of 5S rDNA repeated units, which contains intact promoter elements in the intergenic spacer region (IGS) and appears to be transcriptionally active is present in the genome. Additionally, a limited number of 5S rDNA pseudogenes lacking a portion of coding sequence and the complete IGS was detected. A high level of sequence similarity (from 93.7 to 97.5%) between the IGS of major 5S rDNA variants of East Asian R. rugosa and North American R. nitida was found indicating comparatively recent divergence of these species.
de Haas, Sanne; Delmar, Paul; Bansal, Aruna T; Moisse, Matthieu; Miles, David W; Leighl, Natasha; Escudier, Bernard; Van Cutsem, Eric; Carmeliet, Peter; Scherer, Stefan J; Pallaud, Celine; Lambrechts, Diether
2014-10-01
Despite extensive translational research, no validated biomarkers predictive of bevacizumab treatment outcome have been identified. We performed a meta-analysis of individual patient data from six randomized phase III trials in colorectal, pancreatic, lung, renal, breast, and gastric cancer to explore the potential relationships between 195 common genetic variants in the vascular endothelial growth factor (VEGF) pathway and bevacizumab treatment outcome. The analysis included 1,402 patients (716 bevacizumab-treated and 686 placebo-treated). Twenty variants were associated (P < 0.05) with progression-free survival (PFS) in bevacizumab-treated patients. Of these, 4 variants in EPAS1 survived correction for multiple testing (q < 0.05). Genotype-by-treatment interaction tests revealed that, across these 20 variants, 3 variants in VEGF-C (rs12510099), EPAS1 (rs4953344), and IL8RA (rs2234671) were potentially predictive (P < 0.05), but not resistant to multiple testing (q > 0.05). A weak genotype-by-treatment interaction effect was also observed for rs699946 in VEGF-A, whereas Bayesian genewise analysis revealed that genetic variability in VHL was associated with PFS in the bevacizumab arm (q < 0.05). Variants in VEGF-A, EPAS1, and VHL were located in expression quantitative loci derived from lymphoblastoid cell lines, indicating that they affect the expression levels of their respective gene. This large genetic analysis suggests that variants in VEGF-A, EPAS1, IL8RA, VHL, and VEGF-C have potential value in predicting bevacizumab treatment outcome across tumor types. Although these associations did not survive correction for multiple testing in a genotype-by-interaction analysis, they are among the strongest predictive effects reported to date for genetic variants and bevacizumab efficacy.
Screening for rare variants in the PNPLA3 gene in obese liver biopsy patients.
Zegers, Doreen; Verrijken, An; Francque, Sven; de Freitas, Fenna; Beckers, Sigri; Aerts, Evi; Ruppert, Martin; Hubens, Guy; Michielsen, Peter; Van Hul, Wim; Van Gaal, Luc F
2016-12-01
Previous research has clearly implicated the PNPLA3 gene in the etiology of nonalcoholic fatty liver disease as a polymorphism in the gene was found to be robustly associated to the disease. However, data on the involvement of rare PNPLA3 variants in the development of nonalcoholic fatty liver disease (NAFLD) is currently limited. Therefore, we performed an extensive mutation analysis study on a cohort of obese liver biopsy patients to determine PNPLA3 variation and its correlation with fatty liver disease. We screened the entire coding region of the PNPLA3 gene in DNA samples of 393 obese liver biopsy patients with varying degrees of fatty liver disease. Mutation analysis was performed by high-resolution melting curve analysis in combination with direct sequencing. We identified several common polymorphisms as well as one rare synonymous variant (c.867G>A rs139896256), one rare intronic variant (c.979+13C>T) and 3 nonsynonymous coding variants (p.A76T, p.A104V and p.T200M) in the PNPLA3 gene. In silico analysis indicated that the p.A104V variant will probably have no functional effect, whereas for the p.A76T and p.T200M variant a possible pathogenic effect is suggested. Overall, we showed that novel variants in PNPLA3 are very rare in our liver biopsy cohort, thereby indicating that their impact on the etiology of NAFLD is probably limited. Nevertheless, for the three rare coding variants that were identified in patients with advanced liver disease, further functional characterization will be essential to verify their potential disease causality. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Rea, Matthew; Jiang, Tingting; Eleazer, Rebekah; Eckstein, Meredith; Marshall, Alan G.; Fondufe-Mittendorf, Yvonne N.
2016-01-01
Exposure to inorganic arsenic, a ubiquitous environmental toxic metalloid, leads to carcinogenesis. However, the mechanism is unknown. Several studies have shown that inorganic arsenic exposure alters specific gene expression patterns, possibly through alterations in chromatin structure. While most studies on understanding the mechanism of chromatin-mediated gene regulation have focused on histone post-translational modifications, the role of histone variants remains largely unknown. Incorporation of histone variants alters the functional properties of chromatin. To understand the global dynamics of chromatin structure and function in arsenic-mediated carcinogenesis, analysis of the histone variants incorporated into the nucleosome and their covalent modifications is required. Here we report the first global mass spectrometric analysis of histone H2B variants as cells undergo arsenic-mediated epithelial to mesenchymal transition. We used electron capture dissociation-based top-down tandem mass spectrometry analysis validated with quantitative reverse transcription real-time polymerase chain reaction to identify changes in the expression levels of H2B variants in inorganic arsenic-mediated epithelial-mesenchymal transition. We identified changes in the expression levels of specific histone H2B variants in two cell types, which are dependent on dose and length of exposure of inorganic arsenic. In particular, we found increases in H2B variants H2B1H/1K/1C/1J/1O and H2B2E/2F, and significant decreases in H2B1N/1D/1B as cells undergo inorganic arsenic-mediated epithelial-mesenchymal transition. The analysis of these histone variants provides a first step toward an understanding of the functional significance of the diversity of histone structures, especially in inorganic arsenic-mediated gene expression and carcinogenesis. PMID:27169413
Zeil, Catharina; Widmann, Michael; Fademrecht, Silvia; Vogel, Constantin; Pleiss, Jürgen
2016-05-01
The Lactamase Engineering Database (www.LacED.uni-stuttgart.de) was developed to facilitate the classification and analysis of TEM β-lactamases. The current version contains 474 TEM variants. Two hundred fifty-nine variants form a large scale-free network of highly connected point mutants. The network was divided into three subnetworks which were enriched by single phenotypes: one network with predominantly 2be and two networks with 2br phenotypes. Fifteen positions were found to be highly variable, contributing to the majority of the observed variants. Since it is expected that a considerable fraction of the theoretical sequence space is functional, the currently sequenced 474 variants represent only the tip of the iceberg of functional TEM β-lactamase variants which form a huge natural reservoir of highly interconnected variants. Almost 50% of the variants are part of a quartet. Thus, two single mutations that result in functional enzymes can be combined into a functional protein. Most of these quartets consist of the same phenotype, or the mutations are additive with respect to the phenotype. By predicting quartets from triplets, 3,916 unknown variants were constructed. Eighty-seven variants complement multiple quartets and therefore have a high probability of being functional. The construction of a TEM β-lactamase network and subsequent analyses by clustering and quartet prediction are valuable tools to gain new insights into the viable sequence space of TEM β-lactamases and to predict their phenotype. The highly connected sequence space of TEM β-lactamases is ideally suited to network analysis and demonstrates the strengths of network analysis over tree reconstruction methods. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael
2015-01-01
With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710
Mingo, Janire; Erramuzpe, Asier; Luna, Sandra; Aurtenetxe, Olaia; Amo, Laura; Diez, Ibai; Schepens, Jan T. G.; Hendriks, Wiljan J. A. J.; Cortés, Jesús M.; Pulido, Rafael
2016-01-01
Site-directed mutagenesis (SDM) is a powerful tool to create defined collections of protein variants for experimental and clinical purposes, but effectiveness is compromised when a large number of mutations is required. We present here a one-tube-only standardized SDM approach that generates comprehensive collections of amino acid substitution variants, including scanning- and single site-multiple mutations. The approach combines unified mutagenic primer design with the mixing of multiple distinct primer pairs and/or plasmid templates to increase the yield of a single inverse-PCR mutagenesis reaction. Also, a user-friendly program for automatic design of standardized primers for Ala-scanning mutagenesis is made available. Experimental results were compared with a modeling approach together with stochastic simulation data. For single site-multiple mutagenesis purposes and for simultaneous mutagenesis in different plasmid backgrounds, combination of primer sets and/or plasmid templates in a single reaction tube yielded the distinct mutations in a stochastic fashion. For scanning mutagenesis, we found that a combination of overlapping primer sets in a single PCR reaction allowed the yield of different individual mutations, although this yield did not necessarily follow a stochastic trend. Double mutants were generated when the overlap of primer pairs was below 60%. Our results illustrate that one-tube-only SDM effectively reduces the number of reactions required in large-scale mutagenesis strategies, facilitating the generation of comprehensive collections of protein variants suitable for functional analysis. PMID:27548698
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel
Houle, David; Márquez, Eladio J.
2015-01-01
We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r2 ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. PMID:26068573
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel.
Houle, David; Márquez, Eladio J
2015-06-10
We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r(2) ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. Copyright © 2015 Houle and Márquez.
Gao, Zhiyong; Liu, Baiwei; Huo, Da; Yan, Hanqiu; Jia, Lei; Du, Yiwei; Qian, Haikun; Yang, Yang; Wang, Xiaoli; Li, Jie; Wang, Quanyi
2015-12-18
Norovirus (NoV) is a leading cause of sporadic cases and outbreaks of acute gastroenteritis (AGE). Increased NoV activity was observed in Beijing, China during winter 2014-2015; therefore, we examined the epidemiological patterns and genetic characteristics of NoV in the sporadic cases and outbreaks. The weekly number of infectious diarrhea cases reported by all hospitals in Beijing was analyzed through the China information system for disease control and prevention. Fecal specimens were collected from the outbreaks and outpatients with AGE, and GI and GII NoVs were detected using real time reverse transcription polymerase chain reaction. The partial capsid genes and RNA-dependent RNA polymerase (RdRp) genes of NoV were both amplified and sequenced, and genotyping and phylogenetic analyses were performed. Between December 2014 and March 2015, the number of infectious diarrhea cases in Beijing (10,626 cases) increased by 35.6% over that of the previous year (7835 cases), and the detection rate of NoV (29.8%, 191/640) among outpatients with AGE was significantly higher than in the previous year (12.9%, 79/613) (χ(2) = 53.252, P < 0.001). Between November 2014 and March 2015, 35 outbreaks of AGE were reported in Beijing, and NoVs were detected in 33 outbreaks, all of which belonged to the GII genogroup. NoVs were sequenced and genotyped in 22 outbreaks, among which 20 were caused by a novel GII.17 strain. Among outpatients with AGE, this novel GII.17 strain was first detected in an outpatient in August 2014, and it replaced GII.4 Sydney_2012 as the predominant variant between December 2014 and March 2015. A phylogenetic analysis of the capsid genes and RdRp genes revealed that this novel GII.17 strain was distinct from previously identified GII variants, and it was recently designated as GII.P17_GII.17. This variant was further clustered into two sub-groups, named GII.17_2012 and GII.17_2014. During winter 2014-2015, GII.17_2014 caused the majority of AGE outbreaks in China and Japan. During winter 2014-2015, a novel NoV GII.17 variant replaced the GII.4 variant Sydney 2012 as the predominant strain in Beijing, China and caused increased NoV activity.
Song, Dandan; Li, Ning; Liao, Lejian
2015-01-01
Due to the generation of enormous amounts of data at both lower costs as well as in shorter times, whole-exome sequencing technologies provide dramatic opportunities for identifying disease genes implicated in Mendelian disorders. Since upwards of thousands genomic variants can be sequenced in each exome, it is challenging to filter pathogenic variants in protein coding regions and reduce the number of missing true variants. Therefore, an automatic and efficient pipeline for finding disease variants in Mendelian disorders is designed by exploiting a combination of variants filtering steps to analyze the family-based exome sequencing approach. Recent studies on the Freeman-Sheldon disease are revisited and show that the proposed method outperforms other existing candidate gene identification methods.
Zintzaras, Elias; Doxani, Chrysoula; Rodopoulou, Paraskevi; Bakalos, Georgios; Ziogas, Dimitris C; Ziakas, Panayiotis; Voulgarelis, Michael
2012-04-01
Acute lymphoblastic leukemia (ALL) is a complex disease with genetic background. The genetic association studies (GAS) that investigated the association between ALL and the MTHFR C677T and A1298C gene variants have produced contradictory or inconclusive results. In order to decrease the uncertainty of estimated genetic risk effects, a meticulous meta-analysis of published GAS related the variants in the MTFHR gene with susceptibility to ALL was conducted. The risk effects were estimated based on the odds ratio (OR) of the allele contrast and the generalized odds ratio (OR(G)). Cumulative and recursive cumulative meta-analyses were also performed. The analysis showed marginal significant association for the C677T variant, overall [OR=0.91 (0.82-1.00) and OR(G)=0.89 (0.79-1.01)], and in Whites [OR=0.88 (0.77-0.99) and OR(G)=0.85 (0.73-0.99)]. The A1298C variant produced non-significant results. For both variants, the cumulative meta-analysis did not show a trend of association as evidence accumulates and the recursive cumulative meta-analysis indicated lack of sufficient evidence for denying or claiming an association. The current evidence is not sufficient to draw definite conclusions regarding the association of MTHFR variants and development of ALL. Copyright © 2011 Elsevier Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
To facilitate further evaluation of pheromone biosynthesis activating neuropeptide receptor (PBANR) functionality and regulation, we generated cultured insect cell lines stably expressing a number of fluorescent Bombyx mori PBANR (BommoPBANR) and Pseudaletia separata PBANR (PsesePBANR) variants incl...
RAPTR-SV: a hybrid method for the detection of structural variants
USDA-ARS?s Scientific Manuscript database
Motivation: Identification of Structural Variants (SV) in sequence data results in a large number of false positive calls using existing software, which overburdens subsequent validation. Results: Simulations using RAPTR-SV and another software package that uses a similar algorithm for SV detection...
Shibuta, K; Abe, M; Suzuki, T
1994-01-01
The K variant of human butyrylcholinesterase is caused by a G/A transition in the butyrylcholinesterase gene, which neither creates nor destroys any restriction site. In an attempt to detect the K variant both simply and rapidly, we developed a two step method of "PCR primer introduced restriction analysis" (PCR-PIRA). The first step was used to introduce a new Fun4HI site into the normal allele for a screening test, while the second step was performed to create a new MaeIII site on the variant allele for a specific test. This method thus enabled us to distinguish clearly the K variant from the normal allele, and also showed that the frequency of the K variant allele is 0.164 in the Japanese population. Images PMID:7966197
BlackOPs: increasing confidence in variant detection through mappability filtering.
Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil
2013-10-01
Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.
The Genetic Landscape of Renal Complications in Type 1 Diabetes
Sandholm, Niina; Van Zuydam, Natalie; Ahlqvist, Emma; Juliusdottir, Thorhildur; Deshmukh, Harshal A.; Rayner, N. William; Di Camillo, Barbara; Forsblom, Carol; Fadista, Joao; Ziemek, Daniel; Salem, Rany M.; Hiraki, Linda T.; Pezzolesi, Marcus; Trégouët, David; Dahlström, Emma; Valo, Erkka; Oskolkov, Nikolay; Ladenvall, Claes; Marcovecchio, M. Loredana; Cooper, Jason; Sambo, Francesco; Malovini, Alberto; Manfrini, Marco; McKnight, Amy Jayne; Lajer, Maria; Harjutsalo, Valma; Gordin, Daniel; Parkkonen, Maija; Lyssenko, Valeriya; McKeigue, Paul M.; Rich, Stephen S.; Brosnan, Mary Julia; Fauman, Eric; Bellazzi, Riccardo; Rossing, Peter; Hadjadj, Samy; Krolewski, Andrzej; Paterson, Andrew D.; Hirschhorn, Joel N.; Maxwell, Alexander P.; Cobelli, Claudio; Colhoun, Helen M.; Groop, Leif; McCarthy, Mark I.
2017-01-01
Diabetes is the leading cause of ESRD. Despite evidence for a substantial heritability of diabetic kidney disease, efforts to identify genetic susceptibility variants have had limited success. We extended previous efforts in three dimensions, examining a more comprehensive set of genetic variants in larger numbers of subjects with type 1 diabetes characterized for a wider range of cross-sectional diabetic kidney disease phenotypes. In 2843 subjects, we estimated that the heritability of diabetic kidney disease was 35% (P=6.4×10−3). Genome-wide association analysis and replication in 12,540 individuals identified no single variants reaching stringent levels of significance and, despite excellent power, provided little independent confirmation of previously published associated variants. Whole-exome sequencing in 997 subjects failed to identify any large-effect coding alleles of lower frequency influencing the risk of diabetic kidney disease. However, sets of alleles increasing body mass index (P=2.2×10−5) and the risk of type 2 diabetes (P=6.1×10−4) associated with the risk of diabetic kidney disease. We also found genome-wide genetic correlation between diabetic kidney disease and failure at smoking cessation (P=1.1×10−4). Pathway analysis implicated ascorbate and aldarate metabolism (P=9.0×10−6), and pentose and glucuronate interconversions (P=3.0×10−6) in pathogenesis of diabetic kidney disease. These data provide further evidence for the role of genetic factors influencing diabetic kidney disease in those with type 1 diabetes and highlight some key pathways that may be responsible. Altogether these results reveal important biology behind the major cause of kidney disease. PMID:27647854
Kennedy, Amy E; Kamdar, Kala Y; Lupo, Philip J; Okcu, M Fatih; Scheurer, Michael E; Baum, Marianna K; Dorak, M Tevfik
2014-09-01
Hereditary hemochromatosis (HFE) variants correlating with body iron levels have shown associations with cancer risk, including childhood acute lymphoblastic leukemia (ALL). Using a multi-ethnic sample of cases and controls from Houston, TX, we examined two HFE variants (rs1800562 and rs1799945), one transferrin receptor gene (TFRC) variant (rs3817672) and three additional iron regulatory gene (IRG) variants (SLC11A2 rs422982; TMPRSS6 rs855791 and rs733655) for their associations with childhood ALL. Being positive for either of the HFE variants yielded a modestly elevated odds ratio (OR) for childhood ALL risk in males (1.40, 95% CI=0.83-2.35), which increased to 2.96 (95% CI=1.29-6.80) in the presence of a particular TFRC genotype for rs3817672 (P interaction=0.04). The TFRC genotype also showed an ethnicity-specific association, with increased risk observed in non-Hispanic Whites (OR=2.54, 95% CI=1.05-6.12; P interaction with ethnicity=0.02). The three additional IRG SNPs all showed individual risk associations with childhood ALL in males (OR=1.52-2.60). A polygenic model based on the number of variant alleles in five IRG SNPs revealed a linear increase in risk among males with the increasing number of variants possessed (OR=2.0 per incremental change, 95% CI=1.29-3.12; P=0.002). Our results replicated previous HFE risk associations with childhood ALL in a US population and demonstrated novel associations for IRG SNPs, thereby strengthening the hypothesis that iron excess mediated by genetic variants contributes to childhood ALL risk. Copyright © 2014 Elsevier Ltd. All rights reserved.
2009-01-01
Background Array genomic hybridization is being used clinically to detect pathogenic copy number variants in children with intellectual disability and other birth defects. However, there is no agreement regarding the kind of array, the distribution of probes across the genome, or the resolution that is most appropriate for clinical use. Results We performed 500 K Affymetrix GeneChip® array genomic hybridization in 100 idiopathic intellectual disability trios, each comprised of a child with intellectual disability of unknown cause and both unaffected parents. We found pathogenic genomic imbalance in 16 of these 100 individuals with idiopathic intellectual disability. In comparison, we had found pathogenic genomic imbalance in 11 of 100 children with idiopathic intellectual disability in a previous cohort who had been studied by 100 K GeneChip® array genomic hybridization. Among 54 intellectual disability trios selected from the previous cohort who were re-tested with 500 K GeneChip® array genomic hybridization, we identified all 10 previously-detected pathogenic genomic alterations and at least one additional pathogenic copy number variant that had not been detected with 100 K GeneChip® array genomic hybridization. Many benign copy number variants, including one that was de novo, were also detected with 500 K array genomic hybridization, but it was possible to distinguish the benign and pathogenic copy number variants with confidence in all but 3 (1.9%) of the 154 intellectual disability trios studied. Conclusion Affymetrix GeneChip® 500 K array genomic hybridization detected pathogenic genomic imbalance in 10 of 10 patients with idiopathic developmental disability in whom 100 K GeneChip® array genomic hybridization had found genomic imbalance, 1 of 44 patients in whom 100 K GeneChip® array genomic hybridization had found no abnormality, and 16 of 100 patients who had not previously been tested. Effective clinical interpretation of these studies requires considerable skill and experience. PMID:19917086
Mercatanti, Alberto; Lodovichi, Samuele; Cervelli, Tiziana; Galli, Alvaro
2017-12-01
Evaluation of the functional impact of cancer-associated missense variants is more difficult than for protein-truncating mutations and consequently standard guidelines for the interpretation of sequence variants have been recently proposed. A number of algorithms and software products were developed to predict the impact of cancer-associated missense mutations on protein structure and function. Importantly, direct assessment of the variants using high-throughput functional assays using simple genetic systems can help in speeding up the functional evaluation of newly identified cancer-associated variants. We developed the web tool CRIMEtoYHU (CTY) to help geneticists in the evaluation of the functional impact of cancer-associated missense variants. Humans and the yeast Saccharomyces cerevisiae share thousands of protein-coding genes although they have diverged for a billion years. Therefore, yeast humanization can be helpful in deciphering the functional consequences of human genetic variants found in cancer and give information on the pathogenicity of missense variants. To humanize specific positions within yeast genes, human and yeast genes have to share functional homology. If a mutation in a specific residue is associated with a particular phenotype in humans, a similar substitution in the yeast counterpart may reveal its effect at the organism level. CTY simultaneously finds yeast homologous genes, identifies the corresponding variants and determines the transferability of human variants to yeast counterparts by assigning a reliability score (RS) that may be predictive for the validity of a functional assay. CTY analyzes newly identified mutations or retrieves mutations reported in the COSMIC database, provides information about the functional conservation between yeast and human and shows the mutation distribution in human genes. CTY analyzes also newly found mutations and aborts when no yeast homologue is found. Then, on the basis of the protein domain localization and functional conservation between yeast and human, the selected variants are ranked by the RS. The RS is assigned by an algorithm that computes functional data, type of mutation, chemistry of amino acid substitution and the degree of mutation transferability between human and yeast protein. Mutations giving a positive RS are highly transferable to yeast and, therefore, yeast functional assays will be more predictable. To validate the web application, we have analyzed 8078 cancer-associated variants located in 31 genes that have a yeast homologue. More than 50% of variants are transferable to yeast. Incidentally, 88% of all transferable mutations have a reliability score >0. Moreover, we analyzed by CTY 72 functionally validated missense variants located in yeast genes at positions corresponding to the human cancer-associated variants. All these variants gave a positive RS. To further validate CTY, we analyzed 3949 protein variants (with positive RS) by the predictive algorithm PROVEAN. This analysis shows that yeast-based functional assays will be more predictable for the variants with positive RS. We believe that CTY could be an important resource for the cancer research community by providing information concerning the functional impact of specific mutations, as well as for the design of functional assays useful for decision support in precision medicine. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Miller, Marcus J.; Burrage, Lindsay C.; Gibson, James B.; Strenk, Meghan E.; Lose, Edward J.; Bick, David P.; Elsea, Sarah H.; Sutton, V. Reid; Sun, Qin; Graham, Brett H.; Craigen, William J.; Zhang, Victor Wei; Wong, Lee-Jun C.
2016-01-01
Very long chain acyl-coA dehydrogenase deficiency (VLCADD) is an autosomal recessive inborn error of fatty acid oxidation detected by newborn screening (NBS). Follow-up molecular analyses are often required to clarify VLCADD-suggestive NBS results, but to date the outcome of these studies are not well described for the general screen-positive population. In the following study, we report the molecular findings for 693 unrelated patients that sequentially received Sanger sequence analysis of ACADVL as a result of a positive NBS for VLCADD. Highlighting the variable molecular underpinnings of this disorder, we identified 94 different pathogenic ACADVL variants (40 novel), as well as 134 variants of unknown clinical significance (VUSs). Evidence for the pathogenicity of a subset of recurrent VUSs was provided using multiple in silico analyses. Surprisingly, the most frequent finding in our cohort was carrier status, 57% all individuals had a single pathogenic variant or VUS. This result was further supported by follow-up array and/or acylcarnitine analysis that failed to provide evidence of a second pathogenic allele. Notably, exon-targeted array analysis of 131 individuals screen positive for VLCADD failed to identify copy number changes in ACADVL thus suggesting this test has a low yield in the setting of NBS follow-up. While no genotype was common, the c.848T>C (p.V283A) pathogenic variant was clearly the most frequent; at least one copy was found in ∼10% of all individuals with a positive NBS. Clinical and biochemical data for seven unrelated patients homozygous for the p.V283A allele suggests that it results in a mild phenotype that responds well to standard treatment, but hypoglycemia can occur. Collectively, our data illustrate the molecular heterogeneity of VLCADD and provide novel insight into the outcomes of NBS for this disorder. PMID:26385305
Statistical method to compare massive parallel sequencing pipelines.
Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P
2017-03-01
Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
Hughes, Paul; Deng, Wenjie; Olson, Scott C; Coombs, Robert W; Chung, Michael H; Frenkel, Lisa M
2016-03-01
Accurate analysis of minor populations of drug-resistant HIV requires analysis of a sufficient number of viral templates. We assessed the effect of experimental conditions on the analysis of HIV pol 454 pyrosequences generated from plasma using (1) the "Insertion-deletion (indel) and Carry Forward Correction" (ICC) pipeline, which clusters sequence reads using a nonsubstitution approach and can correct for indels and carry forward errors, and (2) the "Primer Identification (ID)" method, which facilitates construction of a consensus sequence to correct for sequencing errors and allelic skewing. The Primer ID and ICC methods produced similar estimates of viral diversity, but differed in the number of sequence variants generated. Sequence preparation for ICC was comparably simple, but was limited by an inability to assess the number of templates analyzed and allelic skewing. The more costly Primer ID method corrected for allelic skewing and provided the number of viral templates analyzed, which revealed that amplifiable HIV templates varied across specimens and did not correlate with clinical viral load. This latter observation highlights the value of the Primer ID method, which by determining the number of templates amplified, enables more accurate assessment of minority species in the virus population, which may be relevant to prescribing effective antiretroviral therapy.
Tebel, Katrin; Boldt, Vivien; Steininger, Anne; Port, Matthias; Ebert, Grit; Ullmann, Reinhard
2017-01-06
The analysis of DNA copy number variants (CNV) has increasing impact in the field of genetic diagnostics and research. However, the interpretation of CNV data derived from high resolution array CGH or NGS platforms is complicated by the considerable variability of the human genome. Therefore, tools for multidimensional data analysis and comparison of patient cohorts are needed to assist in the discrimination of clinically relevant CNVs from others. We developed GenomeCAT, a standalone Java application for the analysis and integrative visualization of CNVs. GenomeCAT is composed of three modules dedicated to the inspection of single cases, comparative analysis of multidimensional data and group comparisons aiming at the identification of recurrent aberrations in patients sharing the same phenotype, respectively. Its flexible import options ease the comparative analysis of own results derived from microarray or NGS platforms with data from literature or public depositories. Multidimensional data obtained from different experiment types can be merged into a common data matrix to enable common visualization and analysis. All results are stored in the integrated MySQL database, but can also be exported as tab delimited files for further statistical calculations in external programs. GenomeCAT offers a broad spectrum of visualization and analysis tools that assist in the evaluation of CNVs in the context of other experiment data and annotations. The use of GenomeCAT does not require any specialized computer skills. The various R packages implemented for data analysis are fully integrated into GenomeCATs graphical user interface and the installation process is supported by a wizard. The flexibility in terms of data import and export in combination with the ability to create a common data matrix makes the program also well suited as an interface between genomic data from heterogeneous sources and external software tools. Due to the modular architecture the functionality of GenomeCAT can be easily extended by further R packages or customized plug-ins to meet future requirements.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)
Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto
2017-01-01
Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Distribution of Bartonella henselae Variants in Patients, Reservoir Hosts and Vectors in Spain
Gil, Horacio; Escudero, Raquel; Pons, Inmaculada; Rodríguez-Vargas, Manuela; García-Esteban, Coral; Rodríguez-Moreno, Isabel; García-Amil, Cristina; Lobo, Bruno; Valcárcel, Félix; Pérez, Azucena; Jiménez, Santos; Jado, Isabel; Juste, Ramón; Segura, Ferrán; Anda, Pedro
2013-01-01
We have studied the diversity of B. henselae circulating in patients, reservoir hosts and vectors in Spain. In total, we have fully characterized 53 clinical samples from 46 patients, as well as 78 B. henselae isolates obtained from 35 cats from La Rioja and Catalonia (northeastern Spain), four positive cat blood samples from which no isolates were obtained, and three positive fleas by Multiple Locus Sequence Typing and Multiple Locus Variable Number Tandem Repeats Analysis. This study represents the largest series of human cases characterized with these methods, with 10 different sequence types and 41 MLVA profiles. Two of the sequence types and 35 of the profiles were not described previously. Most of the B. henselae variants belonged to ST5. Also, we have identified a common profile (72) which is well distributed in Spain and was found to persist over time. Indeed, this profile seems to be the origin from which most of the variants identified in this study have been generated. In addition, ST5, ST6 and ST9 were found associated with felines, whereas ST1, ST5 and ST8 were the most frequent sequence types found infecting humans. Interestingly, some of the feline associated variants never found on patients were located in a separate clade, which could represent a group of strains less pathogenic for humans. PMID:23874563
Comprehensive genomic analysis of patients with disorders of cerebral cortical development.
Wiszniewski, Wojciech; Gawlinski, Pawel; Gambin, Tomasz; Bekiesinska-Figatowska, Monika; Obersztyn, Ewa; Antczak-Marach, Dorota; Akdemir, Zeynep Hande Coban; Harel, Tamar; Karaca, Ender; Jurek, Marta; Sobecka, Katarzyna; Nowakowska, Beata; Kruk, Malgorzata; Terczynska, Iwona; Goszczanska-Ciuchta, Alicja; Rudzka-Dybala, Mariola; Jamroz, Ewa; Pyrkosz, Antoni; Jakubiuk-Tomaszuk, Anna; Iwanowski, Piotr; Gieruszczak-Bialek, Dorota; Piotrowicz, Malgorzata; Sasiadek, Maria; Kochanowska, Iwona; Gurda, Barbara; Steinborn, Barbara; Dawidziuk, Mateusz; Castaneda, Jennifer; Wlasienko, Pawel; Bezniakow, Natalia; Jhangiani, Shalini N; Hoffman-Zacharska, Dorota; Bal, Jerzy; Szczepanik, Elzbieta; Boerwinkle, Eric; Gibbs, Richard A; Lupski, James R
2018-04-30
Malformations of cortical development (MCDs) manifest with structural brain anomalies that lead to neurologic sequelae, including epilepsy, cerebral palsy, developmental delay, and intellectual disability. To investigate the underlying genetic architecture of patients with disorders of cerebral cortical development, a cohort of 54 patients demonstrating neuroradiologic signs of MCDs was investigated. Individual genomes were interrogated for single-nucleotide variants (SNV) and copy number variants (CNV) with whole-exome sequencing and chromosomal microarray studies. Variation affecting known MCDs-associated genes was found in 16/54 cases, including 11 patients with SNV, 2 patients with CNV, and 3 patients with both CNV and SNV, at distinct loci. Diagnostic pathogenic SNV and potentially damaging variants of unknown significance (VUS) were identified in two groups of seven individuals each. We demonstrated that de novo variants are important among patients with MCDs as they were identified in 10/16 individuals with a molecular diagnosis. Three patients showed changes in known MCDs genes and a clinical phenotype beyond the usual characteristics observed, i.e., phenotypic expansion, for a particular known disease gene clinical entity. We also discovered 2 likely candidate genes, CDH4, and ASTN1, with human and animal studies supporting their roles in brain development, and 5 potential candidate genes. Our findings emphasize genetic heterogeneity of MCDs disorders and postulate potential novel candidate genes involved in cerebral cortical development.
Loconsole, Giuliana; Onelge, Nuket; Yokomi, Raymond K; Kubaa, Raied Abou; Savino, Vito; Saponari, Maria
2013-01-01
The RNA genome of pathogenic and non-pathogenic variants of citrus Hop stunt viroid (HSVd) differ by five to six nucleotides located within the variable (V) domain referred to as the "cachexia expression motif". Sensitive hosts such as mandarin and its hybrids are seriously affected by cachexia disease. Current methods to differentiate HSVd variants rely on lengthy greenhouse biological indexing on Parson's Special mandarin and/or direct nucleotide sequence analysis of amplicons from RT-PCR of HSVd-infected plants. Two independent high throughput assays to segregate HSVd variants by real-time RT-PCR and High-Resolution Melting Temperature (HRM) analysis were developed: one based on EVAGreen dye; the other based on TaqMan probes. Primers for both assays targeted three differentiating nucleotides in the V domain which separated HSVd variants into three clusters by distinct melting temperatures with a confidence level higher than 98%. The accuracy of the HRM assays were validated by nucleotide sequencing of representative samples within each HRM cluster and by testing 45 HSVd-infected field trees from California, Italy, Spain, Syria and Turkey. To our knowledge, this is the first report of a rapid and sensitive approach to detect and differentiate HSVd variants associated with different biological behaviors. Although, HSVd is found in several crops including citrus, cachexia variants are restricted to some citrus-growing areas, particularly the Mediterranean Region. Rapid diagnosis for cachexia and non-cachexia variants is, thus, important for the management of HSVd in citrus and reduces the need for bioindexing and sequencing analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.
Smith, Paul M.; Elson, Joanna L.; Greaves, Laura C.; Wortmann, Saskia B.; Rodenburg, Richard J.T.; Lightowlers, Robert N.; Chrzanowska-Lightowlers, Zofia M.A.; Taylor, Robert W.; Vila-Sanjurjo, Antón
2014-01-01
Mutations of mitochondrial DNA are linked to many human diseases. Despite the identification of a large number of variants in the mitochondrially encoded rRNA (mt-rRNA) genes, the evidence supporting their pathogenicity is, at best, circumstantial. Establishing the pathogenicity of these variations is of major diagnostic importance. Here, we aim to estimate the disruptive effect of mt-rRNA variations on the function of the mitochondrial ribosome. In the absence of direct biochemical methods to study the effect of mt-rRNA variations, we relied on the universal conservation of the rRNA fold to infer their disruptive potential. Our method, named heterologous inferential analysis or HIA, combines conservational information with functional and structural data obtained from heterologous ribosomal sources. Thus, HIA's predictive power is superior to the traditional reliance on simple conservation indexes. By using HIA, we have been able to evaluate the disruptive potential for a subset of uncharacterized 12S mt-rRNA variations. Our analysis revealed the existence of variations in the rRNA component of the human mitoribosome with different degrees of disruptive power. In cases where sufficient information regarding the genetic and pathological manifestation of the mitochondrial phenotype is available, HIA data can be used to predict the pathogenicity of mt-rRNA mutations. In other cases, HIA analysis will allow the prioritization of variants for additional investigation. Eventually, HIA-inspired analysis of potentially pathogenic mt-rRNA variations, in the context of a scoring system specifically designed for these variants, could lead to a powerful diagnostic tool. PMID:24092330
Genetic Characterization of Circulating African Swine Fever Viruses in Nigeria (2007-2015).
Luka, P D; Achenbach, J E; Mwiine, F N; Lamien, C E; Shamaki, D; Unger, H; Erume, J
2017-10-01
Sequencing and analysis of three discrete genome regions of African swine fever viruses (ASFV) from archival samples collected in 2007-2011 and active and passive surveillance between 2012 and 2015 in Nigeria were carried out. Analysis was conducted by genotyping of three single-copy African swine fever (ASF) genes. The E183L and B646L genes that encode structural proteins p54 and p72, respectively, were utilized to delineate genotypes before intragenotypic resolution by characterization of the tetrameric amino acid repeat region within the hypervariable central variable region of the B602L gene. The results showed no variation in the p72 and p54 gene regions sequenced. Phylogeny of p72 sequences revealed that all the Nigerian isolates belonged to genotype I, while that of the p54 recovered the Ia genotype. Analysis of B602L gene revealed the differences in the number of tetrameric repeats. Four new variants (Tet-15, Tet-17a, Tet-17b and Tet-48) were recovered, while a fifth variant (Tet-20) was the most widely distributed in the country displacing Tet-36 reported previously in 2003-2006. The viruses responsible for ASF outbreaks in Nigeria are from very closely related but mutated variants of the virus that have been circulating since 1997. A practical implication of the genetic variability of the Nigerian viral isolates in this study is the need for continuous sampling and analysis of circulating viruses, which will provide epidemiological information on the evolution of ASFV in the field versus new incursion for informed strategic control of the disease in the country. © 2016 Blackwell Verlag GmbH.
Hesse, Andrew N; Bevilacqua, Jennifer; Shankar, Kritika; Reddi, Honey V
2018-05-16
Epilepsy is a diverse neurological condition with extreme genetic and phenotypic heterogeneity. The introduction of next-generation sequencing into the clinical laboratory has made it possible to investigate hundreds of associated genes simultaneously for a patient, even in the absence of a clearly defined syndrome. This has resulted in the detection of rare and novel mutations at a rate well beyond our ability to characterize their effects. This retrospective study reviews genotype data in the context of available phenotypic information on 305 patients spanning the epileptic spectrum to identify established and novel patterns of correlation. Our epilepsy panel comprising 377 genes was used to sequence 305 patients referred for genetic testing. Qualifying variants were annotated with phenotypic data obtained from either the test requisition form or supporting clinical documentation. Observed phenotypes were compared with established phenotypes in OMIM, published literature and the ILAEs 2010 report on genetic testing to assess congruity with known gene aberrations. We identified a number of novel and recognized genetic variants consistent with established epileptic phenotypes. Forty-one pathogenic or predicted deleterious variants were detected in 39 patients with accompanying clinical documentation. Twenty-five of these variants across 15 genes were novel. Furthermore, evaluation of phenotype data for 194 patients with variants of unknown significance in genes with autosomal dominant and X-linked disease inheritance elucidated potentially disease-causing variants that were not currently characterized in the literature. Assessment of key genotype-phenotype correlations from our cohort provide insight into variant classification, as well as the importance of including ILAE recommended genes as part of minimum panel content for comprehensive epilepsy tests. Many of the reported VUSs are likely genuine pathogenic variants driving the observed phenotypes, but not enough evidence is available for assertive classifications. Similar studies will provide more utility via mounting independent genotype-phenotype data from unrelated patients. The possible outcome would be a better molecular diagnostic product, with fewer indeterminate reports containing only VUSs. Copyright © 2018. Published by Elsevier B.V.
Palmer, Nicholette D; Musani, Solomon K; Yerges-Armstrong, Laura M; Feitosa, Mary F; Bielak, Lawrence F; Hernaez, Ruben; Kahali, Bratati; Carr, J Jeffrey; Harris, Tamara B; Jhun, Min A; Kardia, Sharon LR; Langefeld, Carl D; Mosley, Thomas H; Norris, Jill M; Smith, Albert V; Taylor, Herman A; Wagenknecht, Lynne E; Liu, Jiankang; Borecki, Ingrid B; Peyser, Patricia A; Speliotes, Elizabeth K
2013-01-01
Nonalcoholic Fatty Liver Disease (NAFLD) is an obesity-related condition affecting over 50% of individuals in some populations and is expected to become the number one cause of liver disease worldwide by 2020. Common, robustly associated genetic variants in/near five genes were identified for hepatic steatosis, a quantifiable component of NAFLD, in European-ancestry individuals. Here we tested whether these variants were associated with hepatic steatosis in African and/or Hispanic Americans and fine-mapped the observed association signals. We measured hepatic steatosis using computed tomography in five African-American (n=3124) and one Hispanic-American (n=849) cohorts. All analyses controlled for variation in age, age2, gender, alcoholic drinks, and population substructure. Heritability of hepatic steatosis was estimated in three cohorts. Variants in/near PNPLA3, NCAN, LYPLAL1, GCKR, and PPP1R3B were tested for association with hepatic steatosis using a regression framework in each cohort and meta-analyzed. Fine-mapping across African-American cohorts was conducted using meta-analysis. African- and Hispanic-American cohorts were 33.9/37.5% male, with average age of 58.6/42.6 years and body mass index of 31.8/28.9kg/m2, respectively. Hepatic steatosis was 0.20–0.34 heritable in African-and Hispanic-American families (p<0.02 in each cohort). Variants in or near PNPLA3, NCAN, GCKR, PPP1R3B in African Americans and PNPLA3 and PPP1R3B in Hispanic Americans were significantly associated with hepatic steatosis; however, allele frequency and effect size varied across ancestries. Fine-mapping in African Americans highlighted missense variants at PNPLA3 and GCKR and redefined the association region at LYPLAL1. Conclusions We show for the first time that multiple genetic variants are associated with hepatic steatosis across ancestries and explain a substantial proportion of the genetic predisposition in African and Hispanic Americans. Missense variants in PNPLA3 and GCKR are likely functional across multiple ancestries. PMID:23564467
Palmer, Nicholette D; Musani, Solomon K; Yerges-Armstrong, Laura M; Feitosa, Mary F; Bielak, Lawrence F; Hernaez, Ruben; Kahali, Bratati; Carr, J Jeffrey; Harris, Tamara B; Jhun, Min A; Kardia, Sharon L R; Langefeld, Carl D; Mosley, Thomas H; Norris, Jill M; Smith, Albert V; Taylor, Herman A; Wagenknecht, Lynne E; Liu, Jiankang; Borecki, Ingrid B; Peyser, Patricia A; Speliotes, Elizabeth K
2013-09-01
Nonalcoholic fatty liver disease (NAFLD) is an obesity-related condition affecting over 50% of individuals in some populations and is expected to become the number one cause of liver disease worldwide by 2020. Common, robustly associated genetic variants in/near five genes were identified for hepatic steatosis, a quantifiable component of NAFLD, in European ancestry individuals. Here we tested whether these variants were associated with hepatic steatosis in African- and/or Hispanic-Americans and fine-mapped the observed association signals. We measured hepatic steatosis using computed tomography in five African American (n = 3,124) and one Hispanic American (n = 849) cohorts. All analyses controlled for variation in age, age(2) , gender, alcoholic drinks, and population substructure. Heritability of hepatic steatosis was estimated in three cohorts. Variants in/near PNPLA3, NCAN, LYPLAL1, GCKR, and PPP1R3B were tested for association with hepatic steatosis using a regression framework in each cohort and meta-analyzed. Fine-mapping across African American cohorts was conducted using meta-analysis. African- and Hispanic-American cohorts were 33.9/37.5% male, with average age of 58.6/42.6 years and body mass index of 31.8/28.9 kg/m(2) , respectively. Hepatic steatosis was 0.20-0.34 heritable in African- and Hispanic-American families (P < 0.02 in each cohort). Variants in or near PNPLA3, NCAN, GCKR, PPP1R3B in African Americans and PNPLA3 and PPP1R3B in Hispanic Americans were significantly associated with hepatic steatosis; however, allele frequency and effect size varied across ancestries. Fine-mapping in African Americans highlighted missense variants at PNPLA3 and GCKR and redefined the association region at LYPLAL1. Multiple genetic variants are associated with hepatic steatosis across ancestries. This explains a substantial proportion of the genetic predisposition in African- and Hispanic-Americans. Missense variants in PNPLA3 and GCKR are likely functional across multiple ancestries. © 2013 by the American Association for the Study of Liver Diseases.
Whole-Exome Sequencing Study of Thyrotropin-Secreting Pituitary Adenomas.
Sapkota, Santosh; Horiguchi, Kazuhiko; Tosaka, Masahiko; Yamada, Syozo; Yamada, Masanobu
2017-02-01
Thyrotropin (TSH)-secreting pituitary adenomas (TSHomas) are a rare cause of hyperthyroidism, and the genetic aberrations responsible remain unknown. To identify somatic genetic abnormalities in TSHomas. A single-nucleotide polymorphism (SNP) array analysis was performed on 8 TSHomas. Four tumors with no allelic losses or limited loss of heterozygosity were selected, and whole-exome sequencing was performed, including their corresponding blood samples. Somatic variants were confirmed by Sanger sequencing. A set of 8 tumors was also assessed to validate candidate genes. Twelve patients with sporadic TSHomas were examined. The overall performance of whole-exome sequencing was good, with an average coverage of each base in the targeted region of 97.6%. Six DNA variants were confirmed as candidate driver mutations, with an average of 1.5 somatic mutations per tumor. No mutations were recurrent. Two of these mutations were found in genes with an established role in malignant tumorigenesis (SMOX and SYTL3), and 4 had unknown roles (ZSCAN23, ASTN2, R3HDM2, and CWH43). Similarly, an SNP array analysis revealed frequent chromosomal regions of copy number gains, including recurrent gains at loci harboring 4 of these 6 genes. Several candidate somatic mutations and changes in copy numbers for TSHomas were identified. The results showed no recurrence of mutations in the tumors studied but a low number of mutations, thereby highlighting their benign nature. Further studies on a larger cohort of TSHomas, along with the use of epigenetic and transcriptomic approaches, may reveal the underlying genetic lesions. Copyright © 2017 by the Endocrine Society
Role of 2 common variants of 5HT2A gene in medication overuse headache.
Terrazzino, Salvatore; Sances, Grazia; Balsamo, Francesca; Viana, Michele; Monaco, Francesco; Bellomo, Giorgio; Martignoni, Emilia; Tassorelli, Cristina; Nappi, Giuseppe; Canonico, Pier Luigi; Genazzani, Armando A
2010-11-01
The aim of the present study was to evaluate a possible involvement of 2 polymorphisms of the serotonin 5HT2A receptor gene (A-1438G and C516T) as risk factors for medication overuse headache (MOH) and whether the presence of these polymorphic variants might determine differences within MOH patients in monthly drug consumption. Despite a growing scientific interest in the mechanisms underlying the pathophysiology of MOH, few studies have focused on the role of genetics in the development of the disease, as well as on the genetic determinants of the inter-individual variability in the number of drug doses taken per month. Our study was performed by polymerase chain reaction (PCR) and PCR-restriction fragment length polymorphism on genomic DNA extracted from peripheral blood of 227 MOH patients and 312 control subjects. Genotype-specific risks were estimated as odds ratios with associated 95% confidence intervals by unconditional logistic regression and adjusted for age and gender. A stepwise multiple linear regression analysis was employed to identify significant predictors of the number of drug doses taken per month. No significant association was found between 5HT2A A and 1438G and C516T gene polymorphisms and MOH risk. In contrast, a higher consumption of monthly drug doses was observed among 516T 5HT2A carriers (median 50, range 13-120) compared to 516CC patients (median 30, range 12-128) (Mann-Whitney U-test, P = .018). In the stepwise multiple regression analysis, C516T 5HT2A polymorphism (P = .018) and class of overused drug (P = .047) emerged as significant, independent predictors of the monthly drug consumption in MOH patients. Although our results do not support a major role of the A-1438G and C516T polymorphic variants of the 5HT2A gene in the susceptibility of MOH, our findings support an influence of the C516T polymorphism on the number of symptomatic drug doses taken and, possibly, on the drug-seeking behavior in these patients. © 2010 American Headache Society.
Skums, Pavel; Campo, David S; Dimitrova, Zoya; Vaughan, Gilberto; Lau, Daryl T; Khudyakov, Yury
Hepatitis C virus (HCV) is a major cause of liver disease world-wide. Current interferon and ribavirin (IFN/RBV) therapy is effective in 50%-60% of patients. HCV exists in infected patients as a large viral population of intra-host variants (quasispecies), which may be differentially resistant to interferon treatment. We present a method for measuring differential interferon resistance of HCV quasispecies based on mathematical modeling and analysis of HCV population dynamics during the first hours of interferon therapy. The mathematical models showed that individual intra-host HCV variants have a wide range of resistance to IFN treatment in each patient. Analysis of differential IFN resistance among intra-host HCV variants allows for accurate prediction of response to IFN therapy. The models strongly suggest that resistance to interferon may vary broadly among closely related variants in infected hosts and therapy outcome may be defined by a single or a few variants irrespective of their frequency in the intra-host HCV population before treatment.
Wang, Longfei; Lee, Sungyoung; Gim, Jungsoo; Qiao, Dandi; Cho, Michael; Elston, Robert C; Silverman, Edwin K; Won, Sungho
2016-09-01
Family-based designs have been repeatedly shown to be powerful in detecting the significant rare variants associated with human diseases. Furthermore, human diseases are often defined by the outcomes of multiple phenotypes, and thus we expect multivariate family-based analyses may be very efficient in detecting associations with rare variants. However, few statistical methods implementing this strategy have been developed for family-based designs. In this report, we describe one such implementation: the multivariate family-based rare variant association tool (mFARVAT). mFARVAT is a quasi-likelihood-based score test for rare variant association analysis with multiple phenotypes, and tests both homogeneous and heterogeneous effects of each variant on multiple phenotypes. Simulation results show that the proposed method is generally robust and efficient for various disease models, and we identify some promising candidate genes associated with chronic obstructive pulmonary disease. The software of mFARVAT is freely available at http://healthstat.snu.ac.kr/software/mfarvat/, implemented in C++ and supported on Linux and MS Windows. © 2016 WILEY PERIODICALS, INC.
Stability of a jet in confined pressure-driven biphasic flows at low reynolds numbers.
Guillot, Pierre; Colin, Annie; Utada, Andrew S; Ajdari, Armand
2007-09-07
Motivated by its importance for microfluidic applications, we study the stability of jets formed by pressure-driven concentric biphasic flows in cylindrical capillaries. The specificity of this variant of the classical Rayleigh-Plateau instability is the role of the geometry which imposes confinement and Poiseuille flow profiles. We experimentally evidence a transition between situations where the flow takes the form of a jet and regimes where drops are produced. We describe this as the transition from convective to absolute instability, within a simple linear analysis using lubrication theory for flows at low Reynolds number, and reach remarkable agreement with the data.
Cusick, Matthew F; Libbey, Jane E; Cox Gill, Joan; Fujinami, Robert S; Eckels, David D
2013-01-01
Aim To determine whether modulation of T-cell responses by naturally occurring viral variants caused an increase in numbers of Tregs in HCV-infected patients. Patients, materials & methods Human peripheral blood mononuclear cells, having proliferative responses to a wild-type HCV-specific CD4+ T-cell epitope, were used to quantify, via proliferative assays, flow cytometry and class II tetramers, the effects of naturally occurring viral variants arising in the immunodominant epitope. Results In combination, the wild-type and variant peptides led to enhanced suppression of an anti-HCV T-cell response. The variant had a lower avidity for the wild-type-specific CD4+ T cell. Variant-stimulated CD4+ T cells had increased Foxp3, compared with wild-type-stimulated cells. Conclusion A stable viral variant from a chronic HCV subject was able to induce Tregs in multiple individuals that responded to the wild-type HCV-specific CD4+ T-cell epitope. PMID:24421862
Proposal for the nomenclature of human plasminogen (PLG) polymorphism.
Skoda, U; Bertrams, J; Dykes, D; Eiberg, H; Hobart, M; Hummel, K; Kühnl, P; Mauff, G; Nakamura, S; Nishimukai, H
1986-01-01
Since its discovery, human plasminogen (PLG) polymorphism has received widespread acceptance in population genetics and forensic haematology. Due to the large number of variant alleles described, a PLG reference typing and Plasminogen Symposium was held, at which a nomenclature proposal was inaugurated. The technology of comparing PLG variants was based on isoelectric focusing and subsequent detection by caseinolytic overlay and 'Western' blotting. Typing results permitted comparison of so far described variant designations and resulted in a new nomenclature proposal for PLG polymorphism. It is recommended that the two most common alleles found in all investigated races be called: PLG*A (previously also PLG*1) and PLG*B (previously also PLG*2), the known variants with acidic pI: PLG*A1 to *A3, intermediate variants: PLG*M1 to *M5, PLG*M5 being functionally inactive, and basic variants: PLG*B1 to *B3. For future classification of newly discovered variants, samples should be compared at any of the laboratories participating in the reference typing.
Nomenclature for alleles of the thiopurine methyltransferase gene
Appell, Malin L.; Berg, Jonathan; Duley, John; Evans, William E.; Kennedy, Martin A.; Lennard, Lynne; Marinaki, Tony; McLeod, Howard L.; Relling, Mary V.; Schaeffeler, Elke; Schwab, Matthias; Weinshilboum, Richard; Yeoh, Allen E.J.; McDonagh, Ellen M.; Hebert, Joan M.; Klein, Teri E.; Coulthard, Sally A.
2013-01-01
The drug-metabolizing enzyme thiopurine methyltransferase (TPMT) has become one of the best examples of pharmacogenomics to be translated into routine clinical practice. TPMT metabolizes the thiopurines 6-mercaptopurine, 6-thioguanine, and azathioprine, drugs that are widely used for treatment of acute leukemias, inflammatory bowel diseases, and other disorders of immune regulation. Since the discovery of genetic polymorphisms in the TPMT gene, many sequence variants that cause a decreased enzyme activity have been identified and characterized. Increasingly, to optimize dose, pretreatment determination of TPMT status before commencing thiopurine therapy is now routine in many countries. Novel TPMT sequence variants are currently numbered sequentially using PubMed as a source of information; however, this has caused some problems as exemplified by two instances in which authors’ articles appeared on PubMed at the same time, resulting in the same allele numbers given to different polymorphisms. Hence, there is an urgent need to establish an order and consensus to the numbering of known and novel TPMT sequence variants. To address this problem, a TPMT nomenclature committee was formed in 2010, to define the nomenclature and numbering of novel variants for the TPMT gene. A website (http://www.imh.liu.se/tpmtalleles) serves as a platform for this work. Researchers are encouraged to submit novel TPMT alleles to the committee for designation and reservation of unique allele numbers. The committee has decided to renumber two alleles: nucleotide position 106 (G > A) from TPMT*24 to TPMT*30 and position 611 (T > C, rs79901429) from TPMT*28 to TPMT*31. Nomenclature for all other known alleles remains unchanged. PMID:23407052
Linkage of osteoporosis to chromosome 20p12 and association to BMP2.
Styrkarsdottir, Unnur; Cazier, Jean-Baptiste; Kong, Augustine; Rolfsson, Ottar; Larsen, Helene; Bjarnadottir, Emma; Johannsdottir, Vala D; Sigurdardottir, Margret S; Bagger, Yu; Christiansen, Claus; Reynisdottir, Inga; Grant, Struan F A; Jonasson, Kristjan; Frigge, Michael L; Gulcher, Jeffrey R; Sigurdsson, Gunnar; Stefansson, Kari
2003-12-01
Osteoporotic fractures are a major cause of morbidity and mortality in ageing populations. Osteoporosis, defined as low bone mineral density (BMD) and associated fractures, have significant genetic components that are largely unknown. Linkage analysis in a large number of extended osteoporosis families in Iceland, using a phenotype that combines osteoporotic fractures and BMD measurements, showed linkage to Chromosome 20p12.3 (multipoint allele-sharing LOD, 5.10; p value, 6.3 x 10(-7)), results that are statistically significant after adjusting for the number of phenotypes tested and the genome-wide search. A follow-up association analysis using closely spaced polymorphic markers was performed. Three variants in the bone morphogenetic protein 2 (BMP2) gene, a missense polymorphism and two anonymous single nucleotide polymorphism haplotypes, were determined to be associated with osteoporosis in the Icelandic patients. The association is seen with many definitions of an osteoporotic phenotype, including osteoporotic fractures as well as low BMD, both before and after menopause. A replication study with a Danish cohort of postmenopausal women was conducted to confirm the contribution of the three identified variants. In conclusion, we find that a region on the short arm of Chromosome 20 contains a gene or genes that appear to be a major risk factor for osteoporosis and osteoporotic fractures, and our evidence supports the view that BMP2 is at least one of these genes.
Stark, Zornitza; Dashnow, Harriet; Lunke, Sebastian; Tan, Tiong Y; Yeung, Alison; Sadedin, Simon; Thorne, Natalie; Macciocca, Ivan; Gaff, Clara; Oshlack, Alicia; White, Susan M; James, Paul A
2017-11-01
Rapid identification of clinically significant variants is key to the successful application of next generation sequencing technologies in clinical practice. The Melbourne Genomics Health Alliance (MGHA) variant prioritization framework employs a gene prioritization index based on clinician-generated a priori gene lists, and a variant prioritization index (VPI) based on rarity, conservation and protein effect. We used data from 80 patients who underwent singleton whole exome sequencing (WES) to test the ability of the framework to rank causative variants highly, and compared it against the performance of other gene and variant prioritization tools. Causative variants were identified in 59 of the patients. Using the MGHA prioritization framework the average rank of the causative variant was 2.24, with 76% ranked as the top priority variant, and 90% ranked within the top five. Using clinician-generated gene lists resulted in ranking causative variants an average of 8.2 positions higher than prioritization based on variant properties alone. This clinically driven prioritization approach significantly outperformed purely computational tools, placing a greater proportion of causative variants top or in the top 5 (permutation P-value=0.001). Clinicians included 40 of the 49 WES diagnoses in their a priori list of differential diagnoses (81%). The lists generated by PhenoTips and Phenomizer contained 14 (29%) and 18 (37%) of these diagnoses respectively. These results highlight the benefits of clinically led variant prioritization in increasing the efficiency of singleton WES data analysis and have important implications for developing models for the funding and delivery of genomic services.
How to interpret a healthcare economic analysis.
Brown, Melissa M; Brown, Gary C
2005-06-01
The purpose of the review is to present guidelines to help the clinician to interpret healthcare economic analyses and review pertinent recent analysis in the ophthalmic literature. There are four variants of healthcare economic analyses: (1) cost-minimization analysis; (2) cost-benefit analysis; (3) cost-effectiveness analysis and (4) cost-utility analysis. Cost-utility utility analysis has assumed an increasingly important role in healthcare, with increasing number of analyses occurring in the peer-reviewed ophthalmic literature. These include cost-utility analyses of cataract surgery in the first and second eyes, amblyopia treatment, and cost-utility analyses encompassing the vitreoretinal interventions of the following: (1) laser photocoagulation for exudative macular degeneration; (2) laser treatment for diabetic retinopathy; (3) laser photocoagulation for branch retinal vein obstruction; (4) diabetic vitrectomy; (5) treatment of proliferative retinopathy of prematurity and (6) treatment of retinal detachment associated with proliferative vitreoretinopathy. As an increasing number of cost-utility analyses become available they will provide the information system for the practice of value-based medicine, or medicine based upon the patient-perceived value conferred by interventions. Increasing numbers of cost-utility analysis in the ophthalmic literature suggest that ophthalmic interventions, including vitreoretinal interventions, are cost effective. Cost-utility analysis is a major tool in value-based medicine, the practice of medicine based upon the patient-perceived value conferred by healthcare interventions.
Factors influencing success of clinical genome sequencing across a broad spectrum of disorders
Lise, Stefano; Broxholme, John; Cazier, Jean-Baptiste; Rimmer, Andy; Kanapin, Alexander; Lunter, Gerton; Fiddy, Simon; Allan, Chris; Aricescu, A. Radu; Attar, Moustafa; Babbs, Christian; Becq, Jennifer; Beeson, David; Bento, Celeste; Bignell, Patricia; Blair, Edward; Buckle, Veronica J; Bull, Katherine; Cais, Ondrej; Cario, Holger; Chapel, Helen; Copley, Richard R; Cornall, Richard; Craft, Jude; Dahan, Karin; Davenport, Emma E; Dendrou, Calliope; Devuyst, Olivier; Fenwick, Aimée L; Flint, Jonathan; Fugger, Lars; Gilbert, Rodney D; Goriely, Anne; Green, Angie; Greger, Ingo H.; Grocock, Russell; Gruszczyk, Anja V; Hastings, Robert; Hatton, Edouard; Higgs, Doug; Hill, Adrian; Holmes, Chris; Howard, Malcolm; Hughes, Linda; Humburg, Peter; Johnson, David; Karpe, Fredrik; Kingsbury, Zoya; Kini, Usha; Knight, Julian C; Krohn, Jonathan; Lamble, Sarah; Langman, Craig; Lonie, Lorne; Luck, Joshua; McCarthy, Davis; McGowan, Simon J; McMullin, Mary Frances; Miller, Kerry A; Murray, Lisa; Németh, Andrea H; Nesbit, M Andrew; Nutt, David; Ormondroyd, Elizabeth; Oturai, Annette Bang; Pagnamenta, Alistair; Patel, Smita Y; Percy, Melanie; Petousi, Nayia; Piazza, Paolo; Piret, Sian E; Polanco-Echeverry, Guadalupe; Popitsch, Niko; Powrie, Fiona; Pugh, Chris; Quek, Lynn; Robbins, Peter A; Robson, Kathryn; Russo, Alexandra; Sahgal, Natasha; van Schouwenburg, Pauline A; Schuh, Anna; Silverman, Earl; Simmons, Alison; Sørensen, Per Soelberg; Sweeney, Elizabeth; Taylor, John; Thakker, Rajesh V; Tomlinson, Ian; Trebes, Amy; Twigg, Stephen RF; Uhlig, Holm H; Vyas, Paresh; Vyse, Tim; Wall, Steven A; Watkins, Hugh; Whyte, Michael P; Witty, Lorna; Wright, Ben; Yau, Chris; Buck, David; Humphray, Sean; Ratcliffe, Peter J; Bell, John I; Wilkie, Andrew OM; Bentley, David; Donnelly, Peter; McVean, Gilean
2015-01-01
To assess factors influencing the success of whole genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases across a broad spectrum of disorders in whom prior screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritisation. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease causing variants in 21% of cases, rising to 34% (23/68) for Mendelian disorders and 57% (8/14) in trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, though only four were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis, but also highlight many outstanding challenges. PMID:25985138
Validation of a next-generation sequencing assay for clinical molecular oncology.
Cottrell, Catherine E; Al-Kateb, Hussam; Bredemeyer, Andrew J; Duncavage, Eric J; Spencer, David H; Abel, Haley J; Lockwood, Christina M; Hagemann, Ian S; O'Guin, Stephanie M; Burcea, Lauren C; Sawyer, Christopher S; Oschwald, Dayna M; Stratman, Jennifer L; Sher, Dorie A; Johnson, Mark R; Brown, Justin T; Cliften, Paul F; George, Bijoy; McIntosh, Leslie D; Shrivastava, Savita; Nguyen, Tudung T; Payton, Jacqueline E; Watson, Mark A; Crosby, Seth D; Head, Richard D; Mitra, Robi D; Nagarajan, Rakesh; Kulkarni, Shashikant; Seibert, Karen; Virgin, Herbert W; Milbrandt, Jeffrey; Pfeifer, John D
2014-01-01
Currently, oncology testing includes molecular studies and cytogenetic analysis to detect genetic aberrations of clinical significance. Next-generation sequencing (NGS) allows rapid analysis of multiple genes for clinically actionable somatic variants. The WUCaMP assay uses targeted capture for NGS analysis of 25 cancer-associated genes to detect mutations at actionable loci. We present clinical validation of the assay and a detailed framework for design and validation of similar clinical assays. Deep sequencing of 78 tumor specimens (≥ 1000× average unique coverage across the capture region) achieved high sensitivity for detecting somatic variants at low allele fraction (AF). Validation revealed sensitivities and specificities of 100% for detection of single-nucleotide variants (SNVs) within coding regions, compared with SNP array sequence data (95% CI = 83.4-100.0 for sensitivity and 94.2-100.0 for specificity) or whole-genome sequencing (95% CI = 89.1-100.0 for sensitivity and 99.9-100.0 for specificity) of HapMap samples. Sensitivity for detecting variants at an observed 10% AF was 100% (95% CI = 93.2-100.0) in HapMap mixes. Analysis of 15 masked specimens harboring clinically reported variants yielded concordant calls for 13/13 variants at AF of ≥ 15%. The WUCaMP assay is a robust and sensitive method to detect somatic variants of clinical significance in molecular oncology laboratories, with reduced time and cost of genetic analysis allowing for strategic patient management. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Rabies surveillance in the United States during 2009.
Blanton, Jesse D; Palmer, Dustyn; Rupprecht, Charles E
2010-09-15
During 2009, 49 states and Puerto Rico reported 6,690 rabid animals and 4 human rabies cases to the CDC, representing a 2.2% decrease from the 6,841 rabid animals and 2 human cases reported in 2008. Approximately 92% of reported rabid animals were wildlife. Relative contributions by the major animal groups were as follows: 2,327 (34.8%) raccoons, 1,625 (24.3%) bats, 1,603 (24.0%) skunks, 504 (75%) foxes, 300 (4.5%) cats, 81 (1.2%) dogs, and 74 (1.1%) cattle. Compared with 2008, numbers of rabid raccoons and bats that were reported decreased, whereas numbers of rabid skunks, foxes, cats, cattle, dogs, and horses that were reported increased. Fewer rabid raccoons, compared with 2008, were reported by 12 of the 20 eastern states where raccoon rabies is enzootic, and number of rabid raccoons decreased by 2.6% overall nationally. Despite a 10% decrease in the number of rabid bats that were reported and a decrease in the total number of bats submitted for testing, bats were the second most commonly submitted animal, behind cats, during 2009. The number of rabid skunks that were reported increased by 0.9% overall. The proportion of rabid skunks in which infection was attributed to the raccoon rabies virus variant decreased from 473% in 2008 to 40.9% in 2009, resulting in a 12.7% increase in the number of rabid skunks infected with a skunk rabies virus variant. The number of rabid foxes increased 11.0% overall from the previous year. Four cases of rabies involving humans were reported from Texas, Indiana, Virginia, and Michigan. The Texas case represented the first presumptive abortive human rabies case, with the patient recovering after the onset of symptoms without intensive care. The Indiana and Michigan cases were associated with bat rabies virus variants. The human rabies case in Virginia was associated with a canine rabies virus variant acquired during the patient's travel to India.
Genome-wide association study of age at menarche in African-American women
Demerath, Ellen W.; Liu, Ching-Ti; Franceschini, Nora; Chen, Gary; Palmer, Julie R.; Smith, Erin N.; Chen, Christina T.L.; Ambrosone, Christine B.; Arnold, Alice M.; Bandera, Elisa V.; Berenson, Gerald S.; Bernstein, Leslie; Britton, Angela; Cappola, Anne R.; Carlson, Christopher S.; Chanock, Stephen J.; Chen, Wei; Chen, Zhao; Deming, Sandra L.; Elks, Cathy E.; Evans, Michelle K.; Gajdos, Zofia; Henderson, Brian E.; Hu, Jennifer J.; Ingles, Sue; John, Esther M.; Kerr, Kathleen F.; Kolonel, Laurence N.; Le Marchand, Loic; Lu, Xiaoning; Millikan, Robert C.; Musani, Solomon K.; Nock, Nora L.; North, Kari; Nyante, Sarah; Press, Michael F.; Rodriquez-Gil, Jorge L.; Ruiz-Narvaez, Edward A.; Schork, Nicholas J.; Srinivasan, Sathanur R.; Woods, Nancy F.; Zheng, Wei; Ziegler, Regina G.; Zonderman, Alan; Heiss, Gerardo; Gwen Windham, B.; Wellons, Melissa; Murray, Sarah S.; Nalls, Michael; Pastinen, Tomi; Rajkovic, Aleksandar; Hirschhorn, Joel; Adrienne Cupples, L.; Kooperberg, Charles; Murabito, Joanne M.; Haiman, Christopher A.
2013-01-01
African-American (AA) women have earlier menarche on average than women of European ancestry (EA), and earlier menarche is a risk factor for obesity and type 2 diabetes among other chronic diseases. Identification of common genetic variants associated with age at menarche has a potential value in pointing to the genetic pathways underlying chronic disease risk, yet comprehensive genome-wide studies of age at menarche are lacking for AA women. In this study, we tested the genome-wide association of self-reported age at menarche with common single-nucleotide polymorphisms (SNPs) in a total of 18 089 AA women in 15 studies using an additive genetic linear regression model, adjusting for year of birth and population stratification, followed by inverse-variance weighted meta-analysis (Stage 1). Top meta-analysis results were then tested in an independent sample of 2850 women (Stage 2). First, while no SNP passed the pre-specified P < 5 × 10−8 threshold for significance in Stage 1, suggestive associations were found for variants near FLRT2 and PIK3R1, and conditional analysis identified two independent SNPs (rs339978 and rs980000) in or near RORA, strengthening the support for this suggestive locus identified in EA women. Secondly, an investigation of SNPs in 42 previously identified menarche loci in EA women demonstrated that 25 (60%) of them contained variants significantly associated with menarche in AA women. The findings provide the first evidence of cross-ethnic generalization of menarche loci identified to date, and suggest a number of novel biological links to menarche timing in AA women. PMID:23599027
Welderufael, B G; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L G; Fikse, W F
2018-01-01
Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to - but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t -test and a genome-wide significance level of P -value < 10 -4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to - or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2 ) and genes involved in macrophage recruitment and regulation of inflammations ( PDGFD and PTX3 ) were suggested as possible causal genes for susceptibility to - and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to - and recoverability from mastitis.
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong
2018-01-01
Background Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. Material/Methods Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. Results From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10−4). Conclusions This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations. PMID:29505555
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong; Wang, Wenju; Jiang, Lihong
2018-03-05
BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10^-4). CONCLUSIONS This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa
Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less
Kwan, Elizabeth X.; Wang, Xiaobin S.; Amemiya, Haley M.; Brewer, Bonita J.; Raghuraman, M. K.
2016-01-01
The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. PMID:27449518
Kwan, Elizabeth X; Wang, Xiaobin S; Amemiya, Haley M; Brewer, Bonita J; Raghuraman, M K
2016-09-08
The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. Copyright © 2016 Kwan et al.
Validation of copy number variants associated with prostate cancer risk and prognosis.
Blackburn, August; Wilson, Desiree; Gelfond, Jonathan; Yao, Li; Hernandez, Javier; Thompson, Ian M; Leach, Robin J; Lehman, Donna M
2014-01-01
Two recent studies have reported novel heritable copy number variants on chromosomes 2p, 15q, and 12q to be associated with prostate cancer (PCa) risk in non-Hispanic Caucasians. The goal of this study was to determine whether these findings could be independently confirmed in the Caucasian population from the South Texas area. The study subjects consisted of participants of the San Antonio Biomarkers of Risk for PCa cohort and additional cases ascertained in the same metropolitan area. We genotyped all 7 of the reported copy number variants using real-time quantitative polymerase chain reaction in 1,536 (317 cases and 1,219 controls) non-Hispanic Caucasian men, and additionally, we genotyped 632 (191 cases and 441 controls) Hispanic Caucasian men for one of these variants, a deletion on 2p24.3. Association of the deletion on 2p24.3 with overall PCa risk did not meet our significance criteria but was consistent with previous reports (odds ratio, 1.40; 95% confidence interval 0.99-2.00; P = 0.06). Among Hispanic Caucasians, this deletion is much less prevalent (minor allele frequencies of 0.059 and 0.024 in non-Hispanic and Hispanic Caucasians, respectively) and did not show evidence of association with risk for PCa. Interestingly, among non-Hispanic Caucasians, carrying a homozygous deletion of 2p24.3 was significantly associated with high-grade PCa as defined by Gleason score sum ≥8 (odds ratio, 27.99; 95% confidence interval 1.99-392.6; P = 0.007 [the Fisher exact test]). The remaining 6 copy number variable regions either were not polymorphic in our cohort of non-Hispanic Caucasians or showed no evidence of association. Our findings are consistent with the reported observation that a heritable deletion on 2p24.3 is associated with PCa risk in non-Hispanic Caucasians. Additionally, our observations indicate that the 2p24.3 variant is associated with risk for high-grade PCa in a recessive manner. We were unable to replicate any association with PCa for the variants on chromosomes 15q and 12q, which may be explained by regional population differences in low frequency variants and disease heterogeneity. Published by Elsevier Inc.
Identifying Causal Variants at Loci with Multiple Signals of Association
Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar
2014-01-01
Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20–50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. PMID:25104515
Identifying causal variants at loci with multiple signals of association.
Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar
2014-10-01
Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20-50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. Copyright © 2014 by the Genetics Society of America.
Cooper-Knock, Johnathan; Robins, Henry; Niedermoser, Isabell; Wyles, Matthew; Heath, Paul R; Higginbottom, Adrian; Walsh, Theresa; Kazoka, Mbombe; Ince, Paul G; Hautbergue, Guillaume M; McDermott, Christopher J; Kirby, Janine; Shaw, Pamela J
2017-01-01
Amyotrophic lateral sclerosis (ALS) is underpinned by an oligogenic rare variant architecture. Identified genetic variants of ALS include RNA-binding proteins containing prion-like domains (PrLDs). We hypothesized that screening genes encoding additional similar proteins will yield novel genetic causes of ALS. The most common genetic variant of ALS patients is a G4C2-repeat expansion within C9ORF72 . We have shown that G4C2-repeat RNA sequesters RNA-binding proteins. A logical consequence of this is that loss-of-function mutations in G4C2-binding partners might contribute to ALS pathogenesis independently of and/or synergistically with C9ORF72 expansions. Targeted sequencing of genomic DNA encoding either RNA-binding proteins or known ALS genes ( n = 274 genes) was performed in ALS patients to identify rare deleterious genetic variants and explore genotype-phenotype relationships. Genomic DNA was extracted from 103 ALS patients including 42 familial ALS patients and 61 young-onset (average age of onset 41 years) sporadic ALS patients; patients were chosen to maximize the probability of identifying genetic causes of ALS. Thirteen patients carried a G4C2-repeat expansion of C9ORF72 . We identified 42 patients with rare deleterious variants; 6 patients carried more than one variant. Twelve mutations were discovered in known ALS genes which served as a validation of our strategy. Rare deleterious variants in RNA-binding proteins were significantly enriched in ALS patients compared to control frequencies ( p = 5.31E-18). Nineteen patients featured at least one variant in a RNA-binding protein containing a PrLD. The number of variants per patient correlated with rate of disease progression ( t -test, p = 0.033). We identified eighteen patients with a single variant in a G4C2-repeat binding protein. Patients with a G4C2-binding protein variant in combination with a C9ORF72 expansion had a significantly faster disease course ( t -test, p = 0.025). Our data are consistent with an oligogenic model of ALS. We provide evidence for a number of entirely novel genetic variants of ALS caused by mutations in RNA-binding proteins. Moreover we show that these mutations act synergistically with each other and with C9ORF72 expansions to modify the clinical phenotype of ALS. A key finding is that this synergy is present only between functionally interacting variants. This work has significant implications for ALS therapy development.
Targeted Analysis of Whole Genome Sequence Data to Diagnose Genetic Cardiomyopathy
Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa; ...
2014-09-01
Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less
2014-01-01
Background Genome wide association studies (GWAS) in most cattle breeds result in large genomic intervals of significant associations making it difficult to identify causal mutations. This is due to the extensive, low-level linkage disequilibrium within a cattle breed. As there is less linkage disequilibrium across breeds, multibreed GWAS may improve precision of causal variant mapping. Here we test this hypothesis in a Holstein and Jersey cattle data set with 17,925 individuals with records for production and functional traits and 632,003 SNP markers. Results By using a cross validation strategy within the Holstein and Jersey data sets, we were able to identify and confirm a large number of QTL. As expected, the precision of mapping these QTL within the breeds was limited. In the multibreed analysis, we found that many loci were not segregating in both breeds. This was partly an artefact of power of the experiments, with the number of QTL shared between the breeds generally increasing with trait heritability. False discovery rates suggest that the multibreed analysis was less powerful than between breed analyses, in terms of how much genetic variance was explained by the detected QTL. However, the multibreed analysis could more accurately pinpoint the location of the well-described mutations affecting milk production such as DGAT1. Further, the significant SNP in the multibreed analysis were significantly enriched in genes regions, to a considerably greater extent than was observed in the single breed analyses. In addition, we have refined QTL on BTA5 and BTA19 to very small intervals and identified a small number of potential candidate genes in these, as well as in a number of other regions. Conclusion Where QTL are segregating across breed, multibreed GWAS can refine these to reasonably small genomic intervals. However, such QTL appear to represent only a fraction of the genetic variation. Our results suggest a significant proportion of QTL affecting milk production segregate within rather than across breeds, at least for Holstein and Jersey cattle. PMID:24456127
Zhou, Shuangyan; Shi, Danfeng; Liu, Xuewei; Liu, Huanxiang; Yao, Xiaojun
2016-02-24
Recent studies uncovered a novel protective prion protein variant: V127 variant, which was reported intrinsically resistant to prion conversion and propagation. However, the structural basis of its protective effect is still unknown. To uncover the origin of the protective role of V127 variant, molecular dynamics simulations were performed to explore the influence of G127V mutation on two key processes of prion propagation: dimerization and fibril formation. The simulation results indicate V127 variant is unfavorable to form dimer by reducing the main-chain H-bond interactions. The simulations of formed fibrils consisting of β1 strand prove V127 variant will make the formed fibril become unstable and disorder. The weaker interaction energies between layers and reduced H-bonds number for V127 variant reveal this mutation is unfavorable to the formation of stable fibril. Consequently, we find V127 variant is not only unfavorable to the formation of dimer but also unfavorable to the formation of stable core and fibril, which can explain the mechanism on the protective role of V127 variant from the molecular level. Our findings can deepen the understanding of prion disease and may guide the design of peptide mimetics or small molecule to mimic the protective effect of V127 variant.
Method of generating ploynucleotides encoding enhanced folding variants
Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.
2017-05-02
The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.
Ion Mobility Separation of Variant Histone Tails Extending to the “Middle-down” Range
Shvartsburg, Alexandre A.; Zheng, Yupeng; Smith, Richard D.; Kelleher, Neil L.
2012-01-01
Differential ion mobility spectrometry (FAIMS) can baseline-resolve multiple variants of post-translationally modified peptides extending to the 3 - 4 kDa range, which differ in the localization of a PTM as small as acetylation. Essentially orthogonal separations for different charge states expand the total peak capacity in proportion to the number of observed states that increases for longer polypeptides. This might enable resolving localization variants for yet larger peptides and even intact proteins. PMID:22559289
The UCL low-density lipoprotein receptor gene variant database: pathogenicity update
Futema, Marta; Whittall, Ros; Taylor-Beadling, Alison; Williams, Maggie; den Dunnen, Johan T; Humphries, Steve E
2017-01-01
Background Familial hypercholesterolaemia (OMIM 143890) is most frequently caused by variations in the low-density lipoprotein receptor (LDLR) gene. Predicting whether novel variants are pathogenic may not be straightforward, especially for missense and synonymous variants. In 2013, the Association of Clinical Genetic Scientists published guidelines for the classification of variants, with categories 1 and 2 representing clearly not or unlikely pathogenic, respectively, 3 representing variants of unknown significance (VUS), and 4 and 5 representing likely to be or clearly pathogenic, respectively. Here, we update the University College London (UCL) LDLR variant database according to these guidelines. Methods PubMed searches and alerts were used to identify novel LDLR variants for inclusion in the database. Standard in silico tools were used to predict potential pathogenicity. Variants were designated as class 4/5 only when the predictions from the different programs were concordant and as class 3 when predictions were discordant. Results The updated database (http://www.lovd.nl/LDLR) now includes 2925 curated variants, representing 1707 independent events. All 129 nonsense variants, 337 small frame-shifting and 117/118 large rearrangements were classified as 4 or 5. Of the 795 missense variants, 115 were in classes 1 and 2, 605 in class 4 and 75 in class 3. 111/181 intronic variants, 4/34 synonymous variants and 14/37 promoter variants were assigned to classes 4 or 5. Overall, 112 (7%) of reported variants were class 3. Conclusions This study updates the LDLR variant database and identifies a number of reported VUS where additional family and in vitro studies will be required to confirm or refute their pathogenicity. PMID:27821657
Genome-wide gene–gene interaction analysis for next-generation sequencing
Zhao, Jinying; Zhu, Yun; Xiong, Momiao
2016-01-01
The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study. PMID:26173972
A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*
Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing
2011-01-01
Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108
Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples
Peterson, Thomas A.; Park, Junyong
2017-01-01
The fight against cancer is hindered by its highly heterogeneous nature. Genome-wide sequencing studies have shown that individual malignancies contain many mutations that range from those commonly found in tumor genomes to rare somatic variants present only in a small fraction of lesions. Such rare somatic variants dominate the landscape of genomic mutations in cancer, yet efforts to correlate somatic mutations found in one or few individuals with functional roles have been largely unsuccessful. Traditional methods for identifying somatic variants that drive cancer are ‘gene-centric’ in that they consider only somatic variants within a particular gene and make no comparison to other similar genes in the same family that may play a similar role in cancer. In this work, we present oncodomain hotspots, a new ‘domain-centric’ method for identifying clusters of somatic mutations across entire gene families using protein domain models. Our analysis confirms that our approach creates a framework for leveraging structural and functional information encapsulated by protein domains into the analysis of somatic variants in cancer, enabling the assessment of even rare somatic variants by comparison to similar genes. Our results reveal a vast landscape of somatic variants that act at the level of domain families altering pathways known to be involved with cancer such as protein phosphorylation, signaling, gene regulation, and cell metabolism. Due to oncodomain hotspots’ unique ability to assess rare variants, we expect our method to become an important tool for the analysis of sequenced tumor genomes, complementing existing methods. PMID:28426665
Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN.
Fowler, Anna; Mahamdallie, Shazia; Ruark, Elise; Seal, Sheila; Ramsay, Emma; Clarke, Matthew; Uddin, Imran; Wylie, Harriet; Strydom, Ann; Lunter, Gerton; Rahman, Nazneen
2016-11-25
Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, 'exon CNVs') in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs. Methods: We developed a tool for the Detection of Exon Copy Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests BRCA1 and BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA). Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%. Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate. DECoN is freely available at www.icr.ac.uk/decon.
Greer, Justin B; Khuri, Sawsan; Fieber, Lynne A
2017-01-11
The neurotransmitter L-Glutamate (L-Glu) acting at ionotropic L-Glu receptors (iGluR) conveys fast excitatory signal transmission in the nervous systems of all animals. iGluR-dependent neurotransmission is a key component of the synaptic plasticity that underlies learning and memory. During learning, two subtypes of iGluR, α-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptors (AMPAR) and N-methyl-D-aspartate receptors (NMDAR), are dynamically regulated postsynaptically in vertebrates. Invertebrate organisms such as Aplysia californica (Aplysia) are well-studied models for iGluR-mediated function, yet no studies to date have analyzed the evolutionary relationships between iGluR genes in these species and those in vertebrates, to identify genes that may mediate plasticity. We conducted a thorough phylogenetic analysis spanning Bilateria to elucidate these relationships. The expression status of iGluR genes in the Aplysia nervous system was also examined. Our analysis shows that ancestral genes for both NMDAR and AMPAR subtypes were present in the common bilaterian ancestor. NMDAR genes show very high conservation in motifs responsible for forming the conductance pore of the ion channel. The number of NMDAR subunits is greater in vertebrates due to an increased number of splice variants and an increased number of genes, likely due to gene duplication events. AMPAR subunits form an orthologous group, and there is high variability in the number of AMPAR genes in each species due to extensive taxon specific gene gain and loss. qPCR results show that all 12 Aplysia iGluR subunits are expressed in all nervous system ganglia. Orthologous NMDAR subunits in all species studied suggests conserved function across Bilateria, and potentially a conserved mechanism of neuroplasticity and learning. Vertebrates display an increased number of NMDAR genes and splice variants, which may play a role in their greater diversity of physiological responses. Extensive gene gain and loss of AMPAR genes may result in different physiological properties that are taxon specific. Our results suggest a significant role for L-Glu mediated responses throughout the Aplysia nervous system, consistent with L-Glu's role as the primary excitatory neurotransmitter.
VCF-Explorer: filtering and analysing whole genome VCF files.
Akgün, Mete; Demirci, Hüseyin
2017-11-01
The decreasing cost in high-throughput technologies led to a number of sequencing projects consisting of thousands of whole genomes. The paradigm shift from exome to whole genome brings a significant increase in the size of output files. Most of the existing tools which are developed to analyse exome files are not adequate for larger VCF files produced by whole genome studies. In this work we present VCF-Explorer, a variant analysis software capable of handling large files. Memory efficiency and avoiding computationally costly pre-processing step enable to carry out the analysis to be performed with ordinary computers. VCF-Explorer provides an easy to use environment where users can define various types of queries based on variant and sample genotype level annotations. VCF-Explorer can be run in different environments and computational platforms ranging from a standard laptop to a high performance server. VCF-Explorer is freely available at: http://vcfexplorer.sourceforge.net/. mete.akgun@tubitak.gov.tr. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
NASA Astrophysics Data System (ADS)
Abuzahra, M. A. M.; Jakaria; Listyarini, K.; Furqon, A.; Sumantri, C.; Uddin, M. J.; Gunawan, A.
2018-05-01
High-throughput RNA sequencing (RNA-Seq) reveals new challenges for the detection of transcriptome variants (SNPs) in different tissues and species. The aims of this study was to characterize a SNP discovery analysis in the sheep meat odour and flavour transcriptome using RNA-Seq. Six liver samples from divergent sheep meat odour and flavour were analyzed using the Illumina Genome Hiseq 2500 Analyzer. The SNP detection analysis revealed 142 SNPs in sheep meat samples, and a large number of those corresponded to differences between high and low sheep meat odour and flavour ovis genome assembly OAR v4.0. Among them, about 90.4% of genes had multiple polymorphisms within 12 genes (JAML, ANGPTL8, LOC101103463, SEPW1, SCN5A, LOC101113036, DOCK6, GTSE1, KIF12, KCTD17, KANK2, CYP2A6). Several of the SNPs (JAML, CYP2A6, SEPW1, and KIF12) found in this study could be included as suitable markers in genotyping platforms to perform association analyses in commercial populations and apply genomic selection protocols in the sheep meat production.
2012-01-01
Introduction CD226 genetic variants have been associated with a number of autoimmune diseases and recently with systemic sclerosis (SSc). The aim of this study was to test the influence of CD226 loci in SSc susceptibility, clinical phenotypes and autoantibody status in a large multicenter European population. Methods A total of seven European populations of Caucasian ancestry were included, comprising 2,131 patients with SSc and 3,966 healthy controls. Three CD226 single nucleotide polymorphisms (SNPs), rs763361, rs3479968 and rs727088, were genotyped using Taqman 5'allelic discrimination assays. Results Pooled analyses showed no evidence of association of the three SNPs, neither with the global disease nor with the analyzed subphenotypes. However, haplotype block analysis revealed a significant association for the TCG haplotype (SNP order: rs763361, rs34794968, rs727088) with lung fibrosis positive patients (PBonf = 3.18E-02 OR 1.27 (1.05 to 1.54)). Conclusion Our data suggest that the tested genetic variants do not individually influence SSc susceptibility but a CD226 three-variant haplotype is related with genetic predisposition to SSc-related pulmonary fibrosis. PMID:22531499