Sample records for number variant cnv

  1. SG-ADVISER CNV: copy-number variant annotation and interpretation.

    PubMed

    Erikson, Galina A; Deshpande, Neha; Kesavan, Balachandar G; Torkamani, Ali

    2015-09-01

    Copy-number variants have been associated with a variety of diseases, especially cancer, autism, schizophrenia, and developmental delay. The majority of clinically relevant events occur de novo, necessitating the interpretation of novel events. In this light, we present the Scripps Genome ADVISER CNV annotation pipeline and Web server, which aims to fill the gap between copy number variant detection and interpretation by performing in-depth annotations and functional predictions for copy number variants. The Scripps Genome ADVISER CNV suite includes a Web server interface to a high-performance computing environment for calculations of annotations and a table-based user interface that allows for the execution of numerous annotation-based variant filtration strategies and statistics. The annotation results include details regarding location, impact on the coding portion of genes, allele frequency information (including allele frequencies from the Scripps Wellderly cohort), and overlap information with other reference data sets (including ClinVar, DGV, DECIPHER). A summary variant classification is produced (ADVISER score) based on the American College of Medical Genetics and Genomics scoring guidelines. We demonstrate >90% sensitivity/specificity for detection of pathogenic events. Scripps Genome ADVISER CNV is designed to allow users with no prior bioinformatics expertise to manipulate large volumes of copy-number variant data. Scripps Genome ADVISER CNV is available at http://genomics.scripps.edu/ADVISER/.

  2. RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing.

    PubMed

    Chang, Lun-Ching; Das, Biswajit; Lih, Chih-Jian; Si, Han; Camalier, Corinne E; McGregor, Paul M; Polley, Eric

    2016-01-01

    With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that mapped to a genomic region correlated with the DNA copy number variants (CNVs). We propose a method RefCNV that uses a reference set to estimate the distribution of the coverage for each exon. The construction of the reference set includes an evaluation of the sources of variability in the coverage distribution. We observed that the processing steps had an impact on the coverage distribution. For each exon, we compared the observed coverage with the expected normal coverage. Thresholds for determining CNVs were selected to control the false-positive error rate. RefCNV prediction correlated significantly (r = 0.96-0.86) with CNV measured by digital polymerase chain reaction for MET (7q31), EGFR (7p12), or ERBB2 (17q12) in 13 tumor cell lines. The genome-wide CNV analysis showed a good overall correlation (Spearman's coefficient = 0.82) between RefCNV estimation and publicly available CNV data in Cancer Cell Line Encyclopedia. RefCNV also showed better performance than three other CNV estimation methods in genome-wide CNV analysis.

  3. nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data.

    PubMed

    Zhang, Changsheng; Cai, Hongmin; Huang, Jingying; Song, Yan

    2016-09-17

    Variations in DNA copy number have an important contribution to the development of several diseases, including autism, schizophrenia and cancer. Single-cell sequencing technology allows the dissection of genomic heterogeneity at the single-cell level, thereby providing important evolutionary information about cancer cells. In contrast to traditional bulk sequencing, single-cell sequencing requires the amplification of the whole genome of a single cell to accumulate enough samples for sequencing. However, the amplification process inevitably introduces amplification bias, resulting in an over-dispersing portion of the sequencing data. Recent study has manifested that the over-dispersed portion of the single-cell sequencing data could be well modelled by negative binomial distributions. We developed a read-depth based method, nbCNV to detect the copy number variants (CNVs). The nbCNV method uses two constraints-sparsity and smoothness to fit the CNV patterns under the assumption that the read signals are negatively binomially distributed. The problem of CNV detection was formulated as a quadratic optimization problem, and was solved by an efficient numerical solution based on the classical alternating direction minimization method. Extensive experiments to compare nbCNV with existing benchmark models were conducted on both simulated data and empirical single-cell sequencing data. The results of those experiments demonstrate that nbCNV achieves superior performance and high robustness for the detection of CNVs in single-cell sequencing data.

  4. cnvScan: a CNV screening and annotation tool to improve the clinical utility of computational CNV prediction from exome sequencing data.

    PubMed

    Samarakoon, Pubudu Saneth; Sorte, Hanne Sørmo; Stray-Pedersen, Asbjørg; Rødningen, Olaug Kristin; Rognes, Torbjørn; Lyle, Robert

    2016-01-14

    With advances in next generation sequencing technology and analysis methods, single nucleotide variants (SNVs) and indels can be detected with high sensitivity and specificity in exome sequencing data. Recent studies have demonstrated the ability to detect disease-causing copy number variants (CNVs) in exome sequencing data. However, exonic CNV prediction programs have shown high false positive CNV counts, which is the major limiting factor for the applicability of these programs in clinical studies. We have developed a tool (cnvScan) to improve the clinical utility of computational CNV prediction in exome data. cnvScan can accept input from any CNV prediction program. cnvScan consists of two steps: CNV screening and CNV annotation. CNV screening evaluates CNV prediction using quality scores and refines this using an in-house CNV database, which greatly reduces the false positive rate. The annotation step provides functionally and clinically relevant information using multiple source datasets. We assessed the performance of cnvScan on CNV predictions from five different prediction programs using 64 exomes from Primary Immunodeficiency (PIDD) patients, and identified PIDD-causing CNVs in three individuals from two different families. In summary, cnvScan reduces the time and effort required to detect disease-causing CNVs by reducing the false positive count and providing annotation. This improves the clinical utility of CNV detection in exome data.

  5. CNV Workshop: an integrated platform for high-throughput copy number variation discovery and clinical diagnostics.

    PubMed

    Gai, Xiaowu; Perin, Juan C; Murphy, Kevin; O'Hara, Ryan; D'arcy, Monica; Wenocur, Adam; Xie, Hongbo M; Rappaport, Eric F; Shaikh, Tamim H; White, Peter S

    2010-02-04

    Recent studies have shown that copy number variations (CNVs) are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. The increasing availability of high-resolution genome surveillance platforms provides opportunity for rapidly assessing research and clinical samples for CNV content, as well as for determining the potential pathogenicity of identified variants. However, few informatics tools for accurate and efficient CNV detection and assessment currently exist. We developed a suite of software tools and resources (CNV Workshop) for automated, genome-wide CNV detection from a variety of SNP array platforms. CNV Workshop includes three major components: detection, annotation, and presentation of structural variants from genome array data. CNV detection utilizes a robust and genotype-specific extension of the Circular Binary Segmentation algorithm, and the use of additional detection algorithms is supported. Predicted CNVs are captured in a MySQL database that supports cohort-based projects and incorporates a secure user authentication layer and user/admin roles. To assist with determination of pathogenicity, detected CNVs are also annotated automatically for gene content, known disease loci, and gene-based literature references. Results are easily queried, sorted, filtered, and visualized via a web-based presentation layer that includes a GBrowse-based graphical representation of CNV content and relevant public data, integration with the UCSC Genome Browser, and tabular displays of genomic attributes for each CNV. To our knowledge, CNV Workshop represents the first cohesive and convenient platform for detection, annotation, and assessment of the biological and clinical significance of structural variants. CNV Workshop has been successfully utilized for assessment of genomic variation in healthy individuals and disease cohorts and is an ideal platform for coordinating multiple associated

  6. CNV-RF Is a Random Forest-Based Copy Number Variation Detection Method Using Next-Generation Sequencing.

    PubMed

    Onsongo, Getiria; Baughn, Linda B; Bower, Matthew; Henzler, Christine; Schomaker, Matthew; Silverstein, Kevin A T; Thyagarajan, Bharat

    2016-11-01

    Simultaneous detection of small copy number variations (CNVs) (<0.5 kb) and single-nucleotide variants in clinically significant genes is of great interest for clinical laboratories. The analytical variability in next-generation sequencing (NGS) and artifacts in coverage data because of issues with mappability along with lack of robust bioinformatics tools for CNV detection have limited the utility of targeted NGS data to identify CNVs. We describe the development and implementation of a bioinformatics algorithm, copy number variation-random forest (CNV-RF), that incorporates a machine learning component to identify CNVs from targeted NGS data. Using CNV-RF, we identified 12 of 13 deletions in samples with known CNVs, two cases with duplications, and identified novel deletions in 22 additional cases. Furthermore, no CNVs were identified among 60 genes in 14 cases with normal copy number and no CNVs were identified in another 104 patients with clinical suspicion of CNVs. All positive deletions and duplications were confirmed using a quantitative PCR method. CNV-RF also detected heterozygous deletions and duplications with a specificity of 50% across 4813 genes. The ability of CNV-RF to detect clinically relevant CNVs with a high degree of sensitivity along with confirmation using a low-cost quantitative PCR method provides a framework for providing comprehensive NGS-based CNV/single-nucleotide variant detection in a clinical molecular diagnostics laboratory. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  7. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    USDA-ARS?s Scientific Manuscript database

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  8. Genome-wide copy number variation (CNV) detection in Nelore cattle reveals highly frequent variants in genome regions harboring QTLs affecting production traits.

    PubMed

    da Silva, Joaquim Manoel; Giachetto, Poliana Fernanda; da Silva, Luiz Otávio; Cintra, Leandro Carrijo; Paiva, Samuel Rezende; Yamagishi, Michel Eduardo Beleza; Caetano, Alexandre Rodrigues

    2016-06-13

    Copy number variations (CNVs) have been shown to account for substantial portions of observed genomic variation and have been associated with qualitative and quantitative traits and the onset of disease in a number of species. Information from high-resolution studies to detect, characterize and estimate population-specific variant frequencies will facilitate the incorporation of CNVs in genomic studies to identify genes affecting traits of importance. Genome-wide CNVs were detected in high-density single nucleotide polymorphism (SNP) genotyping data from 1,717 Nelore (Bos indicus) cattle, and in NGS data from eight key ancestral bulls. A total of 68,007 and 12,786 distinct CNVs were observed, respectively. Cross-comparisons of results obtained for the eight resequenced animals revealed that 92 % of the CNVs were observed in both datasets, while 62 % of all detected CNVs were observed to overlap with previously validated cattle copy number variant regions (CNVRs). Observed CNVs were used for obtaining breed-specific CNV frequencies and identification of CNVRs, which were subsequently used for gene annotation. A total of 688 of the detected CNVRs were observed to overlap with 286 non-redundant QTLs associated with important production traits in cattle. All of 34 CNVs previously reported to be associated with milk production traits in Holsteins were also observed in Nelore cattle. Comparisons of estimated frequencies of these CNVs in the two breeds revealed 14, 13, 6 and 14 regions in high (>20 %), low (<20 %) and divergent (NEL > HOL, NEL < HOL) frequencies, respectively. Obtained results significantly enriched the bovine CNV map and enabled the identification of variants that are potentially associated with traits under selection in Nelore cattle, particularly in genome regions harboring QTLs affecting production traits.

  9. ParseCNV integrative copy number variation association software with quality tracking

    PubMed Central

    Glessner, Joseph T.; Li, Jin; Hakonarson, Hakon

    2013-01-01

    A number of copy number variation (CNV) calling algorithms exist; however, comprehensive software tools for CNV association studies are lacking. We describe ParseCNV, unique software that takes CNV calls and creates probe-based statistics for CNV occurrence in both case–control design and in family based studies addressing both de novo and inheritance events, which are then summarized based on CNV regions (CNVRs). CNVRs are defined in a dynamic manner to allow for a complex CNV overlap while maintaining precise association region. Using this approach, we avoid failure to converge and non-monotonic curve fitting weaknesses of programs, such as CNVtools and CNVassoc, and although Plink is easy to use, it only provides combined CNV state probe-based statistics, not state-specific CNVRs. Existing CNV association methods do not provide any quality tracking information to filter confident associations, a key issue which is fully addressed by ParseCNV. In addition, uncertainty in CNV calls underlying CNV associations is evaluated to verify significant results, including CNV overlap profiles, genomic context, number of probes supporting the CNV and single-probe intensities. When optimal quality control parameters are followed using ParseCNV, 90% of CNVs validate by polymerase chain reaction, an often problematic stage because of inadequate significant association review. ParseCNV is freely available at http://parsecnv.sourceforge.net. PMID:23293001

  10. ParseCNV integrative copy number variation association software with quality tracking.

    PubMed

    Glessner, Joseph T; Li, Jin; Hakonarson, Hakon

    2013-03-01

    A number of copy number variation (CNV) calling algorithms exist; however, comprehensive software tools for CNV association studies are lacking. We describe ParseCNV, unique software that takes CNV calls and creates probe-based statistics for CNV occurrence in both case-control design and in family based studies addressing both de novo and inheritance events, which are then summarized based on CNV regions (CNVRs). CNVRs are defined in a dynamic manner to allow for a complex CNV overlap while maintaining precise association region. Using this approach, we avoid failure to converge and non-monotonic curve fitting weaknesses of programs, such as CNVtools and CNVassoc, and although Plink is easy to use, it only provides combined CNV state probe-based statistics, not state-specific CNVRs. Existing CNV association methods do not provide any quality tracking information to filter confident associations, a key issue which is fully addressed by ParseCNV. In addition, uncertainty in CNV calls underlying CNV associations is evaluated to verify significant results, including CNV overlap profiles, genomic context, number of probes supporting the CNV and single-probe intensities. When optimal quality control parameters are followed using ParseCNV, 90% of CNVs validate by polymerase chain reaction, an often problematic stage because of inadequate significant association review. ParseCNV is freely available at http://parsecnv.sourceforge.net.

  11. The effect of algorithms on copy number variant detection.

    PubMed

    Tsuang, Debby W; Millard, Steven P; Ely, Benjamin; Chi, Peter; Wang, Kenneth; Raskind, Wendy H; Kim, Sulgi; Brkanac, Zoran; Yu, Chang-En

    2010-12-30

    The detection of copy number variants (CNVs) and the results of CNV-disease association studies rely on how CNVs are defined, and because array-based technologies can only infer CNVs, CNV-calling algorithms can produce vastly different findings. Several authors have noted the large-scale variability between CNV-detection methods, as well as the substantial false positive and false negative rates associated with those methods. In this study, we use variations of four common algorithms for CNV detection (PennCNV, QuantiSNP, HMMSeg, and cnvPartition) and two definitions of overlap (any overlap and an overlap of at least 40% of the smaller CNV) to illustrate the effects of varying algorithms and definitions of overlap on CNV discovery. We used a 56 K Illumina genotyping array enriched for CNV regions to generate hybridization intensities and allele frequencies for 48 Caucasian schizophrenia cases and 48 age-, ethnicity-, and gender-matched control subjects. No algorithm found a difference in CNV burden between the two groups. However, the total number of CNVs called ranged from 102 to 3,765 across algorithms. The mean CNV size ranged from 46 kb to 787 kb, and the average number of CNVs per subject ranged from 1 to 39. The number of novel CNVs not previously reported in normal subjects ranged from 0 to 212. Motivated by the availability of multiple publicly available genome-wide SNP arrays, investigators are conducting numerous analyses to identify putative additional CNVs in complex genetic disorders. However, the number of CNVs identified in array-based studies, and whether these CNVs are novel or valid, will depend on the algorithm(s) used. Thus, given the variety of methods used, there will be many false positives and false negatives. Both guidelines for the identification of CNVs inferred from high-density arrays and the establishment of a gold standard for validation of CNVs are needed.

  12. Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.

    PubMed

    Haraksingh, Rajini R; Abyzov, Alexej; Urban, Alexander Eckehart

    2017-04-24

    High-resolution microarray technology is routinely used in basic research and clinical practice to efficiently detect copy number variants (CNVs) across the entire human genome. A new generation of arrays combining high probe densities with optimized designs will comprise essential tools for genome analysis in the coming years. We systematically compared the genome-wide CNV detection power of all 17 available array designs from the Affymetrix, Agilent, and Illumina platforms by hybridizing the well-characterized genome of 1000 Genomes Project subject NA12878 to all arrays, and performing data analysis using both manufacturer-recommended and platform-independent software. We benchmarked the resulting CNV call sets from each array using a gold standard set of CNVs for this genome derived from 1000 Genomes Project whole genome sequencing data. The arrays tested comprise both SNP and aCGH platforms with varying designs and contain between ~0.5 to ~4.6 million probes. Across the arrays CNV detection varied widely in number of CNV calls (4-489), CNV size range (~40 bp to ~8 Mbp), and percentage of non-validated CNVs (0-86%). We discovered strikingly strong effects of specific array design principles on performance. For example, some SNP array designs with the largest numbers of probes and extensive exonic coverage produced a considerable number of CNV calls that could not be validated, compared to designs with probe numbers that are sometimes an order of magnitude smaller. This effect was only partially ameliorated using different analysis software and optimizing data analysis parameters. High-resolution microarrays will continue to be used as reliable, cost- and time-efficient tools for CNV analysis. However, different applications tolerate different limitations in CNV detection. Our study quantified how these arrays differ in total number and size range of detected CNVs as well as sensitivity, and determined how each array balances these attributes. This analysis will

  13. Sex chromosome aneuploidies and copy-number variants: a further explanation for neurodevelopmental prognosis variability?

    PubMed

    Le Gall, Jessica; Nizon, Mathilde; Pichon, Olivier; Andrieux, Joris; Audebert-Bellanger, Séverine; Baron, Sabine; Beneteau, Claire; Bilan, Frédéric; Boute, Odile; Busa, Tiffany; Cormier-Daire, Valérie; Ferec, Claude; Fradin, Mélanie; Gilbert-Dussardier, Brigitte; Jaillard, Sylvie; Jønch, Aia; Martin-Coignard, Dominique; Mercier, Sandra; Moutton, Sébastien; Rooryck, Caroline; Schaefer, Elise; Vincent, Marie; Sanlaville, Damien; Le Caignec, Cédric; Jacquemont, Sébastien; David, Albert; Isidor, Bertrand

    2017-08-01

    Sex chromosome aneuploidies (SCA) is a group of conditions in which individuals have an abnormal number of sex chromosomes. SCA, such as Klinefelter's syndrome, XYY syndrome, and Triple X syndrome are associated with a large range of neurological outcome. Another genetic event such as another cytogenetic abnormality may explain a part of this variable expressivity. In this study, we have recruited fourteen patients with intellectual disability or developmental delay carrying SCA associated with a copy-number variant (CNV). In our cohort (four patients 47,XXY, four patients 47,XXX, and six patients 47,XYY), seven patients were carrying a pathogenic CNV, two a likely pathogenic CNV and five a variant of uncertain significance. Our analysis suggests that CNV might be considered as an additional independent genetic factor for intellectual disability and developmental delay for patients with SCA and neurodevelopmental disorder.

  14. NanoStringNormCNV: pre-processing of NanoString CNV data.

    PubMed

    Sendorek, Dorota H; Lalonde, Emilie; Yao, Cindy Q; Sabelnykova, Veronica Y; Bristow, Robert G; Boutros, Paul C

    2018-03-15

    The NanoString System is a well-established technology for measuring RNA and DNA abundance. Although it can estimate copy number variation, relatively few tools support analysis of these data. To address this gap, we created NanoStringNormCNV, an R package for pre-processing and copy number variant calling from NanoString data. This package implements algorithms for pre-processing, quality-control, normalization and copy number variation detection. A series of reporting and data visualization methods support exploratory analyses. To demonstrate its utility, we apply it to a new dataset of 96 genes profiled on 41 prostate tumour and 24 matched normal samples. NanoStringNormCNV is implemented in R and is freely available at http://labs.oicr.on.ca/boutros-lab/software/nanostringnormcnv. paul.boutros@oicr.on.ca. Supplementary data are available at Bioinformatics online.

  15. An Organismal CNV Mutator Phenotype Restricted to Early Human Development.

    PubMed

    Liu, Pengfei; Yuan, Bo; Carvalho, Claudia M B; Wuster, Arthur; Walter, Klaudia; Zhang, Ling; Gambin, Tomasz; Chong, Zechen; Campbell, Ian M; Coban Akdemir, Zeynep; Gelowani, Violet; Writzl, Karin; Bacino, Carlos A; Lindsay, Sarah J; Withers, Marjorie; Gonzaga-Jauregui, Claudia; Wiszniewska, Joanna; Scull, Jennifer; Stankiewicz, Paweł; Jhangiani, Shalini N; Muzny, Donna M; Zhang, Feng; Chen, Ken; Gibbs, Richard A; Rautenstrauss, Bernd; Cheung, Sau Wai; Smith, Janice; Breman, Amy; Shaw, Chad A; Patel, Ankita; Hurles, Matthew E; Lupski, James R

    2017-02-23

    De novo copy number variants (dnCNVs) arising at multiple loci in a personal genome have usually been considered to reflect cancer somatic genomic instabilities. We describe a multiple dnCNV (MdnCNV) phenomenon in which individuals with genomic disorders carry five to ten constitutional dnCNVs. These CNVs originate from independent formation incidences, are predominantly tandem duplications or complex gains, exhibit breakpoint junction features reminiscent of replicative repair, and show increased de novo point mutations flanking the rearrangement junctions. The active CNV mutation shower appears to be restricted to a transient perizygotic period. We propose that a defect in the CNV formation process is responsible for the "CNV-mutator state," and this state is dampened after early embryogenesis. The constitutional MdnCNV phenomenon resembles chromosomal instability in various cancers. Investigations of this phenomenon may provide unique access to understanding genomic disorders, structural variant mutagenesis, human evolution, and cancer biology. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. "They Can't Find Anything Wrong with Him, Yet": Mothers' experiences of parenting an infant with a prenatally diagnosed copy number variant (CNV).

    PubMed

    Werner-Lin, Allison; Walser, Sarah; Barg, Frances K; Bernhardt, Barbara A

    2017-02-01

    Chromosome microarray (CMA) testing is used widely in prenatal settings. Some copy number variants (CNVs) detected using CMA are associated with variable or uncertain phenotype and/or possible neurocognitive involvement. Little is known about parenting an infant following such findings. Researchers conducted interviews with 23 mothers of infants diagnosed prenatally with a potentially pathogenic CNV to elicit perspectives on the child's development and disclosure of results to others. Interviews were audiotaped and analyzed for common themes. Most respondents reported their infants were developing typically. The majority expressed concern about their child's future development given the CNV. They reassured themselves their child was unaffected by: comparing him/her to siblings, scrutinizing the child's appearance and behavior, or following provider reassurances. Even without developmental and neurological concerns, some remained acutely observant of their child's neurocognitive development, leading to enrollment in early intervention or ongoing medical assessments. Mothers who were unconcerned stated they would likely attribute atypical behavior or developmental to the CNV. All interviewees shared the result with pediatricians, relatives, or friends, and many shared across groups. Most shared information with pregnant friends considering prenatal testing, but withheld partial or full information from family members due to stigma, lack of understanding, inability to explain the CNV, or presumptions that the child was unaffected. Research must address the long-term consequences of returning uncertain results for parent-child bonding and costs of ongoing assessment and early intervention for typically developing children. Follow up appointments will permit providers to screen for anxiety and assuage worry in the absence of symptoms. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  17. A method for rapid, targeted CNV genotyping identifies rare variants associated with neurocognitive disease.

    PubMed

    Mefford, Heather C; Cooper, Gregory M; Zerr, Troy; Smith, Joshua D; Baker, Carl; Shafer, Neil; Thorland, Erik C; Skinner, Cindy; Schwartz, Charles E; Nickerson, Deborah A; Eichler, Evan E

    2009-09-01

    Copy-number variants (CNVs) are substantial contributors to human disease. A central challenge in CNV-disease association studies is to characterize the pathogenicity of rare and possibly incompletely penetrant events, which requires the accurate detection of rare CNVs in large numbers of individuals. Cost and throughput issues limit our ability to perform these studies. We have adapted the Illumina BeadXpress SNP genotyping assay and developed an algorithm, SNP-Conditional OUTlier detection (SCOUT), to rapidly and accurately detect both rare and common CNVs in large cohorts. This approach is customizable, cost effective, highly parallelized, and largely automated. We applied this method to screen 69 loci in 1105 children with unexplained intellectual disability, identifying pathogenic variants in 3.1% of these individuals and potentially pathogenic variants in an additional 2.3%. We identified seven individuals (0.7%) with a deletion of 16p11.2, which has been previously associated with autism. Our results widen the phenotypic spectrum of these deletions to include intellectual disability without autism. We also detected 1.65-3.4 Mbp duplications at 16p13.11 in 1.1% of affected individuals and 350 kbp deletions at 15q11.2, near the Prader-Willi/Angelman syndrome critical region, in 0.8% of affected individuals. Compared to published CNVs in controls they are significantly (P = 4.7 x 10(-5) and 0.003, respectively) enriched in these children, supporting previously published hypotheses that they are neurocognitive disease risk factors. More generally, this approach offers a previously unavailable balance between customization, cost, and throughput for analysis of CNVs and should prove valuable for targeted CNV detection in both research and diagnostic settings.

  18. Improved Multiplex Ligation-dependent Probe Amplification (i-MLPA) for rapid copy number variant (CNV) detection.

    PubMed

    Saxena, Sonal; Gowdhaman, Kavitha; Kkani, Poornima; Vennapusa, Bhavyasri; Rama Subramanian, Chellamuthu; Ganesh Kumar, S; Mohan, Kommu Naga

    2015-10-23

    In Multiplex Ligation-dependent Probe Amplification (MLPA), copy number variants (CNVs) for specific genes are identified after normalization of the amounts of PCR products from ligated reference probes hybridized to genomic regions that are ideally free from normal variation. However, we observed ambiguous calls for two reference probes in an investigation of the human 15q11.2 region by MLPA among 20 controls, due to the presence of single nucleotide polymorphisms (SNPs) in the probe-binding regions. Further in silico analysis revealed that 18 out of 19 reference probes hybridize to regions subject to variation, underlining the requirement for designing new reference probes against variation-free regions. An improved MLPA (i-MLPA) method was developed by generating a new set of reference probes to reduce the chances of ambiguous calls and new reagents that reduce hybridization times to 30 min from 16h to obtain MLPA ratio data within 6h. Using i-MLPA, we screened 240 schizophrenia patients for CNVs in 15q11.2 region. Three deletions and two duplications were identified among the 240 schizophrenia patients. No variation was observed for the new reference probes. Taken together, i-MLPA procedure helps obtaining non-ambiguous CNV calls within 6h without compromising accuracy. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  20. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  1. Genetic Structures of Copy Number Variants Revealed by Genotyping Single Sperm

    PubMed Central

    Luo, Minjie; Cui, Xiangfeng; Fredman, David; Brookes, Anthony J.; Azaro, Marco A.; Greenawalt, Danielle M.; Hu, Guohong; Wang, Hui-Yun; Tereshchenko, Irina V.; Lin, Yong; Shentu, Yue; Gao, Richeng; Shen, Li; Li, Honghua

    2009-01-01

    Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis. PMID:19384415

  2. Evaluation of three read-depth based CNV detection tools using whole-exome sequencing data.

    PubMed

    Yao, Ruen; Zhang, Cheng; Yu, Tingting; Li, Niu; Hu, Xuyun; Wang, Xiumin; Wang, Jian; Shen, Yiping

    2017-01-01

    Whole exome sequencing (WES) has been widely accepted as a robust and cost-effective approach for clinical genetic testing of small sequence variants. Detection of copy number variants (CNV) within WES data have become possible through the development of various algorithms and software programs that utilize read-depth as the main information. The aim of this study was to evaluate three commonly used, WES read-depth based CNV detection programs using high-resolution chromosomal microarray analysis (CMA) as a standard. Paired CMA and WES data were acquired for 45 samples. A total of 219 CNVs (size ranged from 2.3 kb - 35 mb) identified on three CMA platforms (Affymetrix, Agilent and Illumina) were used as standards. CNVs were called from WES data using XHMM, CoNIFER, and CNVnator with modified settings. All three software packages detected an elevated proportion of small variants (< 20 kb) compared to CMA. XHMM and CoNIFER had poor detection sensitivity (22.2 and 14.6%), which correlated with the number of capturing probes involved. CNVnator detected most variants and had better sensitivity (87.7%); however, suffered from an overwhelming detection of small CNVs below 20 kb, which required further confirmation. Size estimation of variants was exaggerated by CNVnator and understated by XHMM and CoNIFER. Low concordances of CNV, detected by three different read-depth based programs, indicate the immature status of WES-based CNV detection. Low sensitivity and uncertain specificity of WES-based CNV detection in comparison with CMA based CNV detection suggests that CMA will continue to play an important role in detecting clinical grade CNV in the NGS era, which is largely based on WES.

  3. Assessment of circulating copy number variant detection for cancer screening.

    PubMed

    Molparia, Bhuvan; Nichani, Eshaan; Torkamani, Ali

    2017-01-01

    Current high-sensitivity cancer screening methods, largely utilizing correlative biomarkers, suffer from false positive rates that lead to unnecessary medical procedures and debatable public health benefit overall. Detection of circulating tumor DNA (ctDNA), a causal biomarker, has the potential to revolutionize cancer screening. Thus far, the majority of ctDNA studies have focused on detection of tumor-specific point mutations after cancer diagnosis for the purpose of post-treatment surveillance. However, ctDNA point mutation detection methods developed to date likely lack either the scope or analytical sensitivity necessary to be useful for cancer screening, due to the low (<1%) ctDNA fraction derived from early stage tumors. On the other hand, tumor-derived copy number variant (CNV) detection is hypothetically a superior means of ctDNA-based cancer screening for many tumor types, given that, relative to point mutations, each individual tumor CNV contributes a much larger number of ctDNA fragments to the overall pool of circulating free DNA (cfDNA). A small number of studies have demonstrated the potential of ctDNA CNV-based screening in select cancer types. Here we perform an in silico assessment of the potential for ctDNA CNV-based cancer screening across many common cancers, and suggest ctDNA CNV detection shows promise as a broad cancer screening methodology.

  4. Impact of constitutional copy number variants on biological pathway evolution.

    PubMed

    Poptsova, Maria; Banerjee, Samprit; Gokcumen, Omer; Rubin, Mark A; Demichelis, Francesca

    2013-01-23

    Inherited Copy Number Variants (CNVs) can modulate the expression levels of individual genes. However, little is known about how CNVs alter biological pathways and how this varies across different populations. To trace potential evolutionary changes of well-described biological pathways, we jointly queried the genomes and the transcriptomes of a collection of individuals with Caucasian, Asian or Yoruban descent combining high-resolution array and sequencing data. We implemented an enrichment analysis of pathways accounting for CNVs and genes sizes and detected significant enrichment not only in signal transduction and extracellular biological processes, but also in metabolism pathways. Upon the estimation of CNV population differentiation (CNVs with different polymorphism frequencies across populations), we evaluated that 22% of the pathways contain at least one gene that is proximal to a CNV (CNV-gene pair) that shows significant population differentiation. The majority of these CNV-gene pairs belong to signal transduction pathways and 6% of the CNV-gene pairs show statistical association between the copy number states and the transcript levels. The analysis suggested possible examples of positive selection within individual populations including NF-kB, MAPK signaling pathways, and Alu/L1 retrotransposition factors. Altogether, our results suggest that constitutional CNVs may modulate subtle pathway changes through specific pathway enzymes, which may become fixed in some populations.

  5. Hidden Markov Model-Based CNV Detection Algorithms for Illumina Genotyping Microarrays.

    PubMed

    Seiser, Eric L; Innocenti, Federico

    2014-01-01

    Somatic alterations in DNA copy number have been well studied in numerous malignancies, yet the role of germline DNA copy number variation in cancer is still emerging. Genotyping microarrays generate allele-specific signal intensities to determine genotype, but may also be used to infer DNA copy number using additional computational approaches. Numerous tools have been developed to analyze Illumina genotype microarray data for copy number variant (CNV) discovery, although commonly utilized algorithms freely available to the public employ approaches based upon the use of hidden Markov models (HMMs). QuantiSNP, PennCNV, and GenoCN utilize HMMs with six copy number states but vary in how transition and emission probabilities are calculated. Performance of these CNV detection algorithms has been shown to be variable between both genotyping platforms and data sets, although HMM approaches generally outperform other current methods. Low sensitivity is prevalent with HMM-based algorithms, suggesting the need for continued improvement in CNV detection methodologies.

  6. Clinical relevance of small copy-number variants in chromosomal microarray clinical testing.

    PubMed

    Hollenbeck, Dana; Williams, Crescenda L; Drazba, Kathryn; Descartes, Maria; Korf, Bruce R; Rutledge, S Lane; Lose, Edward J; Robin, Nathaniel H; Carroll, Andrew J; Mikhail, Fady M

    2017-04-01

    The 2010 consensus statement on diagnostic chromosomal microarray (CMA) testing recommended an array resolution ≥400 kb throughout the genome as a balance of analytical and clinical sensitivity. In spite of the clear evidence for pathogenicity of large copy-number variants (CNVs) in neurodevelopmental disorders and/or congenital anomalies, the significance of small, nonrecurrent CNVs (<500 kb) has not been well established in a clinical setting. We investigated the clinical significance of all nonpolymorphic small, nonrecurrent CNVs (<500 kb) in patients referred for CMA clinical testing over a period of 6 years, from 2009 to 2014 (a total of 4,417 patients). We excluded from our study patients with benign or likely benign CNVs and patients with only recurrent microdeletions/microduplications <500 kb. In total, 383 patients (8.67%) were found to carry at least one small, nonrecurrent CNV, of whom 176 patients (3.98%) had one small CNV classified as a variant of uncertain significance (VUS), 45 (1.02%) had two or more small VUS CNVs, 20 (0.45%) had one small VUS CNV and a recurrent CNV, 113 (2.56%) had one small pathogenic or likely pathogenic CNV, 17 (0.38%) had two or more small pathogenic or likely pathogenic CNVs, and 12 (0.27%) had one small pathogenic or likely pathogenic CNV and a recurrent CNV. Within the pathogenic group, 80 of 142 patients (56% of all small pathogenic CNV cases) were found to have a single whole-gene or exonic deletion. The themes that emerged from our study are presented in the Discussion section. Our study demonstrates the diagnostic clinical relevance of small, nonrecurrent CNVs <500 kb during CMA clinical testing and underscores the need for careful clinical interpretation of these CNVs.Genet Med 19 4, 377-385.

  7. Impact of constitutional copy number variants on biological pathway evolution

    PubMed Central

    2013-01-01

    Background Inherited Copy Number Variants (CNVs) can modulate the expression levels of individual genes. However, little is known about how CNVs alter biological pathways and how this varies across different populations. To trace potential evolutionary changes of well-described biological pathways, we jointly queried the genomes and the transcriptomes of a collection of individuals with Caucasian, Asian or Yoruban descent combining high-resolution array and sequencing data. Results We implemented an enrichment analysis of pathways accounting for CNVs and genes sizes and detected significant enrichment not only in signal transduction and extracellular biological processes, but also in metabolism pathways. Upon the estimation of CNV population differentiation (CNVs with different polymorphism frequencies across populations), we evaluated that 22% of the pathways contain at least one gene that is proximal to a CNV (CNV-gene pair) that shows significant population differentiation. The majority of these CNV-gene pairs belong to signal transduction pathways and 6% of the CNV-gene pairs show statistical association between the copy number states and the transcript levels. Conclusions The analysis suggested possible examples of positive selection within individual populations including NF-kB, MAPK signaling pathways, and Alu/L1 retrotransposition factors. Altogether, our results suggest that constitutional CNVs may modulate subtle pathway changes through specific pathway enzymes, which may become fixed in some populations. PMID:23342974

  8. CNV-seq, a new method to detect copy number variation using high-throughput sequencing.

    PubMed

    Xie, Chao; Tammi, Martti T

    2009-03-06

    DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Simulation of various sequencing methods with coverage between 0.1x to 8x show overall specificity between 91.7 - 99.9%, and sensitivity between 72.2 - 96.5%. We also show the results for assessment of CNV between two individual human genomes.

  9. New quality measure for SNP array based CNV detection.

    PubMed

    Macé, A; Tuke, M A; Beckmann, J S; Lin, L; Jacquemont, S; Weedon, M N; Reymond, A; Kutalik, Z

    2016-11-01

    Only a few large systematic studies have evaluated the impact of copy number variants (CNVs) on common diseases. Several million individuals have been genotyped on single nucleotide variation arrays, which could be used for genome-wide CNVs association studies. However, CNV calls remain prone to false positives and only empirical filtering strategies exist in the literature. To overcome this issue, we defined a new quality score (QS) estimating the probability of a CNV called by PennCNV to be confirmed by other software. Out-of-sample comparison showed that the correlation between the consensus CNV status and the QS is twice as high as it is for any previously proposed CNV filters. ROC curves displayed an AUC higher than 0.8 and simulations showed an increase up to 20% in statistical power when using QS in comparison to other filtering strategies. Superior performance was confirmed also for alternative consensus CNV definition and through improving known CNV-trait associations. http://goo.gl/T6yuFM CONTACT: zoltan.kutalik@unil.ch or aurelien@mace@unil.chSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  10. Identification of copy number variants in horses.

    PubMed

    Doan, Ryan; Cohen, Noah; Harrington, Jessica; Veazey, Kylee; Veazy, Kylee; Juras, Rytis; Cothran, Gus; McCue, Molly E; Skow, Loren; Dindot, Scott V

    2012-05-01

    Copy number variants (CNVs) represent a substantial source of genetic variation in mammals. However, the occurrence of CNVs in horses and their subsequent impact on phenotypic variation is unknown. We performed a study to identify CNVs in 16 horses representing 15 distinct breeds (Equus caballus) and an individual gray donkey (Equus asinus) using a whole-exome tiling array and the array comparative genomic hybridization methodology. We identified 2368 CNVs ranging in size from 197 bp to 3.5 Mb. Merging identical CNVs from each animal yielded 775 CNV regions (CNVRs), involving 1707 protein- and RNA-coding genes. The number of CNVs per animal ranged from 55 to 347, with median and mean sizes of CNVs of 5.3 kb and 99.4 kb, respectively. Approximately 6% of the genes investigated were affected by a CNV. Biological process enrichment analysis indicated CNVs primarily affected genes involved in sensory perception, signal transduction, and metabolism. CNVs also were identified in genes regulating blood group antigens, coat color, fecundity, lactation, keratin formation, neuronal homeostasis, and height in other species. Collectively, these data are the first report of copy number variation in horses and suggest that CNVs are common in the horse genome and may modulate biological processes underlying different traits observed among horses and horse breeds.

  11. MSeq-CNV: accurate detection of Copy Number Variation from Sequencing of Multiple samples.

    PubMed

    Malekpour, Seyed Amir; Pezeshk, Hamid; Sadeghi, Mehdi

    2018-03-05

    Currently a few tools are capable of detecting genome-wide Copy Number Variations (CNVs) based on sequencing of multiple samples. Although aberrations in mate pair insertion sizes provide additional hints for the CNV detection based on multiple samples, the majority of the current tools rely only on the depth of coverage. Here, we propose a new algorithm (MSeq-CNV) which allows detecting common CNVs across multiple samples. MSeq-CNV applies a mixture density for modeling aberrations in depth of coverage and abnormalities in the mate pair insertion sizes. Each component in this mixture density applies a Binomial distribution for modeling the number of mate pairs with aberration in the insertion size and also a Poisson distribution for emitting the read counts, in each genomic position. MSeq-CNV is applied on simulated data and also on real data of six HapMap individuals with high-coverage sequencing, in 1000 Genomes Project. These individuals include a CEU trio of European ancestry and a YRI trio of Nigerian ethnicity. Ancestry of these individuals is studied by clustering the identified CNVs. MSeq-CNV is also applied for detecting CNVs in two samples with low-coverage sequencing in 1000 Genomes Project and six samples form the Simons Genome Diversity Project.

  12. A stochastic inference of de novo CNV detection and association test in multiplex schizophrenia families.

    PubMed

    Wang, Shi-Heng; Chen, Wei J; Tsai, Yu-Chin; Huang, Yung-Hsiang; Hwu, Hai-Gwo; Hsiao, Chuhsing K

    2013-01-01

    The copy number variation (CNV) is a type of genetic variation in the genome. It is measured based on signal intensity measures and can be assessed repeatedly to reduce the uncertainty in PCR-based typing. Studies have shown that CNVs may lead to phenotypic variation and modification of disease expression. Various challenges exist, however, in the exploration of CNV-disease association. Here we construct latent variables to infer the discrete CNV values and to estimate the probability of mutations. In addition, we propose to pool rare variants to increase the statistical power and we conduct family studies to mitigate the computational burden in determining the composition of CNVs on each chromosome. To explore in a stochastic sense the association between the collapsing CNV variants and disease status, we utilize a Bayesian hierarchical model incorporating the mutation parameters. This model assigns integers in a probabilistic sense to the quantitatively measured copy numbers, and is able to test simultaneously the association for all variants of interest in a regression framework. This integrative model can account for the uncertainty in copy number assignment and differentiate if the variation was de novo or inherited on the basis of posterior probabilities. For family studies, this model can accommodate the dependence within family members and among repeated CNV data. Moreover, the Mendelian rule can be assumed under this model and yet the genetic variation, including de novo and inherited variation, can still be included and quantified directly for each individual. Finally, simulation studies show that this model has high true positive and low false positive rates in the detection of de novo mutation.

  13. Simultaneous mutation and copy number variation (CNV) detection by multiplex PCR-based GS-FLX sequencing.

    PubMed

    Goossens, Dirk; Moens, Lotte N; Nelis, Eva; Lenaerts, An-Sofie; Glassee, Wim; Kalbe, Andreas; Frey, Bruno; Kopal, Guido; De Jonghe, Peter; De Rijk, Peter; Del-Favero, Jurgen

    2009-03-01

    We evaluated multiplex PCR amplification as a front-end for high-throughput sequencing, to widen the applicability of massive parallel sequencers for the detailed analysis of complex genomes. Using multiplex PCR reactions, we sequenced the complete coding regions of seven genes implicated in peripheral neuropathies in 40 individuals on a GS-FLX genome sequencer (Roche). The resulting dataset showed highly specific and uniform amplification. Comparison of the GS-FLX sequencing data with the dataset generated by Sanger sequencing confirmed the detection of all variants present and proved the sensitivity of the method for mutation detection. In addition, we showed that we could exploit the multiplexed PCR amplicons to determine individual copy number variation (CNV), increasing the spectrum of detected variations to both genetic and genomic variants. We conclude that our straightforward procedure substantially expands the applicability of the massive parallel sequencers for sequencing projects of a moderate number of amplicons (50-500) with typical applications in resequencing exons in positional or functional candidate regions and molecular genetic diagnostics. 2008 Wiley-Liss, Inc.

  14. The ICR96 exon CNV validation series: a resource for orthogonal assessment of exon CNV calling in NGS data.

    PubMed

    Mahamdallie, Shazia; Ruark, Elise; Yost, Shawn; Ramsay, Emma; Uddin, Imran; Wylie, Harriett; Elliott, Anna; Strydom, Ann; Renwick, Anthony; Seal, Sheila; Rahman, Nazneen

    2017-01-01

    Detection of deletions and duplications of whole exons (exon CNVs) is a key requirement of genetic testing. Accurate detection of this variant type has proved very challenging in targeted next-generation sequencing (NGS) data, particularly if only a single exon is involved. Many different NGS exon CNV calling methods have been developed over the last five years. Such methods are usually evaluated using simulated and/or in-house data due to a lack of publicly-available datasets with orthogonally generated results. This hinders tool comparisons, transparency and reproducibility. To provide a community resource for assessment of exon CNV calling methods in targeted NGS data, we here present the ICR96 exon CNV validation series. The dataset includes high-quality sequencing data from a targeted NGS assay (the TruSight Cancer Panel) together with Multiplex Ligation-dependent Probe Amplification (MLPA) results for 96 independent samples. 66 samples contain at least one validated exon CNV and 30 samples have validated negative results for exon CNVs in 26 genes. The dataset includes 46 exon CNVs in BRCA1 , BRCA2 , TP53 , MLH1 , MSH2 , MSH6 , PMS2 , EPCAM or PTEN , giving excellent representation of the cancer predisposition genes most frequently tested in clinical practice. Moreover, the validated exon CNVs include 25 single exon CNVs, the most difficult type of exon CNV to detect. The FASTQ files for the ICR96 exon CNV validation series can be accessed through the European-Genome phenome Archive (EGA) under the accession number EGAS00001002428.

  15. Evaluation of copy-number variants as modifiers of breast and ovarian cancer risk for BRCA1 pathogenic variant carriers

    PubMed Central

    Walker, Logan C; Marquart, Louise; Pearson, John F; Wiggins, George A R; O'Mara, Tracy A; Parsons, Michael T; Barrowdale, Daniel; McGuffog, Lesley; Dennis, Joe; Benitez, Javier; Slavin, Thomas P; Radice, Paolo; Frost, Debra; Godwin, Andrew K; Meindl, Alfons; Schmutzler, Rita Katharina; Isaacs, Claudine; Peshkin, Beth N; Caldes, Trinidad; Hogervorst, Frans BL; Lazaro, Conxi; Jakubowska, Anna; Montagna, Marco; Chen, Xiaoqing; Offit, Kenneth; Hulick, Peter J; Andrulis, Irene L; Lindblom, Annika; Nussbaum, Robert L; Nathanson, Katherine L; Chenevix-Trench, Georgia; Antoniou, Antonis C; Couch, Fergus J; Spurdle, Amanda B

    2017-01-01

    Genome-wide studies of patients carrying pathogenic variants (mutations) in BRCA1 or BRCA2 have reported strong associations between single-nucleotide polymorphisms (SNPs) and cancer risk. To conduct the first genome-wide association analysis of copy-number variants (CNVs) with breast or ovarian cancer risk in a cohort of 2500 BRCA1 pathogenic variant carriers, CNV discovery was performed using multiple calling algorithms and Illumina 610k SNP array data from a previously published genome-wide association study. Our analysis, which focused on functionally disruptive genomic deletions overlapping gene regions, identified a number of loci associated with risk of breast or ovarian cancer for BRCA1 pathogenic variant carriers. Despite only including putative deletions called by at least two or more algorithms, detection of selected CNVs by ancillary molecular technologies only confirmed 40% of predicted common (>1% allele frequency) variants. These include four loci that were associated (unadjusted P<0.05) with breast cancer (GTF2H2, ZNF385B, NAALADL2 and PSG5), and two loci associated with ovarian cancer (CYP2A7 and OR2A1). An interesting finding from this study was an association of a validated CNV deletion at the CYP2A7 locus (19q13.2) with decreased ovarian cancer risk (relative risk=0.50, P=0.007). Genomic analysis found this deletion coincides with a region displaying strong regulatory potential in ovarian tissue, but not in breast epithelial cells. This study highlighted the need to verify CNVs in vitro, but also provides evidence that experimentally validated CNVs (with plausible biological consequences) can modify risk of breast or ovarian cancer in BRCA1 pathogenic variant carriers. PMID:28145423

  16. Insights on the functional impact of microRNAs present in autism-associated copy number variants.

    PubMed

    Vaishnavi, Varadarajan; Manikandan, Mayakannan; Tiwary, Basant K; Munirajan, Arasambattu Kannan

    2013-01-01

    Autism spectrum disorder is a complex neurodevelopmental disorder that appears during the first three years of infancy and lasts throughout a person's life. Recently a large category of genomic structural variants, denoted as copy number variants (CNVs), were established to be a major contributor of the pathophysiology of autism. To date almost all studies have focussed only on the genes present in the CNV loci, but the impact of non-coding regulatory microRNAs (miRNAs) present in these regions remain largely unexplored. Hence we attempted to elucidate the biological and functional significance of miRNAs present in autism-associated CNV loci and their target genes by using a series of computational tools. We demonstrate that nearly 11% of the CNV loci harbor miRNAs and a few of these miRNAs were previously reported to be associated with autism. A systematic analysis of the CNV-miRNAs based on their interactions with the target genes enabled the identification of top 10 miRNAs namely hsa-miR-590-3p, hsa-miR-944, hsa-miR-570, hsa-miR-34a, hsa-miR-124, hsa-miR-548f, hsa-miR-429, hsa-miR-200b, hsa-miR-195 and hsa-miR-497 as hub molecules. Further, the CNV-miRNAs formed a regulatory loop with transcription factors and their downstream target genes, and annotation of these target genes indicated their functional involvement in neurodevelopment and synapse. Moreover, miRNAs present in deleted and duplicated CNV loci may explain the difference in dosage of the crucial genes controlled by them. These CNV-miRNAs can also impair the global processing and biogenesis of all miRNAs by targeting key molecules in the miRNA pathway. To our knowledge, this is the first report to highlight the significance of CNV-microRNAs and their target genes to contribute towards the genetic heterogeneity and phenotypic variability of autism.

  17. Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

    PubMed

    Zhao, Min; Wang, Qingguo; Wang, Quan; Jia, Peilin; Zhao, Zhongming

    2013-01-01

    Copy number variation (CNV) is a prevalent form of critical genetic variation that leads to an abnormal number of copies of large genomic regions in a cell. Microarray-based comparative genome hybridization (arrayCGH) or genotyping arrays have been standard technologies to detect large regions subject to copy number changes in genomes until most recently high-resolution sequence data can be analyzed by next-generation sequencing (NGS). During the last several years, NGS-based analysis has been widely applied to identify CNVs in both healthy and diseased individuals. Correspondingly, the strong demand for NGS-based CNV analyses has fuelled development of numerous computational methods and tools for CNV detection. In this article, we review the recent advances in computational methods pertaining to CNV detection using whole genome and whole exome sequencing data. Additionally, we discuss their strengths and weaknesses and suggest directions for future development.

  18. Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives

    PubMed Central

    2013-01-01

    Copy number variation (CNV) is a prevalent form of critical genetic variation that leads to an abnormal number of copies of large genomic regions in a cell. Microarray-based comparative genome hybridization (arrayCGH) or genotyping arrays have been standard technologies to detect large regions subject to copy number changes in genomes until most recently high-resolution sequence data can be analyzed by next-generation sequencing (NGS). During the last several years, NGS-based analysis has been widely applied to identify CNVs in both healthy and diseased individuals. Correspondingly, the strong demand for NGS-based CNV analyses has fuelled development of numerous computational methods and tools for CNV detection. In this article, we review the recent advances in computational methods pertaining to CNV detection using whole genome and whole exome sequencing data. Additionally, we discuss their strengths and weaknesses and suggest directions for future development. PMID:24564169

  19. Clinical Validation of Copy Number Variant Detection from Targeted Next-Generation Sequencing Panels.

    PubMed

    Kerkhof, Jennifer; Schenkel, Laila C; Reilly, Jack; McRobbie, Sheri; Aref-Eshghi, Erfan; Stuart, Alan; Rupar, C Anthony; Adams, Paul; Hegele, Robert A; Lin, Hanxin; Rodenhiser, David; Knoll, Joan; Ainsworth, Peter J; Sadikovic, Bekim

    2017-11-01

    Next-generation sequencing (NGS) technology has rapidly replaced Sanger sequencing in the assessment of sequence variations in clinical genetics laboratories. One major limitation of current NGS approaches is the ability to detect copy number variations (CNVs) approximately >50 bp. Because these represent a major mutational burden in many genetic disorders, parallel CNV assessment using alternate supplemental methods, along with the NGS analysis, is normally required, resulting in increased labor, costs, and turnaround times. The objective of this study was to clinically validate a novel CNV detection algorithm using targeted clinical NGS gene panel data. We have applied this approach in a retrospective cohort of 391 samples and a prospective cohort of 2375 samples and found a 100% sensitivity (95% CI, 89%-100%) for 37 unique events and a high degree of specificity to detect CNVs across nine distinct targeted NGS gene panels. This NGS CNV pipeline enables stand-alone first-tier assessment for CNV and sequence variants in a clinical laboratory setting, dispensing with the need for parallel CNV analysis using classic techniques, such as microarray, long-range PCR, or multiplex ligation-dependent probe amplification. This NGS CNV pipeline can also be applied to the assessment of complex genomic regions, including pseudogenic DNA sequences, such as the PMS2CL gene, and to mitochondrial genome heteroplasmy detection. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  20. CNV analysis in the Lithuanian population.

    PubMed

    Urnikyte, A; Domarkiene, I; Stoma, S; Ambrozaityte, L; Uktveryte, I; Meskiene, R; Kasiulevičius, V; Burokiene, N; Kučinskas, V

    2016-05-04

    Although copy number variation (CNV) has received much attention, knowledge about the characteristics of CNVs such as occurrence rate and distribution in the genome between populations and within the same population is still insufficient. In this study, Illumina 770 K HumanOmniExpress-12 v1.0 (and v1.1) arrays were used to examine the diversity and distribution of CNVs in 286 unrelated individuals from the two main ethnolinguistic groups of the Lithuanian population (Aukštaičiai and Žemaičiai) (see Additional file 3). For primary data analysis, the Illumina GenomeStudio™ Genotyping Module v1.9 and two algorithms, cnvPartition 3.2.0 and QuantiSNP 2.0, were used to identify high-confidence CNVs. A total of 478 autosomal CNVs were detected by both algorithms, and those were clustered in 87 copy number variation regions (CNVRs), spanning ~12.5 Mb of the genome (see Table 1). At least 8.6 % of the CNVRs were unique and had not been reported in the Database of Genomic Variants. Most CNVRs (57.5 %) were rare, with a frequency of <1 %, whereas common CNVRs with at least 5 % frequency made up only 1.1 % of all CNVRs identified. About 49 % of non-singleton CNVRs were shared between Aukštaičiai and Žemaičiai, and the remaining CNVRs were specific to each group. Many of the CNVs detected (66 %) overlapped with known UCSC gene regions. The ethnolinguistic groups of the Lithuanian population could not be differentiated based on CNV profiles, which may reflect their geographical proximity and suggest the homogeneity of the Lithuanian population. In addition, putative novel CNVs unique to the Lithuanian population were identified. The results of our study enhance the CNV map of the Lithuanian population.

  1. CNV-WebStore: online CNV analysis, storage and interpretation.

    PubMed

    Vandeweyer, Geert; Reyniers, Edwin; Wuyts, Wim; Rooms, Liesbeth; Kooy, R Frank

    2011-01-05

    Microarray technology allows the analysis of genomic aberrations at an ever increasing resolution, making functional interpretation of these vast amounts of data the main bottleneck in routine implementation of high resolution array platforms, and emphasising the need for a centralised and easy to use CNV data management and interpretation system. We present CNV-WebStore, an online platform to streamline the processing and downstream interpretation of microarray data in a clinical context, tailored towards but not limited to the Illumina BeadArray platform. Provided analysis tools include CNV analsyis, parent of origin and uniparental disomy detection. Interpretation tools include data visualisation, gene prioritisation, automated PubMed searching, linking data to several genome browsers and annotation of CNVs based on several public databases. Finally a module is provided for uniform reporting of results. CNV-WebStore is able to present copy number data in an intuitive way to both lab technicians and clinicians, making it a useful tool in daily clinical practice.

  2. Exploring the feasibility of using copy number variants as genetic markers through large-scale whole genome sequencing experiments

    USDA-ARS?s Scientific Manuscript database

    Copy number variants (CNV) are large scale duplications or deletions of genomic sequence that are caused by a diverse set of molecular phenomena that are distinct from single nucleotide polymorphism (SNP) formation. Due to their different mechanisms of formation, CNVs are often difficult to track us...

  3. The Role of Constitutional Copy Number Variants in Breast Cancer

    PubMed Central

    Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.

    2015-01-01

    Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231

  4. Accuracy of CNV Detection from GWAS Data.

    PubMed

    Zhang, Dandan; Qian, Yudong; Akula, Nirmala; Alliey-Rodriguez, Ney; Tang, Jinsong; Gershon, Elliot S; Liu, Chunyu

    2011-01-13

    Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites--Birdsuite, Partek, HelixTree, and PennCNV-Affy--in the identification of both rare and common CNVs. Each program's performance was assessed in two ways. The first was its recovery rate, i.e., its ability to call 893 CNVs previously identified in eight HapMap samples by paired-end sequencing of whole-genome fosmid clones, and 51,440 CNVs identified by array Comparative Genome Hybridization (aCGH) followed by validation procedures, in 90 HapMap CEU samples. The second evaluation was program performance calling rare and common CNVs in the Bipolar Genome Study (BiGS) data set (1001 bipolar cases and 1033 controls, all of European ancestry) as measured by the Affymetrix SNP 6.0 array. Accuracy in calling rare CNVs was assessed by positive predictive value, based on the proportion of rare CNVs validated by quantitative real-time PCR (qPCR), while accuracy in calling common CNVs was assessed by false positive/false negative rates based on qPCR validation results from a subset of common CNVs. Birdsuite recovered the highest percentages of known HapMap CNVs containing >20 markers in two reference CNV datasets. The recovery rate increased with decreased CNV frequency. In the tested rare CNV data, Birdsuite and Partek had higher positive predictive values than the other software suites. In a test of three common CNVs in the BiGS dataset, Birdsuite's call was 98.8% consistent with qPCR quantification in one CNV region, but the other two regions showed an unacceptable degree of accuracy. We found relatively poor consistency between the two "gold standards," the sequence data of Kidd et al., and aCGH data of Conrad et al. Algorithms for calling CNVs especially common ones need substantial improvement, and a "gold standard" for detection of CNVs remains to be established.

  5. Analysis of copy number variants by three detection algorithms and their association with body size in horses.

    PubMed

    Metzger, Julia; Philipp, Ute; Lopes, Maria Susana; da Camara Machado, Artur; Felicetti, Michela; Silvestrelli, Maurizio; Distl, Ottmar

    2013-07-18

    Copy number variants (CNVs) have been shown to play an important role in genetic diversity of mammals and in the development of many complex phenotypic traits. The aim of this study was to perform a standard comparative evaluation of CNVs in horses using three different CNV detection programs and to identify genomic regions associated with body size in horses. Analysis was performed using the Illumina Equine SNP50 genotyping beadchip for 854 horses. CNVs were detected by three different algorithms, CNVPartition, PennCNV and QuantiSNP. Comparative analysis revealed 50 CNVs that affected 153 different genes mainly involved in sensory perception, signal transduction and cellular components. Genome-wide association analysis for body size showed highly significant deleted regions on ECA1, ECA8 and ECA9. Homologous regions to the detected CNVs on ECA1 and ECA9 have also been shown to be correlated with human height. Comparative analysis of CNV detection algorithms was useful to increase the specificity of CNV detection but had certain limitations dependent on the detection tool. GWAS revealed genome-wide associated CNVs for body size in horses.

  6. CNV-TV: a robust method to discover copy number variation from short sequencing reads.

    PubMed

    Duan, Junbo; Zhang, Ji-Gang; Deng, Hong-Wen; Wang, Yu-Ping

    2013-05-02

    Copy number variation (CNV) is an important structural variation (SV) in human genome. Various studies have shown that CNVs are associated with complex diseases. Traditional CNV detection methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution. The next generation sequencing (NGS) technique promises a higher resolution detection of CNVs and several methods were recently proposed for realizing such a promise. However, the performances of these methods are not robust under some conditions, e.g., some of them may fail to detect CNVs of short sizes. There has been a strong demand for reliable detection of CNVs from high resolution NGS data. A novel and robust method to detect CNV from short sequencing reads is proposed in this study. The detection of CNV is modeled as a change-point detection from the read depth (RD) signal derived from the NGS, which is fitted with a total variation (TV) penalized least squares model. The performance (e.g., sensitivity and specificity) of the proposed approach are evaluated by comparison with several recently published methods on both simulated and real data from the 1000 Genomes Project. The experimental results showed that both the true positive rate and false positive rate of the proposed detection method do not change significantly for CNVs with different copy numbers and lengthes, when compared with several existing methods. Therefore, our proposed approach results in a more reliable detection of CNVs than the existing methods.

  7. GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies

    PubMed Central

    Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio

    2013-01-01

    We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243

  8. CNV-ROC: A cost effective, computer-aided analytical performance evaluator of chromosomal microarrays.

    PubMed

    Goodman, Corey W; Major, Heather J; Walls, William D; Sheffield, Val C; Casavant, Thomas L; Darbro, Benjamin W

    2015-04-01

    Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. CNV-ROC: A cost effective, computer-aided analytical performance evaluator of chromosomal microarrays

    PubMed Central

    Goodman, Corey W.; Major, Heather J.; Walls, William D.; Sheffield, Val C.; Casavant, Thomas L.; Darbro, Benjamin W.

    2016-01-01

    Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. PMID:25595567

  10. Genome-wide copy number variation (CNV) in patients with autoimmune Addison's disease

    PubMed Central

    2011-01-01

    Background Addison's disease (AD) is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV) could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352) and healthy controls (n = 353) by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28) and T cell maturation (ADAM3A). Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity. PMID:21851588

  11. Detection of clinically relevant copy-number variants by exome sequencing in a large cohort of genetic disorders

    PubMed Central

    Pfundt, Rolph; del Rosario, Marisol; Vissers, Lisenka E.L.M.; Kwint, Michael P.; Janssen, Irene M.; de Leeuw, Nicole; Yntema, Helger G.; Nelen, Marcel R.; Lugtenberg, Dorien; Kamsteeg, Erik-Jan; Wieskamp, Nienke; Stegmann, Alexander P.A.; Stevens, Servi J.C.; Rodenburg, Richard J.T.; Simons, Annet; Mensenkamp, Arjen R.; Rinne, Tuula; Gilissen, Christian; Scheffer, Hans; Veltman, Joris A.; Hehir-Kwa, Jayne Y.

    2017-01-01

    Purpose: Copy-number variation is a common source of genomic variation and an important genetic cause of disease. Microarray-based analysis of copy-number variants (CNVs) has become a first-tier diagnostic test for patients with neurodevelopmental disorders, with a diagnostic yield of 10–20%. However, for most other genetic disorders, the role of CNVs is less clear and most diagnostic genetic studies are generally limited to the study of single-nucleotide variants (SNVs) and other small variants. With the introduction of exome and genome sequencing, it is now possible to detect both SNVs and CNVs using an exome- or genome-wide approach with a single test. Methods: We performed exome-based read-depth CNV screening on data from 2,603 patients affected by a range of genetic disorders for which exome sequencing was performed in a diagnostic setting. Results: In total, 123 clinically relevant CNVs ranging in size from 727 bp to 15.3 Mb were detected, which resulted in 51 conclusive diagnoses and an overall increase in diagnostic yield of ~2% (ranging from 0 to –5.8% per disorder). Conclusions: This study shows that CNVs play an important role in a broad range of genetic disorders and that detection via exome-based CNV profiling results in an increase in the diagnostic yield without additional testing, bringing us closer to single-test genomics. Genet Med advance online publication 27 October 2016 PMID:28574513

  12. Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients

    PubMed Central

    2013-01-01

    Background Hereditary non-polyposis colorectal cancer (HNPCC)/Lynch syndrome (LS) is a cancer syndrome characterised by early-onset epithelial cancers, especially colorectal cancer (CRC) and endometrial cancer. The aim of the current study was to use SNP-array technology to identify genomic aberrations which could contribute to the increased risk of cancer in HNPCC/LS patients. Methods Individuals diagnosed with HNPCC/LS (100) and healthy controls (384) were genotyped using the Illumina Human610-Quad SNP-arrays. Copy number variation (CNV) calling and association analyses were performed using Nexus software, with significant results validated using QuantiSNP. TaqMan Copy-Number assays were used for verification of CNVs showing significant association with HNPCC/LS identified by both software programs. Results We detected copy number (CN) gains associated with HNPCC/LS status on chromosome 7q11.21 (28% cases and 0% controls, Nexus; p = 3.60E-20 and QuantiSNP; p < 1.00E-16) and 16p11.2 (46% in cases, while a CN loss was observed in 23% of controls, Nexus; p = 4.93E-21 and QuantiSNP; p = 5.00E-06) via in silico analyses. TaqMan Copy-Number assay was used for validation of CNVs showing significant association with HNPCC/LS. In addition, CNV burden (total CNV length, average CNV length and number of observed CNV events) was significantly greater in cases compared to controls. Conclusion A greater CNV burden was identified in HNPCC/LS cases compared to controls supporting the notion of higher genomic instability in these patients. One intergenic locus on chromosome 7q11.21 is possibly associated with HNPCC/LS and deserves further investigation. The results from this study highlight the complexities of fluorescent based CNV analyses. The inefficiency of both CNV detection methods to reproducibly detect observed CNVs demonstrates the need for sequence data to be considered alongside intensity data to avoid false positive results. PMID:23531357

  13. Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients.

    PubMed

    Talseth-Palmer, Bente A; Holliday, Elizabeth G; Evans, Tiffany-Jane; McEvoy, Mark; Attia, John; Grice, Desma M; Masson, Amy L; Meldrum, Cliff; Spigelman, Allan; Scott, Rodney J

    2013-03-26

    Hereditary non-polyposis colorectal cancer (HNPCC)/Lynch syndrome (LS) is a cancer syndrome characterised by early-onset epithelial cancers, especially colorectal cancer (CRC) and endometrial cancer. The aim of the current study was to use SNP-array technology to identify genomic aberrations which could contribute to the increased risk of cancer in HNPCC/LS patients. Individuals diagnosed with HNPCC/LS (100) and healthy controls (384) were genotyped using the Illumina Human610-Quad SNP-arrays. Copy number variation (CNV) calling and association analyses were performed using Nexus software, with significant results validated using QuantiSNP. TaqMan Copy-Number assays were used for verification of CNVs showing significant association with HNPCC/LS identified by both software programs. We detected copy number (CN) gains associated with HNPCC/LS status on chromosome 7q11.21 (28% cases and 0% controls, Nexus; p =3.60E-20 and QuantiSNP; p < 1.00E-16) and 16p11.2 (46% in cases, while a CN loss was observed in 23% of controls, Nexus; p = 4.93E-21 and QuantiSNP; p = 5.00E-06) via in silico analyses. TaqMan Copy-Number assay was used for validation of CNVs showing significant association with HNPCC/LS. In addition, CNV burden (total CNV length, average CNV length and number of observed CNV events) was significantly greater in cases compared to controls. A greater CNV burden was identified in HNPCC/LS cases compared to controls supporting the notion of higher genomic instability in these patients. One intergenic locus on chromosome 7q11.21 is possibly associated with HNPCC/LS and deserves further investigation. The results from this study highlight the complexities of fluorescent based CNV analyses. The inefficiency of both CNV detection methods to reproducibly detect observed CNVs demonstrates the need for sequence data to be considered alongside intensity data to avoid false positive results.

  14. Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN.

    PubMed

    Fowler, Anna; Mahamdallie, Shazia; Ruark, Elise; Seal, Sheila; Ramsay, Emma; Clarke, Matthew; Uddin, Imran; Wylie, Harriet; Strydom, Ann; Lunter, Gerton; Rahman, Nazneen

    2016-11-25

    Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, 'exon CNVs') in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs.  Methods: We developed a tool for the Detection of Exon Copy Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests BRCA1 and BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA).  Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%.  Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate

  15. Ready to clone: CNV detection and breakpoint fine-mapping in breast and ovarian cancer susceptibility genes by high-resolution array CGH.

    PubMed

    Hackmann, Karl; Kuhlee, Franziska; Betcheva-Krajcir, Elitza; Kahlert, Anne-Karin; Mackenroth, Luisa; Klink, Barbara; Di Donato, Nataliya; Tzschach, Andreas; Kast, Karin; Wimberger, Pauline; Schrock, Evelin; Rump, Andreas

    2016-10-01

    Detection of predisposing copy number variants (CNV) in 330 families affected with hereditary breast and ovarian cancer (HBOC). In order to complement mutation detection with Illumina's TruSight Cancer panel, we designed a customized high-resolution 8 × 60k array for CGH (aCGH) that covers all 94 genes from the panel. Copy number variants with immediate clinical relevance were detected in 12 families (3.6%). Besides 3 known CNVs in CHEK2, RAD51C, and BRCA1, we identified 3 novel pathogenic CNVs in BRCA1 (deletion of exons 4-13, deletion of exons 12-18) and ATM (deletion exons 57-63) plus an intragenic duplication of BRCA2 (exons 3-11) and an intronic BRCA1 variant with unknown pathogenicity. The precision of high-resolution aCGH enabled straight forward breakpoint amplification of a BRCA1 deletion which subsequently allowed for fast and economic CNV verification in family members of the index patient. Furthermore, we used our aCGH data to validate an algorithm that was able to detect all identified copy number changes from next-generation sequencing (NGS) data. Copy number detection is a mandatory analysis in HBOC families at least if no predisposing mutations were found by sequencing. Currently, high-resolution array CGH is our first choice of method of analysis due to unmatched detection precision. Although it seems possible to detect CNV from sequencing data, there currently is no satisfying tool to do so in a routine diagnostic setting.

  16. Contribution of Rare Copy Number Variants to Isolated Human Malformations

    PubMed Central

    Serra-Juhé, Clara; Rodríguez-Santiago, Benjamín; Cuscó, Ivon; Vendrell, Teresa; Camats, Núria; Torán, Núria; Pérez-Jurado, Luis A.

    2012-01-01

    Background Congenital malformations are present in approximately 2–3% of liveborn babies and 20% of stillborn fetuses. The mechanisms underlying the majority of sporadic and isolated congenital malformations are poorly understood, although it is hypothesized that the accumulation of rare genetic, genomic and epigenetic variants converge to deregulate developmental networks. Methodology/Principal Findings We selected samples from 95 fetuses with congenital malformations not ascribed to a specific syndrome (68 with isolated malformations, 27 with multiple malformations). Karyotyping and Multiplex Ligation-dependent Probe Amplification (MLPA) discarded recurrent genomic and cytogenetic rearrangements. DNA extracted from the affected tissue (46%) or from lung or liver (54%) was analyzed by molecular karyotyping. Validations and inheritance were obtained by MLPA. We identified 22 rare copy number variants (CNV) [>100 kb, either absent (n = 7) or very uncommon (n = 15, <1/2,000) in the control population] in 20/95 fetuses with congenital malformations (21%), including 11 deletions and 11 duplications. One of the 9 tested rearrangements was de novo while the remaining were inherited from a healthy parent. The highest frequency was observed in fetuses with heart hypoplasia (8/17, 62.5%), with two events previously related with the phenotype. Double events hitting candidate genes were detected in two samples with brain malformations. Globally, the burden of deletions was significantly higher in fetuses with malformations compared to controls. Conclusions/Significance Our data reveal a significant contribution of rare deletion-type CNV, mostly inherited but also de novo, to human congenital malformations, especially heart hypoplasia, and reinforce the hypothesis of a multifactorial etiology in most cases. PMID:23056206

  17. Genome-wide copy number variant analysis reveals variants associated with 10 diverse production traits in Holstein cattle

    USDA-ARS?s Scientific Manuscript database

    Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...

  18. Large transcription units unify copy number variants and common fragile sites arising under replication stress.

    PubMed

    Wilson, Thomas E; Arlt, Martin F; Park, So Hae; Rajendran, Sountharia; Paulsen, Michelle; Ljungman, Mats; Glover, Thomas W

    2015-02-01

    Copy number variants (CNVs) resulting from genomic deletions and duplications and common fragile sites (CFSs) seen as breaks on metaphase chromosomes are distinct forms of structural chromosome instability precipitated by replication inhibition. Although they share a common induction mechanism, it is not known how CNVs and CFSs are related or why some genomic loci are much more prone to their occurrence. Here we compare large sets of de novo CNVs and CFSs in several experimental cell systems to each other and to overlapping genomic features. We first show that CNV hotpots and CFSs occurred at the same human loci within a given cultured cell line. Bru-seq nascent RNA sequencing further demonstrated that although genomic regions with low CNV frequencies were enriched in transcribed genes, the CNV hotpots that matched CFSs specifically corresponded to the largest active transcription units in both human and mouse cells. Consistently, active transcription units >1 Mb were robust cell-type-specific predictors of induced CNV hotspots and CFS loci. Unlike most transcribed genes, these very large transcription units replicated late and organized deletion and duplication CNVs into their transcribed and flanking regions, respectively, supporting a role for transcription in replication-dependent lesion formation. These results indicate that active large transcription units drive extreme locus- and cell-type-specific genomic instability under replication stress, resulting in both CNVs and CFSs as different manifestations of perturbed replication dynamics. © 2015 Wilson et al.; Published by Cold Spring Harbor Laboratory Press.

  19. Large transcription units unify copy number variants and common fragile sites arising under replication stress

    PubMed Central

    Park, So Hae; Rajendran, Sountharia; Paulsen, Michelle; Ljungman, Mats; Glover, Thomas W.

    2015-01-01

    Copy number variants (CNVs) resulting from genomic deletions and duplications and common fragile sites (CFSs) seen as breaks on metaphase chromosomes are distinct forms of structural chromosome instability precipitated by replication inhibition. Although they share a common induction mechanism, it is not known how CNVs and CFSs are related or why some genomic loci are much more prone to their occurrence. Here we compare large sets of de novo CNVs and CFSs in several experimental cell systems to each other and to overlapping genomic features. We first show that CNV hotpots and CFSs occurred at the same human loci within a given cultured cell line. Bru-seq nascent RNA sequencing further demonstrated that although genomic regions with low CNV frequencies were enriched in transcribed genes, the CNV hotpots that matched CFSs specifically corresponded to the largest active transcription units in both human and mouse cells. Consistently, active transcription units >1 Mb were robust cell-type-specific predictors of induced CNV hotspots and CFS loci. Unlike most transcribed genes, these very large transcription units replicated late and organized deletion and duplication CNVs into their transcribed and flanking regions, respectively, supporting a role for transcription in replication-dependent lesion formation. These results indicate that active large transcription units drive extreme locus- and cell-type-specific genomic instability under replication stress, resulting in both CNVs and CFSs as different manifestations of perturbed replication dynamics. PMID:25373142

  20. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.

    PubMed

    Marshall, Christian R; Howrigan, Daniel P; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wu, Wenting; Greer, Douglas S; Antaki, Danny; Shetty, Aniket; Holmans, Peter A; Pinto, Dalila; Gujral, Madhusudan; Brandler, William M; Malhotra, Dheeraj; Wang, Zhouzhi; Fajarado, Karin V Fuentes; Maile, Michelle S; Ripke, Stephan; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amin, Farooq; Atkins, Joshua; Bacanu, Silviu A; Belliveau, Richard A; Bergen, Sarah E; Bertalan, Marcelo; Bevilacqua, Elizabeth; Bigdeli, Tim B; Black, Donald W; Bruggeman, Richard; Buccola, Nancy G; Buckner, Randy L; Bulik-Sullivan, Brendan; Byerley, William; Cahn, Wiepke; Cai, Guiqing; Cairns, Murray J; Campion, Dominique; Cantor, Rita M; Carr, Vaughan J; Carrera, Noa; Catts, Stanley V; Chambert, Kimberley D; Cheng, Wei; Cloninger, C Robert; Cohen, David; Cormican, Paul; Craddock, Nick; Crespo-Facorro, Benedicto; Crowley, James J; Curtis, David; Davidson, Michael; Davis, Kenneth L; Degenhardt, Franziska; Del Favero, Jurgen; DeLisi, Lynn E; Dikeos, Dimitris; Dinan, Timothy; Djurovic, Srdjan; Donohoe, Gary; Drapeau, Elodie; Duan, Jubao; Dudbridge, Frank; Eichhammer, Peter; Eriksson, Johan; Escott-Price, Valentina; Essioux, Laurent; Fanous, Ayman H; Farh, Kai-How; Farrell, Martilias S; Frank, Josef; Franke, Lude; Freedman, Robert; Freimer, Nelson B; Friedman, Joseph I; Forstner, Andreas J; Fromer, Menachem; Genovese, Giulio; Georgieva, Lyudmila; Gershon, Elliot S; Giegling, Ina; Giusti-Rodríguez, Paola; Godard, Stephanie; Goldstein, Jacqueline I; Gratten, Jacob; de Haan, Lieuwe; Hamshere, Marian L; Hansen, Mark; Hansen, Thomas; Haroutunian, Vahram; Hartmann, Annette M; Henskens, Frans A; Herms, Stefan; Hirschhorn, Joel N; Hoffmann, Per; Hofman, Andrea; Huang, Hailiang; Ikeda, Masashi; Joa, Inge; Kähler, Anna K; Kahn, René S; Kalaydjieva, Luba; Karjalainen, Juha; Kavanagh, David; Keller, Matthew C; Kelly, Brian J; Kennedy, James L; Kim, Yunjung; Knowles, James A; Konte, Bettina; Laurent, Claudine; Lee, Phil; Lee, S Hong; Legge, Sophie E; Lerer, Bernard; Levy, Deborah L; Liang, Kung-Yee; Lieberman, Jeffrey; Lönnqvist, Jouko; Loughland, Carmel M; Magnusson, Patrik K E; Maher, Brion S; Maier, Wolfgang; Mallet, Jacques; Mattheisen, Manuel; Mattingsdal, Morten; McCarley, Robert W; McDonald, Colm; McIntosh, Andrew M; Meier, Sandra; Meijer, Carin J; Melle, Ingrid; Mesholam-Gately, Raquelle I; Metspalu, Andres; Michie, Patricia T; Milani, Lili; Milanova, Vihra; Mokrab, Younes; Morris, Derek W; Müller-Myhsok, Bertram; Murphy, Kieran C; Murray, Robin M; Myin-Germeys, Inez; Nenadic, Igor; Nertney, Deborah A; Nestadt, Gerald; Nicodemus, Kristin K; Nisenbaum, Laura; Nordin, Annelie; O'Callaghan, Eadbhard; O'Dushlaine, Colm; Oh, Sang-Yun; Olincy, Ann; Olsen, Line; O'Neill, F Anthony; Van Os, Jim; Pantelis, Christos; Papadimitriou, George N; Parkhomenko, Elena; Pato, Michele T; Paunio, Tiina; Perkins, Diana O; Pers, Tune H; Pietiläinen, Olli; Pimm, Jonathan; Pocklington, Andrew J; Powell, John; Price, Alkes; Pulver, Ann E; Purcell, Shaun M; Quested, Digby; Rasmussen, Henrik B; Reichenberg, Abraham; Reimers, Mark A; Richards, Alexander L; Roffman, Joshua L; Roussos, Panos; Ruderfer, Douglas M; Salomaa, Veikko; Sanders, Alan R; Savitz, Adam; Schall, Ulrich; Schulze, Thomas G; Schwab, Sibylle G; Scolnick, Edward M; Scott, Rodney J; Seidman, Larry J; Shi, Jianxin; Silverman, Jeremy M; Smoller, Jordan W; Söderman, Erik; Spencer, Chris C A; Stahl, Eli A; Strengman, Eric; Strohmaier, Jana; Stroup, T Scott; Suvisaari, Jaana; Svrakic, Dragan M; Szatkiewicz, Jin P; Thirumalai, Srinivas; Tooney, Paul A; Veijola, Juha; Visscher, Peter M; Waddington, John; Walsh, Dermot; Webb, Bradley T; Weiser, Mark; Wildenauer, Dieter B; Williams, Nigel M; Williams, Stephanie; Witt, Stephanie H; Wolen, Aaron R; Wormley, Brandon K; Wray, Naomi R; Wu, Jing Qin; Zai, Clement C; Adolfsson, Rolf; Andreassen, Ole A; Blackwood, Douglas H R; Bramon, Elvira; Buxbaum, Joseph D; Cichon, Sven; Collier, David A; Corvin, Aiden; Daly, Mark J; Darvasi, Ariel; Domenici, Enrico; Esko, Tõnu; Gejman, Pablo V; Gill, Michael; Gurling, Hugh; Hultman, Christina M; Iwata, Nakao; Jablensky, Assen V; Jönsson, Erik G; Kendler, Kenneth S; Kirov, George; Knight, Jo; Levinson, Douglas F; Li, Qingqin S; McCarroll, Steven A; McQuillin, Andrew; Moran, Jennifer L; Mowry, Bryan J; Nöthen, Markus M; Ophoff, Roel A; Owen, Michael J; Palotie, Aarno; Pato, Carlos N; Petryshen, Tracey L; Posthuma, Danielle; Rietschel, Marcella; Riley, Brien P; Rujescu, Dan; Sklar, Pamela; St Clair, David; Walters, James T R; Werge, Thomas; Sullivan, Patrick F; O'Donovan, Michael C; Scherer, Stephen W; Neale, Benjamin M; Sebat, Jonathan

    2017-01-01

    Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (odds ratio (OR) = 1.11, P = 5.7 × 10 -15 ), which persisted after excluding loci implicated in previous studies (OR = 1.07, P = 1.7 × 10 -6 ). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 × 10 -11 ) and neurobehavioral phenotypes in mouse (OR = 1.18, P = 7.3 × 10 -5 ). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by nonallelic homologous recombination.

  1. Genovar: a detection and visualization tool for genomic variants.

    PubMed

    Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung

    2012-05-08

    Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. http://genovar.sourceforge.net/.

  2. A structural variant in the 5’-flanking region of the TWIST2 gene affects melanocyte development in belted cattle

    PubMed Central

    Drögemüller, Cord; Jagannathan, Vidhya; Keller, Irene; Wüthrich, Daniel; Bruggmann, Rémy; Schütz, Ekkehard; Demmel, Steffi; Moser, Simon; Signer-Hasler, Heidi; Pieńkowska-Schelling, Aldona; Schelling, Claude; Sande, Marcos; Rongen, Ronald

    2017-01-01

    Belted cattle have a circular belt of unpigmented hair and skin around their midsection. The belt is inherited as a monogenic autosomal dominant trait. We mapped the causative variant to a 37 kb segment on bovine chromosome 3. Whole genome sequence data of 2 belted and 130 control cattle yielded only one private genetic variant in the critical interval in the two belted animals. The belt-associated variant was a copy number variant (CNV) involving the quadruplication of a 6 kb non-coding sequence located approximately 16 kb upstream of the TWIST2 gene. Increased copy numbers at this CNV were strongly associated with the belt phenotype in a cohort of 333 cases and 1322 controls. We hypothesized that the CNV causes aberrant expression of TWIST2 during neural crest development, which might negatively affect melanoblasts. Functional studies showed that ectopic expression of bovine TWIST2 in neural crest in transgenic zebrafish led to a decrease in melanocyte numbers. Our results thus implicate an unsuspected involvement of TWIST2 in regulating pigmentation and reveal a non-coding CNV underlying a captivating Mendelian character. PMID:28658273

  3. Copy number variants in patients with short stature

    PubMed Central

    van Duyvenvoorde, Hermine A; Lui, Julian C; Kant, Sarina G; Oostdijk, Wilma; Gijsbers, Antoinet CJ; Hoffer, Mariëtte JV; Karperien, Marcel; Walenkamp, Marie JE; Noordam, Cees; Voorhoeve, Paul G; Mericq, Verónica; Pereira, Alberto M; Claahsen-van de Grinten, Hedi L; van Gool, Sandy A; Breuning, Martijn H; Losekoot, Monique; Baron, Jeffrey; Ruivenkamp, Claudia AL; Wit, Jan M

    2014-01-01

    Height is a highly heritable and classic polygenic trait. Recent genome-wide association studies (GWAS) have revealed that at least 180 genetic variants influence adult height. However, these variants explain only about 10% of the phenotypic variation in height. Genetic analysis of short individuals can lead to the discovery of novel rare gene defects with a large effect on growth. In an effort to identify novel genes associated with short stature, genome-wide analysis for copy number variants (CNVs), using single-nucleotide polymorphism arrays, in 162 patients (149 families) with short stature was performed. Segregation analysis was performed if possible, and genes in CNVs were compared with information from GWAS, gene expression in rodents' growth plates and published information. CNVs were detected in 40 families. In six families, a known cause of short stature was found (SHOX deletion or duplication, IGF1R deletion), in two combined with a de novo potentially pathogenic CNV. Thirty-three families had one or more potentially pathogenic CNVs (n=40). In 24 of these families, segregation analysis could be performed, identifying three de novo CNVs and nine CNVs segregating with short stature. Four were located near loci associated with height in GWAS (ADAMTS17, TULP4, PRKG2/BMP3 and PAPPA). Besides six CNVs known to be causative for short stature, 40 CNVs with possible pathogenicity were identified. Segregation studies and bioinformatics analysis suggested various potential candidate genes. PMID:24065112

  4. Copy Number Variations Detection: Unravelling the Problem in Tangible Aspects.

    PubMed

    do Nascimento, Francisco; Guimaraes, Katia S

    2017-01-01

    In the midst of the important genomic variants associated to the susceptibility and resistance to complex diseases, Copy Number Variations (CNV) has emerged as a prevalent class of structural variation. Following the flood of next-generation sequencing data, numerous tools publicly available have been developed to provide computational strategies to identify CNV at improved accuracy. This review goes beyond scrutinizing the main approaches widely used for structural variants detection in general, including Split-Read, Paired-End Mapping, Read-Depth, and Assembly-based. In this paper, (1) we characterize the relevant technical details around the detection of CNV, which can affect the estimation of breakpoints and number of copies, (2) we pinpoint the most important insights related to GC-content and mappability biases, and (3) we discuss the paramount caveats in the tools evaluation process. The points brought out in this study emphasize common assumptions, a variety of possible limitations, valuable insights, and directions for desirable contributions to the state-of-the-art in CNV detection tools.

  5. Assessment of large copy number variants in patients with apparently isolated congenital left-sided cardiac lesions reveals clinically relevant genomic events.

    PubMed

    Hanchard, Neil A; Umana, Luis A; D'Alessandro, Lisa; Azamian, Mahshid; Poopola, Mojisola; Morris, Shaine A; Fernbach, Susan; Lalani, Seema R; Towbin, Jeffrey A; Zender, Gloria A; Fitzgerald-Butt, Sara; Garg, Vidu; Bowman, Jessica; Zapata, Gladys; Hernandez, Patricia; Arrington, Cammon B; Furthner, Dieter; Prakash, Siddharth K; Bowles, Neil E; McBride, Kim L; Belmont, John W

    2017-08-01

    Congenital left-sided cardiac lesions (LSLs) are a significant contributor to the mortality and morbidity of congenital heart disease (CHD). Structural copy number variants (CNVs) have been implicated in LSL without extra-cardiac features; however, non-penetrance and variable expressivity have created uncertainty over the use of CNV analyses in such patients. High-density SNP microarray genotyping data were used to infer large, likely-pathogenic, autosomal CNVs in a cohort of 1,139 probands with LSL and their families. CNVs were molecularly confirmed and the medical records of individual carriers reviewed. The gene content of novel CNVs was then compared with public CNV data from CHD patients. Large CNVs (>1 MB) were observed in 33 probands (∼3%). Six of these were de novo and 14 were not observed in the only available parent sample. Associated cardiac phenotypes spanned a broad spectrum without clear predilection. Candidate CNVs were largely non-recurrent, associated with heterozygous loss of copy number, and overlapped known CHD genomic regions. Novel CNV regions were enriched for cardiac development genes, including seven that have not been previously associated with human CHD. CNV analysis can be a clinically useful and molecularly informative tool in LSLs without obvious extra-cardiac defects, and may identify a clinically relevant genomic disorder in a small but important proportion of these individuals. © 2017 Wiley Periodicals, Inc.

  6. Biological relevance of CNV calling methods using familial relatedness including monozygotic twins.

    PubMed

    Castellani, Christina A; Melka, Melkaye G; Wishart, Andrea E; Locke, M Elizabeth O; Awamleh, Zain; O'Reilly, Richard L; Singh, Shiva M

    2014-04-21

    Studies involving the analysis of structural variation including Copy Number Variation (CNV) have recently exploded in the literature. Furthermore, CNVs have been associated with a number of complex diseases and neurodevelopmental disorders. Common methods for CNV detection use SNP, CNV, or CGH arrays, where the signal intensities of consecutive probes are used to define the number of copies associated with a given genomic region. These practices pose a number of challenges that interfere with the ability of available methods to accurately call CNVs. It has, therefore, become necessary to develop experimental protocols to test the reliability of CNV calling methods from microarray data so that researchers can properly discriminate biologically relevant data from noise. We have developed a workflow for the integration of data from multiple CNV calling algorithms using the same array results. It uses four CNV calling programs: PennCNV (PC), Affymetrix® Genotyping Console™ (AGC), Partek® Genomics Suite™ (PGS) and Golden Helix SVS™ (GH) to analyze CEL files from the Affymetrix® Human SNP 6.0 Array™. To assess the relative suitability of each program, we used individuals of known genetic relationships. We found significant differences in CNV calls obtained by different CNV calling programs. Although the programs showed variable patterns of CNVs in the same individuals, their distribution in individuals of different degrees of genetic relatedness has allowed us to offer two suggestions. The first involves the use of multiple algorithms for the detection of the largest possible number of CNVs, and the second suggests the use of PennCNV over all other methods when the use of only one software program is desirable.

  7. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects

    PubMed Central

    Marshall, Christian R.; Howrigan, Daniel P.; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wu, Wenting; Greer, Douglas S.; Antaki, Danny; Shetty, Aniket; Holmans, Peter A.; Pinto, Dalila; Gujral, Madhusudan; Brandler, William M.; Malhotra, Dheeraj; Wang, Zhouzhi; Fajarado, Karin V. Fuentes; Maile, Michelle S.; Ripke, Stephan; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amin, Farooq; Atkins, Joshua; Bacanu, Silviu A.; Belliveau, Richard A.; Bergen, Sarah E.; Bertalan, Marcelo; Bevilacqua, Elizabeth; Bigdeli, Tim B.; Black, Donald W.; Bruggeman, Richard; Buccola, Nancy G.; Buckner, Randy L.; Bulik-Sullivan, Brendan; Byerley, William; Cahn, Wiepke; Cai, Guiqing; Cairns, Murray J.; Campion, Dominique; Cantor, Rita M.; Carr, Vaughan J.; Carrera, Noa; Catts, Stanley V.; Chambert, Kimberley D.; Cheng, Wei; Cloninger, C. Robert; Cohen, David; Cormican, Paul; Craddock, Nick; Crespo-Facorro, Benedicto; Crowley, James J.; Curtis, David; Davidson, Michael; Davis, Kenneth L; Degenhardt, Franziska; Del Favero, Jurgen; DeLisi, Lynn E.; Dikeos, Dimitris; Dinan, Timothy; Djurovic, Srdjan; Donohoe, Gary; Drapeau, Elodie; Duan, Jubao; Dudbridge, Frank; Eichhammer, Peter; Eriksson, Johan; Escott-Price, Valentina; Essioux, Laurent; Fanous, Ayman H.; Farh, Kai-How; Farrell, Martilias S.; Frank, Josef; Franke, Lude; Freedman, Robert; Freimer, Nelson B.; Friedman, Joseph I.; Forstner, Andreas J.; Fromer, Menachem; Genovese, Giulio; Georgieva, Lyudmila; Gershon, Elliot S.; Giegling, Ina; Giusti-Rodríguez, Paola; Godard, Stephanie; Goldstein, Jacqueline I.; Gratten, Jacob; de Haan, Lieuwe; Hamshere, Marian L.; Hansen, Mark; Hansen, Thomas; Haroutunian, Vahram; Hartmann, Annette M.; Henskens, Frans A.; Herms, Stefan; Hirschhorn, Joel N.; Hoffmann, Per; Hofman, Andrea; Huang, Hailiang; Ikeda, Masashi; Joa, Inge; Kähler, Anna K; Kahn, René S; Kalaydjieva, Luba; Karjalainen, Juha; Kavanagh, David; Keller, Matthew C.; Kelly, Brian J.; Kennedy, James L.; Kim, Yunjung; Knowles, James A.; Konte, Bettina; Laurent, Claudine; Lee, Phil; Lee, S. Hong; Legge, Sophie E.; Lerer, Bernard; Levy, Deborah L.; Liang, Kung-Yee; Lieberman, Jeffrey; Lönnqvist, Jouko; Loughland, Carmel M.; Magnusson, Patrik K.E.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Mattheisen, Manuel; Mattingsdal, Morten; McCarley, Robert W; McDonald, Colm; McIntosh, Andrew M.; Meier, Sandra; Meijer, Carin J.; Melle, Ingrid; Mesholam-Gately, Raquelle I.; Metspalu, Andres; Michie, Patricia T.; Milani, Lili; Milanova, Vihra; Mokrab, Younes; Morris, Derek W.; Müller-Myhsok, Bertram; Murphy, Kieran C.; Murray, Robin M.; Myin-Germeys, Inez; Nenadic, Igor; Nertney, Deborah A.; Nestadt, Gerald; Nicodemus, Kristin K.; Nisenbaum, Laura; Nordin, Annelie; O’Callaghan, Eadbhard; O’Dushlaine, Colm; Oh, Sang-Yun; Olincy, Ann; Olsen, Line; O’Neill, F. Anthony; Van Os, Jim; Pantelis, Christos; Papadimitriou, George N.; Parkhomenko, Elena; Pato, Michele T.; Paunio, Tiina; Perkins, Diana O.; Pers, Tune H.; Pietiläinen, Olli; Pimm, Jonathan; Pocklington, Andrew J.; Powell, John; Price, Alkes; Pulver, Ann E.; Purcell, Shaun M.; Quested, Digby; Rasmussen, Henrik B.; Reichenberg, Abraham; Reimers, Mark A.; Richards, Alexander L.; Roffman, Joshua L.; Roussos, Panos; Ruderfer, Douglas M.; Salomaa, Veikko; Sanders, Alan R.; Savitz, Adam; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scolnick, Edward M.; Scott, Rodney J.; Seidman, Larry J.; Shi, Jianxin; Silverman, Jeremy M.; Smoller, Jordan W.; Söderman, Erik; Spencer, Chris C.A.; Stahl, Eli A.; Strengman, Eric; Strohmaier, Jana; Stroup, T. Scott; Suvisaari, Jaana; Svrakic, Dragan M.; Szatkiewicz, Jin P.; Thirumalai, Srinivas; Tooney, Paul A.; Veijola, Juha; Visscher, Peter M.; Waddington, John; Walsh, Dermot; Webb, Bradley T.; Weiser, Mark; Wildenauer, Dieter B.; Williams, Nigel M.; Williams, Stephanie; Witt, Stephanie H.; Wolen, Aaron R.; Wormley, Brandon K.; Wray, Naomi R; Wu, Jing Qin; Zai, Clement C.; Adolfsson, Rolf; Andreassen, Ole A.; Blackwood, Douglas H.R.; Bramon, Elvira; Buxbaum, Joseph D.; Cichon, Sven; Collier, David A; Corvin, Aiden; Daly, Mark J.; Darvasi, Ariel; Domenici, Enrico; Esko, Tõnu; Gejman, Pablo V.; Gill, Michael; Gurling, Hugh; Hultman, Christina M.; Iwata, Nakao; Jablensky, Assen V.; Jönsson, Erik G; Kendler, Kenneth S; Kirov, George; Knight, Jo; Levinson, Douglas F.; Li, Qingqin S; McCarroll, Steven A; McQuillin, Andrew; Moran, Jennifer L.; Mowry, Bryan J.; Nöthen, Markus M.; Ophoff, Roel A.; Owen, Michael J.; Palotie, Aarno; Pato, Carlos N.; Petryshen, Tracey L.; Posthuma, Danielle; Rietschel, Marcella; Riley, Brien P.; Rujescu, Dan; Sklar, Pamela; St. Clair, David; Walters, James T.R.; Werge, Thomas; Sullivan, Patrick F.; O’Donovan, Michael C; Scherer, Stephen W.; Neale, Benjamin M.; Sebat, Jonathan

    2017-01-01

    Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (OR=1.11, P=5.7×10−15), which persisted after excluding loci implicated in previous studies (OR=1.07, P=1.7 ×10−6). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 ×10−11) and neurobehavioral phenotypes in mouse (OR = 1.18, P= 7.3 ×10−5). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by non-allelic homologous recombination. PMID:27869829

  8. The utility of copy number variation (CNV) in studies of hypertension-related left ventricular hypertrophy (LVH): rationale, potential and challenges.

    PubMed

    Boonpeng, Hoh; Yusoff, Khalid

    2013-03-01

    The ultimate goal of human genetics is to understand the role of genome variation in elucidating human traits and diseases. Besides single nucleotide polymorphism (SNP), copy number variation (CNV), defined as gains or losses of a DNA segment larger than 1 kb, has recently emerged as an important tool in understanding heritable source of human genomic differences. It has been shown to contribute to genetic susceptibility of various common and complex diseases. Despite a handful of publications, its role in cardiovascular diseases remains largely unknown. Here, we deliberate on the currently available technologies for CNV detection. The possible utility and the potential roles of CNV in exploring the mechanisms of cardiac remodeling in hypertension will also be addressed. Finally, we discuss the challenges for investigations of CNV in cardiovascular diseases and its possible implications in diagnosis of hypertension-related left ventricular hypertrophy (LVH).

  9. Assessing copy number from exome sequencing and exome array CGH based on CNV spectrum in a large clinical cohort.

    PubMed

    Retterer, Kyle; Scuffins, Julie; Schmidt, Daniel; Lewis, Rachel; Pineda-Alvarez, Daniel; Stafford, Amanda; Schmidt, Lindsay; Warren, Stephanie; Gibellini, Federica; Kondakova, Anastasia; Blair, Amanda; Bale, Sherri; Matyakhina, Ludmila; Meck, Jeanne; Aradhya, Swaroop; Haverfield, Eden

    2015-08-01

    Detection of copy-number variation (CNV) is important for investigating many genetic disorders. Testing a large clinical cohort by array comparative genomic hybridization provides a deep perspective on the spectrum of pathogenic CNV. In this context, we describe a bioinformatics approach to extract CNV information from whole-exome sequencing and demonstrate its utility in clinical testing. Exon-focused arrays and whole-genome chromosomal microarray analysis were used to test 14,228 and 14,000 individuals, respectively. Based on these results, we developed an algorithm to detect deletions/duplications in whole-exome sequencing data and a novel whole-exome array. In the exon array cohort, we observed a positive detection rate of 2.4% (25 duplications, 318 deletions), of which 39% involved one or two exons. Chromosomal microarray analysis identified 3,345 CNVs affecting single genes (18%). We demonstrate that our whole-exome sequencing algorithm resolves CNVs of three or more exons. These results demonstrate the clinical utility of single-exon resolution in CNV assays. Our whole-exome sequencing algorithm approaches this resolution but is complemented by a whole-exome array to unambiguously identify intragenic CNVs and single-exon changes. These data illustrate the next advancements in CNV analysis through whole-exome sequencing and whole-exome array.Genet Med 17 8, 623-629.

  10. Genomic copy number variants: evidence for association with antibody response to anthrax vaccine adsorbed.

    PubMed

    Falola, Michael I; Wiener, Howard W; Wineinger, Nathan E; Cutter, Gary R; Kimberly, Robert P; Edberg, Jeffrey C; Arnett, Donna K; Kaslow, Richard A; Tang, Jianming; Shrestha, Sadeep

    2013-01-01

    Anthrax and its etiologic agent remain a biological threat. Anthrax vaccine is highly effective, but vaccine-induced IgG antibody responses vary widely following required doses of vaccinations. Such variation can be related to genetic factors, especially genomic copy number variants (CNVs) that are known to be enriched among genes with immunologic function. We have tested this hypothesis in two study populations from a clinical trial of anthrax vaccination. We performed CNV-based genome-wide association analyses separately on 794 European Americans and 200 African-Americans. Antibodies to protective antigen were measured at week 8 (early response) and week 30 (peak response) using an enzyme-linked immunosorbent assay. We used DNA microarray data (Affymetrix 6.0) and two CNV detection algorithms, hidden markov model (PennCNV) and circular binary segmentation (GeneSpring) to determine CNVs in all individuals. Multivariable regression analyses were used to identify CNV-specific associations after adjusting for relevant non-genetic covariates. Within the 22 autosomal chromosomes, 2,943 non-overlapping CNV regions were detected by both algorithms. Genomic insertions containing HLA-DRB5, DRB1 and DQA1/DRA genes in the major histocompatibility complex (MHC) region (chromosome 6p21.3) were moderately associated with elevated early antibody response (β = 0.14, p = 1.78×10(-3)) among European Americans, and the strongest association was observed between peak antibody response and a segmental insertion on chromosome 1, containing NBPF4, NBPF5, STXMP3, CLCC1, and GPSM2 genes (β = 1.66, p = 6.06×10(-5)). For African-Americans, segmental deletions spanning PRR20, PCDH17 and PCH68 genes on chromosome 13 were associated with elevated early antibody production (β = 0.18, p = 4.47×10(-5)). Population-specific findings aside, one genomic insertion on chromosome 17 (containing NSF, ARL17 and LRRC37A genes) was associated with elevated peak antibody

  11. Copy number variants and genetic polymorphisms in TBX21, GATA3, Rorc, Foxp3 and susceptibility to Behcet's disease and Vogt-Koyanagi-Harada syndrome.

    PubMed

    Liao, Dan; Hou, Shengping; Zhang, Jun; Fang, Jing; Liu, Yunjia; Bai, Lin; Cao, Qingfeng; Kijlstra, Aize; Yang, Peizeng

    2015-04-15

    This study aimed to investigate the role of genetic variants including single nucleotide polymorphisms (SNPs) and copy number variants (CNVs) of TBX21, GATA3, Rorc and Foxp3 genes in Behcet's disease (BD) and Vogt-Koyanagi-Harada (VKH) syndrome in a Chinese Han population. Genotyping of 25 SNPs was performed by iPLEX system (Sequenom) or polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). TaqMan real time PCR was used to assess CNVs. The expression of Rorc and Foxp3 were examined by real-time PCR and cytokine production was measured by ELISA. High Rorc CNV was associated with the susceptibility to BD (P = 8.99 × 10(-8), OR = 3.0), and low Foxp3 CNV predisposed to BD in female patients (P = 1.92 × 10(-5), OR = 3.1). CNVs for the investigated genes were not altered in VKH syndrome. Further functional studies demonstrated that the relative mRNA expression levels of Rorc were increased in individuals with high Rorc copy number, but not for Foxp3. Increased production of IL-1β and IL-6 was found in individuals carrying a high CNV of Rorc. Our study showed that high CNVs of Rorc and low CNVs of Foxp3 confer risk for BD but not for VKH syndrome. The tested 25 SNPs in TBX21, GATA3, Rorc and Foxp3 did not associate with BD and VKH syndrome.

  12. Rare copy number variants in patients with congenital conotruncal heart defects.

    PubMed

    Xie, Hongbo M; Werner, Petra; Stambolian, Dwight; Bailey-Wilson, Joan E; Hakonarson, Hakon; White, Peter S; Taylor, Deanne M; Goldmuntz, Elizabeth

    2017-03-01

    Previous studies using different cardiac phenotypes, technologies and designs suggest a burden of large, rare or de novo copy number variants (CNVs) in subjects with congenital heart defects. We sought to identify disease-related CNVs, candidate genes, and functional pathways in a large number of cases with conotruncal and related defects that carried no known genetic syndrome. Cases and control samples were divided into two cohorts and genotyped to assess each subject's CNV content. Analyses were performed to ascertain differences in overall CNV prevalence and to identify enrichment of specific genes and functional pathways in conotruncal cases relative to healthy controls. Only findings present in both cohorts are presented. From 973 total conotruncal cases, a burden of rare CNVs was detected in both cohorts. Candidate genes from rare CNVs found in both cohorts were identified based on their association with cardiac development or disease, and/or their reported disruption in published studies. Functional and pathway analyses revealed significant enrichment of terms involved in either heart or early embryonic development. Our study tested one of the largest cohorts specifically with cardiac conotruncal and related defects. These results confirm and extend previous findings that CNVs contribute to disease risk for congenital heart defects in general and conotruncal defects in particular. As disease heterogeneity renders identification of single recurrent genes or loci difficult, functional pathway and gene regulation network analyses appear to be more informative. Birth Defects Research 109:271-295, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  13. Rare copy number variants in neuropsychiatric disorders: Specific phenotype or not?

    PubMed

    Van Den Bossche, Maarten J; Johnstone, Mandy; Strazisar, Mojca; Pickard, Benjamin S; Goossens, Dirk; Lenaerts, An-Sofie; De Zutter, Sonia; Nordin, Annelie; Norrback, Karl-Fredrik; Mendlewicz, Julien; Souery, Daniel; De Rijk, Peter; Sabbe, Bernard G; Adolfsson, Rolf; Blackwood, Douglas; Del-Favero, Jurgen

    2012-10-01

    From a number of genome-wide association studies it was shown that de novo and/or rare copy number variants (CNVs) are found at an increased frequency in neuropsychiatric diseases. In this study we examined the prevalence of CNVs in six genomic regions (1q21.1, 2p16.3, 3q29, 15q11.2, 15q13.3, and 16p11.2) previously implicated in neuropsychiatric diseases. Hereto, a cohort of four neuropsychiatric disorders (schizophrenia, bipolar disorder, major depressive disorder, and intellectual disability) and control individuals from three different populations was used in combination with Multilpex Amplicon Quantifiaction (MAQ) assays, capable of high resolution (kb range) and custom-tailored CNV detection. Our results confirm the etiological candidacy of the six selected CNV regions for neuropsychiatric diseases. It is possible that CNVs in these regions can result in disturbed brain development and in this way lead to an increased susceptibility for different neuropsychiatric disorders, dependent on additional genetic and environmental factors. Our results also suggest that the neurodevelopmental component is larger in the etiology of schizophrenia and intellectual disability than in mood disorders. Finally, our data suggest that deletions are in general more pathogenic than duplications. Given the high frequency of the examined CNVs (1-2%) in patients of different neuropsychiatric disorders, screening of large cohorts with an affordable and feasible method like the MAQ assays used in this study is likely to result in important progress in unraveling the genetic factors leading to an increased susceptibility for several psychiatric disorders. 2012 Wiley Periodicals, Inc

  14. Multiplexed direct genomic selection (MDiGS): a pooled BAC capture approach for highly accurate CNV and SNP/INDEL detection.

    PubMed

    Alvarado, David M; Yang, Ping; Druley, Todd E; Lovett, Michael; Gurnett, Christina A

    2014-06-01

    Despite declining sequencing costs, few methods are available for cost-effective single-nucleotide polymorphism (SNP), insertion/deletion (INDEL) and copy number variation (CNV) discovery in a single assay. Commercially available methods require a high investment to a specific region and are only cost-effective for large samples. Here, we introduce a novel, flexible approach for multiplexed targeted sequencing and CNV analysis of large genomic regions called multiplexed direct genomic selection (MDiGS). MDiGS combines biotinylated bacterial artificial chromosome (BAC) capture and multiplexed pooled capture for SNP/INDEL and CNV detection of 96 multiplexed samples on a single MiSeq run. MDiGS is advantageous over other methods for CNV detection because pooled sample capture and hybridization to large contiguous BAC baits reduces sample and probe hybridization variability inherent in other methods. We performed MDiGS capture for three chromosomal regions consisting of ∼ 550 kb of coding and non-coding sequence with DNA from 253 patients with congenital lower limb disorders. PITX1 nonsense and HOXC11 S191F missense mutations were identified that segregate in clubfoot families. Using a novel pooled-capture reference strategy, we identified recurrent chromosome chr17q23.1q23.2 duplications and small HOXC 5' cluster deletions (51 kb and 12 kb). Given the current interest in coding and non-coding variants in human disease, MDiGS fulfills a niche for comprehensive and low-cost evaluation of CNVs, coding, and non-coding variants across candidate regions of interest. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Detection and assessment of copy number variation using PacBio long-read and Illumina sequencing in New Zealand dairy cattle.

    PubMed

    Couldrey, C; Keehan, M; Johnson, T; Tiplady, K; Winkelman, A; Littlejohn, M D; Scott, A; Kemper, K E; Hayes, B; Davis, S R; Spelman, R J

    2017-07-01

    Single nucleotide polymorphisms have been the DNA variant of choice for genomic prediction, largely because of the ease of single nucleotide polymorphism genotype collection. In contrast, structural variants (SV), which include copy number variants (CNV), translocations, insertions, and inversions, have eluded easy detection and characterization, particularly in nonhuman species. However, evidence increasingly shows that SV not only contribute a substantial proportion of genetic variation but also have significant influence on phenotypes. Here we present the discovery of CNV in a prominent New Zealand dairy bull using long-read PacBio (Pacific Biosciences, Menlo Park, CA) sequencing technology and the Sniffles SV discovery tool (version 0.0.1; https://github.com/fritzsedlazeck/Sniffles). The CNV identified from long reads were compared with CNV discovered in the same bull from Illumina sequencing using CNVnator (read depth-based tool; Illumina Inc., San Diego, CA) as a means of validation. Subsequently, further validation was undertaken using whole-genome Illumina sequencing of 556 cattle representing the wider New Zealand dairy cattle population. Very limited overlap was observed in CNV discovered from the 2 sequencing platforms, in part because of the differences in size of CNV detected. Only a few CNV were therefore able to be validated using this approach. However, the ability to use CNVnator to genotype the 557 cattle for copy number across all regions identified as putative CNV allowed a genome-wide assessment of transmission level of copy number based on pedigree. The more highly transmissible a putative CNV region was observed to be, the more likely the distribution of copy number was multimodal across the 557 sequenced animals. Furthermore, visual assessment of highly transmissible CNV regions provided evidence supporting the presence of CNV across the sequenced animals. This transmission-based approach was able to confirm a subset of CNV that segregates

  16. Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression

    PubMed Central

    Wiedenhoeft, John; Brugel, Eric; Schliep, Alexander

    2016-01-01

    By integrating Haar wavelets with Hidden Markov Models, we achieve drastically reduced running times for Bayesian inference using Forward-Backward Gibbs sampling. We show that this improves detection of genomic copy number variants (CNV) in array CGH experiments compared to the state-of-the-art, including standard Gibbs sampling. The method concentrates computational effort on chromosomal segments which are difficult to call, by dynamically and adaptively recomputing consecutive blocks of observations likely to share a copy number. This makes routine diagnostic use and re-analysis of legacy data collections feasible; to this end, we also propose an effective automatic prior. An open source software implementation of our method is available at http://schlieplab.org/Software/HaMMLET/ (DOI: 10.5281/zenodo.46262). This paper was selected for oral presentation at RECOMB 2016, and an abstract is published in the conference proceedings. PMID:27177143

  17. Comparative analyses across cattle breeds reveal the pitfalls caused by artificial and lineage-differential copy number variations

    USDA-ARS?s Scientific Manuscript database

    Copy number variations (CNV) are well known genomic variants, which often complicate structural and functional genomics studies. Here, we integrated the CNV region (CNVR) result detected from 1,682 Nellore cattle with the equivalent result derived from the Bovine HapMap samples. Through comparing CN...

  18. Genome-wide analysis of CNV (copy number variation) and their associations with narcolepsy in a Japanese population.

    PubMed

    Yamasaki, Maria; Miyagawa, Taku; Toyoda, Hiromi; Khor, Seik-Soon; Koike, Asako; Nitta, Aino; Akiyama, Kumi; Sasaki, Tsukasa; Honda, Yutaka; Honda, Makoto; Tokunaga, Katsushi

    2014-05-01

    In humans, narcolepsy with cataplexy (narcolepsy) is a sleep disorder that is characterized by sleepiness, cataplexy and rapid eye movement (REM) sleep abnormalities. Narcolepsy is caused by a reduction in the number of neurons that produce hypocretin (orexin) neuropeptide. Both genetic and environmental factors contribute to the development of narcolepsy.Rare and large copy number variations (CNVs) reportedly play a role in the etiology of a number of neuropsychiatric disorders. Narcolepsy is considered a neurological disorder; therefore, we sought to investigate any possible association between rare and large CNVs and human narcolepsy. We used DNA microarray data and a CNV detection software application, PennCNV-Affy, to detect CNVs in 426 Japanese narcoleptic patients and 562 healthy individuals. Overall, we found a significant enrichment of rare and large CNVs (frequency ≤1%, size ≥100 kb) in the patients (case-control ratio of CNV count=1.54, P=5.00 × 10(-4)). Next, we extended a region-based association analysis by including CNVs with its size ≥30 kb. Rare and large CNVs in PARK2 region showed a significant association with narcolepsy. Four patients were assessed to carry duplications of the gene region, whereas no controls carried the duplication, which was further confirmed by quantitative PCR assay. This duplication was also found in 2 essential hypersomnia (EHS) patients out of 171 patients. Furthermore, a pathway analysis revealed enrichments of gene disruptions by rare and large CNVs in immune response, acetyltransferase activity, cell cycle regulation and regulation of cell development. This study constitutes the first report on the risk association between multiple rare and large CNVs and the pathogenesis of narcolepsy. In the future, replication studies are needed to confirm the associations.

  19. Detection Copy Number Variants from NGS with Sparse and Smooth Constraints.

    PubMed

    Zhang, Yue; Cheung, Yiu-Ming; Xu, Bo; Su, Weifeng

    2017-01-01

    It is known that copy number variations (CNVs) are associated with complex diseases and particular tumor types, thus reliable identification of CNVs is of great potential value. Recent advances in next generation sequencing (NGS) data analysis have helped manifest the richness of CNV information. However, the performances of these methods are not consistent. Reliably finding CNVs in NGS data in an efficient way remains a challenging topic, worthy of further investigation. Accordingly, we tackle the problem by formulating CNVs identification into a quadratic optimization problem involving two constraints. By imposing the constraints of sparsity and smoothness, the reconstructed read depth signal from NGS is anticipated to fit the CNVs patterns more accurately. An efficient numerical solution tailored from alternating direction minimization (ADM) framework is elaborated. We demonstrate the advantages of the proposed method, namely ADM-CNV, by comparing it with six popular CNV detection methods using synthetic, simulated, and empirical sequencing data. It is shown that the proposed approach can successfully reconstruct CNV patterns from raw data, and achieve superior or comparable performance in detection of the CNVs compared to the existing counterparts.

  20. Effective normalization for copy number variation detection from whole genome sequencing.

    PubMed

    Janevski, Angel; Varadan, Vinay; Kamalakaran, Sitharthan; Banerjee, Nilanjana; Dimitrova, Nevenka

    2012-01-01

    Whole genome sequencing enables a high resolution view of the human genome and provides unique insights into genome structure at an unprecedented scale. There have been a number of tools to infer copy number variation in the genome. These tools, while validated, also include a number of parameters that are configurable to genome data being analyzed. These algorithms allow for normalization to account for individual and population-specific effects on individual genome CNV estimates but the impact of these changes on the estimated CNVs is not well characterized. We evaluate in detail the effect of normalization methodologies in two CNV algorithms FREEC and CNV-seq using whole genome sequencing data from 8 individuals spanning four populations. We apply FREEC and CNV-seq to a sequencing data set consisting of 8 genomes. We use multiple configurations corresponding to different read-count normalization methodologies in FREEC, and statistically characterize the concordance of the CNV calls between FREEC configurations and the analogous output from CNV-seq. The normalization methodologies evaluated in FREEC are: GC content, mappability and control genome. We further stratify the concordance analysis within genic, non-genic, and a collection of validated variant regions. The GC content normalization methodology generates the highest number of altered copy number regions. Both mappability and control genome normalization reduce the total number and length of copy number regions. Mappability normalization yields Jaccard indices in the 0.07 - 0.3 range, whereas using a control genome normalization yields Jaccard index values around 0.4 with normalization based on GC content. The most critical impact of using mappability as a normalization factor is substantial reduction of deletion CNV calls. The output of another method based on control genome normalization, CNV-seq, resulted in comparable CNV call profiles, and substantial agreement in variable gene and CNV region calls

  1. Identification of copy number variation in French dairy and beef breeds using next-generation sequencing.

    PubMed

    Letaief, Rabia; Rebours, Emmanuelle; Grohs, Cécile; Meersseman, Cédric; Fritz, Sébastien; Trouilh, Lidwine; Esquerré, Diane; Barbieri, Johanna; Klopp, Christophe; Philippe, Romain; Blanquet, Véronique; Boichard, Didier; Rocha, Dominique; Boussaha, Mekki

    2017-10-24

    Copy number variations (CNV) are known to play a major role in genetic variability and disease pathogenesis in several species including cattle. In this study, we report the identification and characterization of CNV in eight French beef and dairy breeds using whole-genome sequence data from 200 animals. Bioinformatics analyses to search for CNV were carried out using four different but complementary tools and we validated a subset of the CNV by both in silico and experimental approaches. We report the identification and localization of 4178 putative deletion-only, duplication-only and CNV regions, which cover 6% of the bovine autosomal genome; they were validated by two in silico approaches and/or experimentally validated using array-based comparative genomic hybridization and single nucleotide polymorphism genotyping arrays. The size of these variants ranged from 334 bp to 7.7 Mb, with an average size of ~ 54 kb. Of these 4178 variants, 3940 were deletions, 67 were duplications and 171 corresponded to both deletions and duplications, which were defined as potential CNV regions. Gene content analysis revealed that, among these variants, 1100 deletions and duplications encompassed 1803 known genes, which affect a wide spectrum of molecular functions, and 1095 overlapped with known QTL regions. Our study is a large-scale survey of CNV in eight French dairy and beef breeds. These CNV will be useful to study the link between genetic variability and economically important traits, and to improve our knowledge on the genomic architecture of cattle.

  2. Genome-wide copy number variant analysis in Holstein cattle reveals variants associated with 10 production traits including residual feed intake and dry matter intake

    USDA-ARS?s Scientific Manuscript database

    Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...

  3. Whole-genome CNV analysis: advances in computational approaches.

    PubMed

    Pirooznia, Mehdi; Goes, Fernando S; Zandi, Peter P

    2015-01-01

    Accumulating evidence indicates that DNA copy number variation (CNV) is likely to make a significant contribution to human diversity and also play an important role in disease susceptibility. Recent advances in genome sequencing technologies have enabled the characterization of a variety of genomic features, including CNVs. This has led to the development of several bioinformatics approaches to detect CNVs from next-generation sequencing data. Here, we review recent advances in CNV detection from whole genome sequencing. We discuss the informatics approaches and current computational tools that have been developed as well as their strengths and limitations. This review will assist researchers and analysts in choosing the most suitable tools for CNV analysis as well as provide suggestions for new directions in future development.

  4. Genome-wide association study of preeclampsia detects novel maternal single nucleotide polymorphisms and copy-number variants in subsets of the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study cohort

    PubMed Central

    Zhao, Linlu; Bracken, Michael B.; DeWan, Andrew T.

    2013-01-01

    Summary A genome-wide association study was undertaken to identify maternal single nucleotide polymorphisms (SNPs) and copy-number variants (CNVs) associated with preeclampsia. Case-control analysis was performed on 1070 Afro-Caribbean (n=21 cases and 1049 controls) and 723 Hispanic (n=62 cases and 661 controls) mothers and 1257 mothers of European ancestry (n=50 cases and 1207 controls) from the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study. European ancestry subjects were genotyped on Illumina Human610-Quad and Afro-Caribbean and Hispanic subjects were genotyped on Illumina Human1M-Duo BeadChip microarrays. Genome-wide SNP data were analyzed using PLINK. CNVs were called using three detection algorithms (GNOSIS, PennCNV, and QuantiSNP), merged using CNVision, and then screened using stringent criteria. SNP and CNV findings were compared to those of the Study of Pregnancy Hypertension in Iowa (SOPHIA), an independent preeclampsia case-control dataset of Caucasian mothers (n=177 cases and 116 controls). A list of top SNPs were identified for each of the HAPO ethnic groups, but none reached Bonferroni-corrected significance. Novel candidate CNVs showing enrichment among preeclampsia cases were also identified in each of the three ethnic groups. Several variants were suggestively replicated in SOPHIA. The discovered SNPs and copy-number variable regions present interesting candidate genetic variants for preeclampsia that warrant further replication and investigation. PMID:23551011

  5. SCRIB and PUF60 Are Primary Drivers of the Multisystemic Phenotypes of the 8q24.3 Copy-Number Variant

    PubMed Central

    Dauber, Andrew; Golzio, Christelle; Guenot, Cécile; Jodelka, Francine M.; Kibaek, Maria; Kjaergaard, Susanne; Leheup, Bruno; Martinet, Danielle; Nowaczyk, Malgorzata J.M.; Rosenfeld, Jill A.; Zeesman, Susan; Zunich, Janice; Beckmann, Jacques S.; Hirschhorn, Joel N.; Hastings, Michelle L.; Jacquemont, Sebastien; Katsanis, Nicholas

    2013-01-01

    Copy-number variants (CNVs) represent a significant interpretative challenge, given that each CNV typically affects the dosage of multiple genes. Here we report on five individuals with coloboma, microcephaly, developmental delay, short stature, and craniofacial, cardiac, and renal defects who harbor overlapping microdeletions on 8q24.3. Fine mapping localized a commonly deleted 78 kb region that contains three genes: SCRIB, NRBP2, and PUF60. In vivo dissection of the CNV showed discrete contributions of the planar cell polarity effector SCRIB and the splicing factor PUF60 to the syndromic phenotype, and the combinatorial suppression of both genes exacerbated some, but not all, phenotypic components. Consistent with these findings, we identified an individual with microcephaly, short stature, intellectual disability, and heart defects with a de novo c.505C>T variant leading to a p.His169Tyr change in PUF60. Functional testing of this allele in vivo and in vitro showed that the mutation perturbs the relative dosage of two PUF60 isoforms and, subsequently, the splicing efficiency of downstream PUF60 targets. These data inform the functions of two genes not associated previously with human genetic disease and demonstrate how CNVs can exhibit complex genetic architecture, with the phenotype being the amalgam of both discrete dosage dysfunction of single transcripts and also of binary genetic interactions. PMID:24140112

  6. Detection of genome-wide copy number variants in myeloid malignancies using next-generation sequencing.

    PubMed

    Shen, Wei; Paxton, Christian N; Szankasi, Philippe; Longhurst, Maria; Schumacher, Jonathan A; Frizzell, Kimberly A; Sorrells, Shelly M; Clayton, Adam L; Jattani, Rakhi P; Patel, Jay L; Toydemir, Reha; Kelley, Todd W; Xu, Xinjie

    2018-04-01

    Genetic abnormalities, including copy number variants (CNV), copy number neutral loss of heterozygosity (CN-LOH) and gene mutations, underlie the pathogenesis of myeloid malignancies and serve as important diagnostic, prognostic and/or therapeutic markers. Currently, multiple testing strategies are required for comprehensive genetic testing in myeloid malignancies. The aim of this proof-of-principle study was to investigate the feasibility of combining detection of genome-wide large CNVs, CN-LOH and targeted gene mutations into a single assay using next-generation sequencing (NGS). For genome-wide CNV detection, we designed a single nucleotide polymorphism (SNP) sequencing backbone with 22 762 SNP regions evenly distributed across the entire genome. For targeted mutation detection, 62 frequently mutated genes in myeloid malignancies were targeted. We combined this SNP sequencing backbone with a targeted mutation panel, and sequenced 9 healthy individuals and 16 patients with myeloid malignancies using NGS. We detected 52 somatic CNVs, 11 instances of CN-LOH and 39 oncogenic mutations in the 16 patients with myeloid malignancies, and none in the 9 healthy individuals. All CNVs and CN-LOH were confirmed by SNP microarray analysis. We describe a genome-wide SNP sequencing backbone which allows for sensitive detection of genome-wide CNVs and CN-LOH using NGS. This proof-of-principle study has demonstrated that this strategy can provide more comprehensive genetic profiling for patients with myeloid malignancies using a single assay. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  7. Systematic assessment of the performance of whole-genome amplification for SNP/CNV detection and β-thalassemia genotyping.

    PubMed

    He, Fei; Zhou, Wanjun; Cai, Ren; Yan, Tizhen; Xu, Xiangmin

    2018-04-01

    In this study, we aimed to assess the performance of two whole-genome amplification methods, multiple displacement amplification (MDA), and multiple annealing and looping-based amplification cycle (MALBAC), for β-thalassemia genotyping and single-nucleotide polymorphism (SNP)/copy-number variant (CNV) detection using two DNA sequencing assays. We collected peripheral blood, cell lines, and discarded embryos, and carried out MALBAC and MDA on single-cell and five-cell samples. We detected and statistically analyzed differences in the amplification efficiency, positive predictive value, sensitivity, allele dropout (ADO) rate, SNPs, and CV values between the two methods. Through Sanger sequencing at the single-cell and five-cell levels, we showed that both the amplification rate and ADO rate of MDA were better than those using MALBAC, and the sensitivity and positive predictive value obtained from MDA were higher than those from MALBAC for β-thalassemia genotyping. Using next-generation sequencing (NGS) at the single-cell level, we confirmed that MDA has better properties than MALBAC for SNP detection. However, MALBAC was more stable and homogeneous than MDA using low-depth NGS at the single-cell level for CNV detection. We conclude that MALBAC is the better option for CNV detection, while MDA is better suited for SNV detection.

  8. Family-Based Benchmarking of Copy Number Variation Detection Software.

    PubMed

    Nutsua, Marcel Elie; Fischer, Annegret; Nebel, Almut; Hofmann, Sylvia; Schreiber, Stefan; Krawczak, Michael; Nothnagel, Michael

    2015-01-01

    The analysis of structural variants, in particular of copy-number variations (CNVs), has proven valuable in unraveling the genetic basis of human diseases. Hence, a large number of algorithms have been developed for the detection of CNVs in SNP array signal intensity data. Using the European and African HapMap trio data, we undertook a comparative evaluation of six commonly used CNV detection software tools, namely Affymetrix Power Tools (APT), QuantiSNP, PennCNV, GLAD, R-gada and VEGA, and assessed their level of pair-wise prediction concordance. The tool-specific CNV prediction accuracy was assessed in silico by way of intra-familial validation. Software tools differed greatly in terms of the number and length of the CNVs predicted as well as the number of markers included in a CNV. All software tools predicted substantially more deletions than duplications. Intra-familial validation revealed consistently low levels of prediction accuracy as measured by the proportion of validated CNVs (34-60%). Moreover, up to 20% of apparent family-based validations were found to be due to chance alone. Software using Hidden Markov models (HMM) showed a trend to predict fewer CNVs than segmentation-based algorithms albeit with greater validity. PennCNV yielded the highest prediction accuracy (60.9%). Finally, the pairwise concordance of CNV prediction was found to vary widely with the software tools involved. We recommend HMM-based software, in particular PennCNV, rather than segmentation-based algorithms when validity is the primary concern of CNV detection. QuantiSNP may be used as an additional tool to detect sets of CNVs not detectable by the other tools. Our study also reemphasizes the need for laboratory-based validation, such as qPCR, of CNVs predicted in silico.

  9. Exploration of large, rare copy number variants associated with psychiatric and neurodevelopmental disorders in individuals with anorexia nervosa.

    PubMed

    Yilmaz, Zeynep; Szatkiewicz, Jin P; Crowley, James J; Ancalade, NaEshia; Brandys, Marek K; van Elburg, Annemarie; de Kovel, Carolien G F; Adan, Roger A H; Hinney, Anke; Hebebrand, Johannes; Gratacos, Monica; Fernandez-Aranda, Fernando; Escaramis, Georgia; Gonzalez, Juan R; Estivill, Xavier; Zeggini, Eleftheria; Sullivan, Patrick F; Bulik, Cynthia M

    2017-08-01

    Anorexia nervosa (AN) is a serious and heritable psychiatric disorder. To date, studies of copy number variants (CNVs) have been limited and inconclusive because of small sample sizes. We conducted a case-only genome-wide CNV survey in 1983 female AN cases included in the Genetic Consortium for Anorexia Nervosa. Following stringent quality control procedures, we investigated whether pathogenic CNVs in regions previously implicated in psychiatric and neurodevelopmental disorders were present in AN cases. We observed two instances of the well-established pathogenic CNVs in AN cases. In addition, one case had a deletion in the 13q12 region, overlapping with a deletion reported previously in two AN cases. As a secondary aim, we also examined our sample for CNVs over 1 Mbp in size. Out of the 40 instances of such large CNVs that were not implicated previously for AN or neuropsychiatric phenotypes, two of them contained genes with previous neuropsychiatric associations, and only five of them had no associated reports in public CNV databases. Although ours is the largest study of its kind in AN, larger datasets are needed to comprehensively assess the role of CNVs in the etiology of AN.

  10. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data

    PubMed Central

    De, Rajat K.

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision. PMID:26291322

  11. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

    PubMed

    Sinha, Rituparna; Samaddar, Sandip; De, Rajat K

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision.

  12. Genome-Wide Copy Number Variation Association Analyses for Age at Menarche

    PubMed Central

    Li, Jian; Pan, Rong; Shen, Hui; Tian, Qing; Zhou, Yu; Liu, Yong-Jun

    2012-01-01

    Context: Menarche is a significant physiological event for women. Age at menarche (AAM) is a heritable trait associated with many common female diseases. The genetic basis and the mechanism for AAM are largely unknown. Copy number variation (CNV) is a common type of genetic variation underlying human complex traits. The importance of CNV to AAM variation is unclear. Objective: The objective of the study was to identify CNV important to AAM variation. Design: We performed the first genome-wide CNV study of AAM in 1654 Caucasian females using Affymetrix human single-nucleotide polymorphism 6.0 array. We also replicated our findings in another Chinese cohort containing 752 women. Results: We identified a CNV, variation_38399, in the 2q14.2 region, for association with AAM (P = 1.03 × 10−3). The CNV has two variants (one copy and two copy), with a mean AAM of 14.00 yr and 12.90 yr, respectively. Interestingly, in a Chinese sample containing 752 women, this CNV has been replicated both with a marginally significant P = 0.090 and with a same direction of effect (a lower copy number for a later AAM). The CNV is located approximately 75 kb upstream of the diazepam binding inhibitor (DBI), a gene known to regulate estrogen levels, a key factor for menarche. Conclusion: Our findings for the first time identified a novel CNV and suggested the DBI-mediated endocrinological pathway as a potential mechanism for AAM regulation. PMID:22904172

  13. Detection, breakpoint identification and detailed characterisation of a CNV at the FRA16D site using SNP assays.

    PubMed

    Winchester, L; Newbury, D F; Monaco, A P; Ragoussis, J

    2008-01-01

    Copy Number Variants (CNV) and other submicroscopic structural changes are now recognised to be widespread across the human genome. We show that SNP data generated for association study can be utilised for the identification of deletion CNVs. During analysis of data for an SNP association study for Specific Language Impairment (SLI) a deletion was identified. SLI adversely affects the language development of children in the absence of any obvious cause. Previous studies have found linkage to a region on chromosome 16. The deletion was located in a known fragile site FRA16D in intron 5-6 of the WWOX gene (also known as FOR). Changes in the FRA16D site have been previously linked to cancer and are often characterised in cell lines. A long-range PCR assay was used to confirm the existence of the deletion. We also show the breakpoint identification and large-scale characterisation of this CNV in a normal human sample set. Copyright 2009 S. Karger AG, Basel.

  14. CYP1B1 copy number variation is not a major contributor to primary congenital glaucoma.

    PubMed

    Souzeau, Emmanuelle; Hayes, Melanie; Ruddle, Jonathan B; Elder, James E; Staffieri, Sandra E; Kearns, Lisa S; Mackey, David A; Zhou, Tiger; Ridge, Bronwyn; Burdon, Kathryn P; Dubowsky, Andrew; Craig, Jamie E

    2015-01-01

    To evaluate the prevalence and the diagnostic utility of testing for CYP1B1 copy number variation (CNV) in primary congenital glaucoma (PCG) cases unexplained by CYP1B1 point mutations in The Australian and New Zealand Registry of Advanced Glaucoma. In total, 50 PCG cases either heterozygous for disease-causing variants or with no CYP1B1 sequence variants were included in the study. CYP1B1 CNV was analyzed by Multiplex Ligation-dependent Probe Amplification (MLPA). No deletions or duplications were found in any of the cases. This is the first study to report on CYP1B1 CNV in PCG cases. Our findings show that this mechanism is not a major contributor to the phenotype and is of limited diagnostic utility.

  15. Assessing the impact of copy number variants on miRNA genes in autism by Monte Carlo simulation.

    PubMed

    Marrale, Maurizio; Albanese, Nadia Ninfa; Calì, Francesco; Romano, Valentino

    2014-01-01

    Autism Spectrum Disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins. Previous studies have investigated the role of de novo Copy Number Variants (CNVs) and microRNAs as important but distinct etiological factors in ASD. We developed a novel computational procedure to assess the potential pathogenic role of microRNA genes overlapping de novo CNVs in ASD patients. Here we show that for chromosomes # 1, 2 and 22 the actual number of miRNA loci affected by de novo CNVs in patients was found significantly higher than that estimated by Monte Carlo simulation of random CNV events. Out of 24 miRNA genes over-represented in CNVs from these three chromosomes only hsa-mir-4436b-1 and hsa-mir-4436b-2 have not been detected in CNVs from non-autistic subjects as reported in the Database of Genomic Variants. Altogether the results reported in this study represent a first step towards a full understanding of how a dysregulated expression of the 24 miRNAs genes affect neurodevelopment in autism. We also propose that the procedure used in this study can be effectively applied to CNVs/miRNA genes association data in other genomic disorders beyond autism.

  16. Application of Nexus copy number software for CNV detection and analysis.

    PubMed

    Darvishi, Katayoon

    2010-04-01

    Among human structural genomic variation, copy number variants (CNVs) are the most frequently known component, comprised of gains/losses of DNA segments that are generally 1 kb in length or longer. Array-based comparative genomic hybridization (aCGH) has emerged as a powerful tool for detecting genomic copy number variants (CNVs). With the rapid increase in the density of array technology and with the adaptation of new high-throughput technology, a reliable and computationally scalable method for accurate mapping of recurring DNA copy number aberrations has become a main focus in research. Here we introduce Nexus Copy Number software, a platform-independent tool, to analyze the output files of all types of commercial and custom-made comparative genomic hybridization (CGH) and single-nucleotide polymorphism (SNP) arrays, such as those manufactured by Affymetrix, Agilent Technologies, Illumina, and Roche NimbleGen. It also supports data generated by various array image-analysis software tools such as GenePix, ImaGene, and BlueFuse. (c) 2010 by John Wiley & Sons, Inc.

  17. Increased frequency of de novo copy number variants in congenital heart disease by integrative analysis of single nucleotide polymorphism array and exome sequence data.

    PubMed

    Glessner, Joseph T; Bick, Alexander G; Ito, Kaoru; Homsy, Jason; Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R; Golhar, Ryan; Sanders, Stephan J; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A Jeremy; State, Matthew W; Kaltman, Jonathan R; White, Peter S; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K

    2014-10-24

    Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown pathogenesis. To determine the contribution of de novo copy number variants (CNVs) in the pathogenesis of sporadic CHD. We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism arrays and whole exome sequencing. Results were experimentally validated using digital droplet polymerase chain reaction. We compared validated CNVs in CHD cases with CNVs in 1301 healthy control trios. The 2 complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either single nucleotide polymorphism array (P=7×10(-5); odds ratio, 4.6) or whole exome sequencing data (P=6×10(-4); odds ratio, 3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (P=0.02; odds ratio, 2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in whole exome sequencing and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q subtelomeric deletions. We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. © 2014 American Heart Association, Inc.

  18. Analysis of Extreme Phenotype Bulk Copy Number Variation (XP-CNV) Identified the Association of rp1 with Resistance to Goss's Wilt of Maize.

    PubMed

    Hu, Ying; Ren, Jie; Peng, Zhao; Umana, Arnoldo A; Le, Ha; Danilova, Tatiana; Fu, Junjie; Wang, Haiyan; Robertson, Alison; Hulbert, Scot H; White, Frank F; Liu, Sanzhen

    2018-01-01

    Goss's wilt (GW) of maize is caused by the Gram-positive bacterium Clavibacter michiganensis subsp. nebraskensis (Cmn) and has spread in recent years throughout the Great Plains, posing a threat to production. The genetic basis of plant resistance is unknown. Here, a simple method for quantifying disease symptoms was developed and used to select cohorts of highly resistant and highly susceptible lines known as extreme phenotypes (XP). Copy number variation (CNV) analyses using whole genome sequences of bulked XP revealed 141 genes containing CNV between the two XP groups. The CNV genes include the previously identified common rust resistant locus rp1 . Multiple Rp1 accessions with distinct rp1 haplotypes in an otherwise susceptible accession exhibited hypersensitive responses upon inoculation. GW provides an excellent system for the genetic dissection of diseases caused by closely related subspecies of C. michiganesis . Further work will facilitate breeding strategies to control GW and provide needed insight into the resistance mechanism of important related diseases such as bacterial canker of tomato and bacterial ring rot of potato.

  19. UGT2B17 and SULT1A1 gene copy number variation (CNV) detection by LabChip microfluidic technology.

    PubMed

    Gaedigk, Andrea; Gaedigk, Roger; Leeder, J Steven

    2010-05-01

    Gene copy number variations (CNVs) are increasingly recognized to play important roles in the expression of genes and hence on their respective enzymatic activities. This has been demonstrated for a number of drug metabolizing genes, such as UDP-glucuronosyltransferases 2B17 (UGT2B17) and sulfotransferase 1A1 (SULT1A1), which are subject to genetic heterogeneity, including CNV. Quantitative assays to assess gene copy number are therefore becoming an integral part of accurate genotype assessment and phenotype prediction. In this study, we evaluated a microfluidics-based system, the Bio-Rad Experion system, to determine the power and utility of this platform to detect UGT2B17 and SULT1A1 CNV in DNA samples derived from blood and tissue. UGT2B17 is known to present with 0, 1 or 2 and SULT1A1 with up to 5 gene copies. Distinct clustering (p<0.001) into copy number groups was achieved for both genes. DNA samples derived from blood exhibited less inter-run variability compared to DNA samples obtained from liver tissue. This variability may be caused by tissue-specific PCR inhibitors as it could be overcome by using DNA from another tissue, or after the DNA had undergone whole genome amplification. This method produced results comparable to those reported for other quantitative test platforms.

  20. The landscape of inherited and de novo copy number variants in a plasmodium falciparum genetic cross

    PubMed Central

    2011-01-01

    Background Copy number is a major source of genome variation with important evolutionary implications. Consequently, it is essential to determine copy number variant (CNV) behavior, distributions and frequencies across genomes to understand their origins in both evolutionary and generational time frames. We use comparative genomic hybridization (CGH) microarray and the resolution provided by a segregating population of cloned progeny lines of the malaria parasite, Plasmodium falciparum, to identify and analyze the inheritance of 170 genome-wide CNVs. Results We describe CNVs in progeny clones derived from both Mendelian (i.e. inherited) and non-Mendelian mechanisms. Forty-five CNVs were present in the parent lines and segregated in the progeny population. Furthermore, extensive variation that did not conform to strict Mendelian inheritance patterns was observed. 124 CNVs were called in one or more progeny but in neither parent: we observed CNVs in more than one progeny clone that were not identified in either parent, located more frequently in the telomeric-subtelomeric regions of chromosomes and singleton de novo CNVs distributed evenly throughout the genome. Linkage analysis of CNVs revealed dynamic copy number fluctuations and suggested mechanisms that could have generated them. Five of 12 previously identified expression quantitative trait loci (eQTL) hotspots coincide with CNVs, demonstrating the potential for broad influence of CNV on the transcriptional program and phenotypic variation. Conclusions CNVs are a significant source of segregating and de novo genome variation involving hundreds of genes. Examination of progeny genome segments provides a framework to assess the extent and possible origins of CNVs. This segregating genetic system reveals the breadth, distribution and dynamics of CNVs in a surprisingly plastic parasite genome, providing a new perspective on the sources of diversity in parasite populations. PMID:21936954

  1. Genome-Wide Mapping of Copy Number Variation in Humans: Comparative Analysis of High Resolution Array Platforms

    PubMed Central

    Haraksingh, Rajini R.; Abyzov, Alexej; Gerstein, Mark; Urban, Alexander E.; Snyder, Michael

    2011-01-01

    Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications. PMID:22140474

  2. Evaluation of somatic copy number estimation tools for whole-exome sequencing data.

    PubMed

    Nam, Jae-Yong; Kim, Nayoung K D; Kim, Sang Cheol; Joung, Je-Gun; Xi, Ruibin; Lee, Semin; Park, Peter J; Park, Woong-Yang

    2016-03-01

    Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  3. Population-genetic properties of differentiated copy number variations in cattle.

    PubMed

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Zhou, Yang; Hay, El Hamidi Abdel; Song, Jiuzhou; Sonstegard, Tad S; Van Tassell, Curtis P; Liu, George E

    2016-03-23

    While single nucleotide polymorphism (SNP) is typically the variant of choice for population genetics, copy number variation (CNV) which comprises insertion, deletion and duplication of genomic sequence, is an informative type of genetic variation. CNVs have been shown to be both common in mammals and important for understanding the relationship between genotype and phenotype. However, CNV differentiation, selection and its population genetic properties are not well understood across diverse populations. We performed a population genetics survey based on CNVs derived from the BovineHD SNP array data of eight distinct cattle breeds. We generated high resolution results that show geographical patterns of variations and genome-wide admixture proportions within and among breeds. Similar to the previous SNP-based studies, our CNV-based results displayed a strong correlation of population structure and geographical location. By conducting three pairwise comparisons among European taurine, African taurine, and indicine groups, we further identified 78 unique CNV regions that were highly differentiated, some of which might be due to selection. These CNV regions overlapped with genes involved in traits related to parasite resistance, immunity response, body size, fertility, and milk production. Our results characterize CNV diversity among cattle populations and provide a list of lineage-differentiated CNVs.

  4. Shape-based retrieval of CNV regions in read coverage data.

    PubMed

    Hong, Sangkyun; Yoon, Jeehee; Hong, Dongwan; Lee, Unjoo; Kim, Baeksop; Park, Sanghyun

    2014-01-01

    This study proposes a novel copy number variation (CNV) detection method, CNV_shape, based on variations in the shape of the read coverage data which are obtained from millions of short reads aligned to a reference sequence. The proposed method carries out two transforms, mean shift transform and mean slope transform, to extract the shape of a CNV more precisely from real human data, which are vulnerable to experimental and biological noises. The mean shift transform is a procedure for gaining a preliminary estimation of the CNVs by statistically evaluating moving averages of given read coverage data. The mean slope transform extracts candidate CNVs by filtering out non-stationary sub-regions from each of the primary CNVs pre-estimated in the mean shift procedure. Each of the candidate CNVs is merged with neighbours depending on the merging score to be finally identified as a putative CNV, where the merging score is estimated by the ratio of the positions with non-zero values of the mean shift transform to the total length of the region including two neighbouring candidate CNVs and the interval between them. The proposed CNV detection method was validated experimentally with simulated data and real human data. The simulated data with coverage in the range of 1x to 10x were generated for various sampling sizes and p-values. Five individual human genomes were used as real human data. The results show that relatively small CNVs (> 1 kbp) can be detected from low coverage (> 1.7x) data. The results also reveal that, in contrast to conventional methods, performance improvement from 8.18 to 87.90% was achieved in CNV_shape. The outcomes suggest that the proposed method is very effective in reducing noises inherent in real data as well as in detecting CNVs of various sizes and types.

  5. Contribution of Global Rare Copy-Number Variants to the Risk of Sporadic Congenital Heart Disease

    PubMed Central

    Soemedi, Rachel; Wilson, Ian J.; Bentham, Jamie; Darlay, Rebecca; Töpf, Ana; Zelenika, Diana; Cosgrove, Catherine; Setchfield, Kerry; Thornborough, Chris; Granados-Riveron, Javier; Blue, Gillian M.; Breckpot, Jeroen; Hellens, Stephen; Zwolinkski, Simon; Glen, Elise; Mamasoula, Chrysovalanto; Rahman, Thahira J.; Hall, Darroch; Rauch, Anita; Devriendt, Koenraad; Gewillig, Marc; O’ Sullivan, John; Winlaw, David S.; Bu’Lock, Frances; Brook, J. David; Bhattacharya, Shoumo; Lathrop, Mark; Santibanez-Koref, Mauro; Cordell, Heather J.; Goodship, Judith A.; Keavney, Bernard D.

    2012-01-01

    Previous studies have shown that copy-number variants (CNVs) contribute to the risk of complex developmental phenotypes. However, the contribution of global CNV burden to the risk of sporadic congenital heart disease (CHD) remains incompletely defined. We generated genome-wide CNV data by using Illumina 660W-Quad SNP arrays in 2,256 individuals with CHD, 283 trio CHD-affected families, and 1,538 controls. We found association of rare genic deletions with CHD risk (odds ratio [OR] = 1.8, p = 0.0008). Rare deletions in study participants with CHD had higher gene content (p = 0.001) with higher haploinsufficiency scores (p = 0.03) than they did in controls, and they were enriched with Wnt-signaling genes (p = 1 × 10−5). Recurrent 15q11.2 deletions were associated with CHD risk (OR = 8.2, p = 0.02). Rare de novo CNVs were observed in ∼5% of CHD trios; 10 out of 11 occurred on the paternally transmitted chromosome (p = 0.01). Some of the rare de novo CNVs spanned genes known to be involved in heart development (e.g., HAND2 and GJA5). Rare genic deletions contribute ∼4% of the population-attributable risk of sporadic CHD. Second to previously described CNVs at 1q21.1, deletions at 15q11.2 and those implicating Wnt signaling are the most significant contributors to the risk of sporadic CHD. Rare de novo CNVs identified in CHD trios exhibit paternal origin bias. PMID:22939634

  6. Analysis of Copy Number Variants on Chromosome 21 in Down Syndrome-Associated Congenital Heart Defects.

    PubMed

    Rambo-Martin, Benjamin L; Mulle, Jennifer G; Cutler, David J; Bean, Lora J H; Rosser, Tracie C; Dooley, Kenneth J; Cua, Clifford; Capone, George; Maslen, Cheryl L; Reeves, Roger H; Sherman, Stephanie L; Zwick, Michael E

    2018-01-04

    One in five people with Down syndrome (DS) are born with an atrioventricular septal defect (AVSD), an incidence 2000 times higher than in the euploid population. The genetic loci that contribute to this risk are poorly understood. In this study, we tested two hypotheses: (1) individuals with DS carrying chromosome 21 copy number variants (CNVs) that interrupt exons may be protected from AVSD, because these CNVs return AVSD susceptibility loci back to disomy, and (2) individuals with DS carrying chromosome 21 genes spanned by microduplications are at greater risk for AVSD because these microduplications boost the dosage of AVSD susceptibility loci beyond a tolerable threshold. We tested 198 case individuals with DS+AVSD, and 211 control individuals with DS and a normal heart, using a custom microarray with dense probes tiled on chromosome 21 for array CGH (aCGH). We found that neither an individual chromosome 21 CNV nor any individual gene intersected by a CNV was associated with AVSD in DS. Burden analyses revealed that African American controls had more bases covered by rare deletions than did African American cases. Inversely, we found that Caucasian cases had more genes intersected by rare duplications than did Caucasian controls. We also showed that previously DS+AVSD (DS and a complete AVSD)-associated common CNVs on chromosome 21 failed to replicate. This research adds to the swell of evidence indicating that DS-associated AVSD is similarly heterogeneous, as is AVSD in the euploid population. Copyright © 2018 Rambo-Martin et al.

  7. Increasing the yield in targeted next-generation sequencing by implicating CNV analysis, non-coding exons and the overall variant load: the example of retinal dystrophies.

    PubMed

    Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O; Decker, Christian; Preising, Markus N; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Charbel Issa, Peter; Holz, Frank G; Baig, Shahid M; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J

    2013-01-01

    Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover "hidden mutations" such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5' exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5'-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even

  8. Global diversity, population stratification, and selection of human copy number variation

    PubMed Central

    Sudmant, Peter H.; Mallick, Swapan; Nelson, Bradley J.; Hormozdiari, Fereydoun; Krumm, Niklas; Huddleston, John; Coe, Bradley P.; Baker, Carl; Nordenfelt, Susanne; Bamshad, Michael; Jorde, Lynn B.; Posukh, Olga L.; Sahakyan, Hovhannes; Watkins, W. Scott; Yepiskoposyan, Levon; Abdullah, M. Syafiq; Bravi, Claudio M.; Capelli, Cristian; Hervig, Tor; Wee, Joseph T. S.; Tyler-Smith, Chris; van Driem, George; Romero, Irene Gallego; Jha, Aashish R.; Karachanak-Yankova, Sena; Toncheva, Draga; Comas, David; Henn, Brenna; Kivisild, Toomas; Ruiz-Linares, Andres; Sajantila, Antti; Metspalu, Ene; Parik, Jüri; Villems, Richard; Starikovskaya, Elena B.; Ayodo, George; Beall, Cynthia M.; Di Rienzo, Anna; Hammer, Michael; Khusainova, Rita; Khusnutdinova, Elza; Klitz, William; Winkler, Cheryl; Labuda, Damian; Metspalu, Mait; Tishkoff, Sarah A.; Dryomov, Stanislav; Sukernik, Rem; Patterson, Nick; Reich, David; Eichler, Evan E.

    2015-01-01

    In order to explore the diversity and selective signatures of duplication and deletion human copy number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single nucleotide variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load. PMID:26249230

  9. Dynamics of Copy Number Variation in Host Races of the Pea Aphid

    PubMed Central

    Duvaux, Ludovic; Geissmann, Quentin; Gharbi, Karim; Zhou, Jing-Jiang; Ferrari, Julia; Smadja, Carole M.; Butlin, Roger K.

    2015-01-01

    Copy number variation (CNV) makes a major contribution to overall genetic variation and is suspected to play an important role in adaptation. However, aside from a few model species, the extent of CNV in natural populations has seldom been investigated. Here, we report on CNV in the pea aphid Acyrthosiphon pisum, a powerful system for studying the genetic architecture of host-plant adaptation and speciation thanks to multiple host races forming a continuum of genetic divergence. Recent studies have highlighted the potential importance of chemosensory genes, including the gustatory and olfactory receptor gene families (Gr and Or, respectively), in the process of host race formation. We used targeted resequencing to achieve a very high depth of coverage, and thereby revealed the extent of CNV of 434 genes, including 150 chemosensory genes, in 104 individuals distributed across eight host races of the pea aphid. We found that CNV was widespread in our global sample, with a significantly higher occurrence in multigene families, especially in Ors. We also observed a decrease in the gene probability of being completely duplicated or deleted (CDD) with increase in coding sequence length. Genes with CDD variants were usually more polymorphic for copy number, especially in the P450 gene family where toxin resistance may be related to gene dosage. We found that Gr were overrepresented among genes discriminating host races, as were CDD genes and pseudogenes. Our observations shed new light on CNV dynamics and are consistent with CNV playing a role in both local adaptation and speciation. PMID:25234705

  10. Exome copy number variation detection: Use of a pool of unrelated healthy tissue as reference sample.

    PubMed

    Wenric, Stephane; Sticca, Tiberio; Caberg, Jean-Hubert; Josse, Claire; Fasquelle, Corinne; Herens, Christian; Jamar, Mauricette; Max, Stéphanie; Gothot, André; Caers, Jo; Bours, Vincent

    2017-01-01

    An increasing number of bioinformatic tools designed to detect CNVs (copy number variants) in tumor samples based on paired exome data where a matched healthy tissue constitutes the reference have been published in the recent years. The idea of using a pool of unrelated healthy DNA as reference has previously been formulated but not thoroughly validated. As of today, the gold standard for CNV calling is still aCGH but there is an increasing interest in detecting CNVs by exome sequencing. We propose to design a metric allowing the comparison of two CNV profiles, independently of the technique used and assessed the validity of using a pool of unrelated healthy DNA instead of a matched healthy tissue as reference in exome-based CNV detection. We compared the CNV profiles obtained with three different approaches (aCGH, exome sequencing with a matched healthy tissue as reference, exome sequencing with a pool of eight unrelated healthy tissue as reference) on three multiple myeloma samples. We show that the usual analyses performed to compare CNV profiles (deletion/amplification ratios and CNV size distribution) lack in precision when confronted with low LRR values, as they only consider the binary status of each CNV. We show that the metric-based distance constitutes a more accurate comparison of two CNV profiles. Based on these analyses, we conclude that a reliable picture of CNV alterations in multiple myeloma samples can be obtained from whole-exome sequencing in the absence of a matched healthy sample. © 2016 WILEY PERIODICALS, INC.

  11. CNVinspector: a web-based tool for the interactive evaluation of copy number variations in single patients and in cohorts.

    PubMed

    Knierim, Ellen; Schwarz, Jana Marie; Schuelke, Markus; Seelow, Dominik

    2013-08-01

    Many genetic disorders are caused by copy number variations (CNVs) in the human genome. However, the large number of benign CNV polymorphisms makes it difficult to delineate causative variants for a certain disease phenotype. Hence, we set out to create software that accumulates and visualises locus-specific knowledge and enables clinicians to study their own CNVs in the context of known polymorphisms and disease variants. CNV data from healthy cohorts (Database of Genomic Variants) and from disease-related databases (DECIPHER) were integrated into a joint resource. Data are presented in an interactive web-based application that allows inspection, evaluation and filtering of CNVs in single individuals or in entire cohorts. CNVinspector provides simple interfaces to upload CNV data, compare them with own or published control data and visualise the results in graphical interfaces. Beyond choosing control data from different public studies, platforms and methods, dedicated filter options allow the detection of CNVs that are either enriched in patients or depleted in controls. Alternatively, a search can be restricted to those CNVs that appear in individuals of similar clinical phenotype. For each gene of interest within a CNV, we provide a link to NCBI, ENSEMBL and the GeneDistiller search engine to browse for potential disease-associated genes. With its user-friendly handling, the integration of control data and the filtering options, CNVinspector will facilitate the daily work of clinical geneticists and accelerate the delineation of new syndromes and gene functions. CNVinspector is freely accessible under http://www.cnvinspector.org.

  12. A Genome Wide Study of Copy Number Variation Associated with Nasopharyngeal Carcinoma in Malaysian Chinese Identifies CNVs at 11q14.3 and 6p21.3 as Candidate Loci.

    PubMed

    Low, Joyce Siew Yong; Chin, Yoon Ming; Mushiroda, Taisei; Kubo, Michiaki; Govindasamy, Gopala Krishnan; Pua, Kin Choo; Yap, Yoke Yeow; Yap, Lee Fah; Subramaniam, Selva Kumar; Ong, Cheng Ai; Tan, Tee Yong; Khoo, Alan Soo Beng; Ng, Ching Ching

    2016-01-01

    Nasopharyngeal carcinoma (NPC) is a neoplasm of the epithelial lining of the nasopharynx. Despite various reports linking genomic variants to NPC predisposition, very few reports were done on copy number variations (CNV). CNV is an inherent structural variation that has been found to be involved in cancer predisposition. A discovery cohort of Malaysian Chinese descent (NPC patients, n = 140; Healthy controls, n = 256) were genotyped using Illumina® HumanOmniExpress BeadChip. PennCNV and cnvPartition calling algorithms were applied for CNV calling. Taqman CNV assays and digital PCR were used to validate CNV calls and replicate candidate copy number variant region (CNVR) associations in a follow-up Malaysian Chinese (NPC cases, n = 465; and Healthy controls, n = 677) and Malay cohort (NPC cases, n = 114; Healthy controls, n = 124). Six putative CNVRs overlapping GRM5, MICA/HCP5/HCG26, LILRB3/LILRA6, DPY19L2, RNase3/RNase2 and GOLPH3 genes were jointly identified by PennCNV and cnvPartition. CNVs overlapping GRM5 and MICA/HCP5/HCG26 were subjected to further validation by Taqman CNV assays and digital PCR. Combined analysis in Malaysian Chinese cohort revealed a strong association at CNVR on chromosome 11q14.3 (Pcombined = 1.54x10-5; odds ratio (OR) = 7.27; 95% CI = 2.96-17.88) overlapping GRM5 and a suggestive association at CNVR on chromosome 6p21.3 (Pcombined = 1.29x10-3; OR = 4.21; 95% CI = 1.75-10.11) overlapping MICA/HCP5/HCG26 genes. Our results demonstrated the association of CNVs towards NPC susceptibility, implicating a possible role of CNVs in NPC development.

  13. A Genome Wide Study of Copy Number Variation Associated with Nasopharyngeal Carcinoma in Malaysian Chinese Identifies CNVs at 11q14.3 and 6p21.3 as Candidate Loci

    PubMed Central

    Low, Joyce Siew Yong; Chin, Yoon Ming; Mushiroda, Taisei; Kubo, Michiaki; Govindasamy, Gopala Krishnan; Pua, Kin Choo; Yap, Yoke Yeow; Yap, Lee Fah; Subramaniam, Selva Kumar; Ong, Cheng Ai; Tan, Tee Yong; Khoo, Alan Soo Beng; Ng, Ching Ching

    2016-01-01

    Background Nasopharyngeal carcinoma (NPC) is a neoplasm of the epithelial lining of the nasopharynx. Despite various reports linking genomic variants to NPC predisposition, very few reports were done on copy number variations (CNV). CNV is an inherent structural variation that has been found to be involved in cancer predisposition. Methods A discovery cohort of Malaysian Chinese descent (NPC patients, n = 140; Healthy controls, n = 256) were genotyped using Illumina® HumanOmniExpress BeadChip. PennCNV and cnvPartition calling algorithms were applied for CNV calling. Taqman CNV assays and digital PCR were used to validate CNV calls and replicate candidate copy number variant region (CNVR) associations in a follow-up Malaysian Chinese (NPC cases, n = 465; and Healthy controls, n = 677) and Malay cohort (NPC cases, n = 114; Healthy controls, n = 124). Results Six putative CNVRs overlapping GRM5, MICA/HCP5/HCG26, LILRB3/LILRA6, DPY19L2, RNase3/RNase2 and GOLPH3 genes were jointly identified by PennCNV and cnvPartition. CNVs overlapping GRM5 and MICA/HCP5/HCG26 were subjected to further validation by Taqman CNV assays and digital PCR. Combined analysis in Malaysian Chinese cohort revealed a strong association at CNVR on chromosome 11q14.3 (Pcombined = 1.54x10-5; odds ratio (OR) = 7.27; 95% CI = 2.96–17.88) overlapping GRM5 and a suggestive association at CNVR on chromosome 6p21.3 (Pcombined = 1.29x10-3; OR = 4.21; 95% CI = 1.75–10.11) overlapping MICA/HCP5/HCG26 genes. Conclusion Our results demonstrated the association of CNVs towards NPC susceptibility, implicating a possible role of CNVs in NPC development. PMID:26730743

  14. High-resolution analysis of copy number variants in adults with simple-to-moderate congenital heart disease.

    PubMed

    Zhao, Wei; Niu, Guannan; Shen, Botao; Zheng, Yang; Gong, Fangchao; Wang, Xianfu; Lee, Jiyun; Mulvihill, John J; Chen, Xiaohui; Li, Shibo

    2013-12-01

    As patients with congenital heart disease (CHD) increasingly survive to childbearing age, it becomes important to understand the genetic origins of CHD. In children, CHD is frequently caused by chromosomal imbalances. We searched for submicroscopic imbalances in adults with CHD focusing on simple-to-moderate phenotypes, without associated dysmorphic features, a group not previously examined. A total of 100 Han Chinese adults with a diverse range of isolated CHD and 65 ethnically matched controls were screened using whole-genome array comparative genomic hybridization. Forty-five large (>100 kb) rare copy number variants (CNVs) were identified in 36/100 patients. These variants were not listed in the Database of Genomic Variants nor found in controls. In three of these genomic imbalances (22q11.2, 18q23, 3q21.3), genes that play an important role in cardiac development were implicated, including CRKL, NFATC1, PLXNA1, the latter has not been associated with human CHD before. This study detected a 0.7 Mb 22q11.2 deletion, which marginally overlapped the common 3 Mb 22q11.2 deletion, in one patient with a perimembranous ventricular septal defect without any extracardiac manifestation. Furthermore, we detected a novel inherited aberration dup (16q23.1). Although a causal relationship with CHD remains to be established, this CNVs profile provides a spectrum of genomic imbalances in this condition, and improves the CNV-phenotype correlations. © 2013 Wiley Periodicals, Inc.

  15. Burden of potentially pathologic copy number variants is higher in children with isolated congenital heart disease and significantly impairs covariate-adjusted transplant-free survival.

    PubMed

    Kim, Daniel Seung; Kim, Jerry H; Burt, Amber A; Crosslin, David R; Burnham, Nancy; Kim, Cecilia E; McDonald-McGinn, Donna M; Zackai, Elaine H; Nicolson, Susan C; Spray, Thomas L; Stanaway, Ian B; Nickerson, Deborah A; Heagerty, Patrick J; Hakonarson, Hakon; Gaynor, J William; Jarvik, Gail P

    2016-04-01

    Copy number variants (CNVs) are duplications or deletions of genomic regions. Large CNVs are potentially pathogenic and are overrepresented in children with congenital heart disease (CHD). We sought to determine the frequency of large CNVs in children with isolated CHD, and to evaluate the relationship of these potentially pathogenic CNVs with transplant-free survival. These cases are derived from a prospective cohort of patients with nonsyndromic CHD (n = 422) identified before first surgery. Healthy pediatric controls (n = 500) were obtained from the electronic Medical Records and Genetic Epidemiology Network, and CNV frequency was contrasted for CHD cases and controls. CNVs were determined algorithmically; subsequently screened for >95% overlap between 2 methods, size (>300 kb), quality score, overlap with a gene, and novelty (absent from databases of known, benign CNVs); and separately validated by quantitative polymerase chain reaction. Survival likelihoods for cases were calculated using Cox proportional hazards modeling to evaluate the joint effect of CNV burden and known confounders on transplant-free survival. Children with nonsyndromic CHD had a higher burden of potentially pathogenic CNVs compared with pediatric controls (12.1% vs 5.0%; P = .00016). Presence of a CNV was associated with significantly decreased transplant-free survival after surgery (hazard ratio, 3.42; 95% confidence interval, 1.66-7.09; P = .00090) with confounder adjustment. We confirm that children with isolated CHD have a greater burden of rare/large CNVs. We report a novel finding that these CNVs are associated with an adjusted 2.55-fold increased risk of death or transplant. These data suggest that CNV burden is an important modifier of survival after surgery for CHD. Copyright © 2016 The American Association for Thoracic Surgery. Published by Elsevier Inc. All rights reserved.

  16. CNV-association meta-analysis in 191,161 European adults reveals new loci associated with anthropometric traits.

    PubMed

    Macé, Aurélien; Tuke, Marcus A; Deelen, Patrick; Kristiansson, Kati; Mattsson, Hannele; Nõukas, Margit; Sapkota, Yadav; Schick, Ursula; Porcu, Eleonora; Rüeger, Sina; McDaid, Aaron F; Porteous, David; Winkler, Thomas W; Salvi, Erika; Shrine, Nick; Liu, Xueping; Ang, Wei Q; Zhang, Weihua; Feitosa, Mary F; Venturini, Cristina; van der Most, Peter J; Rosengren, Anders; Wood, Andrew R; Beaumont, Robin N; Jones, Samuel E; Ruth, Katherine S; Yaghootkar, Hanieh; Tyrrell, Jessica; Havulinna, Aki S; Boers, Harmen; Mägi, Reedik; Kriebel, Jennifer; Müller-Nurasyid, Martina; Perola, Markus; Nieminen, Markku; Lokki, Marja-Liisa; Kähönen, Mika; Viikari, Jorma S; Geller, Frank; Lahti, Jari; Palotie, Aarno; Koponen, Päivikki; Lundqvist, Annamari; Rissanen, Harri; Bottinger, Erwin P; Afaq, Saima; Wojczynski, Mary K; Lenzini, Petra; Nolte, Ilja M; Sparsø, Thomas; Schupf, Nicole; Christensen, Kaare; Perls, Thomas T; Newman, Anne B; Werge, Thomas; Snieder, Harold; Spector, Timothy D; Chambers, John C; Koskinen, Seppo; Melbye, Mads; Raitakari, Olli T; Lehtimäki, Terho; Tobin, Martin D; Wain, Louise V; Sinisalo, Juha; Peters, Annette; Meitinger, Thomas; Martin, Nicholas G; Wray, Naomi R; Montgomery, Grant W; Medland, Sarah E; Swertz, Morris A; Vartiainen, Erkki; Borodulin, Katja; Männistö, Satu; Murray, Anna; Bochud, Murielle; Jacquemont, Sébastien; Rivadeneira, Fernando; Hansen, Thomas F; Oldehinkel, Albertine J; Mangino, Massimo; Province, Michael A; Deloukas, Panos; Kooner, Jaspal S; Freathy, Rachel M; Pennell, Craig; Feenstra, Bjarke; Strachan, David P; Lettre, Guillaume; Hirschhorn, Joel; Cusi, Daniele; Heid, Iris M; Hayward, Caroline; Männik, Katrin; Beckmann, Jacques S; Loos, Ruth J F; Nyholt, Dale R; Metspalu, Andres; Eriksson, Johan G; Weedon, Michael N; Salomaa, Veikko; Franke, Lude; Reymond, Alexandre; Frayling, Timothy M; Kutalik, Zoltán

    2017-09-29

    There are few examples of robust associations between rare copy number variants (CNVs) and complex continuous human traits. Here we present a large-scale CNV association meta-analysis on anthropometric traits in up to 191,161 adult samples from 26 cohorts. The study reveals five CNV associations at 1q21.1, 3q29, 7q11.23, 11p14.2, and 18q21.32 and confirms two known loci at 16p11.2 and 22q11.21, implicating at least one anthropometric trait. The discovered CNVs are recurrent and rare (0.01-0.2%), with large effects on height (>2.4 cm), weight (>5 kg), and body mass index (BMI) (>3.5 kg/m 2 ). Burden analysis shows a 0.41 cm decrease in height, a 0.003 increase in waist-to-hip ratio and increase in BMI by 0.14 kg/m 2 for each Mb of total deletion burden (P = 2.5 × 10 -10 , 6.0 × 10 -5 , and 2.9 × 10 -3 ). Our study provides evidence that the same genes (e.g., MC4R, FIBIN, and FMO5) harbor both common and rare variants affecting body size and that anthropometric traits share genetic loci with developmental and psychiatric disorders.Individual SNPs have small effects on anthropometric traits, yet the impact of CNVs has remained largely unknown. Here, Kutalik and co-workers perform a large-scale genome-wide meta-analysis of structural variation and find rare CNVs associated with height, weight and BMI with large effect sizes.

  17. Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data.

    PubMed

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Song, Jiuzhou; Liu, George E

    2013-06-25

    Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.

  18. Kinesthetic but not visual imagery assists in normalizing the CNV in Parkinson's disease.

    PubMed

    Lim, Vanessa K; Polych, Melody A; Holländer, Antje; Byblow, Winston D; Kirk, Ian J; Hamm, Jeff P

    2006-10-01

    This study investigated whether kinesthetic and/or visual imagery could alter the contingent negative variation (CNV) for patients with Parkinson's disease (PD). The CNV was recorded in six patients with PD and seven controls before and after a 10min block of imagery. There were two types of imagery employed: kinesthetic and visual, which were evaluated on separate days. The global field power (GFP) of the late CNV did not change after the visual imagery for either group, nor was there a significant difference between the groups. In contrast, kinesthetic imagery resulted in significant group differences pre-, versus post-imagery GFPs, which was not present prior to performing the kinesthetic imagery task. In patients with PD, the CNV amplitudes post-, relative to pre-kinesthetic imagery, increased over the dorsolateral prefrontal regions and decreased in the ipsilateral parietal regions. There were no such changes in controls. A 10-min session of kinesthetic imagery enhanced the GFP amplitude of the late CNV for patients but not for controls. While the study needs to be replicated with a greater number of participants, the results suggest that kinesthetic imagery may be a promising tool for investigations into motor changes, and may potentially be employed therapeutically, in patients with Parkinson's disease.

  19. Digital Droplet PCR: CNV Analysis and Other Applications.

    PubMed

    Mazaika, Erica; Homsy, Jason

    2014-07-14

    Digital droplet PCR (ddPCR) is an assay that combines state-of-the-art microfluidics technology with TaqMan-based PCR to achieve precise target DNA quantification at high levels of sensitivity and specificity. Because quantification is achieved without the need for standard assays in an easy to interpret, unambiguous digital readout, ddPCR is far simpler, faster, and less error prone than real-time qPCR. The basic protocol can be modified with minor adjustments to suit a wide range of applications, such as CNV analysis, rare variant detection, SNP genotyping, and transcript quantification. This unit describes the ddPCR workflow in detail for the Bio-Rad QX100 system, but the theory and data interpretation are generalizable to any ddPCR system. Copyright © 2014 John Wiley & Sons, Inc.

  20. Copy number variation signature to predict human ancestry

    PubMed Central

    2012-01-01

    Background Copy number variations (CNVs) are genomic structural variants that are found in healthy populations and have been observed to be associated with disease susceptibility. Existing methods for CNV detection are often performed on a sample-by-sample basis, which is not ideal for large datasets where common CNVs must be estimated by comparing the frequency of CNVs in the individual samples. Here we describe a simple and novel approach to locate genome-wide CNVs common to a specific population, using human ancestry as the phenotype. Results We utilized our previously published Genome Alteration Detection Analysis (GADA) algorithm to identify common ancestry CNVs (caCNVs) and built a caCNV model to predict population structure. We identified a 73 caCNV signature using a training set of 225 healthy individuals from European, Asian, and African ancestry. The signature was validated on an independent test set of 300 individuals with similar ancestral background. The error rate in predicting ancestry in this test set was 2% using the 73 caCNV signature. Among the caCNVs identified, several were previously confirmed experimentally to vary by ancestry. Our signature also contains a caCNV region with a single microRNA (MIR270), which represents the first reported variation of microRNA by ancestry. Conclusions We developed a new methodology to identify common CNVs and demonstrated its performance by building a caCNV signature to predict human ancestry with high accuracy. The utility of our approach could be extended to large case–control studies to identify CNV signatures for other phenotypes such as disease susceptibility and drug response. PMID:23270563

  1. A genome-wide assessment of rare copy number variants in colorectal cancer.

    PubMed

    Li, Zhenli; Yu, Dan; Gan, Meifu; Shan, Qiaonan; Yin, Xiaoyang; Tang, Shunli; Zhang, Shuai; Shi, Yongyong; Zhu, Yimin; Lai, Maode; Zhang, Dandan

    2015-09-22

    Colorectal cancer (CRC) is a complex disease with an estimated heritability of approximately 35%. However, known CRC-related common single nucleotide polymorphisms (SNPs) can only explain ~0.65% of the heritability. This "missing heritability" may be explained partially by rare copy number variants (CNVs). In this study, we performed a genome-wide scan using Illumina Human-Omni Express BeadChip, 694 sporadic CRC cases and 1641 controls were eventually included in our analysis after quality control. The global burden analysis revealed a 1.53-fold excess of rare CNVs in CRC cases compared with controls (P < 1 × 10(-6)), and the difference being more pronounced for genic rare CNVs and CNVs overlapped with coding regions (1.65-fold and 1.84-fold, respectively, both P < 1 × 10(-6)). Interestingly, both the cases in the lowest and middle tertile of age carried a higher burden of rare CNVs comparing to the highest tertile. Furthermore, 639 CNV-disrupted genes exclusive to CRC cases were found to be significantly enriched in gene ontology (GO) terms concerning nucleosome assembly and olfactory receptor activity. Our study was the first to evaluate the burden of rare CNVs in sporadic CRC and suggested that rare CNVs contributed to the missing heritability of CRC.

  2. Rare Copy Number Variants Are a Common Cause of Short Stature

    PubMed Central

    Zahnleiter, Diana; Uebe, Steffen; Ekici, Arif B.; Hoyer, Juliane; Wiesener, Antje; Wieczorek, Dagmar; Kunstmann, Erdmute; Reis, André; Doerr, Helmuth-Guenther; Rauch, Anita; Thiel, Christian T.

    2013-01-01

    Human growth has an estimated heritability of about 80%–90%. Nevertheless, the underlying cause of shortness of stature remains unknown in the majority of individuals. Genome-wide association studies (GWAS) showed that both common single nucleotide polymorphisms and copy number variants (CNVs) contribute to height variation under a polygenic model, although explaining only a small fraction of overall genetic variability in the general population. Under the hypothesis that severe forms of growth retardation might also be caused by major gene effects, we searched for rare CNVs in 200 families, 92 sporadic and 108 familial, with idiopathic short stature compared to 820 control individuals. Although similar in number, patients had overall significantly larger CNVs (p-value<1×10−7). In a gene-based analysis of all non-polymorphic CNVs>50 kb for gene function, tissue expression, and murine knock-out phenotypes, we identified 10 duplications and 10 deletions ranging in size from 109 kb to 14 Mb, of which 7 were de novo (p<0.03) and 13 inherited from the likewise affected parent but absent in controls. Patients with these likely disease causing 20 CNVs were smaller than the remaining group (p<0.01). Eleven (55%) of these CNVs either overlapped with known microaberration syndromes associated with short stature or contained GWAS loci for height. Haploinsufficiency (HI) score and further expression profiling suggested dosage sensitivity of major growth-related genes at these loci. Overall 10% of patients carried a disease-causing CNV indicating that, like in neurodevelopmental disorders, rare CNVs are a frequent cause of severe growth retardation. PMID:23516380

  3. Increasing the Yield in Targeted Next-Generation Sequencing by Implicating CNV Analysis, Non-Coding Exons and the Overall Variant Load: The Example of Retinal Dystrophies

    PubMed Central

    Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O.; Decker, Christian; Preising, Markus N.; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Issa, Peter Charbel; Holz, Frank G.; Baig, Shahid M.; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y.; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S.; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J.

    2013-01-01

    Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even

  4. Assessing genome-wide copy number variation in the Han Chinese population.

    PubMed

    Lu, Jianqi; Lou, Haiyi; Fu, Ruiqing; Lu, Dongsheng; Zhang, Feng; Wu, Zhendong; Zhang, Xi; Li, Changhua; Fang, Baijun; Pu, Fangfang; Wei, Jingning; Wei, Qian; Zhang, Chao; Wang, Xiaoji; Lu, Yan; Yan, Shi; Yang, Yajun; Jin, Li; Xu, Shuhua

    2017-10-01

    Copy number variation (CNV) is a valuable source of genetic diversity in the human genome and a well-recognised cause of various genetic diseases. However, CNVs have been considerably under-represented in population-based studies, particularly the Han Chinese which is the largest ethnic group in the world. To build a representative CNV map for the Han Chinese population. We conducted a genome-wide CNV study involving 451 male Han Chinese samples from 11 geographical regions encompassing 28 dialect groups, representing a less-biased panel compared with the currently available data. We detected CNVs by using 4.2M NimbleGen comparative genomic hybridisation array and whole-genome deep sequencing of 51 samples to optimise the filtering conditions in CNV discovery. A comprehensive Han Chinese CNV map was built based on a set of high-quality variants (positive predictive value >0.8, with sizes ranging from 369 bp to 4.16 Mb and a median of 5907 bp). The map consists of 4012 CNV regions (CNVRs), and more than half are novel to the 30 East Asian CNV Project and the 1000 Genomes Project Phase 3. We further identified 81 CNVRs specific to regional groups, which was indicative of the subpopulation structure within the Han Chinese population. Our data are complementary to public data sources, and the CNV map may facilitate in the identification of pathogenic CNVs and further biomedical research studies involving the Han Chinese population. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  5. Effects of EPSPS Copy Number Variation (CNV) and Glyphosate Application on the Aromatic and Branched Chain Amino Acid Synthesis Pathways in Amaranthus palmeri

    PubMed Central

    Fernández-Escalada, Manuel; Zulet-González, Ainhoa; Gil-Monreal, Miriam; Zabalza, Ana; Ravet, Karl; Gaines, Todd; Royuela, Mercedes

    2017-01-01

    A key enzyme of the shikimate pathway, 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS; EC 2.5.1.19), is the known target of the widely used herbicide glyphosate. Glyphosate resistance in Amaranthus palmeri, one of the most troublesome weeds in agriculture, has evolved through increased EPSPS gene copy number. The aim of this work was to study the pleiotropic effects of (i) EPSPS increased transcript abundance due to gene copy number variation (CNV) and of (ii) glyphosate application on the aromatic amino acid (AAA) and branched chain amino acid (BCAA) synthesis pathways. Hydroponically grown glyphosate sensitive (GS) and glyphosate resistant (GR) plants were treated with glyphosate 3 days after treatment. In absence of glyphosate treatment, high EPSPS gene copy number had only a subtle effect on transcriptional regulation of AAA and BCAA pathway genes. In contrast, glyphosate treatment provoked a general accumulation of the transcripts corresponding to genes of the AAA pathway leading to synthesis of chorismate in both GS and GR. After chorismate, anthranilate synthase transcript abundance was higher while chorismate mutase transcription showed a small decrease in GR and remained stable in GS, suggesting a regulatory branch point in the pathway that favors synthesis toward tryptophan over phenylalanine and tyrosine after glyphosate treatment. This was confirmed by studying enzyme activities in vitro and amino acid analysis. Importantly, this upregulation was glyphosate dose dependent and was observed similarly in both GS and GR populations. Glyphosate treatment also had a slight effect on the expression of BCAA genes but no general effect on the pathway could be observed. Taken together, our observations suggest that the high CNV of EPSPS in A. palmeri GR populations has no major pleiotropic effect on the expression of AAA biosynthetic genes, even in response to glyphosate treatment. This finding supports the idea that the fitness cost associated with EPSPS CNV

  6. Detection of copy number variations in epilepsy using exome data.

    PubMed

    Tsuchida, N; Nakashima, M; Kato, M; Heyman, E; Inui, T; Haginoya, K; Watanabe, S; Chiyonobu, T; Morimoto, M; Ohta, M; Kumakura, A; Kubota, M; Kumagai, Y; Hamano, S-I; Lourenco, C M; Yahaya, N A; Ch'ng, G-S; Ngu, L-H; Fattal-Valevski, A; Weisz Hubshman, M; Orenstein, N; Marom, D; Cohen, L; Goldberg-Stern, H; Uchiyama, Y; Imagawa, E; Mizuguchi, T; Takata, A; Miyake, N; Nakajima, H; Saitsu, H; Miyatake, S; Matsumoto, N

    2018-03-01

    Epilepsies are common neurological disorders and genetic factors contribute to their pathogenesis. Copy number variations (CNVs) are increasingly recognized as an important etiology of many human diseases including epilepsy. Whole-exome sequencing (WES) is becoming a standard tool for detecting pathogenic mutations and has recently been applied to detecting CNVs. Here, we analyzed 294 families with epilepsy using WES, and focused on 168 families with no causative single nucleotide variants in known epilepsy-associated genes to further validate CNVs using 2 different CNV detection tools using WES data. We confirmed 18 pathogenic CNVs, and 2 deletions and 2 duplications at chr15q11.2 of clinically unknown significance. Of note, we were able to identify small CNVs less than 10 kb in size, which might be difficult to detect by conventional microarray. We revealed 2 cases with pathogenic CNVs that one of the 2 CNV detection tools failed to find, suggesting that using different CNV tools is recommended to increase diagnostic yield. Considering a relatively high discovery rate of CNVs (18 out of 168 families, 10.7%) and successful detection of CNV with <10 kb in size, CNV detection by WES may be able to surrogate, or at least complement, conventional microarray analysis. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Obesity, starch digestion and amylase: association between copy number variants at human salivary (AMY1) and pancreatic (AMY2) amylase genes

    PubMed Central

    Carpenter, Danielle; Dhar, Sugandha; Mitchell, Laura M.; Fu, Beiyuan; Tyson, Jess; Shwan, Nzar A.A.; Yang, Fengtang; Thomas, Mark G.; Armour, John A.L.

    2015-01-01

    The human salivary amylase genes display extensive copy number variation (CNV), and recent work has implicated this variation in adaptation to starch-rich diets, and in association with body mass index. In this work, we use paralogue ratio tests, microsatellite analysis, read depth and fibre-FISH to demonstrate that human amylase CNV is not a smooth continuum, but is instead partitioned into distinct haplotype classes. There is a fundamental structural distinction between haplotypes containing odd or even numbers of AMY1 gene units, in turn coupled to CNV in pancreatic amylase genes AMY2A and AMY2B. Most haplotypes have one copy each of AMY2A and AMY2B and contain an odd number of copies of AMY1; consequently, most individuals have an even total number of AMY1. In contrast, haplotypes carrying an even number of AMY1 genes have rearrangements leading to CNVs of AMY2A/AMY2B. Read-depth and experimental data show that different populations harbour different proportions of these basic haplotype classes. In Europeans, the copy numbers of AMY1 and AMY2A are correlated, so that phenotypic associations caused by variation in pancreatic amylase copy number could be detected indirectly as weak association with AMY1 copy number. We show that the quantitative polymerase chain reaction (qPCR) assay previously applied to the high-throughput measurement of AMY1 copy number is less accurate than the measures we use and that qPCR data in other studies have been further compromised by systematic miscalibration. Our results uncover new patterns in human amylase variation and imply a potential role for AMY2 CNV in functional associations. PMID:25788522

  8. Obesity, starch digestion and amylase: association between copy number variants at human salivary (AMY1) and pancreatic (AMY2) amylase genes.

    PubMed

    Carpenter, Danielle; Dhar, Sugandha; Mitchell, Laura M; Fu, Beiyuan; Tyson, Jess; Shwan, Nzar A A; Yang, Fengtang; Thomas, Mark G; Armour, John A L

    2015-06-15

    The human salivary amylase genes display extensive copy number variation (CNV), and recent work has implicated this variation in adaptation to starch-rich diets, and in association with body mass index. In this work, we use paralogue ratio tests, microsatellite analysis, read depth and fibre-FISH to demonstrate that human amylase CNV is not a smooth continuum, but is instead partitioned into distinct haplotype classes. There is a fundamental structural distinction between haplotypes containing odd or even numbers of AMY1 gene units, in turn coupled to CNV in pancreatic amylase genes AMY2A and AMY2B. Most haplotypes have one copy each of AMY2A and AMY2B and contain an odd number of copies of AMY1; consequently, most individuals have an even total number of AMY1. In contrast, haplotypes carrying an even number of AMY1 genes have rearrangements leading to CNVs of AMY2A/AMY2B. Read-depth and experimental data show that different populations harbour different proportions of these basic haplotype classes. In Europeans, the copy numbers of AMY1 and AMY2A are correlated, so that phenotypic associations caused by variation in pancreatic amylase copy number could be detected indirectly as weak association with AMY1 copy number. We show that the quantitative polymerase chain reaction (qPCR) assay previously applied to the high-throughput measurement of AMY1 copy number is less accurate than the measures we use and that qPCR data in other studies have been further compromised by systematic miscalibration. Our results uncover new patterns in human amylase variation and imply a potential role for AMY2 CNV in functional associations. © The Author 2015. Published by Oxford University Press.

  9. Double Hits in Schizophrenia.

    PubMed

    Vorstman, Jacob A S; Olde Loohuis, Loes M; Kahn, René S; Ophoff, Roel A

    2018-05-14

    The co-occurrence of a Copy Number Variant (CNV) and a functional variant on the other allele may be a relevant genetic mechanism in schizophrenia. We hypothesized that the cumulative burden of such double hits - in particular those composed of a deletion and a coding single nucleotide variation (SNV) - is increased in patients with schizophrenia.We combined CNV data with coding variants data in 795 patients with schizophrenia and 474 controls. To limit false CNV-detection, only CNVs called only by two algorithms we included. CNV-affected genes were subsequently examined for coding SNVs, which we termed "CNV-SNVs". Correcting for total queried sequence, we assessed the CNV-SNV-burden and the combined predicted deleterious effect. We estimated p-values by permutation of the phenotype.We detected 105 CNV-SNVs; 67 in duplicated and 38 in deleted genic sequence. While the difference in CNV-SNVs rates was not significant, the combined deleteriousness inferred by CNV-SNVs in deleted sequence was almost fourfold higher in cases compared to controls (nominal p = 0.009). This effect may be driven by a higher number of CNV-SNVs and/or by a higher degree of predicted deleteriousness of CNV-SNVs. No such effect was observed for duplications.We provide early evidence that deletions co-occurring with a functional variant may be relevant, albeit of modest impact, for the genetic etiology of schizophrenia. Large-scale consortium studies are required to validate our findings. Sequence-based analyses would provide the best resolution for detection of CNVs as well as coding variants genome-wide.

  10. Identification of Small Exonic CNV from Whole-Exome Sequence Data and Application to Autism Spectrum Disorder

    PubMed Central

    Poultney, Christopher S.; Goldberg, Arthur P.; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J.; Ma’ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D.

    2013-01-01

    Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1–30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1–30 kb CNV, 1–30 kb deletions, and 1–10 kb deletions in ASD. CNV in the 1–30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1–30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. PMID:24094742

  11. A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data.

    PubMed

    Demidov, German; Simakova, Tamara; Vnuchkova, Julia; Bragin, Anton

    2016-10-22

    Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm's sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples.

  12. A high-resolution cattle CNV map by population-scale genome sequencing

    USDA-ARS?s Scientific Manuscript database

    Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. Prior studies in cattle have produced low-resolution CNV maps. We constructed a draft, high-resolution map of cattle CNVs based on whole genome sequencing data from 7...

  13. Sex bias in copy number variation of olfactory receptor gene family depends on ethnicity.

    PubMed

    Shadravan, Farideh

    2013-01-01

    Gender plays a pivotal role in the human genetic identity and is also manifested in many genetic disorders particularly mental retardation. In this study its effect on copy number variation (CNV), known to cause genetic disorders was explored. As the olfactory receptor (OR) repertoire comprises the largest human gene family, it was selected for this study, which was carried out within and between three populations, derived from 150 individuals from the 1000 Genome Project. Analysis of 3872 CNVs detected among 791 OR loci, in which 307 loci showed CNV, revealed the following novel findings: Sex bias in CNV was significantly more prevalent in uncommon than common CNV variants of OR pseudogenes, in which the male genome showed more CNVs; and in one-copy number loss compared to complete deletion of OR pseudogenes; both findings implying a more recent evolutionary role for gender. Sex bias in copy number gain was also detected. Another novel finding was that the observed sex bias was largely dependent on ethnicity and was in general absent in East Asians. Using a CNV public database for sick children (International Standard Cytogenomic Array Consortium) the application of these findings for improving clinical molecular diagnostics is discussed by showing an example of sex bias in CNV among kids with autism. Additional clinical relevance is discussed, as the most polymorphic CNV-enriched OR cluster in the human genome, located on chr 15q11.2, is found near the Prader-Willi syndrome/Angelman syndrome bi-directionally imprinted region associated with two well-known mental retardation syndromes. As olfaction represents the primitive cognition in most mammals, arguably in competition with the development of a larger brain, the extensive retention of OR pseudogenes in females of this study, might point to a parent-of-origin indirect regulatory role for OR pseudogenes in the embryonic development of human brain. Thus any perturbation in the temporal regulation of olfactory

  14. Identification of small exonic CNV from whole-exome sequence data and application to autism spectrum disorder.

    PubMed

    Poultney, Christopher S; Goldberg, Arthur P; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J; Ma'ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D

    2013-10-03

    Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1-30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1-30 kb CNV, 1-30 kb deletions, and 1-10 kb deletions in ASD. CNV in the 1-30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1-30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  15. Copy number variants implicate cardiac function and development pathways in earthquake-induced stress cardiomyopathy.

    PubMed

    Lacey, Cameron J; Doudney, Kit; Bridgman, Paul G; George, Peter M; Mulder, Roger T; Zarifeh, Julie J; Kimber, Bridget; Cadzow, Murray J; Black, Michael A; Merriman, Tony R; Lehnert, Klaus; Bickley, Vivienne M; Pearson, John F; Cameron, Vicky A; Kennedy, Martin A

    2018-05-15

    The pathophysiology of stress cardiomyopathy (SCM), also known as takotsubo syndrome, is poorly understood. SCM usually occurs sporadically, often in association with a stressful event, but clusters of cases are reported after major natural disasters. There is some evidence that this is a familial condition. We have examined three possible models for an underlying genetic predisposition to SCM. Our primary study cohort consists of 28 women who suffered SCM as a result of two devastating earthquakes that struck the city of Christchurch, New Zealand, in 2010 and 2011. To seek possible underlying genetic factors we carried out exome analysis, genotyping array analysis, and array comparative genomic hybridization on these subjects. The most striking finding was the observation of a markedly elevated rate of rare, heterogeneous copy number variants (CNV) of uncertain clinical significance (in 12/28 subjects). Several of these CNVs impacted on genes of cardiac relevance including RBFOX1, GPC5, KCNRG, CHODL, and GPBP1L1. There is no physical overlap between the CNVs, and the genes they impact do not appear to be functionally related. The recognition that SCM predisposition may be associated with a high rate of rare CNVs offers a novel perspective on this enigmatic condition.

  16. Novel origins of copy number variation in the dog genome

    PubMed Central

    2012-01-01

    Background Copy number variants (CNVs) account for substantial variation between genomes and are a major source of normal and pathogenic phenotypic differences. The dog is an ideal model to investigate mutational mechanisms that generate CNVs as its genome lacks a functional ortholog of the PRDM9 gene implicated in recombination and CNV formation in humans. Here we comprehensively assay CNVs using high-density array comparative genomic hybridization in 50 dogs from 17 dog breeds and 3 gray wolves. Results We use a stringent new method to identify a total of 430 high-confidence CNV loci, which range in size from 9 kb to 1.6 Mb and span 26.4 Mb, or 1.08%, of the assayed dog genome, overlapping 413 annotated genes. Of CNVs observed in each breed, 98% are also observed in multiple breeds. CNVs predicted to disrupt gene function are significantly less common than expected by chance. We identify a significant overrepresentation of peaks of GC content, previously shown to be enriched in dog recombination hotspots, in the vicinity of CNV breakpoints. Conclusions A number of the CNVs identified by this study are candidates for generating breed-specific phenotypes. Purifying selection seems to be a major factor shaping structural variation in the dog genome, suggesting that many CNVs are deleterious. Localized peaks of GC content appear to be novel sites of CNV formation in the dog genome by non-allelic homologous recombination, potentially activated by the loss of PRDM9. These sequence features may have driven genome instability and chromosomal rearrangements throughout canid evolution. PMID:22916802

  17. Increased CNV-Region deletions in mild cognitive impairment (MCI) and Alzheimer's disease (AD) subjects in the ADNI sample

    PubMed Central

    Guffanti, Guia; Torri, Federica; Rasmussen, Jerod; Clark, Andrew P.; Lakatos, Anita; Turner, Jessica A.; Fallon, James H.; Saykin, Andrew J.; Weiner, Michael; Vawter, Marquis P.; Knowles, James A.; Potkin, Steven G.; Macciardi, Fabio

    2014-01-01

    We investigated the genome-wide distribution of CNVs in the Alzheimer's disease (AD) Neuroimaging Initiative (ADNI) sample (146 with AD, 313 with Mild Cognitive Impairment (MCI), and 181 controls). Comparison of single CNVs between cases (MCI and AD) and controls shows overrepresentation of large heterozygous deletions in cases (p-value < 0.0001). The analysis of CNV-Regions identifies 44 copy number variable loci of heterozygous deletions, with more CNV-Regions among affected than controls (p = 0.005). Seven of the 44 CNV-Regions are nominally significant for association with cognitive impairment. We validated and confirmed our main findings with genome re-sequencing of selected patients and controls. The functional pathway analysis of the genes putatively affected by deletions of CNV-Regions reveals enrichment of genes implicated in axonal guidance, cell–cell adhesion, neuronal morphogenesis and differentiation. Our findings support the role of CNVs in AD, and suggest an association between large deletions and the development of cognitive impairment PMID:23583670

  18. Screening for common copy-number variants in cancer genes.

    PubMed

    Tyson, Jess; Majerus, Tamsin M O; Walker, Susan; Armour, John A L

    2010-12-01

    For most cases of colorectal cancer that arise without a family history of the disease, it is proposed that an appreciable heritable component of predisposition is the result of contributions from many loci. Although progress has been made in identifying single nucleotide variants associated with colorectal cancer risk, the involvement of low-penetrance copy number variants is relatively unexplored. We have used multiplex amplifiable probe hybridization (MAPH) in a fourfold multiplex (QuadMAPH), positioned at an average resolution of one probe per 2 kb, to screen a total of 1.56 Mb of genomic DNA for copy number variants around the genes APC, AXIN1, BRCA1, BRCA2, CTNNB1, HRAS, MLH1, MSH2, and TP53. Two deletion events were detected, one upstream of MLH1 in a control individual and the other in APC in a colorectal cancer patient, but these do not seem to correspond to copy number polymorphisms with measurably high population frequencies. In summary, by means of our QuadMAPH assay, copy number measurement data were of sufficient resolution and accuracy to detect any copy number variants with high probability. However, this study has demonstrated a very low incidence of deletion and duplication variants within intronic and flanking regions of these nine genes, in both control individuals and colorectal cancer patients. Copyright © 2010 Elsevier Inc. All rights reserved.

  19. Rare copy number variants in a population-based investigation of hypoplastic right heart syndrome.

    PubMed

    Dimopoulos, Aggeliki; Sicko, Robert J; Kay, Denise M; Rigler, Shannon L; Druschel, Charlotte M; Caggana, Michele; Browne, Marilyn L; Fan, Ruzong; Romitti, Paul A; Brody, Lawrence C; Mills, James L

    2017-01-20

    Hypoplastic right heart syndrome (HRHS) is a rare congenital defect characterized by underdevelopment of the right heart structures commonly accompanied by an atrial septal defect. Familial HRHS reports suggest genetic factor involvement. We examined the role of copy number variants (CNVs) in HRHS. We genotyped 32 HRHS cases identified from all New York State live births (1998-2005) using Illumina HumanOmni2.5 microarrays. CNVs were called with PennCNV and prioritized if they were ≥20 Kb, contained ≥10 SNPs and had minimal overlap with CNVs from in-house controls, the Database of Genomic Variants, HapMap3, and Childrens Hospital of Philadelphia database. We identified 28 CNVs in 17 cases; several encompassed genes important for right heart development. One case had a 2p16-2p23 duplication spanning LBH, a limb and heart development transcription factor. Lbh mis-expression results in right ventricular hypoplasia and pulmonary valve defects. This duplication also encompassed SOS1, a factor associated with pulmonary valve stenosis in Noonan syndrome. Sos1 -/- mice display thin and poorly trabeculated ventricles. In another case, we identified a 1.5 Mb deletion associated with Williams-Beuren syndrome, a disorder that includes valvular malformations. A third case had a 24 Kb deletion upstream of the TGFβ ligand ITGB8. Embryos genetically null for Itgb8, and its intracellular interactant Band 4.1B, display lethal cardiac phenotypes. To our knowledge, this is the first study of CNVs in HRHS. We identified several rare CNVs that overlap genes related to right ventricular wall and valve development, suggesting that genetics plays a role in HRHS and providing clues for further investigation. Birth Defects Research 109:16-26, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  20. Rare Copy Number Variants in a Population Based Investigation of Hypoplastic Right Heart Syndrome

    PubMed Central

    Dimopoulos, Aggeliki; Sicko, Robert J.; Kay, Denise M.; Rigler, Shannon L.; Druschel, Charlotte M.; Caggana, Michele; Browne, Marilyn L.; Fan, Ruzong; Romitti, Paul A.; Brody, Lawrence C.; Mills, James L.

    2016-01-01

    Background Hypoplastic right heart syndrome (HRHS) is a rare congenital defect characterized by underdevelopment of the right heart structures commonly accompanied by an atrial septal defect. Familial HRHS reports suggest genetic factor involvement. We examined the role of copy number variants (CNVs) in HRHS. Methods We genotyped 32 HRHS cases identified from all New York State live births (1998–2005) using Illumina HumanOmni2.5 microarrays. CNVs were called with PennCNV and prioritized if they were ≥20Kb, contained ≥10 SNPs and had minimal overlap with CNVs from in-house controls, the Database of Genomic Variants, HapMap3 and CHOP database. Results We identified 28 CNVs in 17 cases; several encompassed genes important for right heart development. One case had a 2p16–2p23 duplication spanning LBH, a limb and heart development transcription factor. Lbh mis-expression results in right ventricular hypoplasia and pulmonary valve defects. This duplication also encompassed SOS1, a factor associated with pulmonary valve stenosis in Noonan syndrome. Sos1−/− mice display thin and poorly trabeculated ventricles. In another case, we identified a 1.5Mb deletion associated with Williams Beuren syndrome, a disorder that includes valvular malformations. A third case had a 24Kb deletion upstream of the TGFβ ligand ITGB8. Embryos genetically null for Itgb8, and its intracellular interactant Band 4.1B, display lethal cardiac phenotypes. Conclusions To our knowledge, this is the first study of CNVs in HRHS. We identified several rare CNVs that overlap genes related to right ventricular wall and valve development, suggesting that genetics plays a role in HRHS and providing clues for further investigation. PMID:28009100

  1. Differences in aggressive behavior and DNA copy number variants between BALB/cJ and BALB/cByJ substrains.

    PubMed

    Velez, Lady; Sokoloff, Greta; Miczek, Klaus A; Palmer, Abraham A; Dulawa, Stephanie C

    2010-03-01

    Some BALB/c substrains exhibit different levels of aggression. We compared aggression levels between male BALB/cJ and BALB/cByJ substrains using the resident intruder paradigm. These substrains were also assessed in other tests of emotionality and information processing including the open field, forced swim, fear conditioning, and prepulse inhibition tests. We also evaluated single nucleotide polymorphisms (SNPs) previously reported between these BALB/c substrains. Finally, we compared BALB/cJ and BALB/cByJ mice for genomic deletions or duplications, collectively termed copy number variants (CNVs), to identify candidate genes that might underlie the observed behavioral differences. BALB/cJ mice showed substantially higher aggression levels than BALB/cByJ mice; however, only minor differences in other behaviors were observed. None of the previously reported SNPs were verified. Eleven CNV regions were identified between the two BALB/c substrains. Our findings identify a robust difference in aggressive behavior between BALB/cJ and BALB/cByJ substrains, which could be the result of the identified CNVs.

  2. GenomeCAT: a versatile tool for the analysis and integrative visualization of DNA copy number variants.

    PubMed

    Tebel, Katrin; Boldt, Vivien; Steininger, Anne; Port, Matthias; Ebert, Grit; Ullmann, Reinhard

    2017-01-06

    The analysis of DNA copy number variants (CNV) has increasing impact in the field of genetic diagnostics and research. However, the interpretation of CNV data derived from high resolution array CGH or NGS platforms is complicated by the considerable variability of the human genome. Therefore, tools for multidimensional data analysis and comparison of patient cohorts are needed to assist in the discrimination of clinically relevant CNVs from others. We developed GenomeCAT, a standalone Java application for the analysis and integrative visualization of CNVs. GenomeCAT is composed of three modules dedicated to the inspection of single cases, comparative analysis of multidimensional data and group comparisons aiming at the identification of recurrent aberrations in patients sharing the same phenotype, respectively. Its flexible import options ease the comparative analysis of own results derived from microarray or NGS platforms with data from literature or public depositories. Multidimensional data obtained from different experiment types can be merged into a common data matrix to enable common visualization and analysis. All results are stored in the integrated MySQL database, but can also be exported as tab delimited files for further statistical calculations in external programs. GenomeCAT offers a broad spectrum of visualization and analysis tools that assist in the evaluation of CNVs in the context of other experiment data and annotations. The use of GenomeCAT does not require any specialized computer skills. The various R packages implemented for data analysis are fully integrated into GenomeCATs graphical user interface and the installation process is supported by a wizard. The flexibility in terms of data import and export in combination with the ability to create a common data matrix makes the program also well suited as an interface between genomic data from heterogeneous sources and external software tools. Due to the modular architecture the functionality of

  3. CNV detection method optimized for high-resolution arrayCGH by normality test.

    PubMed

    Ahn, Jaegyoon; Yoon, Youngmi; Park, Chihyun; Park, Sanghyun

    2012-04-01

    High-resolution arrayCGH platform makes it possible to detect small gains and losses which previously could not be measured. However, current CNV detection tools fitted to early low-resolution data are not applicable to larger high-resolution data. When CNV detection tools are applied to high-resolution data, they suffer from high false-positives, which increases validation cost. Existing CNV detection tools also require optimal parameter values. In most cases, obtaining these values is a difficult task. This study developed a CNV detection algorithm that is optimized for high-resolution arrayCGH data. This tool operates up to 1500 times faster than existing tools on a high-resolution arrayCGH of whole human chromosomes which has 42 million probes whose average length is 50 bases, while preserving false positive/negative rates. The algorithm also uses a normality test, thereby removing the need for optimal parameters. To our knowledge, this is the first formulation for CNV detecting problems that results in a near-linear empirical overall complexity for real high-resolution data. Copyright © 2012 Elsevier Ltd. All rights reserved.

  4. Prospective chromosome analysis of 3429 amniocentesis samples in China using copy number variation sequencing.

    PubMed

    Wang, Jing; Chen, Lin; Zhou, Cong; Wang, Li; Xie, Hanbin; Xiao, Yuanyuan; Zhu, Hongmei; Hu, Ting; Zhang, Zhu; Zhu, Qian; Liu, Zhiying; Liu, Shanlin; Wang, He; Xu, Mengnan; Ren, Zhilin; Yu, Fuli; Cram, David S; Liu, Hongqian

    2018-05-28

    Next generation sequencing (NGS) is emerging as a viable alternative to chromosome microarray analysis for the diagnosis of chromosome disease syndromes. One NGS methodology, copy number variation sequencing (CNV-Seq), has been shown to deliver high reliability, accuracy and reproducibility for detection of fetal CNVs in prenatal samples. However, its clinical utility as a first tier diagnostic method has yet to be demonstrated in a large cohort of pregnant women referred for fetal chromosome testing. To evaluate CNV-Seq as a first tier diagnostic method for detection of fetal chromosome anomalies in a general population of pregnant women with high-risk prenatal indications. Prospective analysis of 3429 pregnant women referred for amniocentesis and fetal chromosome testing for different risk indications, including advanced maternal age (AMA), high-risk maternal serum screening (HR-MSS), and positivity for an ultrasound soft marker (USM). Amniocentesis was performed by standard procedures. Amniocyte DNA was analyzed by CNV-Seq with a chromosome resolution of 0.1 Mb. Fetal chromosome anomalies including whole chromosome aneuploidy and segmental imbalances were independently confirmed by gold standard cytogenetic and molecular methods and their pathogenicity determined following guidelines of the American College of Medical Genetics for sequence variants. Clear interpretable CNV-Seq results were obtained for all 3429 amniocentesis samples. CNV-Seq identified 3293 (96%) samples with a normal molecular karyotype and 136 samples (4%) with an altered molecular karyotype. A total of 146 fetal chromosome anomalies were detected, comprising 46 whole chromosome aneuploidies (pathogenic), 29 submicroscopic microdeletions/microduplications with known or suspected associations with chromosome disease syndromes (pathogenic), 22 other microdeletions/microduplications (likely pathogenic) and 49 variants of uncertain significance (VUS). Overall, the cumulative frequency of

  5. G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods.

    PubMed

    Manconi, Andrea; Manca, Emanuele; Moscatelli, Marco; Gnocchi, Matteo; Orro, Alessandro; Armano, Giuliano; Milanesi, Luciano

    2015-01-01

    Copy number variations (CNVs) are the most prevalent types of structural variations (SVs) in the human genome and are involved in a wide range of common human diseases. Different computational methods have been devised to detect this type of SVs and to study how they are implicated in human diseases. Recently, computational methods based on high-throughput sequencing (HTS) are increasingly used. The majority of these methods focus on mapping short-read sequences generated from a donor against a reference genome to detect signatures distinctive of CNVs. In particular, read-depth based methods detect CNVs by analyzing genomic regions with significantly different read-depth from the other ones. The pipeline analysis of these methods consists of four main stages: (i) data preparation, (ii) data normalization, (iii) CNV regions identification, and (iv) copy number estimation. However, available tools do not support most of the operations required at the first two stages of this pipeline. Typically, they start the analysis by building the read-depth signal from pre-processed alignments. Therefore, third-party tools must be used to perform most of the preliminary operations required to build the read-depth signal. These data-intensive operations can be efficiently parallelized on graphics processing units (GPUs). In this article, we present G-CNV, a GPU-based tool devised to perform the common operations required at the first two stages of the analysis pipeline. G-CNV is able to filter low-quality read sequences, to mask low-quality nucleotides, to remove adapter sequences, to remove duplicated read sequences, to map the short-reads, to resolve multiple mapping ambiguities, to build the read-depth signal, and to normalize it. G-CNV can be efficiently used as a third-party tool able to prepare data for the subsequent read-depth signal generation and analysis. Moreover, it can also be integrated in CNV detection tools to generate read-depth signals.

  6. The importance of copy number variation in congenital heart disease

    PubMed Central

    Costain, Gregory; Silversides, Candice K; Bassett, Anne S

    2016-01-01

    Congenital heart disease (CHD) is the most common class of major malformations in humans. The historical association with large chromosomal abnormalities foreshadowed the role of submicroscopic rare copy number variations (CNVs) as important genetic causes of CHD. Recent studies have provided robust evidence for these structural variants as genome-wide contributors to all forms of CHD, including CHD that appears isolated without extra-cardiac features. Overall, a CNV-related molecular diagnosis can be made in up to one in eight patients with CHD. These include de novo and inherited variants at established (chromosome 22q11.2), emerging (chromosome 1q21.1), and novel loci across the genome. Variable expression of rare CNVs provides support for the notion of a genetic spectrum of CHD that crosses traditional anatomic classification boundaries. Clinical genetic testing using genome-wide technologies (e.g., chromosomal microarray analysis) is increasingly employed in prenatal, paediatric and adult settings. CNV discoveries in CHD have translated to changes to clinical management, prognostication and genetic counselling. The convergence of findings at individual gene and at pathway levels is shedding light on the mechanisms that govern human cardiac morphogenesis. These clinical and research advances are helping to inform whole-genome sequencing, the next logical step in delineating the genetic architecture of CHD. PMID:28706735

  7. The Role of Copy Number Variation in Susceptibility to Amyotrophic Lateral Sclerosis: Genome-Wide Association Study and Comparison with Published Loci

    PubMed Central

    Wain, Louise V.; Pedroso, Inti; Landers, John E.; Breen, Gerome; Shaw, Christopher E.; Leigh, P. Nigel; Brown, Robert H.

    2009-01-01

    Background The genetic contribution to sporadic amyotrophic lateral sclerosis (ALS) has not been fully elucidated. There are increasing efforts to characterise the role of copy number variants (CNVs) in human diseases; two previous studies concluded that CNVs may influence risk of sporadic ALS, with multiple rare CNVs more important than common CNVs. A little-explored issue surrounding genome-wide CNV association studies is that of post-calling filtering and merging of raw CNV calls. We undertook simulations to define filter thresholds and considered optimal ways of merging overlapping CNV calls for association testing, taking into consideration possibly overlapping or nested, but distinct, CNVs and boundary estimation uncertainty. Methodology and Principal Findings In this study we screened Illumina 300K SNP genotyping data from 730 ALS cases and 789 controls for copy number variation. Following quality control filters using thresholds defined by simulation, a total of 11321 CNV calls were made across 575 cases and 621 controls. Using region-based and gene-based association analyses, we identified several loci showing nominally significant association. However, the choice of criteria for combining calls for association testing has an impact on the ranking of the results by their significance. Several loci which were previously reported as being associated with ALS were identified here. However, of another 15 genes previously reported as exhibiting ALS-specific copy number variation, only four exhibited copy number variation in this study. Potentially interesting novel loci, including EEF1D, a translation elongation factor involved in the delivery of aminoacyl tRNAs to the ribosome (a process which has previously been implicated in genetic studies of spinal muscular atrophy) were identified but must be treated with caution due to concerns surrounding genomic location and platform suitability. Conclusions and Significance Interpretation of CNV association findings

  8. VCS: Tool for Visualizing Copy Number Variation and Single Nucleotide Polymorphism.

    PubMed

    Kim, HyoYoung; Sung, Samsun; Cho, Seoae; Kim, Tae-Hun; Seo, Kangseok; Kim, Heebal

    2014-12-01

    Copy number variation (CNV) or single nucleotide phlyorphism (SNP) is useful genetic resource to aid in understanding complex phenotypes or deseases susceptibility. Although thousands of CNVs and SNPs are currently avaliable in the public databases, they are somewhat difficult to use for analyses without visualization tools. We developed a web-based tool called the VCS (visualization of CNV or SNP) to visualize the CNV or SNP detected. The VCS tool can assist to easily interpret a biological meaning from the numerical value of CNV and SNP. The VCS provides six visualization tools: i) the enrichment of genome contents in CNV; ii) the physical distribution of CNV or SNP on chromosomes; iii) the distribution of log2 ratio of CNVs with criteria of interested; iv) the number of CNV or SNP per binning unit; v) the distribution of homozygosity of SNP genotype; and vi) cytomap of genes within CNV or SNP region.

  9. Ethnic differentiation of copy number variation on chromosome 16p12.3 for association with obesity phenotypes in European and Chinese populations.

    PubMed

    Yang, T-L; Guo, Y; Li, S M; Li, S K; Tian, Q; Liu, Y-J; Deng, H-W

    2013-02-01

    Genomic copy number variations (CNVs) have been strongly implicated as important genetic factors for obesity. A recent genome-wide association study identified a novel variant, rs12444979, which is in high linkage disequilibrium with CNV 16p12.3, for association with obesity in Europeans. The aim of this study was to directly examine the relationship between the CNV 16p12.3 and obesity phenotypes, including body mass index (BMI) and body fat mass. Subjects were a multi-ethnic sample, including 2286 unrelated subjects from a European population and 1627 unrelated Han subjects from a Chinese population. Body fat mass was measured using dual energy X-ray absorptiometry. Using Affymetrix Genome-Wide Human SNP Array 6.0, we directly detected CNV 16p12.3, with the deletion frequency of 27.26 and 0.8% in the European and Chinese populations, respectively. We confirmed the significant association between this CNV and obesity (BMI: P=1.38 × 10(-2); body fat mass: P=2.13 × 10(-3)) in the European population. Less copy numbers were associated with lower BMI and body fat mass, and the effect size was estimated to be 0.62 (BMI) and 1.41 (body fat mass), respectively. However, for the Chinese population, we did not observe significant association signal, and the frequencies of this deletion CNV are quite different between the European and Chinese populations (P<0.001). Our findings first suggest that CNV 16p12.3 might be ethnic specific and cause ethnic phenotypic diversity, which may provide some new clues into the understanding of the genetic architecture of obesity.

  10. Population Structure Shapes Copy Number Variation in Malaria Parasites.

    PubMed

    Cheeseman, Ian H; Miller, Becky; Tan, John C; Tan, Asako; Nair, Shalini; Nkhoma, Standwell C; De Donato, Marcos; Rodulfo, Hectorina; Dondorp, Arjen; Branch, Oralee H; Mesia, Lastenia Ruiz; Newton, Paul; Mayxay, Mayfong; Amambua-Ngwa, Alfred; Conway, David J; Nosten, François; Ferdig, Michael T; Anderson, Tim J C

    2016-03-01

    If copy number variants (CNVs) are predominantly deleterious, we would expect them to be more efficiently purged from populations with a large effective population size (Ne) than from populations with a small Ne. Malaria parasites (Plasmodium falciparum) provide an excellent organism to examine this prediction, because this protozoan shows a broad spectrum of population structures within a single species, with large, stable, outbred populations in Africa, small unstable inbred populations in South America and with intermediate population characteristics in South East Asia. We characterized 122 single-clone parasites, without prior laboratory culture, from malaria-infected patients in seven countries in Africa, South East Asia and South America using a high-density single-nucleotide polymorphism/CNV microarray. We scored 134 high-confidence CNVs across the parasite exome, including 33 deletions and 102 amplifications, which ranged in size from <500 bp to 59 kb, as well as 10,107 flanking, biallelic single-nucleotide polymorphisms. Overall, CNVs were rare, small, and skewed toward low frequency variants, consistent with the deleterious model. Relative to African and South East Asian populations, CNVs were significantly more common in South America, showed significantly less skew in allele frequencies, and were significantly larger. On this background of low frequency CNV, we also identified several high-frequency CNVs under putative positive selection using an FST outlier analysis. These included known adaptive CNVs containing rh2b and pfmdr1, and several other CNVs (e.g., DNA helicase and three conserved proteins) that require further investigation. Our data are consistent with a significant impact of genetic structure on CNV burden in an important human pathogen. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  11. Accurate and exact CNV identification from targeted high-throughput sequence data.

    PubMed

    Nord, Alex S; Lee, Ming; King, Mary-Claire; Walsh, Tom

    2011-04-12

    Massively parallel sequencing of barcoded DNA samples significantly increases screening efficiency for clinically important genes. Short read aligners are well suited to single nucleotide and indel detection. However, methods for CNV detection from targeted enrichment are lacking. We present a method combining coverage with map information for the identification of deletions and duplications in targeted sequence data. Sequencing data is first scanned for gains and losses using a comparison of normalized coverage data between samples. CNV calls are confirmed by testing for a signature of sequences that span the CNV breakpoint. With our method, CNVs can be identified regardless of whether breakpoints are within regions targeted for sequencing. For CNVs where at least one breakpoint is within targeted sequence, exact CNV breakpoints can be identified. In a test data set of 96 subjects sequenced across ~1 Mb genomic sequence using multiplexing technology, our method detected mutations as small as 31 bp, predicted quantitative copy count, and had a low false-positive rate. Application of this method allows for identification of gains and losses in targeted sequence data, providing comprehensive mutation screening when combined with a short read aligner.

  12. The genetic effect of copy number variations on the risk of alcoholism in a Korean population.

    PubMed

    Bae, Joon Seol; Jung, Myung Hun; Lee, Boung Chul; Cheong, Hyun Sub; Park, Byung Lae; Kim, Lyoung Hyo; Kim, Jeong-Hyun; Pasaje, Charisse Flerida A; Lee, Jin Sol; Jung, Kyoung Hwa; Chai, Young Gyu; Shin, Hyoung Doo; Choi, Ihn-Geun

    2012-01-01

    Alcoholism, a chronic behavioral disorder characterized by excessive alcohol consumption, has been a leading cause of morbidity and premature death. This condition is believed to be influenced by genetic factors. As copy number variation (CNV) has been recently discovered in human genome, genomic diversity of human genome is more frequent than previously thought. Many studies have reported evidences that CNV is associated with the development of complex diseases. In this study, we hypothesized that CNV can predict the risk of alcoholism. Using the Illumina HumanHap660W-Quad BeadChip (∼660 k markers), genome-wide genotyping was performed to obtain signal and allelic intensities from 116 alcoholic cases and 1,022 healthy controls (total n = 1,138) in a Korean population. To identify alcoholism-associated CNV regions, we performed a genome-wide association analysis, using multivariate logistic regression model controlling for age and gender. We identified a total of 255,732 individual CNVs and 3,261 CNV regions (1,067 common CNV regions, frequency > 1%) in this study. Results from multivariate logistic regression showed that the chr20:61195302-61195978 regions were significantly associated with the risk of alcoholism after multiple corrections (p = 5.02E-05, p(corr) = 0.04). Most of the identified variations in this study overlapped with the previously reported CNVs in the Database of Genomic Variants (95.3%). The identified CNVs, which encompassed 3,226 functional genes, were significantly enriched in the cellular part, in the membrane-bound organelle, in the cell part, in developmental processes, in cell communication, in neurological system process, in sensory perception of smell and chemical stimulus, and in olfactory receptor activity. This is the first genome-wide association study to investigate the relationship between common CNV and alcoholism. Our results suggest that the newly identified CNV regions may contribute to the development of alcoholism

  13. Low copy number of the salivary amylase gene predisposes to obesity.

    PubMed

    Falchi, Mario; El-Sayed Moustafa, Julia Sarah; Takousis, Petros; Pesce, Francesco; Bonnefond, Amélie; Andersson-Assarsson, Johanna C; Sudmant, Peter H; Dorajoo, Rajkumar; Al-Shafai, Mashael Nedham; Bottolo, Leonardo; Ozdemir, Erdal; So, Hon-Cheong; Davies, Robert W; Patrice, Alexandre; Dent, Robert; Mangino, Massimo; Hysi, Pirro G; Dechaume, Aurélie; Huyvaert, Marlène; Skinner, Jane; Pigeyre, Marie; Caiazzo, Robert; Raverdy, Violeta; Vaillant, Emmanuel; Field, Sarah; Balkau, Beverley; Marre, Michel; Visvikis-Siest, Sophie; Weill, Jacques; Poulain-Godefroy, Odile; Jacobson, Peter; Sjostrom, Lars; Hammond, Christopher J; Deloukas, Panos; Sham, Pak Chung; McPherson, Ruth; Lee, Jeannette; Tai, E Shyong; Sladek, Robert; Carlsson, Lena M S; Walley, Andrew; Eichler, Evan E; Pattou, Francois; Spector, Timothy D; Froguel, Philippe

    2014-05-01

    Common multi-allelic copy number variants (CNVs) appear enriched for phenotypic associations compared to their biallelic counterparts. Here we investigated the influence of gene dosage effects on adiposity through a CNV association study of gene expression levels in adipose tissue. We identified significant association of a multi-allelic CNV encompassing the salivary amylase gene (AMY1) with body mass index (BMI) and obesity, and we replicated this finding in 6,200 subjects. Increased AMY1 copy number was positively associated with both amylase gene expression (P = 2.31 × 10(-14)) and serum enzyme levels (P < 2.20 × 10(-16)), whereas reduced AMY1 copy number was associated with increased BMI (change in BMI per estimated copy = -0.15 (0.02) kg/m(2); P = 6.93 × 10(-10)) and obesity risk (odds ratio (OR) per estimated copy = 1.19, 95% confidence interval (CI) = 1.13-1.26; P = 1.46 × 10(-10)). The OR value of 1.19 per copy of AMY1 translates into about an eightfold difference in risk of obesity between subjects in the top (copy number > 9) and bottom (copy number < 4) 10% of the copy number distribution. Our study provides a first genetic link between carbohydrate metabolism and BMI and demonstrates the power of integrated genomic approaches beyond genome-wide association studies.

  14. Recurrent Rearrangements of Human Amylase Genes Create Multiple Independent CNV Series.

    PubMed

    Shwan, Nzar A A; Louzada, Sandra; Yang, Fengtang; Armour, John A L

    2017-05-01

    The human amylase gene cluster includes the human salivary (AMY1) and pancreatic amylase genes (AMY2A and AMY2B), and is a highly variable and dynamic region of the genome. Copy number variation (CNV) of AMY1 has been implicated in human dietary adaptation, and in population association with obesity, but neither of these findings has been independently replicated. Despite these functional implications, the structural genomic basis of CNV has only been defined in detail very recently. In this work, we use high-resolution analysis of copy number, and analysis of segregation in trios, to define new, independent allelic series of amylase CNVs in sub-Saharan Africans, including a series of higher-order expansions of a unit consisting of one copy each of AMY1, AMY2A, and AMY2B. We use fiber-FISH (fluorescence in situ hybridization) to define unexpected complexity in the accompanying rearrangements. These findings demonstrate recurrent involvement of the amylase gene region in genomic instability, involving at least five independent rearrangements of the pancreatic amylase genes (AMY2A and AMY2B). Structural features shared by fundamentally distinct lineages strongly suggest that the common ancestral state for the human amylase cluster contained more than one, and probably three, copies of AMY1. © 2017 WILEY PERIODICALS, INC.

  15. A Copy Number Variant at the KITLG Locus Likely Confers Risk for Canine Squamous Cell Carcinoma of the Digit

    PubMed Central

    Karyadi, Danielle M.; Karlins, Eric; Decker, Brennan; vonHoldt, Bridgett M.; Carpintero-Ramirez, Gretchen; Parker, Heidi G.; Wayne, Robert K.; Ostrander, Elaine A.

    2013-01-01

    The domestic dog is a robust model for studying the genetics of complex disease susceptibility. The strategies used to develop and propagate modern breeds have resulted in an elevated risk for specific diseases in particular breeds. One example is that of Standard Poodles (STPOs), who have increased risk for squamous cell carcinoma of the digit (SCCD), a locally aggressive cancer that causes lytic bone lesions, sometimes with multiple toe recurrence. However, only STPOs of dark coat color are at high risk; light colored STPOs are almost entirely unaffected, suggesting that interactions between multiple pathways are necessary for oncogenesis. We performed a genome-wide association study (GWAS) on STPOs, comparing 31 SCCD cases to 34 unrelated black STPO controls. The peak SNP on canine chromosome 15 was statistically significant at the genome-wide level (Praw = 1.60×10−7; Pgenome = 0.0066). Additional mapping resolved the region to the KIT Ligand (KITLG) locus. Comparison of STPO cases to other at-risk breeds narrowed the locus to a 144.9-Kb region. Haplotype mapping among 84 STPO cases identified a minimal region of 28.3 Kb. A copy number variant (CNV) containing predicted enhancer elements was found to be strongly associated with SCCD in STPOs (P = 1.72×10−8). Light colored STPOs carry the CNV risk alleles at the same frequency as black STPOs, but are not susceptible to SCCD. A GWAS comparing 24 black and 24 light colored STPOs highlighted only the MC1R locus as significantly different between the two datasets, suggesting that a compensatory mutation within the MC1R locus likely protects light colored STPOs from disease. Our findings highlight a role for KITLG in SCCD susceptibility, as well as demonstrate that interactions between the KITLG and MC1R loci are potentially required for SCCD oncogenesis. These findings highlight how studies of breed-limited diseases are useful for disentangling multigene disorders. PMID:23555311

  16. Similar genomic proportions of copy number variation within gray wolves and modern dog breeds inferred from whole genome sequencing.

    PubMed

    Serres-Armero, Aitor; Povolotskaya, Inna S; Quilez, Javier; Ramirez, Oscar; Santpere, Gabriel; Kuderna, Lukas F K; Hernandez-Rodriguez, Jessica; Fernandez-Callejo, Marcos; Gomez-Sanchez, Daniel; Freedman, Adam H; Fan, Zhenxin; Novembre, John; Navarro, Arcadi; Boyko, Adam; Wayne, Robert; Vilà, Carles; Lorente-Galdos, Belen; Marques-Bonet, Tomas

    2017-12-19

    Whole genome re-sequencing data from dogs and wolves are now commonly used to study how natural and artificial selection have shaped the patterns of genetic diversity. Single nucleotide polymorphisms, microsatellites and variants in mitochondrial DNA have been interrogated for links to specific phenotypes or signals of domestication. However, copy number variation (CNV), despite its increasingly recognized importance as a contributor to phenotypic diversity, has not been extensively explored in canids. Here, we develop a new accurate probabilistic framework to create fine-scale genomic maps of segmental duplications (SDs), compare patterns of CNV across groups and investigate their role in the evolution of the domestic dog by using information from 34 canine genomes. Our analyses show that duplicated regions are enriched in genes and hence likely possess functional importance. We identify 86 loci with large CNV differences between dogs and wolves, enriched in genes responsible for sensory perception, immune response, metabolic processes, etc. In striking contrast to the observed loss of nucleotide diversity in domestic dogs following the population bottlenecks that occurred during domestication and breed creation, we find a similar proportion of CNV loci in dogs and wolves, suggesting that other dynamics are acting to particularly select for CNVs with potentially functional impacts. This work is the first comparison of genome wide CNV patterns in domestic and wild canids using whole-genome sequencing data and our findings contribute to study the impact of novel kinds of genetic changes on the evolution of the domestic dog.

  17. Genome-wide copy number analysis reveals candidate gene loci that confer susceptibility to high-grade prostate cancer.

    PubMed

    Poniah, Prevathe; Mohd Zain, Shamsul; Abdul Razack, Azad Hassan; Kuppusamy, Shanggar; Karuppayah, Shankar; Sian Eng, Hooi; Mohamed, Zahurin

    2017-09-01

    Two key issues in prostate cancer (PCa) that demand attention currently are the need for a more precise and minimally invasive screening test owing to the inaccuracy of prostate-specific antigen and differential diagnosis to distinguish advanced vs. indolent cancers. This continues to pose a tremendous challenge in diagnosis and prognosis of PCa and could potentially lead to overdiagnosis and overtreatment complications. Copy number variations (CNVs) in the human genome have been linked to various carcinomas including PCa. Detection of these variants may improve clinical treatment as well as an understanding of the pathobiology underlying this complex disease. To this end, we undertook a pilot genome-wide CNV analysis approach in 36 subjects (18 patients with high-grade PCa and 18 controls that were matched by age and ethnicity) in search of more accurate biomarkers that could potentially explain susceptibility toward high-grade PCa. We conducted this study using the array comparative genomic hybridization technique. Array results were validated in 92 independent samples (46 high-grade PCa, 23 benign prostatic hyperplasia, and 23 healthy controls) using polymerase chain reaction-based copy number counting method. A total of 314 CNV regions were found to be unique to PCa subjects in this cohort (P<0.05). A log 2 ratio-based copy number analysis revealed 5 putative rare or novel CNV loci or both associated with susceptibility to PCa. The CNV gain regions were 1q21.3, 15q15, 7p12.1, and a novel CNV in PCa 12q23.1, harboring ARNT, THBS1, SLC5A8, and DDC genes that are crucial in the p53 and cancer pathways. A CNV loss and deletion event was observed at 8p11.21, which contains the SFRP1 gene from the Wnt signaling pathway. Cross-comparison analysis with genes associated to PCa revealed significant CNVs involved in biological processes that elicit cancer pathogenesis via cytokine production and endothelial cell proliferation. In conclusion, we postulated that the CNVs

  18. X-chromosome tiling path array detection of copy number variants in patients with chromosome X-linked mental retardation

    PubMed Central

    Madrigal, I; Rodríguez-Revenga, L; Armengol, L; González, E; Rodriguez, B; Badenas, C; Sánchez, A; Martínez, F; Guitart, M; Fernández, I; Arranz, JA; Tejada, MI; Pérez-Jurado, LA; Estivill, X; Milà, M

    2007-01-01

    Background Aproximately 5–10% of cases of mental retardation in males are due to copy number variations (CNV) on the X chromosome. Novel technologies, such as array comparative genomic hybridization (aCGH), may help to uncover cryptic rearrangements in X-linked mental retardation (XLMR) patients. We have constructed an X-chromosome tiling path array using bacterial artificial chromosomes (BACs) and validated it using samples with cytogenetically defined copy number changes. We have studied 54 patients with idiopathic mental retardation and 20 controls subjects. Results Known genomic aberrations were reliably detected on the array and eight novel submicroscopic imbalances, likely causative for the mental retardation (MR) phenotype, were detected. Putatively pathogenic rearrangements included three deletions and five duplications (ranging between 82 kb to one Mb), all but two affecting genes previously known to be responsible for XLMR. Additionally, we describe different CNV regions with significant different frequencies in XLMR and control subjects (44% vs. 20%). Conclusion This tiling path array of the human X chromosome has proven successful for the detection and characterization of known rearrangements and novel CNVs in XLMR patients. PMID:18047645

  19. Identifying Potential Regions of Copy Number Variation for Bipolar Disorder

    PubMed Central

    Chen, Yi-Hsuan; Lu, Ru-Band; Hung, Hung; Kuo, Po-Hsiu

    2014-01-01

    Bipolar disorder is a complex psychiatric disorder with high heritability, but its genetic determinants are still largely unknown. Copy number variation (CNV) is one of the sources to explain part of the heritability. However, it is a challenge to estimate discrete values of the copy numbers using continuous signals calling from a set of markers, and to simultaneously perform association testing between CNVs and phenotypic outcomes. The goal of the present study is to perform a series of data filtering and analysis procedures using a DNA pooling strategy to identify potential CNV regions that are related to bipolar disorder. A total of 200 normal controls and 200 clinically diagnosed bipolar patients were recruited in this study, and were randomly divided into eight control and eight case pools. Genome-wide genotyping was employed using Illumina Human Omni1-Quad array with approximately one million markers for CNV calling. We aimed at setting a series of criteria to filter out the signal noise of marker data and to reduce the chance of false-positive findings for CNV regions. We first defined CNV regions for each pool. Potential CNV regions were reported based on the different patterns of CNV status between cases and controls. Genes that were mapped into the potential CNV regions were examined with association testing, Gene Ontology enrichment analysis, and checked with existing literature for their associations with bipolar disorder. We reported several CNV regions that are related to bipolar disorder. Two CNV regions on chromosome 11 and 22 showed significant signal differences between cases and controls (p < 0.05). Another five CNV regions on chromosome 6, 9, and 19 were overlapped with results in previous CNV studies. Experimental validation of two CNV regions lent some support to our reported findings. Further experimental and replication studies could be designed for these selected regions. PMID:27605030

  20. Genetic Variants Identified from Epilepsy of Unknown Etiology in Chinese Children by Targeted Exome Sequencing

    PubMed Central

    Wang, Yimin; Du, Xiaonan; Bin, Rao; Yu, Shanshan; Xia, Zhezhi; Zheng, Guo; Zhong, Jianmin; Zhang, Yunjian; Jiang, Yong-hui; Wang, Yi

    2017-01-01

    Genetic factors play a major role in the etiology of epilepsy disorders. Recent genomics studies using next generation sequencing (NGS) technique have identified a large number of genetic variants including copy number (CNV) and single nucleotide variant (SNV) in a small set of genes from individuals with epilepsy. These discoveries have contributed significantly to evaluate the etiology of epilepsy in clinic and lay the foundation to develop molecular specific treatment. However, the molecular basis for a majority of epilepsy patients remains elusive, and furthermore, most of these studies have been conducted in Caucasian children. Here we conducted a targeted exome-sequencing of 63 trios of Chinese epilepsy families using a custom-designed NGS panel that covers 412 known and candidate genes for epilepsy. We identified pathogenic and likely pathogenic variants in 15 of 63 (23.8%) families in known epilepsy genes including SCN1A, CDKL5, STXBP1, CHD2, SCN3A, SCN9A, TSC2, MBD5, POLG and EFHC1. More importantly, we identified likely pathologic variants in several novel candidate genes such as GABRE, MYH1, and CLCN6. Our results provide the evidence supporting the application of custom-designed NGS panel in clinic and indicate a conserved genetic susceptibility for epilepsy between Chinese and Caucasian children. PMID:28074849

  1. The effect of modifying response and performance feedback parameters on the CNV in humans

    NASA Technical Reports Server (NTRS)

    Otto, D. A.; Leifer, L. J.

    1972-01-01

    The effect on the CNV of sustained and delayed motor response with the dominant and nondominant hand in the presence and absence of visual performance feedback, was studied in 15 male adults. Monopolar scalp recordings were obtained at Fz, Cz, Pz, and bilaterally over the motor hand area. Results indicated that the magnitude of the CNV was greater in the delayed than sustained response task, greater in the presence than absence of feedback, and greater over the motor hand area contralateral to movement. Frontal CNV habituated in the sustained, but not the delayed response task, suggested that frontal negative variations in the former case signify an orienting response to novelty or uncertainty. The absence of habituation in the delay condition was interpreted in terms of the motor inhibitory function of frontal association cortex. Performance feedback appeared to enhance CNV indirectly by increasing the motivation of subjects. A multiprocess conception of CNV was proposed in which vortex-negative slow potentials reflect a multiplicity of psychophysiological processes occurring at a variety of cortical and subcortical locations in the brain preparatory to a motor or mental action.

  2. Copy number variations at the Prader-Willi syndrome region on chromosome 15 and associations with obesity in whites.

    PubMed

    Chen, Yuan; Liu, Yong-Jun; Pei, Yu-Fang; Yang, Tie-Lin; Deng, Fei-Yan; Liu, Xiao-Gang; Li, Ding-You; Deng, Hong-Wen

    2011-06-01

    Obesity is a serious health problem with strong genetic determination. Copy number variation (CNV) is a common type of genomic variant associated with some complex human diseases. However, it is not clear how CNVs contribute to the etiology of obesity. In this study, we examined 1,000 unrelated US whites to search for CNVs that may predispose to obesity. We focused our analyses on the Prader-Willi syndrome (PWS) critical region (chromosome 15q11-q13), because the PWS region is a hotspot for CNV generation and obesity is one of the major clinical manifestations for chromosome abnormalities at this region. We constructed a map containing 39 CNVs at the PWS critical region with CNV occurrence rates higher than 1%. Among them, three CNVs were significantly associated with body fat mass (P < 0.05), with a higher copy number (CN) associated with an increase of 5.08-9.77 kg in body fat mass. These three CNVs are close to two known PWS genes, NDN (necdin homolog) and C15orf2 (chromosome 15 open reading frame 2), and partially overlap with another obesity gene PWRN1 (Prader-Willi region nonprotein-coding RNA 1). Interestingly, our recently published whole genome association scan study using the same sample by examining single-nucleotide polymorphisms (SNPs) did not find any significant associations at these CNV regions, suggesting the importance of examining both CNVs and SNPs for better understanding of genetic basis of obesity. Further studies are warranted to validate these CNVs and their importance to obesity.

  3. CNV analysis in Tourette syndrome implicates large genomic rearrangements in COL8A1 and NRXN1.

    PubMed

    Nag, Abhishek; Bochukova, Elena G; Kremeyer, Barbara; Campbell, Desmond D; Muller, Heike; Valencia-Duarte, Ana V; Cardona, Julio; Rivas, Isabel C; Mesa, Sandra C; Cuartas, Mauricio; Garcia, Jharley; Bedoya, Gabriel; Cornejo, William; Herrera, Luis D; Romero, Roxana; Fournier, Eduardo; Reus, Victor I; Lowe, Thomas L; Farooqi, I Sadaf; Mathews, Carol A; McGrath, Lauren M; Yu, Dongmei; Cook, Ed; Wang, Kai; Scharf, Jeremiah M; Pauls, David L; Freimer, Nelson B; Plagnol, Vincent; Ruiz-Linares, Andrés

    2013-01-01

    Tourette syndrome (TS) is a neuropsychiatric disorder with a strong genetic component. However, the genetic architecture of TS remains uncertain. Copy number variation (CNV) has been shown to contribute to the genetic make-up of several neurodevelopmental conditions, including schizophrenia and autism. Here we describe CNV calls using SNP chip genotype data from an initial sample of 210 TS cases and 285 controls ascertained in two Latin American populations. After extensive quality control, we found that cases (N = 179) have a significant excess (P = 0.006) of large CNV (>500 kb) calls compared to controls (N = 234). Amongst 24 large CNVs seen only in the cases, we observed four duplications of the COL8A1 gene region. We also found two cases with ∼400 kb deletions involving NRXN1, a gene previously implicated in neurodevelopmental disorders, including TS. Follow-up using multiplex ligation-dependent probe amplification (and including 53 more TS cases) validated the CNV calls and identified additional patients with rearrangements in COL8A1 and NRXN1, but none in controls. Examination of available parents indicates that two out of three NRXN1 deletions detected in the TS cases are de-novo mutations. Our results are consistent with the proposal that rare CNVs play a role in TS aetiology and suggest a possible role for rearrangements in the COL8A1 and NRXN1 gene regions.

  4. CNV Analysis in Tourette Syndrome Implicates Large Genomic Rearrangements in COL8A1 and NRXN1

    PubMed Central

    Nag, Abhishek; Bochukova, Elena G.; Kremeyer, Barbara; Campbell, Desmond D.; Muller, Heike; Valencia-Duarte, Ana V.; Cardona, Julio; Rivas, Isabel C.; Mesa, Sandra C.; Cuartas, Mauricio; Garcia, Jharley; Bedoya, Gabriel; Cornejo, William; Herrera, Luis D.; Romero, Roxana; Fournier, Eduardo; Reus, Victor I.; Lowe, Thomas L.; Farooqi, I. Sadaf; Mathews, Carol A.; McGrath, Lauren M.; Yu, Dongmei; Cook, Ed; Wang, Kai; Scharf, Jeremiah M.; Pauls, David L.; Freimer, Nelson B.; Plagnol, Vincent; Ruiz-Linares, Andrés

    2013-01-01

    Tourette syndrome (TS) is a neuropsychiatric disorder with a strong genetic component. However, the genetic architecture of TS remains uncertain. Copy number variation (CNV) has been shown to contribute to the genetic make-up of several neurodevelopmental conditions, including schizophrenia and autism. Here we describe CNV calls using SNP chip genotype data from an initial sample of 210 TS cases and 285 controls ascertained in two Latin American populations. After extensive quality control, we found that cases (N = 179) have a significant excess (P = 0.006) of large CNV (>500 kb) calls compared to controls (N = 234). Amongst 24 large CNVs seen only in the cases, we observed four duplications of the COL8A1 gene region. We also found two cases with ∼400kb deletions involving NRXN1, a gene previously implicated in neurodevelopmental disorders, including TS. Follow-up using multiplex ligation-dependent probe amplification (and including 53 more TS cases) validated the CNV calls and identified additional patients with rearrangements in COL8A1 and NRXN1, but none in controls. Examination of available parents indicates that two out of three NRXN1 deletions detected in the TS cases are de-novo mutations. Our results are consistent with the proposal that rare CNVs play a role in TS aetiology and suggest a possible role for rearrangements in the COL8A1 and NRXN1 gene regions. PMID:23533600

  5. A bayesian analysis for identifying DNA copy number variations using a compound poisson process.

    PubMed

    Chen, Jie; Yiğiter, Ayten; Wang, Yu-Ping; Deng, Hong-Wen

    2010-01-01

    To study chromosomal aberrations that may lead to cancer formation or genetic diseases, the array-based Comparative Genomic Hybridization (aCGH) technique is often used for detecting DNA copy number variants (CNVs). Various methods have been developed for gaining CNVs information based on aCGH data. However, most of these methods make use of the log-intensity ratios in aCGH data without taking advantage of other information such as the DNA probe (e.g., biomarker) positions/distances contained in the data. Motivated by the specific features of aCGH data, we developed a novel method that takes into account the estimation of a change point or locus of the CNV in aCGH data with its associated biomarker position on the chromosome using a compound Poisson process. We used a Bayesian approach to derive the posterior probability for the estimation of the CNV locus. To detect loci of multiple CNVs in the data, a sliding window process combined with our derived Bayesian posterior probability was proposed. To evaluate the performance of the method in the estimation of the CNV locus, we first performed simulation studies. Finally, we applied our approach to real data from aCGH experiments, demonstrating its applicability.

  6. Functional effects of CCL3L1 copy number.

    PubMed

    Carpenter, D; McIntosh, R S; Pleass, R J; Armour, J A L

    2012-07-01

    Copy number variation (CNV) is becoming increasingly important as a feature of human variation in disease susceptibility studies. However, the consequences of CNV are not so well understood. Here, we present data exploring the functional consequences of CNV of CCL3L1 in 55 independent UK samples with no known clinical phenotypes. The copy number of CCL3L1 was determined by the paralogue ratio test, and expression levels of macrophage inflammatory protein-1α (MIP-1α) and mRNA from stimulated monocytes were measured and analysed. The data show no statistically significant association of MIP-1α protein levels with copy number. However, there was a significant correlation between copy number and CCL3L1:CCL3 mRNA ratio. The data also provide evidence that expression of CCL3 predominates in both protein and mRNA, and therefore the observed variation of CCL3 is potentially more important biologically than that of CNV of CCL3L1.

  7. Rare copy number variants and congenital heart defects in the 22q11.2 deletion syndrome.

    PubMed

    Mlynarski, Elisabeth E; Xie, Michael; Taylor, Deanne; Sheridan, Molly B; Guo, Tingwei; Racedo, Silvia E; McDonald-McGinn, Donna M; Chow, Eva W C; Vorstman, Jacob; Swillen, Ann; Devriendt, Koen; Breckpot, Jeroen; Digilio, Maria Cristina; Marino, Bruno; Dallapiccola, Bruno; Philip, Nicole; Simon, Tony J; Roberts, Amy E; Piotrowicz, Małgorzata; Bearden, Carrie E; Eliez, Stephan; Gothelf, Doron; Coleman, Karlene; Kates, Wendy R; Devoto, Marcella; Zackai, Elaine; Heine-Suñer, Damian; Goldmuntz, Elizabeth; Bassett, Anne S; Morrow, Bernice E; Emanuel, Beverly S

    2016-03-01

    The 22q11.2 deletion syndrome (22q11DS; velocardiofacial/DiGeorge syndrome; VCFS/DGS; MIM #192430; 188400) is the most common microdeletion syndrome. The phenotypic presentation of 22q11DS is highly variable; approximately 60-75 % of 22q11DS patients have been reported to have a congenital heart defect (CHD), mostly of the conotruncal type, and/or aortic arch defect. The etiology of the cardiac phenotypic variability is not currently known for the majority of patients. We hypothesized that rare copy number variants (CNVs) outside the 22q11.2 deleted region may modify the risk of being born with a CHD in this sensitized population. Rare CNV analysis was performed using Affymetrix SNP Array 6.0 data from 946 22q11DS subjects with CHDs (n = 607) or with normal cardiac anatomy (n = 339). Although there was no significant difference in the overall burden of rare CNVs, an overabundance of CNVs affecting cardiac-related genes was detected in 22q11DS individuals with CHDs. When the rare CNVs were examined with regard to gene interactions, specific cardiac networks, such as Wnt signaling, appear to be overrepresented in 22q11DS CHD cases but not 22q11DS controls with a normal heart. Collectively, these data suggest that CNVs outside the 22q11.2 region may contain genes that modify risk for CHDs in some 22q11DS patients.

  8. CoNVaQ: a web tool for copy number variation-based association studies.

    PubMed

    Larsen, Simon Jonas; do Canto, Luisa Matos; Rogatto, Silvia Regina; Baumbach, Jan

    2018-05-18

    Copy number variations (CNVs) are large segments of the genome that are duplicated or deleted. Structural variations in the genome have been linked to many complex diseases. Similar to how genome-wide association studies (GWAS) have helped discover single-nucleotide polymorphisms linked to disease phenotypes, the extension of GWAS to CNVs has aided the discovery of structural variants associated with human traits and diseases. We present CoNVaQ, an easy-to-use web-based tool for CNV-based association studies. The web service allows users to upload two sets of CNV segments and search for genomic regions where the occurrence of CNVs is significantly associated with the phenotype. CoNVaQ provides two models: a simple statistical model using Fisher's exact test and a novel query-based model matching regions to user-defined queries. For each region, the method computes a global q-value statistic by repeated permutation of samples among the populations. We demonstrate our platform by using it to analyze a data set of HPV-positive and HPV-negative penile cancer patients. CoNVaQ provides a simple workflow for performing CNV-based association studies. It is made available as a web platform in order to provide a user-friendly workflow for biologists and clinicians to carry out CNV data analysis without installing any software. Through the web interface, users are also able to analyze their results to find overrepresented GO terms and pathways. In addition, our method is also available as a package for the R programming language. CoNVaQ is available at https://convaq.compbio.sdu.dk .

  9. Genome-wide screening identifies a KCNIP1 copy number variant as a genetic predictor for atrial fibrillation

    PubMed Central

    Tsai, Chia-Ti; Hsieh, Chia-Shan; Chang, Sheng-Nan; Chuang, Eric Y.; Ueng, Kwo-Chang; Tsai, Chin-Feng; Lin, Tsung-Hsien; Wu, Cho-Kai; Lee, Jen-Kuang; Lin, Lian-Yu; Wang, Yi-Chih; Yu, Chih-Chieh; Lai, Ling-Ping; Tseng, Chuen-Den; Hwang, Juey-Jen; Chiang, Fu-Tien; Lin, Jiunn-Lee

    2016-01-01

    Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia. Previous genome-wide association studies had identified single-nucleotide polymorphisms in several genomic regions to be associated with AF. In human genome, copy number variations (CNVs) are known to contribute to disease susceptibility. Using a genome-wide multistage approach to identify AF susceptibility CNVs, we here show a common 4,470-bp diallelic CNV in the first intron of potassium interacting channel 1 gene (KCNIP1) is strongly associated with AF in Taiwanese populations (odds ratio=2.27 for insertion allele; P=6.23 × 10−24). KCNIP1 insertion is associated with higher KCNIP1 mRNA expression. KCNIP1-encoded protein potassium interacting channel 1 (KCHIP1) is physically associated with potassium Kv channels and modulates atrial transient outward current in cardiac myocytes. Overexpression of KCNIP1 results in inducible AF in zebrafish. In conclusions, a common CNV in KCNIP1 gene is a genetic predictor of AF risk possibly pointing to a functional pathway. PMID:26831368

  10. Copy Number Variation of KIR Genes Influences HIV-1 Control

    PubMed Central

    Shianna, Kevin V.; Feng, Sheng; Urban, Thomas J.; Ge, Dongliang; De Luca, Andrea; Martinez-Picado, Javier; Wolinsky, Steven M.; Martinson, Jeremy J.; Jamieson, Beth D.; Bream, Jay H.; Martin, Maureen P.; Borrow, Persephone; Letvin, Norman L.; McMichael, Andrew J.; Haynes, Barton F.; Telenti, Amalio; Carrington, Mary; Goldstein, David B.; Alter, Galit

    2011-01-01

    A genome-wide screen for large structural variants showed that a copy number variant (CNV) in the region encoding killer cell immunoglobulin-like receptors (KIR) associates with HIV-1 control as measured by plasma viral load at set point in individuals of European ancestry. This CNV encompasses the KIR3DL1-KIR3DS1 locus, encoding receptors that interact with specific HLA-Bw4 molecules to regulate the activation of lymphocyte subsets including natural killer (NK) cells. We quantified the number of copies of KIR3DS1 and KIR3DL1 in a large HIV-1 positive cohort, and showed that an increase in KIR3DS1 count associates with a lower viral set point if its putative ligand is present (p = 0.00028), as does an increase in KIR3DL1 count in the presence of KIR3DS1 and appropriate ligands for both receptors (p = 0.0015). We further provide functional data that demonstrate that NK cells from individuals with multiple copies of KIR3DL1, in the presence of KIR3DS1 and the appropriate ligands, inhibit HIV-1 replication more robustly, and associated with a significant expansion in the frequency of KIR3DS1+, but not KIR3DL1+, NK cells in their peripheral blood. Our results suggest that the relative amounts of these activating and inhibitory KIR play a role in regulating the peripheral expansion of highly antiviral KIR3DS1+ NK cells, which may determine differences in HIV-1 control following infection. PMID:22140359

  11. Time-Resolved Influences of Functional DAT1 and COMT Variants on Visual Perception and Post-Processing

    PubMed Central

    Bender, Stephan; Rellum, Thomas; Freitag, Christine; Resch, Franz; Rietschel, Marcella; Treutlein, Jens; Jennen-Steinmetz, Christine; Brandeis, Daniel; Banaschewski, Tobias; Laucht, Manfred

    2012-01-01

    Background Dopamine plays an important role in orienting and the regulation of selective attention to relevant stimulus characteristics. Thus, we examined the influences of functional variants related to dopamine inactivation in the dopamine transporter (DAT1) and catechol-O-methyltransferase genes (COMT) on the time-course of visual processing in a contingent negative variation (CNV) task. Methods 64-channel EEG recordings were obtained from 195 healthy adolescents of a community-based sample during a continuous performance task (A-X version). Early and late CNV as well as preceding visual evoked potential components were assessed. Results Significant additive main effects of DAT1 and COMT on the occipito-temporal early CNV were observed. In addition, there was a trend towards an interaction between the two polymorphisms. Source analysis showed early CNV generators in the ventral visual stream and in frontal regions. There was a strong negative correlation between occipito-temporal visual post-processing and the frontal early CNV component. The early CNV time interval 500–1000 ms after the visual cue was specifically affected while the preceding visual perception stages were not influenced. Conclusions Late visual potentials allow the genomic imaging of dopamine inactivation effects on visual post-processing. The same specific time-interval has been found to be affected by DAT1 and COMT during motor post-processing but not motor preparation. We propose the hypothesis that similar dopaminergic mechanisms modulate working memory encoding in both the visual and motor and perhaps other systems. PMID:22844499

  12. Time-resolved influences of functional DAT1 and COMT variants on visual perception and post-processing.

    PubMed

    Bender, Stephan; Rellum, Thomas; Freitag, Christine; Resch, Franz; Rietschel, Marcella; Treutlein, Jens; Jennen-Steinmetz, Christine; Brandeis, Daniel; Banaschewski, Tobias; Laucht, Manfred

    2012-01-01

    Dopamine plays an important role in orienting and the regulation of selective attention to relevant stimulus characteristics. Thus, we examined the influences of functional variants related to dopamine inactivation in the dopamine transporter (DAT1) and catechol-O-methyltransferase genes (COMT) on the time-course of visual processing in a contingent negative variation (CNV) task. 64-channel EEG recordings were obtained from 195 healthy adolescents of a community-based sample during a continuous performance task (A-X version). Early and late CNV as well as preceding visual evoked potential components were assessed. Significant additive main effects of DAT1 and COMT on the occipito-temporal early CNV were observed. In addition, there was a trend towards an interaction between the two polymorphisms. Source analysis showed early CNV generators in the ventral visual stream and in frontal regions. There was a strong negative correlation between occipito-temporal visual post-processing and the frontal early CNV component. The early CNV time interval 500-1000 ms after the visual cue was specifically affected while the preceding visual perception stages were not influenced. Late visual potentials allow the genomic imaging of dopamine inactivation effects on visual post-processing. The same specific time-interval has been found to be affected by DAT1 and COMT during motor post-processing but not motor preparation. We propose the hypothesis that similar dopaminergic mechanisms modulate working memory encoding in both the visual and motor and perhaps other systems.

  13. Whole exome sequencing is necessary to clarify ID/DD cases with de novo copy number variants of uncertain significance: Two proof-of-concept examples.

    PubMed

    Giorgio, Elisa; Ciolfi, Andrea; Biamino, Elisa; Caputo, Viviana; Di Gregorio, Eleonora; Belligni, Elga Fabia; Calcia, Alessandro; Gaidolfi, Elena; Bruselles, Alessandro; Mancini, Cecilia; Cavalieri, Simona; Molinatto, Cristina; Cirillo Silengo, Margherita; Ferrero, Giovanni Battista; Tartaglia, Marco; Brusco, Alfredo

    2016-07-01

    Whole exome sequencing (WES) is a powerful tool to identify clinically undefined forms of intellectual disability/developmental delay (ID/DD), especially in consanguineous families. Here we report the genetic definition of two sporadic cases, with syndromic ID/DD for whom array-Comparative Genomic Hybridization (aCGH) identified a de novo copy number variant (CNV) of uncertain significance. The phenotypes included microcephaly with brachycephaly and a distinctive facies in one proband, and hypotonia in the legs and mild ataxia in the other. WES allowed identification of a functionally relevant homozygous variant affecting a known disease gene for rare syndromic ID/DD in each proband, that is, c.1423C>T (p.Arg377*) in the Trafficking Protein Particle Complex 9 (TRAPPC9), and c.154T>C (p.Cys52Arg) in the Very Low Density Lipoprotein Receptor (VLDLR). Four mutations affecting TRAPPC9 have been previously reported, and the present finding further depicts this syndromic form of ID, which includes microcephaly with brachycephaly, corpus callosum hypoplasia, facial dysmorphism, and overweight. VLDLR-associated cerebellar hypoplasia (VLDLR-CH) is characterized by non-progressive congenital ataxia and moderate-to-profound intellectual disability. The c.154T>C (p.Cys52Arg) mutation was associated with a very mild form of ataxia, mild intellectual disability, and cerebellar hypoplasia without cortical gyri simplification. In conclusion, we report two novel cases with rare causes of autosomal recessive ID, which document how interpreting de novo array-CGH variants represents a challenge in consanguineous families; as such, clinical WES should be considered in diagnostic testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  14. Low α-defensin gene copy number increases the risk for IgA nephropathy and renal dysfunction.

    PubMed

    Ai, Zhen; Li, Ming; Liu, Wenting; Foo, Jia-Nee; Mansouri, Omniah; Yin, Peiran; Zhou, Qian; Tang, Xueqing; Dong, Xiuqing; Feng, Shaozhen; Xu, Ricong; Zhong, Zhong; Chen, Jian; Wan, Jianxin; Lou, Tanqi; Yu, Jianwen; Zhou, Qin; Fan, Jinjin; Mao, Haiping; Gale, Daniel; Barratt, Jonathan; Armour, John A L; Liu, Jianjun; Yu, Xueqing

    2016-06-29

    Although a major source of genetic variation, copy number variations (CNVs) and their involvement in disease development have not been well studied. Immunoglobulin A nephropathy (IgAN) is the most common primary glomerulonephritis worldwide. We performed association analysis of the DEFA1A3 CNV locus in two independent IgAN cohorts of southern Chinese Han (total of 1189 cases and 1187 controls). We discovered three independent copy number associations within the locus: DEFA1A3 [P = 3.99 × 10(-9); odds ratio (OR), 0.88], DEFA3 (P = 6.55 × 10(-5); OR, 0.82), and a noncoding deletion variant (211bp) (P = 3.50 × 10(-16); OR, 0.75) (OR per copy, fixed-effects meta-analysis). While showing strong association with an increased risk for IgAN (P = 9.56 × 10(-20)), low total copy numbers of the three variants also showed significant association with renal dysfunction in patients with IgAN (P = 0.03; hazards ratio, 3.69; after controlling for the effects of known prognostic factors) and also with increased serum IgA1 (P = 0.02) and galactose-deficient IgA1 (P = 0.03). For replication, we confirmed the associations of DEFA1A3 (P = 4.42 × 10(-4); OR, 0.82) and DEFA3 copy numbers (P = 4.30 × 10(-3); OR, 0.74) with IgAN in a Caucasian cohort (531 cases and 198 controls) and found the 211bp variant to be much rarer in Caucasians. We also observed an association of the 211bp copy number with membranous nephropathy (P = 1.11 × 10(-7); OR, 0.74; in 493 Chinese cases and 500 matched controls), but not with diabetic kidney disease (in 806 Chinese cases and 786 matched controls). By explaining 4.96% of disease risk and influencing renal dysfunction in patients with IgAN, the DEFA1A3 CNV locus may be a potential therapeutic target for developing treatments for this disease. Copyright © 2016, American Association for the Advancement of Science.

  15. CNV amplitude as a neural correlate for stuttering frequency: A case report of acquired stuttering.

    PubMed

    Vanhoutte, Sarah; Van Borsel, John; Cosyns, Marjan; Batens, Katja; van Mierlo, Pieter; Hemelsoet, Dimitri; Van Roost, Dirk; Corthals, Paul; De Letter, Miet; Santens, Patrick

    2014-11-01

    A neural hallmark of developmental stuttering is abnormal articulatory programming. One of the neurophysiological substrates of articulatory preparation is the contingent negative variation (CNV). Unfortunately, CNV tasks are rarely performed in persons who stutter and mainly focus on the effect of task variation rather than on interindividual variation in stutter related variables. However, variations in motor programming seem to be related to variation in stuttering frequency. The current study presents a case report of acquired stuttering following stroke and stroke related surgery in the left superior temporal gyrus. A speech related CNV task was administered at four points in time with differences in stuttering severity and frequency. Unexpectedly, CNV amplitudes at electrode sites approximating bilateral motor and left inferior frontal gyrus appeared to be inversely proportional to stuttering frequency. The higher the stuttering frequency, the lower the activity for articulatory preparation. Thus, the amount of disturbance in motor programming seems to determine stuttering frequency. At right frontal electrodes, a relative increase in CNV amplitude was seen at the test session with most severe stuttering. Right frontal overactivation is cautiously suggested to be a compensation strategy. In conclusion, late CNV amplitude elicited by a relatively simple speech task seems to be able to provide an objective, neural correlate of stuttering frequency. The present case report supports the hypothesis that motor preparation has an important role in stuttering. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. Use of next-generation sequencing to detect LDLR gene copy number variation in familial hypercholesterolemia[S

    PubMed Central

    Iacocca, Michael A.; Wang, Jian; Dron, Jacqueline S.; Robinson, John F.; McIntyre, Adam D.; Cao, Henian

    2017-01-01

    Familial hypercholesterolemia (FH) is a heritable condition of severely elevated LDL cholesterol, caused predominantly by autosomal codominant mutations in the LDL receptor gene (LDLR). In providing a molecular diagnosis for FH, the current procedure often includes targeted next-generation sequencing (NGS) panels for the detection of small-scale DNA variants, followed by multiplex ligation-dependent probe amplification (MLPA) in LDLR for the detection of whole-exon copy number variants (CNVs). The latter is essential because ∼10% of FH cases are attributed to CNVs in LDLR; accounting for them decreases false negative findings. Here, we determined the potential of replacing MLPA with bioinformatic analysis applied to NGS data, which uses depth-of-coverage analysis as its principal method to identify whole-exon CNV events. In analysis of 388 FH patient samples, there was 100% concordance in LDLR CNV detection between these two methods: 38 reported CNVs identified by MLPA were also successfully detected by our NGS method, while 350 samples negative for CNVs by MLPA were also negative by NGS. This result suggests that MLPA can be removed from the routine diagnostic screening for FH, significantly reducing associated costs, resources, and analysis time, while promoting more widespread assessment of this important class of mutations across diagnostic laboratories. PMID:28874442

  17. Developing a Risk-scoring Model for Ankylosing Spondylitis Based on a Combination of HLA-B27, Single-nucleotide Polymorphism, and Copy Number Variant Markers.

    PubMed

    Jung, Seung-Hyun; Cho, Sung-Min; Yim, Seon-Hee; Kim, So-Hee; Park, Hyeon-Chun; Cho, Mi-La; Shim, Seung-Cheol; Kim, Tae-Hwan; Park, Sung-Hwan; Chung, Yeun-Jun

    2016-12-01

    To develop a genotype-based ankylosing spondylitis (AS) risk prediction model that is more sensitive and specific than HLA-B27 typing. To develop the AS genetic risk scoring (AS-GRS) model, 648 individuals (285 cases and 363 controls) were examined for 5 copy number variants (CNV), 7 single-nucleotide polymorphisms (SNP), and an HLA-B27 marker by TaqMan assays. The AS-GRS model was developed using logistic regression and validated with a larger independent set (576 cases and 680 controls). Through logistic regression, we built the AS-GRS model consisting of 5 genetic components: HLA-B27, 3 CNV (1q32.2, 13q13.1, and 16p13.3), and 1 SNP (rs10865331). All significant associations of genetic factors in the model were replicated in the independent validation set. The discriminative ability of the AS-GRS model measured by the area under the curve was excellent: 0.976 (95% CI 0.96-0.99) in the model construction set and 0.951 (95% CI 0.94-0.96) in the validation set. The AS-GRS model showed higher specificity and accuracy than the HLA-B27-only model when the sensitivity was set to over 94%. When we categorized the individuals into quartiles based on the AS-GRS scores, OR of the 4 groups (low, intermediate-1, intermediate-2, and high risk) showed an increasing trend with the AS-GRS scores (r 2 = 0.950) and the highest risk group showed a 494× higher risk of AS than the lowest risk group (95% CI 237.3-1029.1). Our AS-GRS could be used to identify individuals at high risk for AS before major symptoms appear, which may improve the prognosis for them through early treatment.

  18. Novel genes involved in severe early-onset obesity revealed by rare copy number and sequence variants

    PubMed Central

    Flores, Raquel; González, Juan R.; Argente, Jesús; Pérez-Jurado, Luis A.

    2017-01-01

    Obesity is a multifactorial disorder with high heritability (50–75%), which is probably higher in early-onset and severe cases. Although rare monogenic forms and several genes and regions of susceptibility, including copy number variants (CNVs), have been described, the genetic causes underlying the disease still remain largely unknown. We searched for rare CNVs (>100kb in size, altering genes and present in <1/2000 population controls) in 157 Spanish children with non-syndromic early-onset obesity (EOO: body mass index >3 standard deviations above the mean at <3 years of age) using SNP array molecular karyotypes. We then performed case control studies (480 EOO cases/480 non-obese controls) with the validated CNVs and rare sequence variants (RSVs) detected by targeted resequencing of selected CNV genes (n = 14), and also studied the inheritance patterns in available first-degree relatives. A higher burden of gain-type CNVs was detected in EOO cases versus controls (OR = 1.71, p-value = 0.0358). In addition to a gain of the NPY gene in a familial case with EOO and attention deficit hyperactivity disorder, likely pathogenic CNVs included gains of glutamate receptors (GRIK1, GRM7) and the X-linked gastrin-peptide receptor (GRPR), all inherited from obese parents. Putatively functional RSVs absent in controls were also identified in EOO cases at NPY, GRIK1 and GRPR. A patient with a heterozygous deletion disrupting two contiguous and related genes, SLCO4C1 and SLCO6A1, also had a missense RSV at SLCO4C1 on the other allele, suggestive of a recessive model. The genes identified showed a clear enrichment of shared co-expression partners with known genes strongly related to obesity, reinforcing their role in the pathophysiology of the disease. Our data reveal a higher burden of rare CNVs and RSVs in several related genes in patients with EOO compared to controls, and implicate NPY, GRPR, two glutamate receptors and SLCO4C1 in highly penetrant forms of familial obesity

  19. Novel genes involved in severe early-onset obesity revealed by rare copy number and sequence variants.

    PubMed

    Serra-Juhé, Clara; Martos-Moreno, Gabriel Á; Bou de Pieri, Francesc; Flores, Raquel; González, Juan R; Rodríguez-Santiago, Benjamín; Argente, Jesús; Pérez-Jurado, Luis A

    2017-05-01

    Obesity is a multifactorial disorder with high heritability (50-75%), which is probably higher in early-onset and severe cases. Although rare monogenic forms and several genes and regions of susceptibility, including copy number variants (CNVs), have been described, the genetic causes underlying the disease still remain largely unknown. We searched for rare CNVs (>100kb in size, altering genes and present in <1/2000 population controls) in 157 Spanish children with non-syndromic early-onset obesity (EOO: body mass index >3 standard deviations above the mean at <3 years of age) using SNP array molecular karyotypes. We then performed case control studies (480 EOO cases/480 non-obese controls) with the validated CNVs and rare sequence variants (RSVs) detected by targeted resequencing of selected CNV genes (n = 14), and also studied the inheritance patterns in available first-degree relatives. A higher burden of gain-type CNVs was detected in EOO cases versus controls (OR = 1.71, p-value = 0.0358). In addition to a gain of the NPY gene in a familial case with EOO and attention deficit hyperactivity disorder, likely pathogenic CNVs included gains of glutamate receptors (GRIK1, GRM7) and the X-linked gastrin-peptide receptor (GRPR), all inherited from obese parents. Putatively functional RSVs absent in controls were also identified in EOO cases at NPY, GRIK1 and GRPR. A patient with a heterozygous deletion disrupting two contiguous and related genes, SLCO4C1 and SLCO6A1, also had a missense RSV at SLCO4C1 on the other allele, suggestive of a recessive model. The genes identified showed a clear enrichment of shared co-expression partners with known genes strongly related to obesity, reinforcing their role in the pathophysiology of the disease. Our data reveal a higher burden of rare CNVs and RSVs in several related genes in patients with EOO compared to controls, and implicate NPY, GRPR, two glutamate receptors and SLCO4C1 in highly penetrant forms of familial obesity.

  20. Genome wide analysis reveals single nucleotide polymorphisms associated with fatness and putative novel copy number variants in three pig breeds

    PubMed Central

    2013-01-01

    Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222

  1. Analysis of genome-wide copy number variations in Chinese indigenous and western pig breeds by 60 K SNP genotyping arrays.

    PubMed

    Wang, Yanan; Tang, Zhonglin; Sun, Yaqi; Wang, Hongyang; Wang, Chao; Yu, Shaobo; Liu, Jing; Zhang, Yu; Fan, Bin; Li, Kui; Liu, Bang

    2014-01-01

    Copy number variations (CNVs) represent a substantial source of structural variants in mammals and contribute to both normal phenotypic variability and disease susceptibility. Although low-resolution CNV maps are produced in many domestic animals, and several reports have been published about the CNVs of porcine genome, the differences between Chinese and western pigs still remain to be elucidated. In this study, we used Porcine SNP60 BeadChip and PennCNV algorithm to perform a genome-wide CNV detection in 302 individuals from six Chinese indigenous breeds (Tongcheng, Laiwu, Luchuan, Bama, Wuzhishan and Ningxiang pigs), three western breeds (Yorkshire, Landrace and Duroc) and one hybrid (Tongcheng×Duroc). A total of 348 CNV Regions (CNVRs) across genome were identified, covering 150.49 Mb of the pig genome or 6.14% of the autosomal genome sequence. In these CNVRs, 213 CNVRs were found to exist only in the six Chinese indigenous breeds, and 60 CNVRs only in the three western breeds. The characters of CNVs in four Chinese normal size breeds (Luchuan, Tongcheng and Laiwu pigs) and two minipig breeds (Bama and Wuzhishan pigs) were also analyzed in this study. Functional annotation suggested that these CNVRs possess a great variety of molecular function and may play important roles in phenotypic and production traits between Chinese and western breeds. Our results are important complementary to the CNV map in pig genome, which provide new information about the diversity of Chinese and western pig breeds, and facilitate further research on porcine genome CNVs.

  2. Analysis of Genome-Wide Copy Number Variations in Chinese Indigenous and Western Pig Breeds by 60 K SNP Genotyping Arrays

    PubMed Central

    Sun, Yaqi; Wang, Hongyang; Wang, Chao; Yu, Shaobo; Liu, Jing; Zhang, Yu; Fan, Bin; Li, Kui; Liu, Bang

    2014-01-01

    Copy number variations (CNVs) represent a substantial source of structural variants in mammals and contribute to both normal phenotypic variability and disease susceptibility. Although low-resolution CNV maps are produced in many domestic animals, and several reports have been published about the CNVs of porcine genome, the differences between Chinese and western pigs still remain to be elucidated. In this study, we used Porcine SNP60 BeadChip and PennCNV algorithm to perform a genome-wide CNV detection in 302 individuals from six Chinese indigenous breeds (Tongcheng, Laiwu, Luchuan, Bama, Wuzhishan and Ningxiang pigs), three western breeds (Yorkshire, Landrace and Duroc) and one hybrid (Tongcheng×Duroc). A total of 348 CNV Regions (CNVRs) across genome were identified, covering 150.49 Mb of the pig genome or 6.14% of the autosomal genome sequence. In these CNVRs, 213 CNVRs were found to exist only in the six Chinese indigenous breeds, and 60 CNVRs only in the three western breeds. The characters of CNVs in four Chinese normal size breeds (Luchuan, Tongcheng and Laiwu pigs) and two minipig breeds (Bama and Wuzhishan pigs) were also analyzed in this study. Functional annotation suggested that these CNVRs possess a great variety of molecular function and may play important roles in phenotypic and production traits between Chinese and western breeds. Our results are important complementary to the CNV map in pig genome, which provide new information about the diversity of Chinese and western pig breeds, and facilitate further research on porcine genome CNVs. PMID:25198154

  3. PSE-HMM: genome-wide CNV detection from NGS data using an HMM with Position-Specific Emission probabilities.

    PubMed

    Malekpour, Seyed Amir; Pezeshk, Hamid; Sadeghi, Mehdi

    2016-11-03

    Copy Number Variation (CNV) is envisaged to be a major source of large structural variations in the human genome. In recent years, many studies apply Next Generation Sequencing (NGS) data for the CNV detection. However, still there is a necessity to invent more accurate computational tools. In this study, mate pair NGS data are used for the CNV detection in a Hidden Markov Model (HMM). The proposed HMM has position specific emission probabilities, i.e. a Gaussian mixture distribution. Each component in the Gaussian mixture distribution captures a different type of aberration that is observed in the mate pairs, after being mapped to the reference genome. These aberrations may include any increase (decrease) in the insertion size or change in the direction of mate pairs that are mapped to the reference genome. This HMM with Position-Specific Emission probabilities (PSE-HMM) is utilized for the genome-wide detection of deletions and tandem duplications. The performance of PSE-HMM is evaluated on a simulated dataset and also on a real data of a Yoruban HapMap individual, NA18507. PSE-HMM is effective in taking observation dependencies into account and reaches a high accuracy in detecting genome-wide CNVs. MATLAB programs are available at http://bs.ipm.ir/softwares/PSE-HMM/ .

  4. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing

    PubMed Central

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J.; Szatkiewicz, Jin P.

    2015-01-01

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. PMID:25883151

  5. Copy Number Variation in the Horse Genome

    PubMed Central

    Ghosh, Sharmila; Qu, Zhipeng; Das, Pranab J.; Fang, Erica; Juras, Rytis; Cothran, E. Gus; McDonell, Sue; Kenney, Daniel G.; Lear, Teri L.; Adelson, David L.; Chowdhary, Bhanu P.; Raudsepp, Terje

    2014-01-01

    We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs) in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs) across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches. PMID:25340504

  6. Use of next-generation sequencing to detect LDLR gene copy number variation in familial hypercholesterolemia.

    PubMed

    Iacocca, Michael A; Wang, Jian; Dron, Jacqueline S; Robinson, John F; McIntyre, Adam D; Cao, Henian; Hegele, Robert A

    2017-11-01

    Familial hypercholesterolemia (FH) is a heritable condition of severely elevated LDL cholesterol, caused predominantly by autosomal codominant mutations in the LDL receptor gene ( LDLR ). In providing a molecular diagnosis for FH, the current procedure often includes targeted next-generation sequencing (NGS) panels for the detection of small-scale DNA variants, followed by multiplex ligation-dependent probe amplification (MLPA) in LDLR for the detection of whole-exon copy number variants (CNVs). The latter is essential because ∼10% of FH cases are attributed to CNVs in LDLR ; accounting for them decreases false negative findings. Here, we determined the potential of replacing MLPA with bioinformatic analysis applied to NGS data, which uses depth-of-coverage analysis as its principal method to identify whole-exon CNV events. In analysis of 388 FH patient samples, there was 100% concordance in LDLR CNV detection between these two methods: 38 reported CNVs identified by MLPA were also successfully detected by our NGS method, while 350 samples negative for CNVs by MLPA were also negative by NGS. This result suggests that MLPA can be removed from the routine diagnostic screening for FH, significantly reducing associated costs, resources, and analysis time, while promoting more widespread assessment of this important class of mutations across diagnostic laboratories. Copyright © 2017 by the American Society for Biochemistry and Molecular Biology, Inc.

  7. Complex and multi-allelic copy number variation in human disease

    PubMed Central

    McCarroll, Steven A.

    2015-01-01

    Hundreds of copy number variants are complex and multi-allelic, in that they have many structural alleles and have rearranged multiple times in the ancestors who contributed chromosomes to current humans. Not only are the relationships of these multi-allelic CNVs (mCNVs) to phenotypes generally unknown, but many mCNVs have not yet been described at the basic levels—alleles, allele frequencies, structural features—that support genetic investigation. To date, most reported disease associations to these variants have been ascertained through candidate gene studies. However, only a few associations have reached the level of acceptance defined by durable replications in many cohorts. This likely stems from longstanding challenges in making precise molecular measurements of the alleles individuals have at these loci. However, approaches for mCNV analysis are improving quickly, and some of the unique characteristics of mCNVs may assist future association studies. Their various structural alleles are likely to have different magnitudes of effect, creating a natural allelic series of growing phenotypic impact and giving investigators a set of natural predictions and testable hypotheses about the extent to which each allele of an mCNV predisposes to a phenotype. Also, mCNVs’ low-to-modest correlation to individual single-nucleotide polymorphisms (SNPs) may make it easier to distinguish between mCNVs and nearby SNPs as the drivers of an association signal, and perhaps, make it possible to preliminarily screen candidate loci, or the entire genome, for the many mCNV–disease relationships that remain to be discovered. PMID:26163405

  8. A Novel Center Star Multiple Sequence Alignment Algorithm Based on Affine Gap Penalty and K-Band

    NASA Astrophysics Data System (ADS)

    Zou, Quan; Shan, Xiao; Jiang, Yi

    Multiple sequence alignment is one of the most important topics in computational biology, but it cannot deal with the large data so far. As the development of copy-number variant(CNV) and Single Nucleotide Polymorphisms(SNP) research, many researchers want to align numbers of similar sequences for detecting CNV and SNP. In this paper, we propose a novel multiple sequence alignment algorithm based on affine gap penalty and k-band. It can align more quickly and accurately, that will be helpful for mining CNV and SNP. Experiments prove the performance of our algorithm.

  9. Data-driven approach to detect common copy-number variations and frequency profiles in a population-based Korean cohort.

    PubMed

    Moon, Sanghoon; Kim, Young Jin; Hong, Chang Bum; Kim, Dong-Joon; Lee, Jong-Young; Kim, Bong-Jo

    2011-11-01

    To date, hundreds of thousands of copy-number variation (CNV) data have been reported using various platforms. The proportion of Asians in these data is, however, relatively small as compared with that of other ethnic groups, such as Caucasians and Yorubas. Because of limitations in platform resolution and the high noise level in signal intensity, in most CNV studies (particularly those using single nucleotide polymorphism arrays), the average number of CNVs in an individual is less than the number of known CNVs. In this study, we ascertained reliable, common CNV regions (CNVRs) and identified actual frequency rates in the Korean population to provide more CNV information. We performed two-stage analyses for detecting structural variations with two platforms. We discovered 576 common CNVRs (88 CNV segments on average in an individual), and 87% (501 of 576) of these CNVRs overlapped by ≥1 bp with previously validated CNV events. Interestingly, from the frequency analysis of CNV profiles, 52 of 576 CNVRs had a frequency rate of <1% in the 8842 individuals. Compared with other common CNV studies, this study found six common CNVRs that were not reported in previous CNV studies. In conclusion, we propose the data-driven detection approach to discover common CNVRs including those of unreported in the previous Korean CNV study while minimizing false positives. Through our approach, we successfully discovered more common CNVRs than previous Korean CNV study and conducted frequency analysis. These results will be a valuable resource for the effective level of CNVs in the Korean population.

  10. Contingent negative variation (CNV) associated with sensorimotor timing error correction.

    PubMed

    Jang, Joonyong; Jones, Myles; Milne, Elizabeth; Wilson, Daniel; Lee, Kwang-Hyuk

    2016-02-15

    Detection and subsequent correction of sensorimotor timing errors are fundamental to adaptive behavior. Using scalp-recorded event-related potentials (ERPs), we sought to find ERP components that are predictive of error correction performance during rhythmic movements. Healthy right-handed participants were asked to synchronize their finger taps to a regular tone sequence (every 600 ms), while EEG data were continuously recorded. Data from 15 participants were analyzed. Occasional irregularities were built into stimulus presentation timing: 90 ms before (advances: negative shift) or after (delays: positive shift) the expected time point. A tapping condition alternated with a listening condition in which identical stimulus sequence was presented but participants did not tap. Behavioral error correction was observed immediately following a shift, with a degree of over-correction with positive shifts. Our stimulus-locked ERP data analysis revealed, 1) increased auditory N1 amplitude for the positive shift condition and decreased auditory N1 modulation for the negative shift condition; and 2) a second enhanced negativity (N2) in the tapping positive condition, compared with the tapping negative condition. In response-locked epochs, we observed a CNV (contingent negative variation)-like negativity with earlier latency in the tapping negative condition compared with the tapping positive condition. This CNV-like negativity peaked at around the onset of subsequent tapping, with the earlier the peak, the better the error correction performance with the negative shifts while the later the peak, the better the error correction performance with the positive shifts. This study showed that the CNV-like negativity was associated with the error correction performance during our sensorimotor synchronization study. Auditory N1 and N2 were differentially involved in negative vs. positive error correction. However, we did not find evidence for their involvement in behavioral error

  11. Environmental change drives accelerated adaptation through stimulated copy number variation

    PubMed Central

    Hull, Ryan M.; Cruz, Cristina; Jack, Carmen V.

    2017-01-01

    Copy number variation (CNV) is rife in eukaryotic genomes and has been implicated in many human disorders, particularly cancer, in which CNV promotes both tumorigenesis and chemotherapy resistance. CNVs are considered random mutations but often arise through replication defects; transcription can interfere with replication fork progression and stability, leading to increased mutation rates at highly transcribed loci. Here we investigate whether inducible promoters can stimulate CNV to yield reproducible, environment-specific genetic changes. We propose a general mechanism for environmentally-stimulated CNV and validate this mechanism for the emergence of copper resistance in budding yeast. By analysing a large cohort of individual cells, we directly demonstrate that CNV of the copper-resistance gene CUP1 is stimulated by environmental copper. CNV stimulation accelerates the formation of novel alleles conferring enhanced copper resistance, such that copper exposure actively drives adaptation to copper-rich environments. Furthermore, quantification of CNV in individual cells reveals remarkable allele selectivity in the rate at which specific environments stimulate CNV. We define the key mechanistic elements underlying this selectivity, demonstrating that CNV is regulated by both promoter activity and acetylation of histone H3 lysine 56 (H3K56ac) and that H3K56ac is required for CUP1 CNV and efficient copper adaptation. Stimulated CNV is not limited to high-copy CUP1 repeat arrays, as we find that H3K56ac also regulates CNV in 3 copy arrays of CUP1 or SFA1 genes. The impact of transcription on DNA damage is well understood, but our research reveals that this apparently problematic association forms a pathway by which mutations can be directed to particular loci in particular environments and furthermore that this mutagenic process can be regulated through histone acetylation. Stimulated CNV therefore represents an unanticipated and remarkably controllable pathway

  12. Prediction of Response to Therapy and Clinical Outcome through a Pilot Study of Complete Genetic Assessment of Ovarian Cancer

    DTIC Science & Technology

    2015-12-01

    Oncology program supported by this grant consented patients to 11-104. OncoPanel is a cancer genomic assay that detects somatic mutations, copy number...KMT2D, EP300, FANCD2 Sertoli Leydig cell DICER1 Copy number variants: In addition, 219 patients were analyzed for copy-number variations ( CNV ) in...OncoPanel genes. >12,000 total CNV were reported in the cohort (Figure 2). Single- copy deletions (n=5558) and copy-number gains (low amplification) (n

  13. Caged Naloxone: Synthesis, Characterization, and Stability of 3- O-(4,5-Dimethoxy-2-nitrophenyl)carboxymethyl Naloxone (CNV-NLX).

    PubMed

    Lewin, Anita H; Fix, Scott E; Zhong, Desong; Mayer, Louise D; Burgess, Jason P; Mascarella, S Wayne; Reddy, P Anantha; Seltzman, Herbert H; Carroll, F Ivy

    2018-03-21

    The photolabile analogue of the broad-spectrum opioid antagonist naloxone, 3- O-(4,5-dimethoxy-2-nitrophenyl)carboxymethyl naloxone (also referred to as "caged naloxone", 3- O-(α-carboxy-6-nitroveratryl)naloxone, CNV-NLX), has been found to be a valuable biochemical probe. While the synthesis of CNV-NLX is simple, its characterization is complicated by the fact that it is produced as a mixture of α R,5 R,9 R,13 S,14 S and α S,5 R,9 R,13 S,14 S diastereomers. Using long-range and heteronuclear NMR correlations, the 1 H NMR and 13 C NMR resonances of both diastereomers have been fully assigned, confirming the structures. Monitoring of solutions of CNV-NLX in saline buffer, in methanol, and in DMSO has shown CNV-NLX to be stable for over a week under fluorescent laboratory lights at room temperature. Exposure of such solutions to λ 365 nm from a hand-held UV lamp led to the formation of naloxone and CNV-related breakdown products.

  14. A remark on copy number variation detection methods.

    PubMed

    Li, Shuo; Dou, Xialiang; Gao, Ruiqi; Ge, Xinzhou; Qian, Minping; Wan, Lin

    2018-01-01

    Copy number variations (CNVs) are gain and loss of DNA sequence of a genome. High throughput platforms such as microarrays and next generation sequencing technologies (NGS) have been applied for genome wide copy number losses. Although progress has been made in both approaches, the accuracy and consistency of CNV calling from the two platforms remain in dispute. In this study, we perform a deep analysis on copy number losses on 254 human DNA samples, which have both SNP microarray data and NGS data publicly available from Hapmap Project and 1000 Genomes Project respectively. We show that the copy number losses reported from Hapmap Project and 1000 Genome Project only have < 30% overlap, while these reports are required to have cross-platform (e.g. PCR, microarray and high-throughput sequencing) experimental supporting by their corresponding projects, even though state-of-art calling methods were employed. On the other hand, copy number losses are found directly from HapMap microarray data by an accurate algorithm, i.e. CNVhac, almost all of which have lower read mapping depth in NGS data; furthermore, 88% of which can be supported by the sequences with breakpoint in NGS data. Our results suggest the ability of microarray calling CNVs and the possible introduction of false negatives from the unessential requirement of the additional cross-platform supporting. The inconsistency of CNV reports from Hapmap Project and 1000 Genomes Project might result from the inadequate information containing in microarray data, the inconsistent detection criteria, or the filtration effect of cross-platform supporting. The statistical test on CNVs called from CNVhac show that the microarray data can offer reliable CNV reports, and majority of CNV candidates can be confirmed by raw sequences. Therefore, the CNV candidates given by a good caller could be highly reliable without cross-platform supporting, so additional experimental information should be applied in need instead of

  15. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

    PubMed

    Wang, WeiBo; Wang, Wei; Sun, Wei; Crowley, James J; Szatkiewicz, Jin P

    2015-08-18

    Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Genetic association study identifies a functional CNV in the WWOX gene contributes to the risk of intracranial aneurysms.

    PubMed

    Fan, Jin; Sun, Wen; Lin, Min; Yu, Ke; Wang, Jian; Duan, Dan; Zheng, Bo; Yang, Zhenghui; Wang, Qingsong

    2016-03-29

    Intracranial aneurysms (IAs) accounts for 85% of hemorrhagic stroke. Genetic factors have been known to play an important role in the development of IAs. A functional CNV (CNV-67048) of human WW domain-containing oxidoreductase (WWOX), which has been identified as a tumor suppressor gene in multiple cancers, was identified to be associated with gliomas risk previously. Here, we hypothesized that the CNV-67048 could also affect susceptibility of IAs. Based on a two-stage, case- control study with a total of 976 patients of IAs and 1,200 matched healthy controls, we found the effect size for per copy deletion was 1.35 (95% CI = 1.16-1.57; Ptrend = 1.18 × 10-4). Compared with the individuals having no deletion, significantly higher risk of IAs was detected for both subjects carrying 1 copy deletion (OR = 1.24, 95% CI = 1.02-1.52) and subjects carrying 2 copy deletion (OR = 1.77, 95% CI = 1.24-2.53). Real-time PCR was used to confirm the abnormal expression of WWOX in tissues of IA patients and influence of genotypes of CNV-67048. The expression level of WWOX in IA tissues was significantly lower than that in corresponding normal tissues (P = 0.004), and the deletion genotypes of CNV-67048 have lower WWOX mRNA levels in both tumor tissues and border tissues (P < 0.01). Our data suggests that the deletion genotypes of CNV-67048 in WWOX predispose their carriers to IAs, which might be a genetic biomarker to predict risk of IAs in Chinese.

  17. Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets.

    PubMed

    Samadian, Soroush; Bruce, Jeff P; Pugh, Trevor J

    2018-03-01

    Somatic copy number variations (CNVs) play a crucial role in development of many human cancers. The broad availability of next-generation sequencing data has enabled the development of algorithms to computationally infer CNV profiles from a variety of data types including exome and targeted sequence data; currently the most prevalent types of cancer genomics data. However, systemic evaluation and comparison of these tools remains challenging due to a lack of ground truth reference sets. To address this need, we have developed Bamgineer, a tool written in Python to introduce user-defined haplotype-phased allele-specific copy number events into an existing Binary Alignment Mapping (BAM) file, with a focus on targeted and exome sequencing experiments. As input, this tool requires a read alignment file (BAM format), lists of non-overlapping genome coordinates for introduction of gains and losses (bed file), and an optional file defining known haplotypes (vcf format). To improve runtime performance, Bamgineer introduces the desired CNVs in parallel using queuing and parallel processing on a local machine or on a high-performance computing cluster. As proof-of-principle, we applied Bamgineer to a single high-coverage (mean: 220X) exome sequence file from a blood sample to simulate copy number profiles of 3 exemplar tumors from each of 10 tumor types at 5 tumor cellularity levels (20-100%, 150 BAM files in total). To demonstrate feasibility beyond exome data, we introduced read alignments to a targeted 5-gene cell-free DNA sequencing library to simulate EGFR amplifications at frequencies consistent with circulating tumor DNA (10, 1, 0.1 and 0.01%) while retaining the multimodal insert size distribution of the original data. We expect Bamgineer to be of use for development and systematic benchmarking of CNV calling algorithms by users using locally-generated data for a variety of applications. The source code is freely available at http://github.com/pughlab/bamgineer.

  18. A comprehensive profile of DNA copy number variations in a Korean population: identification of copy number invariant regions among Koreans.

    PubMed

    Jeon, Jae Pil; Shim, Sung Mi; Jung, Jong Sun; Nam, Hye Young; Lee, Hye Jin; Oh, Berm Seok; Kim, Kuchan; Kim, Hyung Lae; Han, Bok Ghee

    2009-09-30

    To examine copy number variations among the Korean population, we compared individual genomes with the Korean reference genome assembly using the publicly available Korean HapMap SNP 50 k chip data from 90 individuals. Korean individuals exhibited 123 copy number variation regions (CNVRs) covering 27.2 mb, equivalent to 1.0% of the genome in the copy number variation (CNV) analysis using the combined criteria of P value (P<0.01) and standard deviation of copy numbers (SD>or= 0.25) among study subjects. In contrast, when compared to the Affymetrix reference genome assembly from multiple ethnic groups, considerably more CNVRs (n=643) were detected in larger proportions (5.0%) of the genome covering 135.1 mb even by more stringent criteria (P<0.001 and SD>or=0.25), reflecting ethnic diversity of structural variations between Korean and other populations. Some CNVRs were validated by the quantitative multiplex PCR of short fluorescent fragment (QMPSF) method, and then copy number invariant regions were detected among the study subjects. These copy number invariant regions would be used as good internal controls for further CNV studies. Lastly, we demonstrated that the CNV information could stratify even a single ethnic population with a proper reference genome assembly from multiple heterogeneous populations.

  19. Trial-by-trial fluctuations in CNV amplitude reflect anticipatory adjustment of response caution.

    PubMed

    Boehm, Udo; van Maanen, Leendert; Forstmann, Birte; van Rijn, Hedderik

    2014-08-01

    The contingent negative variation, a slow cortical potential, occurs when humans are warned by a stimulus about an upcoming task. The cognitive processes that give rise to this EEG potential are not yet well understood. To explain these processes, we adopt a recently developed theoretical framework from the area of perceptual decision-making. This framework assumes that the basal ganglia control the tradeoff between fast and accurate decision-making in the cortex. It suggests that an increase in cortical excitability serves to lower response caution, which results in faster but more error prone responding. We propose that the CNV reflects this increased cortical excitability. To test this hypothesis, we conducted an EEG experiment in which participants performed the random dot motion task either under speed or under accuracy stress. Our results show that trial-by-trial fluctuations in participants' response speed as well as model-based estimates of response caution correlated with single-trial CNV amplitude under conditions of speed but not accuracy stress. We conclude that the CNV might reflect adjustments of response caution, which serves to enhance quick decision-making. Copyright © 2014 Elsevier Inc. All rights reserved.

  20. Distribution, functional impact, and origin mechanisms of copy number variation in the barley genome

    PubMed Central

    2013-01-01

    Background There is growing evidence for the prevalence of copy number variation (CNV) and its role in phenotypic variation in many eukaryotic species. Here we use array comparative genomic hybridization to explore the extent of this type of structural variation in domesticated barley cultivars and wild barleys. Results A collection of 14 barley genotypes including eight cultivars and six wild barleys were used for comparative genomic hybridization. CNV affects 14.9% of all the sequences that were assessed. Higher levels of CNV diversity are present in the wild accessions relative to cultivated barley. CNVs are enriched near the ends of all chromosomes except 4H, which exhibits the lowest frequency of CNVs. CNV affects 9.5% of the coding sequences represented on the array and the genes affected by CNV are enriched for sequences annotated as disease-resistance proteins and protein kinases. Sequence-based comparisons of CNV between cultivars Barke and Morex provided evidence that DNA repair mechanisms of double-strand breaks via single-stranded annealing and synthesis-dependent strand annealing play an important role in the origin of CNV in barley. Conclusions We present the first catalog of CNVs in a diploid Triticeae species, which opens the door for future genome diversity research in a tribe that comprises the economically important cereal species wheat, barley, and rye. Our findings constitute a valuable resource for the identification of CNV affecting genes of agronomic importance. We also identify potential mechanisms that can generate variation in copy number in plant genomes. PMID:23758725

  1. A map of copy number variations in Chinese populations.

    PubMed

    Lou, Haiyi; Li, Shilin; Yang, Yajun; Kang, Longli; Zhang, Xin; Jin, Wenfei; Wu, Bailin; Jin, Li; Xu, Shuhua

    2011-01-01

    It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of them within and between populations. However, CNV study of Chinese minorities, which harbor the majority of genetic diversity of Chinese populations, has been underrepresented considering the same efforts in other populations. Here we constructed, to our knowledge, a first CNV map in seven Chinese populations representing the major linguistic groups in China with 1,440 CNV regions identified using Affymetrix SNP 6.0 Array. Considerable differences in distributions of CNV regions between populations and substantial population structures were observed. We showed that ∼35% of CNV regions identified in minority ethnic groups are not shared by Han Chinese population, indicating that the contribution of the minorities to genetic architecture of Chinese population could not be ignored. We further identified highly differentiated CNV regions between populations. For example, a common deletion in Dong and Zhuang (44.4% and 50%), which overlaps two keratin-associated protein genes contributing to the structure of hair fibers, was not observed in Han Chinese. Interestingly, the most differentiated CNV deletion between HapMap CEU and YRI containing CCL3L1 gene reported in previous studies was also the highest differentiated regions between Tibetan and other populations. Besides, by jointly analyzing CNVs and SNPs, we found a CNV region containing gene CTDSPL were in almost perfect linkage disequilibrium between flanking SNPs in Tibetan while not in other populations except HapMap CHD. Furthermore, we found the SNP taggability of CNVs in Chinese populations was much lower than that in European populations. Our results suggest the necessity of a full characterization of CNVs in Chinese populations, and the CNV map we constructed serves as a useful resource in

  2. A Map of Copy Number Variations in Chinese Populations

    PubMed Central

    Yang, Yajun; Kang, Longli; Zhang, Xin; Jin, Wenfei; Wu, Bailin; Jin, Li; Xu, Shuhua

    2011-01-01

    It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of them within and between populations. However, CNV study of Chinese minorities, which harbor the majority of genetic diversity of Chinese populations, has been underrepresented considering the same efforts in other populations. Here we constructed, to our knowledge, a first CNV map in seven Chinese populations representing the major linguistic groups in China with 1,440 CNV regions identified using Affymetrix SNP 6.0 Array. Considerable differences in distributions of CNV regions between populations and substantial population structures were observed. We showed that ∼35% of CNV regions identified in minority ethnic groups are not shared by Han Chinese population, indicating that the contribution of the minorities to genetic architecture of Chinese population could not be ignored. We further identified highly differentiated CNV regions between populations. For example, a common deletion in Dong and Zhuang (44.4% and 50%), which overlaps two keratin-associated protein genes contributing to the structure of hair fibers, was not observed in Han Chinese. Interestingly, the most differentiated CNV deletion between HapMap CEU and YRI containing CCL3L1 gene reported in previous studies was also the highest differentiated regions between Tibetan and other populations. Besides, by jointly analyzing CNVs and SNPs, we found a CNV region containing gene CTDSPL were in almost perfect linkage disequilibrium between flanking SNPs in Tibetan while not in other populations except HapMap CHD. Furthermore, we found the SNP taggability of CNVs in Chinese populations was much lower than that in European populations. Our results suggest the necessity of a full characterization of CNVs in Chinese populations, and the CNV map we constructed serves as a useful resource in

  3. Copy Number Variation in Obsessive-Compulsive Disorder and Tourette Syndrome: A Cross-Disorder Study

    PubMed Central

    McGrath, Lauren M.; Yu, Dongmei; Marshall, Christian; Davis, Lea K.; Thiruvahindrapuram, Bhooma; Li, Bingbin; Cappi, Carolina; Gerber, Gloria; Wolf, Aaron; Schroeder, Frederick A.; Osiecki, Lisa; O’Dushlaine, Colm; Kirby, Andrew; Illmann, Cornelia; Haddad, Stephen; Gallagher, Patience; Fagerness, Jesen A.; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Bienvenu, O. Joseph; Black, Donald W.; Bloch, Michael H.; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Cath, Danielle C.; Cavallini, Maria C.; Chouinard, Sylvain; Coric, Vladimir; Cullen, Bernadette; Delorme, Richard; Denys, Damiaan; Derks, Eske M.; Dion, Yves; Rosário, Maria C.; Eapen, Valsama; Evans, Patrick; Falkai, Peter; Fernandez, Thomas; Garrido, Helena; Geller, Daniel; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Heiman, Gary A.; Hemmings, Sian M.J.; Herrera, Luis D.; Hounie, Ana G.; Jankovic, Joseph; Kennedy, James L; King, Robert A.; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Lochner, Christine; Lowe, Thomas L.; Lyon, Gholson J.; Macciardi, Fabio; Maier, Wolfgang; McCracken, James T.; McMahon, William; Murphy, Dennis L.; Naarden, Allan L; Neale, Benjamin M; Nurmi, Erika; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark; Robertson, Mary M.; Rosenberg, David; Rouleau, Guy A.; Ruhrmann, Stephan; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Tischfield, Jay A.; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Wang, Ying; Wendland, Jens R.; Shugart, Yin Yao; Miguel, Euripedes C.; Nicolini, Humberto; Oostra, Ben A.; Moessner, Rainald; Wagner, Michael; Ruiz-Linares, Andres; Heutink, Peter; Nestadt, Gerald; Freimer, Nelson; Petryshen, Tracey; Posthuma, Danielle; Jenike, Michael A.; Cox, Nancy J.; Hanna, Gregory L.; Brentani, Helena; Scherer, Stephen W.; Arnold, Paul D.; Stewart, S. Evelyn; Mathews, Carol A.; Knowles, James A.; Cook, Edwin H.; Pauls, David L.; Wang, Kai; Scharf, Jeremiah M.

    2014-01-01

    Objective Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable, neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500kb), rare (<1%) copy number variants (CNVs) in OCD and the largest genome-wide CNV analysis in TS to date. Method The primary analyses utilized a cross-disorder design for 2,699 patients (1,613 ascertained for OCD, 1,086 ascertained for TS) and 1,789 controls. Parental data facilitated a de novo analysis in 348 OCD trios. Results Although no global CNV burden was detected in the cross-disorder analysis or in secondary, disease-specific analyses, there was a 3.3-fold increased burden of large deletions previously associated with other neurodevelopmental disorders (p=.09). Half of these neurodevelopmental deletions were located in a single locus, 16p13.11 (5 patient deletions: 0 control deletions, p=0.08 in current study, p=0.025 compared to published controls). Three 16p13.11 deletions were confirmed de novo, providing further support to the etiological significance of this region. The overall OCD de novo rate was 1.4%, which is intermediate between published rates in controls (0.7%) and in autism or schizophrenia (2–4%). Conclusion Several converging lines of evidence implicate 16p13.11 deletions in OCD, with weaker evidence for a role in TS. The trend toward increased overall neurodevelopmental CNV burden in TS and OCD suggests that deletions previously associated with other neurodevelopmental disorders may also contribute to these phenotypes. PMID:25062598

  4. Contingent negative variation (CNV) and erotic preference in self-declared homosexuals and in child sex offenders.

    PubMed

    Howard, R C; Longmore, F J; Mason, P A; Martin, J L

    1994-10-01

    Contingent negative variation (CNV) was recorded bilaterally from central electrodes using a "match/mismatch" paradigm in (Study 1) samples of heterosexual men (N = 6), gay men (N = 10) and lesbian women (N = 14) and (Study 2) in samples of child sex offenders (N = 34) and heterosexual control men (N = 19). Sexual orientation was assessed using the Multidimensional Scale of Sexuality (MSS) and the Human Sexuality Questionnaire (HSQ). Separate CNV averages were formed for each condition of stimulation: for Study 1, slides of adult male and female nudes; for Study 2, slides of child, pubescent and adult male and female nudes. Penile plethysmographic (PPG) data were also obtained from 15 of the child sex offender sample while they viewed stimuli of the same categories as were used in the CNV recording. On the basis of their PPG responses to children, child sex offenders were classified as either "pedophiles" or "non-pedophiles". In Study 1 significant Group x Sex (of slide) and Group x Electrode interactions indicated that: (i) heterosexual men (but neither homosexual group) showed significantly larger CNVs to female than to male slides; (ii) both homosexual groups showed significantly asymmetrical (R > L) CNVs. In Study 2, controls showed significantly greater CNVs to adult females than to both adult males and female children. Child sex offenders showed no significant differences in CNV to male and female slides for any age. "Non-pedophiles" showed significantly larger CNVs to female adults than to female children, but "pedophiles" did not. It is concluded that CNV has promise as a measure of both deviant and non-deviant sexual preference.

  5. Copy number variation plays an important role in clinical epilepsy

    PubMed Central

    Olson, Heather; Shen, Yiping; Avallone, Jennifer; Sheidley, Beth R.; Pinsky, Rebecca; Bergin, Ann M.; Berry, Gerard T.; Duffy, Frank H.; Eksioglu, Yaman; Harris, David J.; Hisama, Fuki M.; Ho, Eugenia; Irons, Mira; Jacobsen, Christina M.; James, Philip; Kothare, Sanjeev; Khwaja, Omar; Lipton, Jonathan; Loddenkemper, Tobias; Markowitz, Jennifer; Maski, Kiran; Megerian, J. Thomas; Neilan, Edward; Raffalli, Peter C.; Robbins, Michael; Roberts, Amy; Roe, Eugene; Rollins, Caitlin; Sahin, Mustafa; Sarco, Dean; Schonwald, Alison; Smith, Sharon E.; Soul, Janet; Stoler, Joan M.; Takeoka, Masanori; Tan, Wen-Han; Torres, Alcy R.; Tsai, Peter; Urion, David K.; Weissman, Laura; Wolff, Robert; Wu, Bai-Lin; Miller, David T.; Poduri, Annapurna

    2015-01-01

    Objective To evaluate the role of copy number abnormalities detectable by chromosomal microarray (CMA) testing in patients with epilepsy at a tertiary care center. Methods We identified patients with ICD-9 codes for epilepsy or seizures and clinical CMA testing performed between October 2006 and February 2011 at Boston Children’s Hospital. We reviewed medical records and included patients meeting criteria for epilepsy. We phenotypically characterized patients with epilepsy-associated abnormalities on CMA. Results Of 973 patients who had CMA and ICD-9 codes for epilepsy or seizures, 805 patients satisfied criteria for epilepsy. We observed 437 copy number variants (CNVs) in 323 patients (1–4 per patient), including 185 (42%) deletions and 252 (58%) duplications. Forty (9%) were confirmed de novo, 186 (43%) were inherited, and parental data were unavailable for 211 (48%). Excluding full chromosome trisomies, CNV size ranged from 18 kb to 142 Mb, and 34% were over 500 kb. In at least 40 cases (5%), the epilepsy phenotype was explained by a CNV, including 29 patients with epilepsy-associated syndromes and 11 with likely disease-associated CNVs involving epilepsy genes or “hotspots.” We observed numerous recurrent CNVs including 10 involving loss or gain of Xp22.31, a region described in patients with and without epilepsy. Interpretation Copy number abnormalities play an important role in patients with epilepsy. Given that the diagnostic yield of CMA for epilepsy patients is similar to the yield in autism spectrum disorders and in prenatal diagnosis, for which published guidelines recommend testing with CMA, we recommend the implementation of CMA in the evaluation of unexplained epilepsy. PMID:24811917

  6. Increased Frequency of De Novo Copy Number Variations in Congenital Heart Disease by Integrative Analysis of SNP Array and Exome Sequence Data

    PubMed Central

    Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R.; Golhar, Ryan; Sanders, Stephan J.; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A. Jeremy; State, Matthew W.; Kaltman, Jonathan R.; White, Peter S.; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D.; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K.

    2014-01-01

    Rationale Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown etiology. Objective To determine the contribution of de novo copy number variants (CNVs) in the etiology of sporadic CHD. Methods and Results We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism (SNP) arrays and/or whole exome sequencing (WES). Results were experimentally validated using digital droplet PCR. We compared validated CNVs in CHD cases to CNVs in 1,301 healthy control trios. The two complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either SNP array (p=7x10−5, Odds Ratio (OR)=4.6) or WES data (p=6x10−4, OR=3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (p=0.02, OR=2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in WES and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q sub-telomeric deletions. Conclusions We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. PMID:25205790

  7. Validation of copy number variants associated with prostate cancer risk and prognosis.

    PubMed

    Blackburn, August; Wilson, Desiree; Gelfond, Jonathan; Yao, Li; Hernandez, Javier; Thompson, Ian M; Leach, Robin J; Lehman, Donna M

    2014-01-01

    Two recent studies have reported novel heritable copy number variants on chromosomes 2p, 15q, and 12q to be associated with prostate cancer (PCa) risk in non-Hispanic Caucasians. The goal of this study was to determine whether these findings could be independently confirmed in the Caucasian population from the South Texas area. The study subjects consisted of participants of the San Antonio Biomarkers of Risk for PCa cohort and additional cases ascertained in the same metropolitan area. We genotyped all 7 of the reported copy number variants using real-time quantitative polymerase chain reaction in 1,536 (317 cases and 1,219 controls) non-Hispanic Caucasian men, and additionally, we genotyped 632 (191 cases and 441 controls) Hispanic Caucasian men for one of these variants, a deletion on 2p24.3. Association of the deletion on 2p24.3 with overall PCa risk did not meet our significance criteria but was consistent with previous reports (odds ratio, 1.40; 95% confidence interval 0.99-2.00; P = 0.06). Among Hispanic Caucasians, this deletion is much less prevalent (minor allele frequencies of 0.059 and 0.024 in non-Hispanic and Hispanic Caucasians, respectively) and did not show evidence of association with risk for PCa. Interestingly, among non-Hispanic Caucasians, carrying a homozygous deletion of 2p24.3 was significantly associated with high-grade PCa as defined by Gleason score sum ≥8 (odds ratio, 27.99; 95% confidence interval 1.99-392.6; P = 0.007 [the Fisher exact test]). The remaining 6 copy number variable regions either were not polymorphic in our cohort of non-Hispanic Caucasians or showed no evidence of association. Our findings are consistent with the reported observation that a heritable deletion on 2p24.3 is associated with PCa risk in non-Hispanic Caucasians. Additionally, our observations indicate that the 2p24.3 variant is associated with risk for high-grade PCa in a recessive manner. We were unable to replicate any association with PCa for the

  8. Distribution and Functionality of Copy Number Variation across European Cattle Populations.

    PubMed

    Upadhyay, Maulik; da Silva, Vinicus H; Megens, Hendrik-Jan; Visker, Marleen H P W; Ajmone-Marsan, Paolo; Bâlteanu, Valentin A; Dunner, Susana; Garcia, Jose F; Ginja, Catarina; Kantanen, Juha; Groenen, Martien A M; Crooijmans, Richard P M A

    2017-01-01

    Copy number variation (CNV), which is characterized by large-scale losses or gains of DNA fragments, contributes significantly to genetic and phenotypic variation. Assessing CNV across different European cattle populations might reveal genetic changes responsible for phenotypic differences, which have accumulated throughout the domestication history of cattle as consequences of evolutionary forces that act upon them. To explore pattern of CNVs across European cattle, we genotyped 149 individuals, that represent different European regions, using the Illumina Bovine HD Genotyping array. A total of 9,944 autosomal CNVs were identified in 149 samples using a Hidden Markov Model (HMM) as employed in PennCNV. Animals originating from several breeds of British Isles, and Balkan and Italian regions, on average, displayed higher abundance of CNV counts than Dutch or Alpine animals. A total of 923 CNV regions (CNVRs) were identified by aggregating CNVs overlapping in at least two animals. The hierarchical clustering of CNVRs indicated low differentiation and sharing of high-frequency CNVRs between European cattle populations. Various CNVRs identified in the present study overlapped with olfactory receptor genes and genes related to immune system. In addition, we also detected a CNV overlapping the Kit gene in English longhorn cattle which has previously been associated with color-sidedness. To conclude, we provide a comprehensive overview of CNV distribution in genome of European cattle. Our results indicate an important role of purifying selection and genomic drift in shaping CNV diversity that exists between different European cattle populations.

  9. Distribution and Functionality of Copy Number Variation across European Cattle Populations

    PubMed Central

    Upadhyay, Maulik; da Silva, Vinicus H.; Megens, Hendrik-Jan; Visker, Marleen H. P. W.; Ajmone-Marsan, Paolo; Bâlteanu, Valentin A.; Dunner, Susana; Garcia, Jose F.; Ginja, Catarina; Kantanen, Juha; Groenen, Martien A. M.; Crooijmans, Richard P. M. A.

    2017-01-01

    Copy number variation (CNV), which is characterized by large-scale losses or gains of DNA fragments, contributes significantly to genetic and phenotypic variation. Assessing CNV across different European cattle populations might reveal genetic changes responsible for phenotypic differences, which have accumulated throughout the domestication history of cattle as consequences of evolutionary forces that act upon them. To explore pattern of CNVs across European cattle, we genotyped 149 individuals, that represent different European regions, using the Illumina Bovine HD Genotyping array. A total of 9,944 autosomal CNVs were identified in 149 samples using a Hidden Markov Model (HMM) as employed in PennCNV. Animals originating from several breeds of British Isles, and Balkan and Italian regions, on average, displayed higher abundance of CNV counts than Dutch or Alpine animals. A total of 923 CNV regions (CNVRs) were identified by aggregating CNVs overlapping in at least two animals. The hierarchical clustering of CNVRs indicated low differentiation and sharing of high-frequency CNVRs between European cattle populations. Various CNVRs identified in the present study overlapped with olfactory receptor genes and genes related to immune system. In addition, we also detected a CNV overlapping the Kit gene in English longhorn cattle which has previously been associated with color-sidedness. To conclude, we provide a comprehensive overview of CNV distribution in genome of European cattle. Our results indicate an important role of purifying selection and genomic drift in shaping CNV diversity that exists between different European cattle populations. PMID:28878807

  10. Analysis of copy number variations in Holstein-Friesian cow genomes based on whole-genome sequence data.

    PubMed

    Mielczarek, M; Frąszczak, M; Giannico, R; Minozzi, G; Williams, John L; Wojdak-Maksymiec, K; Szyda, J

    2017-07-01

    Thirty-two whole genome DNA sequences of cows were analyzed to evaluate inter-individual variability in the distribution and length of copy number variations (CNV) and to functionally annotate CNV breakpoints. The total number of deletions per individual varied between 9,731 and 15,051, whereas the number of duplications was between 1,694 and 5,187. Most of the deletions (81%) and duplications (86%) were unique to a single cow. No relation between the pattern of variant sharing and a family relationship or disease status was found. The animal-averaged length of deletions was from 5,234 to 9,145 bp and the average length of duplications was between 7,254 and 8,843 bp. Highly significant inter-individual variation in length and number of CNV was detected for both deletions and duplications. The majority of deletion and duplication breakpoints were located in intergenic regions and introns, whereas fewer were identified in noncoding transcripts and splice regions. Only 1.35 and 0.79% of the deletion and duplication breakpoints were observed within coding regions. A gene with the highest number of deletion breakpoints codes for protein kinase cGMP-dependent type I, whereas the T-cell receptor α constant gene had the most duplication breakpoints. The functional annotation of genes with the largest incidence of deletion/duplication breakpoints identified 87/112 Kyoto Encyclopedia of Genes and Genomes pathways, but none of the pathways were significantly enriched or depleted with breakpoints. The analysis of Gene Ontology (GO) terms revealed that a cluster with the highest enrichment score among genes with many deletion breakpoints was represented by GO terms related to ion transport, whereas the GO term cluster mostly enriched among the genes with many duplication breakpoints was related to binding of macromolecules. Furthermore, when considering the number of deletion breakpoints per gene functional category, no significant differences were observed between the

  11. Analysis of copy number variations at 15 schizophrenia-associated loci.

    PubMed

    Rees, Elliott; Walters, James T R; Georgieva, Lyudmila; Isles, Anthony R; Chambert, Kimberly D; Richards, Alexander L; Mahoney-Davies, Gerwyn; Legge, Sophie E; Moran, Jennifer L; McCarroll, Steven A; O'Donovan, Michael C; Owen, Michael J; Kirov, George

    2014-02-01

    A number of copy number variants (CNVs) have been suggested as susceptibility factors for schizophrenia. For some of these the data remain equivocal, and the frequency in individuals with schizophrenia is uncertain. To determine the contribution of CNVs at 15 schizophrenia-associated loci (a) using a large new data-set of patients with schizophrenia (n = 6882) and controls (n = 6316), and (b) combining our results with those from previous studies. We used Illumina microarrays to analyse our data. Analyses were restricted to 520 766 probes common to all arrays used in the different data-sets. We found higher rates in participants with schizophrenia than in controls for 13 of the 15 previously implicated CNVs. Six were nominally significantly associated (P<0.05) in this new data-set: deletions at 1q21.1, NRXN1, 15q11.2 and 22q11.2 and duplications at 16p11.2 and the Angelman/Prader-Willi Syndrome (AS/PWS) region. All eight AS/PWS duplications in patients were of maternal origin. When combined with published data, 11 of the 15 loci showed highly significant evidence for association with schizophrenia (P<4.1×10(-4)). We strengthen the support for the majority of the previously implicated CNVs in schizophrenia. About 2.5% of patients with schizophrenia and 0.9% of controls carry a large, detectable CNV at one of these loci. Routine CNV screening may be clinically appropriate given the high rate of known deleterious mutations in the disorder and the comorbidity associated with these heritable mutations.

  12. A large-scale survey of genetic copy number variations among Han Chinese residing in Taiwan

    PubMed Central

    Lin, Chien-Hsing; Li, Ling-Hui; Ho, Sheng-Feng; Chuang, Tzu-Po; Wu, Jer-Yuarn; Chen, Yuan-Tsong; Fann, Cathy SJ

    2008-01-01

    Background Copy number variations (CNVs) have recently been recognized as important structural variations in the human genome. CNVs can affect gene expression and thus may contribute to phenotypic differences. The copy number inferring tool (CNIT) is an effective hidden Markov model-based algorithm for estimating allele-specific copy number and predicting chromosomal alterations from single nucleotide polymorphism microarrays. The CNIT algorithm, which was constructed using data from 270 HapMap multi-ethnic individuals, was applied to identify CNVs from 300 unrelated Han Chinese individuals in Taiwan. Results Using stringent selection criteria, 230 regions with variable copy numbers were identified in the Han Chinese population; 133 (57.83%) had been reported previously, 64 displayed greater than 1% CNV allele frequency. The average size of the CNV regions was 322 kb (ranging from 1.48 kb to 5.68 Mb) and covered a total of 2.47% of the human genome. A total of 196 of the CNV regions were simple deletions and 27 were simple amplifications. There were 449 genes and 5 microRNAs within these CNV regions; some of these genes are known to be associated with diseases. Conclusion The identified CNVs are characteristic of the Han Chinese population and should be considered when genetic studies are conducted. The CNV distribution in the human genome is still poorly characterized, and there is much diversity among different ethnic populations. PMID:19108714

  13. Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip.

    PubMed

    Rao, Y S; Li, J; Zhang, R; Lin, X R; Xu, J G; Xie, L; Xu, Z Q; Wang, L; Gan, J K; Xie, X J; He, J; Zhang, X Q

    2016-08-01

    Copy number variation (CNV) is an important source of genetic variation in organisms and a main factor that affects phenotypic variation. A comprehensive study of chicken CNV can provide valuable information on genetic diversity and facilitate future analyses of associations between CNV and economically important traits in chickens. In the present study, an F2 full-sib chicken population (554 individuals), established from a cross between Xinghua and White Recessive Rock chickens, was used to explore CNV in the chicken genome. Genotyping was performed using a chicken 60K SNP BeadChip. A total of 1,875 CNV were detected with the PennCNV algorithm, and the average number of CNV was 3.42 per individual. The CNV were distributed across 383 independent CNV regions (CNVR) and covered 41 megabases (3.97%) of the chicken genome. Seven CNVR in 108 individuals were validated by quantitative real-time PCR, and 81 of these individuals (75%) also were detected with the PennCNV algorithm. In total, 274 CNVR (71.54%) identified in the current study were previously reported. Of these, 147 (38.38%) were reported in at least 2 studies. Additionally, 109 of the CNVR (28.46%) discovered here are novel. A total of 709 genes within or overlapping with the CNVR was retrieved. Out of the 2,742 quantitative trait loci (QTL) collected in the chicken QTL database, 43 QTL had confidence intervals overlapping with the CNVR, and 32 CNVR encompassed one or more functional genes. The functional genes located in the CNVR are likely to be the QTG that are associated with underlying economic traits. This study considerably expands our insight into the structural variation in the genome of chickens and provides an important resource for genomic variation, especially for genomic structural variation related to economic traits in chickens. © 2016 Poultry Science Association Inc.

  14. Traditional karyotyping vs copy number variation sequencing for detection of chromosomal abnormalities associated with spontaneous miscarriage.

    PubMed

    Liu, S; Song, L; Cram, D S; Xiong, L; Wang, K; Wu, R; Liu, J; Deng, K; Jia, B; Zhong, M; Yang, F

    2015-10-01

    To compare the performance of traditional G-banding karyotyping with that of copy number variation sequencing (CNV-Seq) for detection of chromosomal abnormalities associated with miscarriage. Products of conception (POC) were collected from spontaneous miscarriages. Chromosomal abnormalities were detected using high-resolution G-banding karyotyping and CNV sequencing. Quantitative fluorescent polymerase chain reaction analysis of maternal and POC DNA for short tandem repeat (STR) markers was used to both monitor maternal cell contamination and confirm the chromosomal status and sex of the miscarriage tissue. A total of 64 samples of POC, comprising 16 with an abnormal and 48 with a normal karyotype, were selected and coded for analysis by CNV-Seq. CNV-Seq results were concordant for 14 (87.5%) of the 16 gross chromosomal abnormalities identified by karyotyping, including 11 autosomal trisomies and three sex chromosomal aneuploidies (45,X). Of the two discordant results, a 69,XXX polyploidy was missed by CNV-Seq, although supporting STR marker analysis confirmed the triploidy. In contrast, CNV-Seq identified a sample with 45,X karyotype as a 45,X/46,XY mosaic. In the remaining 48 samples of POC with a normal karyotype, CNV-Seq detected a 2.58-Mb 22q deletion associated with DiGeorge syndrome and nine different smaller CNVs of no apparent clinical significance. CNV-Seq used in parallel with STR profiling is a reliable and accurate alternative to karyotyping for identifying chromosome copy number abnormalities associated with spontaneous miscarriage. Copyright © 2015 ISUOG. Published by John Wiley & Sons Ltd.

  15. Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation.

    PubMed

    Szatkiewicz, Jin P; Wang, WeiBo; Sullivan, Patrick F; Wang, Wei; Sun, Wei

    2013-02-01

    Structural variation is an important class of genetic variation in mammals. High-throughput sequencing (HTS) technologies promise to revolutionize copy-number variation (CNV) detection but present substantial analytic challenges. Converging evidence suggests that multiple types of CNV-informative data (e.g. read-depth, read-pair, split-read) need be considered, and that sophisticated methods are needed for more accurate CNV detection. We observed that various sources of experimental biases in HTS confound read-depth estimation, and note that bias correction has not been adequately addressed by existing methods. We present a novel read-depth-based method, GENSENG, which uses a hidden Markov model and negative binomial regression framework to identify regions of discrete copy-number changes while simultaneously accounting for the effects of multiple confounders. Based on extensive calibration using multiple HTS data sets, we conclude that our method outperforms existing read-depth-based CNV detection algorithms. The concept of simultaneous bias correction and CNV detection can serve as a basis for combining read-depth with other types of information such as read-pair or split-read in a single analysis. A user-friendly and computationally efficient implementation of our method is freely available.

  16. Copy number variation at the 7q11.23 segmental duplications is a susceptibility factor for the Williams-Beuren syndrome deletion

    PubMed Central

    Cuscó, Ivon; Corominas, Roser; Bayés, Mònica; Flores, Raquel; Rivera-Brugués, Núria; Campuzano, Victoria; Pérez-Jurado, Luis A.

    2008-01-01

    Large copy number variants (CNVs) have been recently found as structural polymorphisms of the human genome of still unknown biological significance. CNVs are significantly enriched in regions with segmental duplications or low-copy repeats (LCRs). Williams-Beuren syndrome (WBS) is a neurodevelopmental disorder caused by a heterozygous deletion of contiguous genes at 7q11.23 mediated by nonallelic homologous recombination (NAHR) between large flanking LCRs and facilitated by a structural variant of the region, a ∼2-Mb paracentric inversion present in 20%–25% of WBS-transmitting progenitors. We now report that eight out of 180 (4.44%) WBS-transmitting progenitors are carriers of a CNV, displaying a chromosome with large deletion of LCRs. The prevalence of this CNV among control individuals and non-transmitting progenitors is much lower (1%, n = 600), thus indicating that it is a predisposing factor for the WBS deletion (odds ratio 4.6-fold, P = 0.002). LCR duplications were found in 2.22% of WBS-transmitting progenitors but also in 1.16% of controls, which implies a non–statistically significant increase in WBS-transmitting progenitors. We have characterized the organization and breakpoints of these CNVs, encompassing ∼100–300 kb of genomic DNA and containing several pseudogenes but no functional genes. Additional structural variants of the region have also been defined, all generated by NAHR between different blocks of segmental duplications. Our data further illustrate the highly dynamic structure of regions rich in segmental duplications, such as the WBS locus, and indicate that large CNVs can act as susceptibility alleles for disease-associated genomic rearrangements in the progeny. PMID:18292220

  17. Modified screening and ranking algorithm for copy number variation detection.

    PubMed

    Xiao, Feifei; Min, Xiaoyi; Zhang, Heping

    2015-05-01

    Copy number variation (CNV) is a type of structural variation, usually defined as genomic segments that are 1 kb or larger, which present variable copy numbers when compared with a reference genome. The screening and ranking algorithm (SaRa) was recently proposed as an efficient approach for multiple change-points detection, which can be applied to CNV detection. However, some practical issues arise from application of SaRa to single nucleotide polymorphism data. In this study, we propose a modified SaRa on CNV detection to address these issues. First, we use the quantile normalization on the original intensities to guarantee that the normal mean model-based SaRa is a robust method. Second, a novel normal mixture model coupled with a modified Bayesian information criterion is proposed for candidate change-point selection and further clustering the potential CNV segments to copy number states. Simulations revealed that the modified SaRa became a robust method for identifying change-points and achieved better performance than the circular binary segmentation (CBS) method. By applying the modified SaRa to real data from the HapMap project, we illustrated its performance on detecting CNV segments. In conclusion, our modified SaRa method improves SaRa theoretically and numerically, for identifying CNVs with high-throughput genotyping data. The modSaRa package is implemented in R program and freely available at http://c2s2.yale.edu/software/modSaRa. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Genome-wide patterns of copy number variation in the Chinese yak genome.

    PubMed

    Zhang, Xiao; Wang, Kun; Wang, Lizhong; Yang, Yongzhi; Ni, Zhengqiang; Xie, Xiuyue; Shao, Xuemin; Han, Jin; Wan, Dongshi; Qiu, Qiang

    2016-05-20

    Copy number variation (CNV) represents an important source of genetic divergence that can produce drastic phenotypic differences and may therefore be subject to selection during domestication and environmental adaptation. To investigate the evolutionary dynamics of CNV in the yak genome, we used a read depth approach to detect CNV based on genome resequencing data from 14 wild and 65 domestic yaks and determined CNV regions related to domestication and adaptations to high-altitude. We identified 2,634 CNV regions (CNVRs) comprising a total of 153 megabases (5.7 % of the yak genome) and 3,879 overlapping annotated genes. Comparison between domestic and wild yak populations identified 121 potentially selected CNVRs, harboring genes related to neuronal development, reproduction, nutrition and energy metabolism. In addition, we found 85 CNVRs that are significantly different between domestic yak living in high- and low-altitude areas, including three genes related to hypoxia response and six related to immune defense. This analysis shows that genic CNVs may play an important role in phenotypic changes during yak domestication and adaptation to life at high-altitude. We present the first refined CNV map for yak along with comprehensive genomic analysis of yak CNV. Our results provide new insights into the genetic basis of yak domestication and adaptation to living in a high-altitude environment, as well as a valuable genetic resource that will facilitate future CNV association studies of important traits in yak and other bovid species.

  19. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia.

    PubMed

    Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R

    2017-10-01

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10 -8 ), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect.

  20. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia

    PubMed Central

    Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Goldstein, Jacqueline I; Jarskog, L Fredrik; Hilliard, Chris; Alfirevic, Ana; Duncan, Laramie; Fourches, Denis; Huang, Hailiang; Lek, Monkol; Neale, Benjamin M; Ripke, Stephan; Shianna, Kevin; Szatkiewicz, Jin P; Tropsha, Alexander; van den Oord, Edwin JCG; Cascorbi, Ingolf; Dettling, Michael; Gazit, Ephraim; Goff, Donald C; Holden, Arthur L; Kelly, Deanna L; Malhotra, Anil K; Nielsen, Jimmi; Pirmohamed, Munir; Rujescu, Dan; Werge, Thomas; Levy, Deborah L; Josiassen, Richard C; Kennedy, James L; Lieberman, Jeffrey A; Daly, Mark J; Sullivan, Patrick F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R

    2017-01-01

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10−8), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect. PMID:27400856

  1. A unique haplotype of RCCX copy number variation: from the clinics of congenital adrenal hyperplasia to evolutionary genetics

    PubMed Central

    Doleschall, Márton; Luczay, Andrea; Koncz, Klára; Hadzsiev, Kinga; Erhardt, Éva; Szilágyi, Ágnes; Doleschall, Zoltán; Németh, Krisztina; Török, Dóra; Prohászka, Zoltán; Gereben, Balázs; Fekete, György; Gláz, Edit; Igaz, Péter; Korbonits, Márta; Tóth, Miklós; Rácz, Károly; Patócs, Attila

    2017-01-01

    There is a difficulty in the molecular diagnosis of congenital adrenal hyperplasia (CAH) due to the c.955C>T (p.(Q319*), formerly Q318X, rs7755898) variant of the CYP21A2 gene. Therefore, a systematic assessment of the genetic and evolutionary relationships between c.955C>T, CYP21A2 haplotypes and the RCCX copy number variation (CNV) structures, which harbor CYP21A2, was performed. In total, 389 unrelated Hungarian individuals with European ancestry (164 healthy subjects, 125 patients with non-functioning adrenal incidentaloma and 100 patients with classical CAH) as well as 34 adrenocortical tumor specimens were studied using a set of experimental and bioinformatic methods. A unique, moderately frequent (2%) haplotypic RCCX CNV structure with three repeated segments, abbreviated to LBSASB, harboring a CYP21A2 with a c.955C>T variant in the 3′-segment, and a second CYP21A2 with a specific c.*12C>T (rs150697472) variant in the middle segment occurred in all c.955C>T carriers with normal steroid levels. The second CYP21A2 was free of CAH-causing mutations and produced mRNA in the adrenal gland, confirming its functionality and ability to rescue the carriers from CAH. Neither LBSASB nor c.*12C>T occurred in classical CAH patients. However, CAH-causing CYP21A2 haplotypes with c.955C>T could be derived from the 3′-segment of LBSASB after the loss of functional CYP21A2 from the middle segment. The c.*12C>T indicated a functional CYP21A2 and could distinguish between non-pathogenic and pathogenic genomic contexts of the c.955C>T variant in the studied European population. Therefore, c.*12C>T may be suitable as a marker to avoid this genetic confound and improve the diagnosis of CAH. PMID:28401898

  2. A unique haplotype of RCCX copy number variation: from the clinics of congenital adrenal hyperplasia to evolutionary genetics.

    PubMed

    Doleschall, Márton; Luczay, Andrea; Koncz, Klára; Hadzsiev, Kinga; Erhardt, Éva; Szilágyi, Ágnes; Doleschall, Zoltán; Németh, Krisztina; Török, Dóra; Prohászka, Zoltán; Gereben, Balázs; Fekete, György; Gláz, Edit; Igaz, Péter; Korbonits, Márta; Tóth, Miklós; Rácz, Károly; Patócs, Attila

    2017-06-01

    There is a difficulty in the molecular diagnosis of congenital adrenal hyperplasia (CAH) due to the c.955C>T (p.(Q319*), formerly Q318X, rs7755898) variant of the CYP21A2 gene. Therefore, a systematic assessment of the genetic and evolutionary relationships between c.955C>T, CYP21A2 haplotypes and the RCCX copy number variation (CNV) structures, which harbor CYP21A2, was performed. In total, 389 unrelated Hungarian individuals with European ancestry (164 healthy subjects, 125 patients with non-functioning adrenal incidentaloma and 100 patients with classical CAH) as well as 34 adrenocortical tumor specimens were studied using a set of experimental and bioinformatic methods. A unique, moderately frequent (2%) haplotypic RCCX CNV structure with three repeated segments, abbreviated to LBSASB, harboring a CYP21A2 with a c.955C>T variant in the 3'-segment, and a second CYP21A2 with a specific c.*12C>T (rs150697472) variant in the middle segment occurred in all c.955C>T carriers with normal steroid levels. The second CYP21A2 was free of CAH-causing mutations and produced mRNA in the adrenal gland, confirming its functionality and ability to rescue the carriers from CAH. Neither LBSASB nor c.*12C>T occurred in classical CAH patients. However, CAH-causing CYP21A2 haplotypes with c.955C>T could be derived from the 3'-segment of LBSASB after the loss of functional CYP21A2 from the middle segment. The c.*12C>T indicated a functional CYP21A2 and could distinguish between non-pathogenic and pathogenic genomic contexts of the c.955C>T variant in the studied European population. Therefore, c.*12C>T may be suitable as a marker to avoid this genetic confound and improve the diagnosis of CAH.

  3. Associations of GBP2 gene copy number variations with growth traits and transcriptional expression in Chinese cattle.

    PubMed

    Zhang, Gui-Min; Zheng, Li; He, Hua; Song, Cheng-Chuang; Zhang, Zi-Jing; Cao, Xiu-Kai; Lei, Chu-Zhao; Lan, Xian-Yong; Qi, Xing-Lei; Chen, Hong; Huang, Yong-Zhen

    2018-03-20

    Copy number variations (CNVs) recently have been recognized as another important genetic variability followed single nucleotide polymorphisms (SNPs). The guanylate binding protein 2 (GBP2) gene plays an important role in cell proliferation. This study was performed to determine the presence of GBP2 CNV (relative to Angus cattle) in 466 individuals representing six main cattle breeds from China, identify its relationship with growth, and explore the biological effects of gene expression. There were two CNV regions in the GBP2 gene, for three types, CNV1 loss type (relative to Angus cattle) was more frequent in XN than other breeds, and CNV2 loss type (relative to Angus cattle) was more frequent in XN and CDM than other breeds. Though the GBP2 gene copy number presented no correlation with the transcriptional expression of JX (P > .05), but the transcriptional expression in heart is higher than other tissues, and the copy number in muscles and fat of JX is higher than others breeds. Statistical analysis revealed that the GBP2 gene CNV1 and CNV2 were significantly associated with growth traits (P < .05). In conclusion, this research established the correlations between CNVs of GBP2 gene and growth traits in different cattle breeds, and our results suggested that the CNVs in GBP2 gene may be considered markers for the molecular breeding of Chinese beef cattle. Copyright © 2018. Published by Elsevier B.V.

  4. Analysis of structural diversity in wolf-like canids reveals post-domestication variants.

    PubMed

    Ramirez, Oscar; Olalde, Iñigo; Berglund, Jonas; Lorente-Galdos, Belen; Hernandez-Rodriguez, Jessica; Quilez, Javier; Webster, Matthew T; Wayne, Robert K; Lalueza-Fox, Carles; Vilà, Carles; Marques-Bonet, Tomas

    2014-06-12

    Although a variety of genetic changes have been implicated in causing phenotypic differences among dogs, the role of copy number variants (CNVs) and their impact on phenotypic variation is still poorly understood. Further, very limited knowledge exists on structural variation in the gray wolf, the ancestor of the dog, or other closely related wild canids. Documenting CNVs variation in wild canids is essential to identify ancestral states and variation that may have appeared after domestication. In this work, we genotyped 1,611 dog CNVs in 23 wolf-like canids (4 purebred dogs, one dingo, 15 gray wolves, one red wolf, one coyote and one golden jackal) to identify CNVs that may have arisen after domestication. We have found an increase in GC-rich regions close to the breakpoints and around 1 kb away from them suggesting that some common motifs might be associated with the formation of CNVs. Among the CNV regions that showed the largest differentiation between dogs and wild canids we found 12 genes, nine of which are related to two known functions associated with dog domestication; growth (PDE4D, CRTC3 and NEB) and neurological function (PDE4D, EML5, ZNF500, SLC6A11, ELAVL2, RGS7 and CTSB). Our results provide insight into the evolution of structural variation in canines, where recombination is not regulated by PRDM9 due to the inactivation of this gene. We also identified genes within the most differentiated CNV regions between dogs and wolves, which could reflect selection during the domestication process.

  5. A genome-wide association study of copy number variations with umbilical hernia in swine.

    PubMed

    Long, Yi; Su, Ying; Ai, Huashui; Zhang, Zhiyan; Yang, Bin; Ruan, Guorong; Xiao, Shijun; Liao, Xinjun; Ren, Jun; Huang, Lusheng; Ding, Nengshui

    2016-06-01

    Umbilical hernia (UH) is one of the most common congenital defects in pigs, leading to considerable economic loss and serious animal welfare problems. To test whether copy number variations (CNVs) contribute to pig UH, we performed a case-control genome-wide CNV association study on 905 pigs from the Duroc, Landrace and Yorkshire breeds using the Porcine SNP60 BeadChip and penncnv algorithm. We first constructed a genomic map comprising 6193 CNVs that pertain to 737 CNV regions. Then, we identified eight CNVs significantly associated with the risk for UH in the three pig breeds. Six of seven significantly associated CNVs were validated using quantitative real-time PCR. Notably, a rare CNV (CNV14:13030843-13059455) encompassing the NUGGC gene was strongly associated with UH (permutation-corrected P = 0.0015) in Duroc pigs. This CNV occurred exclusively in seven Duroc UH-affected individuals. SNPs surrounding the CNV did not show association signals, indicating that rare CNVs may play an important role in complex pig diseases such as UH. The NUGGC gene has been implicated in human omphalocele and inguinal hernia. Our finding supports that CNVs, including the NUGGC CNV, contribute to the pathogenesis of pig UH. © 2016 Stichting International Foundation for Animal Genetics.

  6. Contemporary evolution of resistance at the major insecticide target site gene Ace-1 by mutation and copy number variation in the malaria mosquito Anopheles gambiae

    PubMed Central

    Weetman, David; Mitchell, Sara N; Wilding, Craig S; Birks, Daniel P; Yawson, Alexander E; Essandoh, John; Mawejje, Henry D; Djogbenou, Luc S; Steen, Keith; Rippon, Emily J; Clarkson, Christopher S; Field, Stuart G; Rigden, Daniel J; Donnelly, Martin J

    2015-01-01

    Functionally constrained genes are ideal insecticide targets because disruption is often fatal, and resistance mutations are typically costly. Synaptic acetylcholinesterase (AChE) is an essential neurotransmission enzyme targeted by insecticides used increasingly in malaria control. In Anopheles and Culex mosquitoes, a glycine–serine substitution at codon 119 of the Ace-1 gene confers both resistance and fitness costs, especially for 119S/S homozygotes. G119S in Anopheles gambiae from Accra (Ghana) is strongly associated with resistance, and, despite expectations of cost, resistant 119S alleles are increasing significantly in frequency. Sequencing of Accra females detected only a single Ace-1 119S haplotype, whereas 119G diversity was high overall but very low at non-synonymous sites, evidence of strong purifying selection driven by functional constraint. Flanking microsatellites showed reduced diversity, elevated linkage disequilibrium and high differentiation of 119S, relative to 119G homozygotes across up to two megabases of the genome. Yet these signals of selection were inconsistent and sometimes weak tens of kilobases from Ace-1. This unexpected finding is attributable to apparently ubiquitous amplification of 119S alleles as part of a large copy number variant (CNV) far exceeding the size of the Ace-1 gene, whereas 119G alleles were unduplicated. Ace-1 CNV was detectable in archived samples collected when the 119S allele was rare in Ghana. Multicopy amplification of resistant alleles has not been observed previously and is likely to underpin the recent increase in 119S frequency. The large CNV compromised localization of the strong selective sweep around Ace-1, emphasizing the need to integrate CNV analysis into genome scans for selection. PMID:25865270

  7. Dopamine Inactivation Efficacy Related to Functional DAT1 and COMT Variants Influences Motor Response Evaluation

    PubMed Central

    Bender, Stephan; Rellum, Thomas; Freitag, Christine; Resch, Franz; Rietschel, Marcella; Treutlein, Jens; Jennen-Steinmetz, Christine; Brandeis, Daniel; Banaschewski, Tobias; Laucht, Manfred

    2012-01-01

    Background Dopamine plays an important role in orienting, response anticipation and movement evaluation. Thus, we examined the influence of functional variants related to dopamine inactivation in the dopamine transporter (DAT1) and catechol-O-methyltransferase genes (COMT) on the time-course of motor processing in a contingent negative variation (CNV) task. Methods 64-channel EEG recordings were obtained from 195 healthy adolescents of a community-based sample during a continuous performance task (A-X version). Early and late CNV as well as motor postimperative negative variation were assessed. Adolescents were genotyped for the COMT Val158Met and two DAT1 polymorphisms (variable number tandem repeats in the 3′-untranslated region and in intron 8). Results The results revealed a significant interaction between COMT and DAT1, indicating that COMT exerted stronger effects on lateralized motor post-processing (centro-parietal motor postimperative negative variation) in homozygous carriers of a DAT1 haplotype increasing DAT1 expression. Source analysis showed that the time interval 500–1000 ms after the motor response was specifically affected in contrast to preceding movement anticipation and programming stages, which were not altered. Conclusions Motor slow negative waves allow the genomic imaging of dopamine inactivation effects on cortical motor post-processing during response evaluation. This is the first report to point towards epistatic effects in the motor system during response evaluation, i.e. during the post-processing of an already executed movement rather than during movement programming. PMID:22649558

  8. Comparative studies of copy number variation detection methods for next-generation sequencing technologies.

    PubMed

    Duan, Junbo; Zhang, Ji-Gang; Deng, Hong-Wen; Wang, Yu-Ping

    2013-01-01

    Copy number variation (CNV) has played an important role in studies of susceptibility or resistance to complex diseases. Traditional methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution of genomic regions. Following the emergence of next generation sequencing (NGS) technologies, CNV detection methods based on the short read data have recently been developed. However, due to the relatively young age of the procedures, their performance is not fully understood. To help investigators choose suitable methods to detect CNVs, comparative studies are needed. We compared six publicly available CNV detection methods: CNV-seq, FREEC, readDepth, CNVnator, SegSeq and event-wise testing (EWT). They are evaluated both on simulated and real data with different experiment settings. The receiver operating characteristic (ROC) curve is employed to demonstrate the detection performance in terms of sensitivity and specificity, box plot is employed to compare their performances in terms of breakpoint and copy number estimation, Venn diagram is employed to show the consistency among these methods, and F-score is employed to show the overlapping quality of detected CNVs. The computational demands are also studied. The results of our work provide a comprehensive evaluation on the performances of the selected CNV detection methods, which will help biological investigators choose the best possible method.

  9. Copy Number Variations in Tilapia Genomes.

    PubMed

    Li, Bi Jun; Li, Hong Lian; Meng, Zining; Zhang, Yong; Lin, Haoran; Yue, Gen Hua; Xia, Jun Hong

    2017-02-01

    Discovering the nature and pattern of genome variation is fundamental in understanding phenotypic diversity among populations. Although several millions of single nucleotide polymorphisms (SNPs) have been discovered in tilapia, the genome-wide characterization of larger structural variants, such as copy number variation (CNV) regions has not been carried out yet. We conducted a genome-wide scan for CNVs in 47 individuals from three tilapia populations. Based on 254 Gb of high-quality paired-end sequencing reads, we identified 4642 distinct high-confidence CNVs. These CNVs account for 1.9% (12.411 Mb) of the used Nile tilapia reference genome. A total of 1100 predicted CNVs were found overlapping with exon regions of protein genes. Further association analysis based on linear model regression found 85 CNVs ranging between 300 and 27,000 base pairs significantly associated to population types (R 2  > 0.9 and P > 0.001). Our study sheds first insights on genome-wide CNVs in tilapia. These CNVs among and within tilapia populations may have functional effects on phenotypes and specific adaptation to particular environments.

  10. Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements

    PubMed Central

    Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.

    2008-01-01

    Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679

  11. Plasmodium copy number variation scan: gene copy numbers evaluation in haploid genomes.

    PubMed

    Beghain, Johann; Langlois, Anne-Claire; Legrand, Eric; Grange, Laura; Khim, Nimol; Witkowski, Benoit; Duru, Valentine; Ma, Laurence; Bouchier, Christiane; Ménard, Didier; Paul, Richard E; Ariey, Frédéric

    2016-04-12

    In eukaryotic genomes, deletion or amplification rates have been estimated to be a thousand more frequent than single nucleotide variation. In Plasmodium falciparum, relatively few transcription factors have been identified, and the regulation of transcription is seemingly largely influenced by gene amplification events. Thus copy number variation (CNV) is a major mechanism enabling parasite genomes to adapt to new environmental changes. Currently, the detection of CNVs is based on quantitative PCR (qPCR), which is significantly limited by the relatively small number of genes that can be analysed at any one time. Technological advances that facilitate whole-genome sequencing, such as next generation sequencing (NGS) enable deeper analyses of the genomic variation to be performed. Because the characteristics of Plasmodium CNVs need special consideration in algorithms and strategies for which classical CNV detection programs are not suited a dedicated algorithm to detect CNVs across the entire exome of P. falciparum was developed. This algorithm is based on a custom read depth strategy through NGS data and called PlasmoCNVScan. The analysis of CNV identification on three genes known to have different levels of amplification and which are located either in the nuclear, apicoplast or mitochondrial genomes is presented. The results are correlated with the qPCR experiments, usually used for identification of locus specific amplification/deletion. This tool will facilitate the study of P. falciparum genomic adaptation in response to ecological changes: drug pressure, decreased transmission, reduction of the parasite population size (transition to pre-elimination endemic area).

  12. Single Color Multiplexed ddPCR Copy Number Measurements and Single Nucleotide Variant Genotyping.

    PubMed

    Wood-Bouwens, Christina M; Ji, Hanlee P

    2018-01-01

    Droplet digital PCR (ddPCR) allows for accurate quantification of genetic events such as copy number variation and single nucleotide variants. Probe-based assays represent the current "gold-standard" for detection and quantification of these genetic events. Here, we introduce a cost-effective single color ddPCR assay that allows for single genome resolution quantification of copy number and single nucleotide variation.

  13. Lack of association of rs3798220 with small apolipoprotein(a) isoforms and high lipoprotein(a) levels in East and Southeast Asians.

    PubMed

    Khalifa, Mahmoud; Noureen, Asma; Ertelthalner, Kathrin; Bandegi, Ahmad Reza; Delport, Rhena; Firdaus, Wance J J; Geethanjali, Finney S; Luthra, Kalpana; Makemaharn, Orawan; Pang, Richard W C; Salem, Abdel-Halim; Sasaki, Jun; Schiefenhoevel, Wulf; Lingenhel, Arno; Kronenberg, Florian; Utermann, Gerd; Schmidt, Konrad

    2015-10-01

    The variant allele of rs3798220 in the apolipoprotein(a) gene (LPA) is used to assess the risk for coronary artery disease (CAD) in Europeans, where it is associated with short alleles of the Kringle IV-2 (KIV-2) copy number variation (CNV) and high lipoprotein(a) (Lp(a)) concentrations. No association of rs3798220 with CAD was detected in a GWAS of East Asians. Our study investigated the association of rs3798220 with Lp(a) concentrations and KIV-2 CNV size in non-European populations to explain the missing association of the variant with CAD in Asians. We screened three populations from Africa and seven from Asia by TaqMan Assay for rs3798220 and determined KIV-2 CNV sizes of LPA alleles by pulsed-field gel electrophoresis (PFGE). Additionally, CAD cases from India were analysed. To investigate the phylogenetic origin of rs3798220, 40 LPA alleles from Chinese individuals were separated by PFGE and haplotyped for further SNPs. The variant was not found in Africans. Allele frequencies in East and Southeast Asians ranged from 2.9% to 11.6%, and were very low (0.15%) in CAD cases and controls from India. The variant was neither associated with short KIV-2 CNV alleles nor elevated Lp(a) concentrations in Asians. Our study shows that rs3798220 is no marker for short KIV-2 CNV alleles and high Lp(a) in East and Southeast Asians, although the haplotype background is shared with Europeans. It appears unlikely that this SNP confers atherogenic potential on its own. Furthermore, this SNP does not explain Lp(a) attributed risk for CAD in Asian Indians. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  14. Genome wide analysis of rare copy number variations in alcohol abuse or dependence.

    PubMed

    Rodríguez-López, Julio; Flórez, Gerardo; Blanco, Vanessa; Pereiro, César; Fernández, José Manuel; Fariñas, Emilio; Estévez, Valentín; Gómez-Trigo, Jesús; Gurriarán, Xaquín; Calvo, Raquel; Sáiz, Pilar; Vázquez, Fernando Lino; Arrojo, Manuel; Costas, Javier

    2018-06-02

    Genetics plays an important role in alcohol abuse/dependence. Its heritability has been estimated as 45-65%. Rare copy number variations (CNVs) have been confirmed as relevant genetic factors in other neuropsychiatric disorders, such as autism spectrum disorders, schizophrenia, epilepsy, or Tourette syndrome. In the present study, we analyzed the role of rare CNVs affecting exons of coding genes in a sample from Northwest Spain genotyped using the Illumina Infinium PsychArray Beadchip. After rigorous genotyping quality control procedure, 712 patients with alcohol abuse or dependence and 804 controls were used for CNV detection. CNV calling was performed using PennCNV and cnvPartition, and analyses were restricted to CNVs of at least 100 kb and including at least 10 single nucleotide polymorphisms. Logistic regression was used to test for the effect of CNV as well as number of genes affected by CNVs on case/control status, after adjustment for demographic and experimental covariates. We have found an excess of deletions (p = 0.008) and genes affected by deletions (p = 0.017) in cases. This effect was restricted to the 14.8% of affected genes that are intolerant to loss-of-function mutations (gene count p = 0.009). The importance of this subset of genes is emerging in other psychiatric disorders of neurodevelopmental origin, suggesting that disturbance in neurodevelopment mediated by genetic alterations may be a risk factor for alcohol use disorder. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Anaconda: AN automated pipeline for somatic COpy Number variation Detection and Annotation from tumor exome sequencing data.

    PubMed

    Gao, Jianing; Wan, Changlin; Zhang, Huan; Li, Ao; Zang, Qiguang; Ban, Rongjun; Ali, Asim; Yu, Zhenghua; Shi, Qinghua; Jiang, Xiaohua; Zhang, Yuanwei

    2017-10-03

    Copy number variations (CNVs) are the main genetic structural variations in cancer genome. Detecting CNVs in genetic exome region is efficient and cost-effective in identifying cancer associated genes. Many tools had been developed accordingly and yet these tools lack of reliability because of high false negative rate, which is intrinsically caused by genome exonic bias. To provide an alternative option, here, we report Anaconda, a comprehensive pipeline that allows flexible integration of multiple CNV-calling methods and systematic annotation of CNVs in analyzing WES data. Just by one command, Anaconda can generate CNV detection result by up to four CNV detecting tools. Associated with comprehensive annotation analysis of genes involved in shared CNV regions, Anaconda is able to deliver a more reliable and useful report in assistance with CNV-associate cancer researches. Anaconda package and manual can be freely accessed at http://mcg.ustc.edu.cn/bsc/ANACONDA/ .

  16. A Likelihood-Based Framework for Association Analysis of Allele-Specific Copy Numbers.

    PubMed

    Hu, Y J; Lin, D Y; Sun, W; Zeng, D

    2014-10-01

    Copy number variants (CNVs) and single nucleotide polymorphisms (SNPs) co-exist throughout the human genome and jointly contribute to phenotypic variations. Thus, it is desirable to consider both types of variants, as characterized by allele-specific copy numbers (ASCNs), in association studies of complex human diseases. Current SNP genotyping technologies capture the CNV and SNP information simultaneously via fluorescent intensity measurements. The common practice of calling ASCNs from the intensity measurements and then using the ASCN calls in downstream association analysis has important limitations. First, the association tests are prone to false-positive findings when differential measurement errors between cases and controls arise from differences in DNA quality or handling. Second, the uncertainties in the ASCN calls are ignored. We present a general framework for the integrated analysis of CNVs and SNPs, including the analysis of total copy numbers as a special case. Our approach combines the ASCN calling and the association analysis into a single step while allowing for differential measurement errors. We construct likelihood functions that properly account for case-control sampling and measurement errors. We establish the asymptotic properties of the maximum likelihood estimators and develop EM algorithms to implement the corresponding inference procedures. The advantages of the proposed methods over the existing ones are demonstrated through realistic simulation studies and an application to a genome-wide association study of schizophrenia. Extensions to next-generation sequencing data are discussed.

  17. tRNA gene copy number variation in humans

    PubMed Central

    Iben, James R.; Maraia, Richard J.

    2014-01-01

    The human tRNAome consists of more than 500 interspersed tRNA genes comprising 51 anticodon families of largely unequal copy number. We examined tRNA gene copy number variation (tgCNV) in six individuals; two kindreds of two parents and a child, using high coverage whole genome sequence data. Such differences may be important because translation of some mRNAs is sensitive to the relative amounts of tRNAs and because tRNA competition determines translational efficiency vs. fidelity and production of native vs. misfolded proteins. We identified several tRNA gene clusters with CNV, which in some cases were part of larger iterations. In addition there was an isolated tRNALysCUU gene that was absent as a homozygous deletion in one of the parents. When assessed by semiquantitative PCR in 98 DNA samples representing a wide variety of ethnicities, this allele was found deleted in hetero- or homozygosity in all groups at ~50% frequency. This is the first report of copy number variation of human tRNA genes. We conclude that tgCNV exists at significant levels among individual humans and discuss the results in terms of genetic diversity and prior genome wide association studies (GWAS) that suggest the importance of the ratio of tRNALys isoacceptors in Type-2 diabetes. PMID:24342656

  18. Association of TLR7 and TSHR copy number variation with Graves’ disease and Graves’ ophthalmopathy in Chinese population in Taiwan

    PubMed Central

    2014-01-01

    Background Graves’ disease (GD) and Graves’ ophthalmopathy (GO) are autoimmune disorders, which might be influenced by genetic factors. Copy number variation (CNV) is an important source of genomic diversity in humans, and influences disease susceptibility. This study investigated the association between CNV in the TSHR and TLR7 genes and the development of GD and GO in a Chinese population in Taiwan. Methods For this case-control study, sample from 196 healthy controls and 484 GD patients, including 203 patients with GO were studied. CNV was detected by real-time polymerase chain reaction (PCR) using TaqMan™ probes and the relative copy number (CN) was estimated by using the comparative Ct method. Results The differences in the distribution of TSHR CNV in healthy controls and GD patients were statistically significant (p value = 0.01). However, the difference in the distribution of TSHR CNV in the control group and the GO group was not statistically significant (p value = 0.06). For TLR7 CNV, the results were not significantly different when we compared the distribution in healthy controls and GD patients and in healthy controls and GO patients (p values for Fisher’s exact test were 0.13 and 0.09, respectively). However, a lower than normal CNV for TLR7 (CNV < 2 for female and CNV < 1 for male) was found to have a protective effect against the development of GD (odds ratio (OR) = 0.24; 95% confidence interval (CI), 0.07-0.75) after adjusting for age and gender. Conclusions These results suggested that TSHR and TLR7 CNV might be associated with susceptibility to GD. PMID:24517461

  19. BHD-associated kidney cancer exhibits unique molecular characteristics and a wide variety of variants in chromatin remodeling genes.

    PubMed

    Hasumi, Hisashi; Furuya, Mitsuko; Tatsuno, Kenji; Yamamoto, Shogo; Baba, Masaya; Hasumi, Yukiko; Isono, Yasuhiro; Suzuki, Kae; Jikuya, Ryosuke; Otake, Shinji; Muraoka, Kentaro; Osaka, Kimito; Hayashi, Narihiko; Makiyama, Kazuhide; Miyoshi, Yasuhide; Kondo, Keiichi; Nakaigawa, Noboru; Kawahara, Takashi; Izumi, Koji; Teranishi, Junichi; Yumura, Yasushi; Uemura, Hiroji; Nagashima, Yoji; Metwalli, Adam R; Schmidt, Laura S; Aburatani, Hiroyuki; Linehan, W Marston; Yao, Masahiro

    2018-05-14

    Birt-Hogg-Dubé (BHD) syndrome is a hereditary kidney cancer syndrome, which predisposes patients to develop kidney cancer, cutaneous fibrofolliculomas and pulmonary cysts. The responsible gene FLCN is a tumor suppressor for kidney cancer which plays an important role in energy homeostasis through the regulation of mitochondrial oxidative metabolism. However, the process by which FLCN-deficiency leads to renal tumorigenesis is unclear. In order to clarify molecular pathogenesis of BHD-associated kidney cancer, we conducted whole-exome sequencing analysis using next-generation sequencing technology as well as metabolite analysis using LC/MS and GC/MS. Whole-exome sequencing analysis of BHD-associated kidney cancer revealed that copy number variations (CNV) of BHD-associated kidney cancer are considerably different from those already reported in sporadic cases. In somatic variant analysis, very few variants were commonly observed in BHD-associated kidney cancer; however, variants in chromatin remodeling genes were frequently observed in BHD-associated kidney cancer (17/29 tumors, 59%). Metabolite analysis of BHD-associated kidney cancer revealed metabolic reprogramming towards upregulated redox regulation which may neutralize reactive oxygen species potentially produced from mitochondria with increased respiratory capacity under FLCN-deficiency. BHD-associated kidney cancer displays unique molecular characteristics which are completely different from sporadic kidney cancer, providing mechanistic insight into tumorigenesis under FLCN-deficiency as well as a foundation for development of novel therapeutics for kidney cancer.

  20. Copy number variation as a genetic basis for heterotaxy and heterotaxy-spectrum congenital heart defects.

    PubMed

    Cowan, Jason R; Tariq, Muhammad; Shaw, Chad; Rao, Mitchell; Belmont, John W; Lalani, Seema R; Smolarek, Teresa A; Ware, Stephanie M

    2016-12-19

    Genomic disorders and rare copy number abnormalities are identified in 15-25% of patients with syndromic conditions, but their prevalence in individuals with isolated birth defects is less clear. A spectrum of congenital heart defects (CHDs) is seen in heterotaxy, a highly heritable and genetically heterogeneous multiple congenital anomaly syndrome resulting from failure to properly establish left-right (L-R) organ asymmetry during early embryonic development. To identify novel genetic causes of heterotaxy, we analysed copy number variants (CNVs) in 225 patients with heterotaxy and heterotaxy-spectrum CHDs using array-based genotyping methods. Clinically relevant CNVs were identified in approximately 20% of patients and encompassed both known and putative heterotaxy genes. Patients were carefully phenotyped, revealing a significant association of abdominal situs inversus with pathogenic or likely pathogenic CNVs, while d-transposition of the great arteries was more frequently associated with common CNVs. Identified cytogenetic abnormalities ranged from large unbalanced translocations to smaller, kilobase-scale CNVs, including a rare, single exon deletion in ZIC3, a gene known to cause X-linked heterotaxy. Morpholino loss-of-function experiments in Xenopus support a role for one of these novel candidates, the platelet isoform of phosphofructokinase-1 (PFKP) in heterotaxy. Collectively, our results confirm a high CNV yield for array-based testing in patients with heterotaxy, and support use of CNV analysis for identification of novel biological processes relevant to human laterality.This article is part of the themed issue 'Provocative questions in left-right asymmetry'. © 2016 The Author(s).

  1. Rare Copy Number Variation in Treatment-Resistant Major Depressive Disorder

    PubMed Central

    O’Dushlaine, Colm; Ripke, Stephan; Ruderfer, Douglas M.; Hamilton, Steven P.; Fava, Maurizio; Iosifescu, Dan V.; Kohane, Isaac S.; Churchill, Susanne E.; Castro, Victor M.; Clements, Caitlin C.; Blumenthal, Sarah R.; Murphy, Shawn N.; Smoller, Jordan W.; Perlis, Roy H.

    2014-01-01

    Background While antidepressant treatment response appears to be partially heritable, no consistent genetic associations have been identified. Large, rare copy number variants (CNVs) play a role in other neuropsychiatric diseases, so we assessed their association with treatment-resistant depression (TRD). Methods We analyzed data from two genome-wide association studies comprising 1263 Caucasian patients with major depressive disorder. One was drawn from a large health system by applying natural language processing to electronic health records (i2b2 cohort). The second consisted of a multicenter study of sequential antidepressant treatments, Sequenced Treatment Alternatives to Relieve Depression. The Birdsuite package was used to identify rare deletions and duplications. Individuals without symptomatic remission, despite two antidepressant treatment trials, were contrasted with those who remitted with a first treatment trial. Results CNV data were derived for 778 subjects in the i2b2 cohort, including 300 subjects (37%) with TRD, and 485 subjects in Sequenced Treatment Alternatives to Relieve Depression cohort, including 152 (31%) with TRD. CNV burden analyses identified modest enrichment of duplications in cases (empirical p = .04 for duplications of 100–200 kilobase) and a particular deletion region spanning gene PABPC4L (empirical p = .02, 6 cases: 0 controls). Pathway analysis suggested enrichment of CNVs intersecting genes regulating actin cytoskeleton. However, none of these associations survived genome-wide correction. Conclusions Contribution of rare CNVs to TRD appears to be modest, individually or in aggregate. The electronic health record-based methodology demonstrated here should facilitate collection of larger TRD cohorts necessary to further characterize these effects. PMID:24529801

  2. High-resolution copy number variation analysis of schizophrenia in Japan.

    PubMed

    Kushima, I; Aleksic, B; Nakatochi, M; Shimamura, T; Shiino, T; Yoshimi, A; Kimura, H; Takasaki, Y; Wang, C; Xing, J; Ishizuka, K; Oya-Ito, T; Nakamura, Y; Arioka, Y; Maeda, T; Yamamoto, M; Yoshida, M; Noma, H; Hamada, S; Morikawa, M; Uno, Y; Okada, T; Iidaka, T; Iritani, S; Yamamoto, T; Miyashita, M; Kobori, A; Arai, M; Itokawa, M; Cheng, M-C; Chuang, Y-A; Chen, C-H; Suzuki, M; Takahashi, T; Hashimoto, R; Yamamori, H; Yasuda, Y; Watanabe, Y; Nunokawa, A; Someya, T; Ikeda, M; Toyota, T; Yoshikawa, T; Numata, S; Ohmori, T; Kunimoto, S; Mori, D; Iwata, N; Ozaki, N

    2017-03-01

    Recent schizophrenia (SCZ) studies have reported an increased burden of de novo copy number variants (CNVs) and identified specific high-risk CNVs, although with variable phenotype expressivity. However, the pathogenesis of SCZ has not been fully elucidated. Using array comparative genomic hybridization, we performed a high-resolution genome-wide CNV analysis on a mainly (92%) Japanese population (1699 SCZ cases and 824 controls) and identified 7066 rare CNVs, 70.0% of which were small (<100 kb). Clinically significant CNVs were significantly more frequent in cases than in controls (odds ratio=3.04, P=9.3 × 10 -9 , 9.0% of cases). We confirmed a significant association of X-chromosome aneuploidies with SCZ and identified 11 de novo CNVs (e.g., MBD5 deletion) in cases. In patients with clinically significant CNVs, 41.7% had a history of congenital/developmental phenotypes, and the rate of treatment resistance was significantly higher (odds ratio=2.79, P=0.0036). We found more severe clinical manifestations in patients with two clinically significant CNVs. Gene set analysis replicated previous findings (e.g., synapse, calcium signaling) and identified novel biological pathways including oxidative stress response, genomic integrity, kinase and small GTPase signaling. Furthermore, involvement of multiple SCZ candidate genes and biological pathways in the pathogenesis of SCZ was suggested in established SCZ-associated CNV loci. Our study shows the high genetic heterogeneity of SCZ and its clinical features and raises the possibility that genomic instability is involved in its pathogenesis, which may be related to the increased burden of de novo CNVs and variable expressivity of CNVs.

  3. Association between sequence variants in panicle development genes and the number of spikelets per panicle in rice.

    PubMed

    Jang, Su; Lee, Yunjoo; Lee, Gileung; Seo, Jeonghwan; Lee, Dongryung; Yu, Yoye; Chin, Joong Hyoun; Koh, Hee-Jong

    2018-01-15

    Balancing panicle-related traits such as panicle length and the numbers of primary and secondary branches per panicle, is key to improving the number of spikelets per panicle in rice. Identifying genetic information contributes to a broader understanding of the roles of gene and provides candidate alleles for use as DNA markers. Discovering relations between panicle-related traits and sequence variants allows opportunity for molecular application in rice breeding to improve the number of spikelets per panicle. In total, 142 polymorphic sites, which constructed 58 haplotypes, were detected in coding regions of ten panicle development gene and 35 sequence variants in six genes were significantly associated with panicle-related traits. Rice cultivars were clustered according to their sequence variant profiles. One of the four resultant clusters, which contained only indica and tong-il varieties, exhibited the largest average number of favorable alleles and highest average number of spikelets per panicle, suggesting that the favorable allele combination found in this cluster was beneficial in increasing the number of spikelets per panicle. Favorable alleles identified in this study can be used to develop functional markers for rice breeding programs. Furthermore, stacking several favorable alleles has the potential to substantially improve the number of spikelets per panicle in rice.

  4. Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

    USDA-ARS?s Scientific Manuscript database

    Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...

  5. Evaluation of copy number variation detection for a SNP array platform

    PubMed Central

    2014-01-01

    Background Copy Number Variations (CNVs) are usually inferred from Single Nucleotide Polymorphism (SNP) arrays by use of some software packages based on given algorithms. However, there is no clear understanding of the performance of these software packages; it is therefore difficult to select one or several software packages for CNV detection based on the SNP array platform. We selected four publicly available software packages designed for CNV calling from an Affymetrix SNP array, including Birdsuite, dChip, Genotyping Console (GTC) and PennCNV. The publicly available dataset generated by Array-based Comparative Genomic Hybridization (CGH), with a resolution of 24 million probes per sample, was considered to be the “gold standard”. Compared with the CGH-based dataset, the success rate, average stability rate, sensitivity, consistence and reproducibility of these four software packages were assessed compared with the “gold standard”. Specially, we also compared the efficiency of detecting CNVs simultaneously by two, three and all of the software packages with that by a single software package. Results Simply from the quantity of the detected CNVs, Birdsuite detected the most while GTC detected the least. We found that Birdsuite and dChip had obvious detecting bias. And GTC seemed to be inferior because of the least amount of CNVs it detected. Thereafter we investigated the detection consistency produced by one certain software package and the rest three software suits. We found that the consistency of dChip was the lowest while GTC was the highest. Compared with the CNVs detecting result of CGH, in the matching group, GTC called the most matching CNVs, PennCNV-Affy ranked second. In the non-overlapping group, GTC called the least CNVs. With regards to the reproducibility of CNV calling, larger CNVs were usually replicated better. PennCNV-Affy shows the best consistency while Birdsuite shows the poorest. Conclusion We found that PennCNV outperformed the

  6. Population-genetic nature of copy number variations in the human genome.

    PubMed

    Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko

    2010-03-01

    Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.

  7. Contemporary evolution of resistance at the major insecticide target site gene Ace-1 by mutation and copy number variation in the malaria mosquito Anopheles gambiae.

    PubMed

    Weetman, David; Mitchell, Sara N; Wilding, Craig S; Birks, Daniel P; Yawson, Alexander E; Essandoh, John; Mawejje, Henry D; Djogbenou, Luc S; Steen, Keith; Rippon, Emily J; Clarkson, Christopher S; Field, Stuart G; Rigden, Daniel J; Donnelly, Martin J

    2015-06-01

    Functionally constrained genes are ideal insecticide targets because disruption is often fatal, and resistance mutations are typically costly. Synaptic acetylcholinesterase (AChE) is an essential neurotransmission enzyme targeted by insecticides used increasingly in malaria control. In Anopheles and Culex mosquitoes, a glycine-serine substitution at codon 119 of the Ace-1 gene confers both resistance and fitness costs, especially for 119S/S homozygotes. G119S in Anopheles gambiae from Accra (Ghana) is strongly associated with resistance, and, despite expectations of cost, resistant 119S alleles are increasing significantly in frequency. Sequencing of Accra females detected only a single Ace-1 119S haplotype, whereas 119G diversity was high overall but very low at non-synonymous sites, evidence of strong purifying selection driven by functional constraint. Flanking microsatellites showed reduced diversity, elevated linkage disequilibrium and high differentiation of 119S, relative to 119G homozygotes across up to two megabases of the genome. Yet these signals of selection were inconsistent and sometimes weak tens of kilobases from Ace-1. This unexpected finding is attributable to apparently ubiquitous amplification of 119S alleles as part of a large copy number variant (CNV) far exceeding the size of the Ace-1 gene, whereas 119G alleles were unduplicated. Ace-1 CNV was detectable in archived samples collected when the 119S allele was rare in Ghana. Multicopy amplification of resistant alleles has not been observed previously and is likely to underpin the recent increase in 119S frequency. The large CNV compromised localization of the strong selective sweep around Ace-1, emphasizing the need to integrate CNV analysis into genome scans for selection. © 2015 The Authors. Molecular Ecology published by John Wiley & Sons Ltd.

  8. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    PubMed Central

    Bickhart, Derek M.; Xu, Lingyang; Hutchison, Jana L.; Cole, John B.; Null, Daniel J.; Schroeder, Steven G.; Song, Jiuzhou; Garcia, Jose Fernando; Sonstegard, Tad S.; Van Tassell, Curtis P.; Schnabel, Robert D.; Taylor, Jeremy F.; Lewin, Harris A.; Liu, George E.

    2016-01-01

    The diversity and population genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analysed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, and Romagnola), sequenced to 11-fold coverage to identify 1,853 non-redundant CNV regions. Supported by high validation rates in array comparative genomic hybridization (CGH) and qPCR experiments, these CNV regions accounted for 3.1% (87.5 Mb) of the cattle reference genome, representing a significant increase over previous estimates of the area of the genome that is copy number variable (∼2%). Further population genetics and evolutionary genomics analyses based on these CNVs revealed the population structures of the cattle taurine and indicine breeds and uncovered potential diversely selected CNVs near important functional genes, including AOX1, ASZ1, GAT, GLYAT, and KRTAP9-1. Additionally, 121 CNV gene regions were found to be either breed specific or differentially variable across breeds, such as RICTOR in dairy breeds and PNPLA3 in beef breeds. In contrast, clusters of the PRP and PAG genes were found to be duplicated in all sequenced animals, suggesting that subfunctionalization, neofunctionalization, or overdominance play roles in diversifying those fertility-related genes. These CNV results provide a new glimpse into the diverse selection histories of cattle breeds and a basis for correlating structural variation with complex traits in the future. PMID:27085184

  9. Novel Population Specific Autosomal Copy Number Variation and Its Functional Analysis amongst Negritos from Peninsular Malaysia

    PubMed Central

    Mokhtar, Siti Shuhada; Marshall, Christian R.; Phipps, Maude E.; Thiruvahindrapuram, Bhooma; Lionel, Anath C.; Scherer, Stephen W.; Peng, Hoh Boon

    2014-01-01

    Copy number variation (CNV) has been recognized as a major contributor to human genome diversity. It plays an important role in determining phenotypes and has been associated with a number of common and complex diseases. However CNV data from diverse populations is still limited. Here we report the first investigation of CNV in the indigenous populations from Peninsular Malaysia. We genotyped 34 Negrito genomes from Peninsular Malaysia using the Affymetrix SNP 6.0 microarray and identified 48 putative novel CNVs, consisting of 24 gains and 24 losses, of which 5 were identified in at least 2 unrelated samples. These CNVs appear unique to the Negrito population and were absent in the DGV, HapMap3 and Singapore Genome Variation Project (SGVP) datasets. Analysis of gene ontology revealed that genes within these CNVs were enriched in the immune system (GO:0002376), response to stimulus mechanisms (GO:0050896), the metabolic pathways (GO:0001852), as well as regulation of transcription (GO:0006355). Copy number gains in CNV regions (CNVRs) enriched with genes were significantly higher than the losses (P value <0.001). In view of the small population size, relative isolation and semi-nomadic lifestyles of this community, we speculate that these CNVs may be attributed to recent local adaptation of Negritos from Peninsular Malaysia. PMID:24956385

  10. Novel population specific autosomal copy number variation and its functional analysis amongst Negritos from Peninsular Malaysia.

    PubMed

    Mokhtar, Siti Shuhada; Marshall, Christian R; Phipps, Maude E; Thiruvahindrapuram, Bhooma; Lionel, Anath C; Scherer, Stephen W; Peng, Hoh Boon

    2014-01-01

    Copy number variation (CNV) has been recognized as a major contributor to human genome diversity. It plays an important role in determining phenotypes and has been associated with a number of common and complex diseases. However CNV data from diverse populations is still limited. Here we report the first investigation of CNV in the indigenous populations from Peninsular Malaysia. We genotyped 34 Negrito genomes from Peninsular Malaysia using the Affymetrix SNP 6.0 microarray and identified 48 putative novel CNVs, consisting of 24 gains and 24 losses, of which 5 were identified in at least 2 unrelated samples. These CNVs appear unique to the Negrito population and were absent in the DGV, HapMap3 and Singapore Genome Variation Project (SGVP) datasets. Analysis of gene ontology revealed that genes within these CNVs were enriched in the immune system (GO:0002376), response to stimulus mechanisms (GO:0050896), the metabolic pathways (GO:0001852), as well as regulation of transcription (GO:0006355). Copy number gains in CNV regions (CNVRs) enriched with genes were significantly higher than the losses (P value <0.001). In view of the small population size, relative isolation and semi-nomadic lifestyles of this community, we speculate that these CNVs may be attributed to recent local adaptation of Negritos from Peninsular Malaysia.

  11. The Role of mGluR Copy Number Variation in Genetic and Environmental Forms of Syndromic Autism Spectrum Disorder.

    PubMed

    Wenger, Tara L; Kao, Charlly; McDonald-McGinn, Donna M; Zackai, Elaine H; Bailey, Alice; Schultz, Robert T; Morrow, Bernice E; Emanuel, Beverly S; Hakonarson, Hakon

    2016-01-19

    While abnormal signaling mediated through metabotropic glutamate receptor 5 (mGluR5) is involved in the pathophysiology of Autism Spectrum Disorder (ASD), Fragile X Syndrome and Tuberous Sclerosis, the role of other mGluRs and their associated signaling network genes in syndromic ASD is unknown. This study sought to determine whether mGluR Copy Number Variants (CNV's) were overrepresented in children with syndromic ASD and if mGluR "second hit" confers additional risk for ASD in 22q11.2 Deletion Syndrome (22q11DS). To determine whether mGluR network CNV'S are enriched in syndromic ASD, we examined microarrays from children with ASD (n = 539). Patient categorization (syndromic vs nonsyndromic) was done via blinded medical chart review in mGluR positive and randomly selected mGluR negative cases. 11.5% of ASD had mGluR CNV's vs. 3.2% in controls (p < 0.001). Syndromic ASD was more prevalent in children with mGluR CNVs (74% vs 16%, p < 0.001). A comparison cohort with 22q11DS (n = 25 with ASD, n = 50 without ASD), all haploinsufficient for mGluR network gene RANBP1, were evaluated for "second mGluR hits". 20% with 22q11.2DS + ASD had "second hits" in mGluR network genes vs 2% in 22q11.2DS-ASD (p < 0.014). We propose that altered RANBP1 expression may provide a mechanistic link for several seemingly unrelated genetic and environmental forms of ASD.

  12. Detection and quantitation of chromosomal mosaicism in human blastocysts using copy number variation sequencing.

    PubMed

    Ruttanajit, Tida; Chanchamroen, Sujin; Cram, David S; Sawakwongpra, Kritchakorn; Suksalak, Wanwisa; Leng, Xue; Fan, Junmei; Wang, Li; Yao, Yuanqing; Quangkananurug, Wiwat

    2016-02-01

    Currently, our understanding of the nature and reproductive potential of blastocysts associated with trophectoderm (TE) lineage chromosomal mosaicism is limited. The objective of this study was to first validate copy number variation sequencing (CNV-Seq) for measuring the level of mosaicism and second, examine the nature and level of mosaicism in TE biopsies of patient's blastocysts. TE biopy samples were analysed by array comparative genomic hybridization (CGH) and CNV-Seq to discriminate between euploid, aneuploid and mosaic blastocysts. Using artificial models of TE mosaicism for five different chromosomes, CNV-Seq accurately and reproducibly quantitated mosaicism at levels of 50% and 20%. In a comparative 24-chromosome study of 49 blastocysts by array CGH and CNV-Seq, 43 blastocysts (87.8%) had a concordant diagnosis and 6 blastocysts (12.2%) were discordant. The discordance was attributed to low to medium levels of chromosomal mosaicism (30-70%) not detected by array CGH. In an expanded study of 399 blastocysts using CNV-Seq as the sole diagnostic method, the proportion of diploid-aneuploid mosaics (34, 8.5%) was significantly higher than aneuploid mosaics (18, 4.5%) (p < 0.02). Mosaicism is a significant chromosomal abnormality associated with the TE lineage of human blastocysts that can be reliably and accurately detected by CNV-Seq. © 2015 John Wiley & Sons, Ltd.

  13. The use of population-scale sequencing to identify CNVs impacting productive traits in different cattle breeds

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  14. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  15. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect the rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an incre...

  16. Comprehensive genomic analysis of patients with disorders of cerebral cortical development.

    PubMed

    Wiszniewski, Wojciech; Gawlinski, Pawel; Gambin, Tomasz; Bekiesinska-Figatowska, Monika; Obersztyn, Ewa; Antczak-Marach, Dorota; Akdemir, Zeynep Hande Coban; Harel, Tamar; Karaca, Ender; Jurek, Marta; Sobecka, Katarzyna; Nowakowska, Beata; Kruk, Malgorzata; Terczynska, Iwona; Goszczanska-Ciuchta, Alicja; Rudzka-Dybala, Mariola; Jamroz, Ewa; Pyrkosz, Antoni; Jakubiuk-Tomaszuk, Anna; Iwanowski, Piotr; Gieruszczak-Bialek, Dorota; Piotrowicz, Malgorzata; Sasiadek, Maria; Kochanowska, Iwona; Gurda, Barbara; Steinborn, Barbara; Dawidziuk, Mateusz; Castaneda, Jennifer; Wlasienko, Pawel; Bezniakow, Natalia; Jhangiani, Shalini N; Hoffman-Zacharska, Dorota; Bal, Jerzy; Szczepanik, Elzbieta; Boerwinkle, Eric; Gibbs, Richard A; Lupski, James R

    2018-04-30

    Malformations of cortical development (MCDs) manifest with structural brain anomalies that lead to neurologic sequelae, including epilepsy, cerebral palsy, developmental delay, and intellectual disability. To investigate the underlying genetic architecture of patients with disorders of cerebral cortical development, a cohort of 54 patients demonstrating neuroradiologic signs of MCDs was investigated. Individual genomes were interrogated for single-nucleotide variants (SNV) and copy number variants (CNV) with whole-exome sequencing and chromosomal microarray studies. Variation affecting known MCDs-associated genes was found in 16/54 cases, including 11 patients with SNV, 2 patients with CNV, and 3 patients with both CNV and SNV, at distinct loci. Diagnostic pathogenic SNV and potentially damaging variants of unknown significance (VUS) were identified in two groups of seven individuals each. We demonstrated that de novo variants are important among patients with MCDs as they were identified in 10/16 individuals with a molecular diagnosis. Three patients showed changes in known MCDs genes  and a clinical phenotype beyond the usual characteristics observed, i.e., phenotypic expansion, for a particular known disease gene clinical entity. We also discovered 2 likely candidate genes, CDH4, and ASTN1, with human and animal studies supporting their roles in brain development, and 5 potential candidate genes. Our findings emphasize genetic heterogeneity of MCDs disorders and postulate potential novel candidate genes involved in cerebral cortical development.

  17. A computational method for detecting copy number variations using scale-space filtering

    PubMed Central

    2013-01-01

    Background As next-generation sequencing technology made rapid and cost-effective sequencing available, the importance of computational approaches in finding and analyzing copy number variations (CNVs) has been amplified. Furthermore, most genome projects need to accurately analyze sequences with fairly low-coverage read data. It is urgently needed to develop a method to detect the exact types and locations of CNVs from low coverage read data. Results Here, we propose a new CNV detection method, CNV_SS, which uses scale-space filtering. The scale-space filtering is evaluated by applying to the read coverage data the Gaussian convolution for various scales according to a given scaling parameter. Next, by differentiating twice and finding zero-crossing points, inflection points of scale-space filtered read coverage data are calculated per scale. Then, the types and the exact locations of CNVs are obtained by analyzing the finger print map, the contours of zero-crossing points for various scales. Conclusions The performance of CNV_SS showed that FNR and FPR stay in the range of 1.27% to 2.43% and 1.14% to 2.44%, respectively, even at a relatively low coverage (0.5x ≤C ≤2x). CNV_SS gave also much more effective results than the conventional methods in the evaluation of FNR, at 3.82% at least and 76.97% at most even when the coverage level of read data is low. CNV_SS source code is freely available from http://dblab.hallym.ac.kr/CNV SS/. PMID:23418726

  18. Truncating Variants in NAA15 Are Associated with Variable Levels of Intellectual Disability, Autism Spectrum Disorder, and Congenital Anomalies.

    PubMed

    Cheng, Hanyin; Dharmadhikari, Avinash V; Varland, Sylvia; Ma, Ning; Domingo, Deepti; Kleyner, Robert; Rope, Alan F; Yoon, Margaret; Stray-Pedersen, Asbjørg; Posey, Jennifer E; Crews, Sarah R; Eldomery, Mohammad K; Akdemir, Zeynep Coban; Lewis, Andrea M; Sutton, Vernon R; Rosenfeld, Jill A; Conboy, Erin; Agre, Katherine; Xia, Fan; Walkiewicz, Magdalena; Longoni, Mauro; High, Frances A; van Slegtenhorst, Marjon A; Mancini, Grazia M S; Finnila, Candice R; van Haeringen, Arie; den Hollander, Nicolette; Ruivenkamp, Claudia; Naidu, Sakkubai; Mahida, Sonal; Palmer, Elizabeth E; Murray, Lucinda; Lim, Derek; Jayakar, Parul; Parker, Michael J; Giusto, Stefania; Stracuzzi, Emanuela; Romano, Corrado; Beighley, Jennifer S; Bernier, Raphael A; Küry, Sébastien; Nizon, Mathilde; Corbett, Mark A; Shaw, Marie; Gardner, Alison; Barnett, Christopher; Armstrong, Ruth; Kassahn, Karin S; Van Dijck, Anke; Vandeweyer, Geert; Kleefstra, Tjitske; Schieving, Jolanda; Jongmans, Marjolijn J; de Vries, Bert B A; Pfundt, Rolph; Kerr, Bronwyn; Rojas, Samantha K; Boycott, Kym M; Person, Richard; Willaert, Rebecca; Eichler, Evan E; Kooy, R Frank; Yang, Yaping; Wu, Joseph C; Lupski, James R; Arnesen, Thomas; Cooper, Gregory M; Chung, Wendy K; Gecz, Jozef; Stessman, Holly A F; Meng, Linyan; Lyon, Gholson J

    2018-05-03

    N-alpha-acetylation is a common co-translational protein modification that is essential for normal cell function in humans. We previously identified the genetic basis of an X-linked infantile lethal Mendelian disorder involving a c.109T>C (p.Ser37Pro) missense variant in NAA10, which encodes the catalytic subunit of the N-terminal acetyltransferase A (NatA) complex. The auxiliary subunit of the NatA complex, NAA15, is the dimeric binding partner for NAA10. Through a genotype-first approach with whole-exome or genome sequencing (WES/WGS) and targeted sequencing analysis, we identified and phenotypically characterized 38 individuals from 33 unrelated families with 25 different de novo or inherited, dominantly acting likely gene disrupting (LGD) variants in NAA15. Clinical features of affected individuals with LGD variants in NAA15 include variable levels of intellectual disability, delayed speech and motor milestones, and autism spectrum disorder. Additionally, mild craniofacial dysmorphology, congenital cardiac anomalies, and seizures are present in some subjects. RNA analysis in cell lines from two individuals showed degradation of the transcripts with LGD variants, probably as a result of nonsense-mediated decay. Functional assays in yeast confirmed a deleterious effect for two of the LGD variants in NAA15. Further supporting a mechanism of haploinsufficiency, individuals with copy-number variant (CNV) deletions involving NAA15 and surrounding genes can present with mild intellectual disability, mild dysmorphic features, motor delays, and decreased growth. We propose that defects in NatA-mediated N-terminal acetylation (NTA) lead to variable levels of neurodevelopmental disorders in humans, supporting the importance of the NatA complex in normal human development. Copyright © 2018 American Society of Human Genetics. All rights reserved.

  19. Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing.

    PubMed

    Yi, Guoqiang; Qu, Lujiang; Liu, Jianfeng; Yan, Yiyuan; Xu, Guiyun; Yang, Ning

    2014-11-07

    Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing. A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson's correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding. Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

  20. Accurately Assessing the Risk of Schizophrenia Conferred by Rare Copy-Number Variation Affecting Genes with Brain Function

    PubMed Central

    Raychaudhuri, Soumya; Korn, Joshua M.; McCarroll, Steven A.; Altshuler, David; Sklar, Pamela; Purcell, Shaun; Daly, Mark J.

    2010-01-01

    Investigators have linked rare copy number variation (CNVs) to neuropsychiatric diseases, such as schizophrenia. One hypothesis is that CNV events cause disease by affecting genes with specific brain functions. Under these circumstances, we expect that CNV events in cases should impact brain-function genes more frequently than those events in controls. Previous publications have applied “pathway” analyses to genes within neuropsychiatric case CNVs to show enrichment for brain-functions. While such analyses have been suggestive, they often have not rigorously compared the rates of CNVs impacting genes with brain function in cases to controls, and therefore do not address important confounders such as the large size of brain genes and overall differences in rates and sizes of CNVs. To demonstrate the potential impact of confounders, we genotyped rare CNV events in 2,415 unaffected controls with Affymetrix 6.0; we then applied standard pathway analyses using four sets of brain-function genes and observed an apparently highly significant enrichment for each set. The enrichment is simply driven by the large size of brain-function genes. Instead, we propose a case-control statistical test, cnv-enrichment-test, to compare the rate of CNVs impacting specific gene sets in cases versus controls. With simulations, we demonstrate that cnv-enrichment-test is robust to case-control differences in CNV size, CNV rate, and systematic differences in gene size. Finally, we apply cnv-enrichment-test to rare CNV events published by the International Schizophrenia Consortium (ISC). This approach reveals nominal evidence of case-association in neuronal-activity and the learning gene sets, but not the other two examined gene sets. The neuronal-activity genes have been associated in a separate set of schizophrenia cases and controls; however, testing in independent samples is necessary to definitively confirm this association. Our method is implemented in the PLINK software package

  1. Arabidopsis thaliana population analysis reveals high plasticity of the genomic region spanning MSH2, AT3G18530 and AT3G18535 genes and provides evidence for NAHR-driven recurrent CNV events occurring in this location.

    PubMed

    Zmienko, Agnieszka; Samelak-Czajka, Anna; Kozlowski, Piotr; Szymanska, Maja; Figlerowicz, Marek

    2016-11-08

    Intraspecies copy number variations (CNVs), defined as unbalanced structural variations of specific genomic loci, ≥1 kb in size, are present in the genomes of animals and plants. A growing number of examples indicate that CNVs may have functional significance and contribute to phenotypic diversity. In the model plant Arabidopsis thaliana at least several hundred protein-coding genes might display CNV; however, locus-specific genotyping studies in this plant have not been conducted. We analyzed the natural CNVs in the region overlapping MSH2 gene that encodes the DNA mismatch repair protein, and AT3G18530 and AT3G18535 genes that encode poorly characterized proteins. By applying multiplex ligation-dependent probe amplification and droplet digital PCR we genotyped those genes in 189 A. thaliana accessions. We found that AT3G18530 and AT3G18535 were duplicated (2-14 times) in 20 and deleted in 101 accessions. MSH2 was duplicated in 12 accessions (up to 12-14 copies) but never deleted. In all but one case, the MSH2 duplications were associated with those of AT3G18530 and AT3G18535. Considering the structure of the CNVs, we distinguished 5 genotypes for this region, determined their frequency and geographical distribution. We defined the CNV breakpoints in 35 accessions with AT3G18530 and AT3G18535 deletions and tandem duplications and showed that they were reciprocal events, resulting from non-allelic homologous recombination between 99 %-identical sequences flanking these genes. The widespread geographical distribution of the deletions supported by the SNP and linkage disequilibrium analyses of the genomic sequence confirmed the recurrent nature of this CNV. We characterized in detail for the first time the complex multiallelic CNV in Arabidopsis genome. The region encoding MSH2, AT3G18530 and AT3G18535 genes shows enormous variation of copy numbers among natural ecotypes, being a remarkable example of high Arabidopsis genome plasticity. We provided the molecular

  2. Distribution of Disease-Associated Copy Number Variants across Distinct Disorders of Cognitive Development

    ERIC Educational Resources Information Center

    Pescosolido, Matthew F.; Gamsiz, Ece D.; Nagpal, Shailender; Morrow, Eric M.

    2013-01-01

    Objective: The purpose of the present study was to discover the extent to which distinct "DSM" disorders share large, highly recurrent copy number variants (CNVs) as susceptibility factors. We also sought to identify gene mechanisms common to groups of diagnoses and/or specific to a given diagnosis based on associations with CNVs. Method:…

  3. Genomic characteristics of cattle copy number variations

    USDA-ARS?s Scientific Manuscript database

    We performed a systematic analysis of cattle copy number variations (CNVs) using the Bovine HapMap SNP genotyping data, including 539 animals of 21 modern cattle breeds and 6 outgroups. After correcting genomic waves and considering the trio information, we identified 682 candidate CNV regions (CNVR...

  4. Sonic Hedgehog mutations are not a common cause of congenital hypopituitarism in the absence of complex midline cerebral defects.

    PubMed

    Paulo, Sabrina Soares; Fernandes-Rosa, Fábio L; Turatti, Wendy; Coeli-Lacchini, Fernanda Borchers; Martinelli, Carlos E; Nakiri, Guilherme S; Moreira, Ayrton C; Santos, Antônio C; de Castro, Margaret; Antonini, Sonir R

    2015-04-01

    Sonic Hedgehog (SHH) and GLI2, an obligatory mediator of SHH signal transduction, are holoprosencephaly (HPE)-associated genes essential in pituitary formation. GLI2 variants have been found in patients with congenital hypopituitarism without complex midline cerebral defects (MCD). However, data on the occurrence of SHH mutations in these patients are limited. We screened for SHH and GLI2 mutations or copy number variations (CNV) in patients with congenital hypopituitarism without MCD or with variable degrees of MCD. Detailed data on clinical, laboratory and neuroimaging findings of 115 patients presenting with congenital hypopituitarism without MCD, septo-optic dysplasia or HPE were analysed. The SHH and GLI2 genes were directly sequenced, and the presence of gene CNV was analysed by multiplex ligation-dependent probe amplification (MLPA). Anterior pituitary deficiency was found in 74% and 53% of patients with SOD or HPE, respectively. Diabetes insipidus was common in patients with HPE (47%) but infrequent in patients with congenital hypopituitarism or SOD (7% and 8%, respectively). A single heterozygous nonsense SHH mutation (p.Tyr175Ter) was found in a patient presenting with hypopituitarism and alobar HPE. No other SHH mutations or CNV were found. Nine GLI2 variations (8 missense and 1 frameshift) including a homozygous and a compound heterozygous variation were found in patients with congenital hypopituitarism or SOD, but not in HPE patients. No GLI2 CNV were found. SHH mutations or copy number variations are not a common cause of congenital hypopituitarism in patients without complex midline cerebral defects. GLI2 variants are found in some patients with congenital hypopituitarism without complex midline cerebral defects or septo-optic dysplasia. However, functional analyses of these variants are needed to strengthen genotype-phenotype relationship. © 2014 John Wiley & Sons Ltd.

  5. A nCounter CNV Assay to Detect HER2 Amplification: A Correlation Study with Immunohistochemistry and In Situ Hybridization in Advanced Gastric Cancer.

    PubMed

    Ahn, Soomin; Hong, Mineui; Van Vrancken, Michael; Lyou, You Jeong; Kim, Seung Tae; Park, Se Hoon; Kang, Won Ki; Park, Young Suk; Jung, Sin-Ho; Woo, Minah; Lee, Jeeyun; Kim, Kyoung-Mee

    2016-08-01

    Screening amplified genes for targeted therapy with high-throughput technology is very important. The NanoString nCounter system allows multiplexed digital quantification of target molecules through the use of color-coded barcodes with the great advantage that formalin-fixed, paraffin-embedded (FFPE) tissue can be utilized. We tested nCounter custom copy number variation (CNV) panels in 220 gastric cancer samples and evaluated the utility of this method as a screening tool for the detection of CNV using HER2. For the validation of results, we compared the nCounter results with immunohistochemistry (IHC), and we further performed in situ hybridization (ISH) in discrepant cases. The average HER2 gene copy numbers (CNs) by nCounter were 17.25, 2.0 and 2.61 for the HER2 IHC positive (3+), equivocal (2+), and negative cases, respectively. Out of the 16 IHC 3+ cases, 13 (81.3 %) were reported as HER2 CN gain (≥4). Gastric cancers with homogeneous HER2 overexpression or high tumor purity showed HER2 CN ≥10. Among the 192 cases with HER2 IHC negative and without HER2 gene amplification, 29 showed a HER2 CN ≥4 with the nCounter assay. The nCounter assay had a concordance rate of 83.4 % (kappa value, 0.35), a sensitivity of 66.7 %, a specificity of 85.2 %, a negative predictive value of 96 %, and a positive predictive value of 32.6 % compared with HER2 IHC/ISH results. Fresh frozen (FF) samples revealed a higher concordance rate (91.5 %, kappa value, 0.59) than FFPE samples (78.5 %, kappa value 0.27) and showed a high specificity (97.2 %). The nCounter CNV assay is a reliable and practical method to detect high CN variations. Given the intra-tumoral HER2 heterogeneity and normal cell contamination, additional IHC and/or FISH is necessary and needs caution in interpretation, especially in FFPE tissue samples.

  6. Automated design of paralogue ratio test assays for the accurate and rapid typing of copy number variation

    PubMed Central

    Veal, Colin D.; Xu, Hang; Reekie, Katherine; Free, Robert; Hardwick, Robert J.; McVey, David; Brookes, Anthony J.; Hollox, Edward J.; Talbot, Christopher J.

    2013-01-01

    Motivation: Genomic copy number variation (CNV) can influence susceptibility to common diseases. High-throughput measurement of gene copy number on large numbers of samples is a challenging, yet critical, stage in confirming observations from sequencing or array Comparative Genome Hybridization (CGH). The paralogue ratio test (PRT) is a simple, cost-effective method of accurately determining copy number by quantifying the amplification ratio between a target and reference amplicon. PRT has been successfully applied to several studies analyzing common CNV. However, its use has not been widespread because of difficulties in assay design. Results: We present PRTPrimer (www.prtprimer.org) software for automated PRT assay design. In addition to stand-alone software, the web site includes a database of pre-designed assays for the human genome at an average spacing of 6 kb and a web interface for custom assay design. Other reference genomes can also be analyzed through local installation of the software. The usefulness of PRTPrimer was tested within known CNV, and showed reproducible quantification. This software and database provide assays that can rapidly genotype CNV, cost-effectively, on a large number of samples and will enable the widespread adoption of PRT. Availability: PRTPrimer is available in two forms: a Perl script (version 5.14 and higher) that can be run from the command line on Linux systems and as a service on the PRTPrimer web site (www.prtprimer.org). Contact: cjt14@le.ac.uk Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:23742985

  7. High mutation rates explain low population genetic divergence at copy-number-variable loci in Homo sapiens.

    PubMed

    Hu, Xin-Sheng; Yeh, Francis C; Hu, Yang; Deng, Li-Ting; Ennos, Richard A; Chen, Xiaoyang

    2017-02-22

    Copy-number-variable (CNV) loci differ from single nucleotide polymorphic (SNP) sites in size, mutation rate, and mechanisms of maintenance in natural populations. It is therefore hypothesized that population genetic divergence at CNV loci will differ from that found at SNP sites. Here, we test this hypothesis by analysing 856 CNV loci from the genomes of 1184 healthy individuals from 11 HapMap populations with a wide range of ancestry. The results show that population genetic divergence at the CNV loci is generally more than three times lower than at genome-wide SNP sites. Populations generally exhibit very small genetic divergence (G st  = 0.05 ± 0.049). The smallest divergence is among African populations (G st  = 0.0081 ± 0.0025), with increased divergence among non-African populations (G st  = 0.0217 ± 0.0109) and then among African and non-African populations (G st  = 0.0324 ± 0.0064). Genetic diversity is high in African populations (~0.13), low in Asian populations (~0.11), and intermediate in the remaining 11 populations. Few significant linkage disequilibria (LDs) occur between the genome-wide CNV loci. Patterns of gametic and zygotic LDs indicate the absence of epistasis among CNV loci. Mutation rate is about twice as large as the migration rate in the non-African populations, suggesting that the high mutation rates play dominant roles in producing the low population genetic divergence at CNV loci.

  8. Tempered mlo broad-spectrum resistance to barley powdery mildew in an Ethiopian landrace

    PubMed Central

    Ge, Xintian; Deng, Weiwei; Lee, Zheng Zhou; Lopez-Ruiz, Francisco J.; Schweizer, Patrick; Ellwood, Simon R.

    2016-01-01

    Recessive mutations in the Mlo gene confer broad spectrum resistance in barley (Hordeum vulgare) to powdery mildew (Blumeria graminis f. sp. hordei), a widespread and damaging disease. However, all alleles discovered to date also display deleterious pleiotropic effects, including the naturally occurring mlo-11 mutant which is widely deployed in Europe. Recessive resistance was discovered in Eth295, an Ethiopian landrace, which was developmentally controlled and quantitative without spontaneous cell wall appositions or extensive necrosis and loss of photosynthetic tissue. This resistance is determined by two copies of the mlo-11 repeat units, that occur upstream to the wild-type Mlo gene, compared to 11–12 in commonly grown cultivars and was designated mlo-11 (cnv2). mlo-11 repeat unit copy number-dependent DNA methylation corresponded with cytological and macroscopic phenotypic differences between copy number variants. Sequence data indicated mlo-11 (cnv2) formed via recombination between progenitor mlo-11 repeat units and the 3′ end of an adjacent stowaway MITE containing region. mlo-11 (cnv2) is the only example of a moderated mlo variant discovered to date and may have arisen by natural selection against the deleterious effects of the progenitor mlo-11 repeat unit configuration. PMID:27404990

  9. Chromosomal abnormalities and copy number variations in fetal left-sided congenital heart defects.

    PubMed

    Jansen, Fenna A R; Hoffer, Mariette J V; van Velzen, Christine L; Plati, Stephani Klingeman; Rijlaarsdam, Marry E B; Clur, Sally-Ann B; Blom, Nico A; Pajkrt, Eva; Bhola, Shama L; Knegt, Alida C; de Boer, Marion A; Haak, Monique C

    2016-02-01

    To demonstrate the spectrum of copy number variants (CNVs) in fetuses with isolated left-sided congenital heart defects (CHDs), and analyse genetic content. Between 2003 and 2012, 200 fetuses were identified with left-sided CHD. Exclusion criteria were chromosomal rearrangements, 22q11.2 microdeletion and/or extra-cardiac malformations (n = 64). We included cases with additional minor anomalies (n = 39), such as single umbilical artery. In 54 of 136 eligible cases, stored material was available for array analysis. CNVs were categorized as either (likely) benign, (likely) pathogenic or of unknown significance. In 18 of the 54 isolated left-sided CHDs we found 28 rare CNVs (prevalence 33%, average 1.6 CNV per person, size 10.6 kb-2.2 Mb). Our interpretation yielded clinically significant CNVs in two of 54 cases (4%) and variants of unknown significance in three other cases (6%). In left-sided CHDs that appear isolated, with normal chromosome analysis and 22q11.2 FISH analysis, array analysis detects clinically significant CNVs. When counselling parents of a fetus with a left-sided CHD it must be taken into consideration that aside from the cardiac characteristics, the presence of extra-cardiac malformations and chromosomal abnormalities influence the treatment plan and prognosis. © 2015 John Wiley & Sons, Ltd.

  10. An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression

    PubMed Central

    Wyszynski, Asaf; Hong, Chi-Chen; Lam, Kristin; Michailidou, Kyriaki; Lytle, Christian; Yao, Song; Zhang, Yali; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Hopper, John L.; Southey, Melissa C.; Schmidt, Marjanka K.; Broeks, Annegien; Muir, Kenneth; Lophatananon, Artitaya; Fasching, Peter A.; Beckmann, Matthias W.; Peto, Julian; dos-Santos-Silva, Isabel; Sawyer, Elinor J.; Tomlinson, Ian; Burwinkel, Barbara; Marme, Frederik; Guénel, Pascal; Truong, Thérèse; Bojesen, Stig E.; Nordestgaard, Børge G.; González-Neira, Anna; Benitez, Javier; Neuhausen, Susan L.; Brenner, Hermann; Dieffenbach, Aida Karina; Meindl, Alfons; Schmutzler, Rita K.; Brauch, Hiltrud; Nevanlinna, Heli; Khan, Sofia; Matsuo, Keitaro; Ito, Hidemi; Dörk, Thilo; Bogdanova, Natalia V.; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Wu, Anna H.; Van Den Berg, David; Lambrechts, Diether; Wildiers, Hans; Chang-Claude, Jenny; Rudolph, Anja; Radice, Paolo; Peterlongo, Paolo; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; Milne, Roger L.; Haiman, Christopher A.; Henderson, Brian E.; Dumont, Martine; Teo, Soo Hwang; Wong, Tien Y.; Kristensen, Vessela; Zheng, Wei; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Andrulis, Irene L.; Knight, Julia A.; Devilee, Peter; Seynaeve, Caroline; García-Closas, Montserrat; Figueroa, Jonine; Klevebring, Daniel; Czene, Kamila; Hooning, Maartje J.; van den Ouweland, Ans M.W.; Darabi, Hatef; Shu, Xiao-Ou; Gao, Yu-Tang; Cox, Angela; Blot, William; Signorello, Lisa B.; Shah, Mitul; Kang, Daehee; Choi, Ji-Yeob; Hartman, Mikael; Miao, Hui; Hamann, Ute; Jakubowska, Anna; Lubinski, Jan; Sangrajrang, Suleeporn; McKay, James; Toland, Amanda E.; Yannoukakos, Drakoulis; Shen, Chen-Yang; Wu, Pei-Ei; Swerdlow, Anthony; Orr, Nick; Simard, Jacques; Pharoah, Paul D.P.; Dunning, Alison M.; Chenevix-Trench, Georgia; Hall, Per; Bandera, Elisa; Amos, Chris; Ambrosone, Christine; Easton, Douglas F.; Cole, Michael D.

    2016-01-01

    Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following oestrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR = 0.68 95%CI 0.55–0.83, P = 0.0002; replication OR = 0.77 95% CI 0.73-0.82, P = 2.1 × 10−19) and identify 13 additional linked variants (r2 > 0.8) in the 20Kb linkage block containing the enCNV (P = 3.2 × 10−15 − 5.6 × 10−17). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5. PMID:27402876

  11. Elucidating the genetic architecture of familial schizophrenia using rare copy number variant and linkage scans.

    PubMed

    Xu, Bin; Woodroffe, Abigail; Rodriguez-Murillo, Laura; Roos, J Louw; van Rensburg, Elizabeth J; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria

    2009-09-29

    To elucidate the genetic architecture of familial schizophrenia we combine linkage analysis with studies of fine-level chromosomal variation in families recruited from the Afrikaner population in South Africa. We demonstrate that individually rare inherited copy number variants (CNVs) are more frequent in cases with familial schizophrenia as compared to unaffected controls and affect almost exclusively genic regions. Interestingly, we find that while the prevalence of rare structural variants is similar in familial and sporadic cases, the type of variants is markedly different. In addition, using a high-density linkage scan with a panel of nearly 2,000 markers, we identify a region on chromosome 13q34 that shows genome-wide significant linkage to schizophrenia and show that in the families not linked to this locus, there is evidence for linkage to chromosome 1p36. No causative CNVs were identified in either locus. Overall, our results from approaches designed to detect risk variants with relatively low frequency and high penetrance in a well-defined and relatively homogeneous population, provide strong empirical evidence supporting the notion that multiple genetic variants, including individually rare ones, that affect many different genes contribute to the genetic risk of familial schizophrenia. They also highlight differences in the genetic architecture of the familial and sporadic forms of the disease.

  12. TNFRSF10C copy number variation is associated with metastatic colorectal cancer

    PubMed Central

    Tanenbaum, Daniel G.; Hall, William A.; Colbert, Lauren E.; Bastien, Amanda J.; Brat, Daniel J.; Kong, Jun; Kim, Sungjin; Dwivedi, Bhakti; Kowalski, Jeanne; Landry, Jerome C.

    2016-01-01

    Background Genetic markers for distant metastatic disease in patients with colorectal cancer (CRC) are not well defined. Identification of genetic alterations associated with metastatic CRC could help to guide systemic and local treatment strategies. We evaluated the association of tumor necrosis factor receptor superfamily member 10C (TNFRSF10C) copy number variation (CNV) with distant metastatic disease in patients with CRC using The Cancer Genome Atlas (TCGA). Methods Genetic sequencing data and clinical characteristics were obtained from TCGA for all available patients with CRC. There were 515 CRC patient samples with CNV and clinical outcome data, including a subset of 144 rectal adenocarcinoma patient samples. Using the TCGA CRC dataset, CNV of TNFRSF10C was evaluated for association with distant metastatic disease (M1 vs. M0). Multivariate logistic regression analysis with odds ratio (OR) using a 95% confidence interval (CI) was performed adjusting for age, T stage, N stage, adjuvant chemotherapy, gender, microsatellite instability (MSI), location, and surgical margin status. Results TNFRSF10C CNV in patients with CRC was associated with distant metastatic disease [OR 4.81 (95% CI, 2.13–10.85) P<0.001] and positive lymph nodes [OR 18.83 (95% CI, 8.42–42.09)]; P<0.001) but not MSI (OR P=0.799). On multivariate analysis, after adjusting for pathologic T stage, N stage, adjuvant chemotherapy, gender, and MSI, TNFRSF10C CNV remained significantly associated with distant metastatic disease (OR P=0.018). Subset analysis revealed that TNFRSF10C CNV was also significantly associated with distant metastatic disease in patients with rectal adenocarcinoma (OR P=0.016). Conclusions TNFRSF10C CNV in patients with CRC is associated with distant metastatic disease. With further validation, such genetic profiles could be used clinically to support optimal systemic treatment strategies versus more aggressive local therapies in patients with CRC, including radiation

  13. Copy number variation of GATA4 and NKX2-5 in Chinese fetuses with congenital heart disease.

    PubMed

    Liu, Zhen; Wang, Jing; Liu, Shanling; Deng, Ying; Liu, Hongqian; Li, Nana; Li, Shengli; Chen, Xinlin; Lin, Yuan; Wang, He; Zhu, Jun

    2015-04-01

    Congenital heart disease (CHD) is one of the most common birth defects in newborns. The etiology of CHD has remained largely unknown, but it is assumed to result from the combined effects of genetic and environmental factors. Recent investigations have detected potentially pathogenic copy number variations (CNV) in a proportion of patients with CHD. The present case-control study evaluated whether CNV in the GATA4 and NKX2-5 genes contribute to the pathogenesis of CHD in Chinese fetuses (n = 117), by comparing them with non-CHD control subjects (n = 100). Multiplex ligation-dependent probe amplification with the P311A probe mixture was used to detect CNV. The normalized signals were within the normal range for all exons in all CHD patients and non-CHD control subjects. Of the 117 CHD patients, three had a deletion of 22q11, and two had a duplication of 22q11. There was no evidence of a role for NKX2-5 and GATA4 CNV in fetal CHD; therefore, these CNV may not be common in fetal CHD in China. © 2014 Japan Pediatric Society.

  14. CNV discovery for milk composition traits in dairy cattle using whole genome resequencing.

    PubMed

    Gao, Yahui; Jiang, Jianping; Yang, Shaohua; Hou, Yali; Liu, George E; Zhang, Shengli; Zhang, Qin; Sun, Dongxiao

    2017-03-29

    Copy number variations (CNVs) are important and widely distributed in the genome. CNV detection opens a new avenue for exploring genes associated with complex traits in humans, animals and plants. Herein, we present a genome-wide assessment of CNVs that are potentially associated with milk composition traits in dairy cattle. In this study, CNVs were detected based on whole genome re-sequencing data of eight Holstein bulls from four half- and/or full-sib families, with extremely high and low estimated breeding values (EBVs) of milk protein percentage and fat percentage. The range of coverage depth per individual was 8.2-11.9×. Using CNVnator, we identified a total of 14,821 CNVs, including 5025 duplications and 9796 deletions. Among them, 487 differential CNV regions (CNVRs) comprising ~8.23 Mb of the cattle genome were observed between the high and low groups. Annotation of these differential CNVRs were performed based on the cattle genome reference assembly (UMD3.1) and totally 235 functional genes were found within the CNVRs. By Gene Ontology and KEGG pathway analyses, we found that genes were significantly enriched for specific biological functions related to protein and lipid metabolism, insulin/IGF pathway-protein kinase B signaling cascade, prolactin signaling pathway and AMPK signaling pathways. These genes included INS, IGF2, FOXO3, TH, SCD5, GALNT18, GALNT16, ART3, SNCA and WNT7A, implying their potential association with milk protein and fat traits. In addition, 95 CNVRs were overlapped with 75 known QTLs that are associated with milk protein and fat traits of dairy cattle (Cattle QTLdb). In conclusion, based on NGS of 8 Holstein bulls with extremely high and low EBVs for milk PP and FP, we identified a total of 14,821 CNVs, 487 differential CNVRs between groups, and 10 genes, which were suggested as promising candidate genes for milk protein and fat traits.

  15. Elucidating the genetic architecture of familial schizophrenia using rare copy number variant and linkage scans

    PubMed Central

    Xu, Bin; Woodroffe, Abigail; Rodriguez-Murillo, Laura; Roos, J. Louw; van Rensburg, Elizabeth J.; Abecasis, Gonçalo R.; Gogos, Joseph A.; Karayiorgou, Maria

    2009-01-01

    To elucidate the genetic architecture of familial schizophrenia we combine linkage analysis with studies of fine-level chromosomal variation in families recruited from the Afrikaner population in South Africa. We demonstrate that individually rare inherited copy number variants (CNVs) are more frequent in cases with familial schizophrenia as compared to unaffected controls and affect almost exclusively genic regions. Interestingly, we find that while the prevalence of rare structural variants is similar in familial and sporadic cases, the type of variants is markedly different. In addition, using a high-density linkage scan with a panel of nearly 2,000 markers, we identify a region on chromosome 13q34 that shows genome-wide significant linkage to schizophrenia and show that in the families not linked to this locus, there is evidence for linkage to chromosome 1p36. No causative CNVs were identified in either locus. Overall, our results from approaches designed to detect risk variants with relatively low frequency and high penetrance in a well-defined and relatively homogeneous population, provide strong empirical evidence supporting the notion that multiple genetic variants, including individually rare ones, that affect many different genes contribute to the genetic risk of familial schizophrenia. They also highlight differences in the genetic architecture of the familial and sporadic forms of the disease. PMID:19805367

  16. An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.

    PubMed

    Zare, Fatima; Dow, Michelle; Monteleone, Nicholas; Hosny, Abdelrahman; Nabavi, Sheida

    2017-05-31

    Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors' complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is

  17. Genomic amplification of the caprine EDNRA locus might lead to a dose dependent loss of pigmentation

    PubMed Central

    Menzi, Fiona; Keller, Irene; Reber, Irene; Beck, Julia; Brenig, Bertram; Schütz, Ekkehard; Leeb, Tosso; Drögemüller, Cord

    2016-01-01

    The South African Boer goat displays a characteristic white spotting phenotype, in which the pigment is limited to the head. Exploiting the existing phenotype variation within the breed, we mapped the locus causing this white spotting phenotype to chromosome 17 by genome wide association. Subsequent whole genome sequencing identified a 1 Mb copy number variant (CNV) harboring 5 genes including EDNRA. The analysis of 358 Boer goats revealed 3 alleles with one, two, and three copies of this CNV. The copy number is correlated with the degree of white spotting in goats. We propose a hypothesis that ectopic overexpression of a mutant EDNRA scavenges EDN3 required for EDNRB signaling and normal melanocyte development and thus likely lead to an absence of melanocytes in the non-pigmented body areas of Boer goats. Our findings demonstrate the value of domestic animals as reservoir of unique mutants and for identifying a precisely defined functional CNV. PMID:27329507

  18. Aluminum tolerance is associated with higher MATE1 gene copy-number in maize

    USDA-ARS?s Scientific Manuscript database

    Genome structure variation, including copy-number (CNV) and presence/absence variation (PAV), comprise a large extent of maize genetic diversity but their effect on phenotypes remains largely unexplored. Here we describe how copy-number variation in a major aluminum (Al) tolerance locus contributes ...

  19. Copy Number Variants in Obesity-Related Syndromes: Review and Perspectives on Novel Molecular Approaches

    PubMed Central

    Koiffmann, Celia Priszkulnik

    2012-01-01

    In recent decades, obesity has reached epidemic proportions worldwide and became a major concern in public health. Despite heritability estimates of 40 to 70% and the long-recognized genetic basis of obesity in a number of rare cases, the list of common obesity susceptibility variants by the currently published genome-wide association studies (GWASs) only explain a small proportion of the individual variation in risk of obesity. It was not until very recently that GWASs of copy number variants (CNVs) in individuals with extreme phenotypes reported a number of large and rare CNVs conferring high risk to obesity, and specifically deletions on chromosome 16p11.2. In this paper, we comment on the recent advances in the field of genetics of obesity with an emphasis on the genes and genomic regions implicated in highly penetrant forms of obesity associated with developmental disorders. Array genomic hybridization in this patient population has afforded discovery opportunities for CNVs that have not previously been detectable. This information can be used to generate new diagnostic arrays and sequencing platforms, which will likely enhance detection of known genetic conditions with the potential to elucidate new disease genes and ultimately help in developing a next-generation sequencing protocol relevant to clinical practice. PMID:23316347

  20. Duplication of an upstream silencer of FZP increases grain yield in rice.

    PubMed

    Bai, Xufeng; Huang, Yong; Hu, Yong; Liu, Haiyang; Zhang, Bo; Smaczniak, Cezary; Hu, Gang; Han, Zhongmin; Xing, Yongzhong

    2017-11-01

    Transcriptional silencer and copy number variants (CNVs) are associated with gene expression. However, their roles in generating phenotypes have not been well studied. Here we identified a rice quantitative trait locus, SGDP7 (Small Grain and Dense Panicle 7). SGDP7 is identical to FZP (FRIZZY PANICLE), which represses the formation of axillary meristems. The causal mutation of SGDP7 is an 18-bp fragment, named CNV-18bp, which was inserted ~5.3 kb upstream of FZP and resulted in a tandem duplication in the cultivar Chuan 7. The CNV-18bp duplication repressed FZP expression, prolonged the panicle branching period and increased grain yield by more than 15% through substantially increasing the number of spikelets per panicle (SPP) and slightly decreasing the 1,000-grain weight (TGW). The transcription repressor OsBZR1 binds the CGTG motifs in CNV-18bp and thereby represses FZP expression, indicating that CNV-18bp is the upstream silencer of FZP. These findings showed that the silencer CNVs coordinate a trade-off between SPP and TGW by fine-tuning FZP expression, and balancing the trade-off could enhance yield potential.

  1. A novel method for sex determination by detecting the number of X chromosomes.

    PubMed

    Nakanishi, Hiroaki; Shojo, Hideki; Ohmori, Takeshi; Hara, Masaaki; Takada, Aya; Adachi, Noboru; Saito, Kazuyuki

    2015-01-01

    A novel method for sex determination, based on the detection of the number of X chromosomes, was established. Current methods, based on the detection of the Y chromosome, can directly identify an unknown sample as male, but female gender is determined indirectly, by not detecting the Y chromosome. Thus, a direct determination of female gender is important because the quality (e.g., fragmentation and amelogenin-Y null allele) of the Y chromosome DNA may lead to a false result. Thus, we developed a novel sex determination method by analyzing the number of X chromosomes using a copy number variation (CNV) detection technique (the comparative Ct method). In this study, we designed a primer set using the amelogenin-X gene without the CNV region as the target to determine the X chromosome copy number, to exclude the influence of the CNV region from the comparative Ct value. The number of X chromosomes was determined statistically using the CopyCaller software with real-time PCR. All DNA samples from participants (20 males, 20 females) were evaluated correctly using this method with 1-ng template DNA. A minimum of 0.2-ng template DNA was found to be necessary for accurate sex determination with this method. When using ultraviolet-irradiated template DNA, as mock forensic samples, the sex of the samples could not be determined by short tandem repeat (STR) analysis but was correctly determined using our method. Thus, we successfully developed a method of sex determination based on the number of X chromosomes. Our novel method will be useful in forensic practice for sex determination.

  2. Copy number variation in metabolic phenotypes.

    PubMed

    Lanktree, M; Hegele, R A

    2008-01-01

    Despite successes in identifying genetic contributors to common metabolic phenotypes, only part of the heritable component of these traits has thus far been explained. Copy number variation (CNV) is likely to be responsible for some of the unexplained variation. As observed with single nucleotide changes, it is probable that both rare and common CNVs will contribute to susceptibility to metabolic disease. For instance, CNVs in the LDLR gene underlie a substantial portion of disease in patients with heterozygous familial hypercholesterolemia. As well, a common CNV in LPA encoding apolipoprotein(a) is the primary determinant of plasma lipoprotein(a) concentrations, a risk factor for atherosclerosis. Recent efforts to map CNVs in control populations have defined their size, frequency and distribution. Many of the identified CNVs overlap genes with important functions in metabolic pathways. The overlap of CNVs that were found in control datasets with functional candidate genes or genes with previous evidence of association with metabolic syndrome presents an important subset for future CNV association studies. Finally, we describe an approach to search for CNVs in a rare high-penetrance metabolic disorder, namely lipodystrophy. As methods to identify CNVs increase in precision and accuracy, the prospect of identifying their role in both rare Mendelian and common complex metabolic phenotypes will become a reality. Copyright 2009 S. Karger AG, Basel.

  3. Comparative analyses across cattle genders and breeds reveal the pitfalls caused by false positive and lineage-differential copy number variations.

    PubMed

    Zhou, Yang; Utsunomiya, Yuri T; Xu, Lingyang; Hay, El Hamidi Abdel; Bickhart, Derek M; Sonstegard, Tad S; Van Tassell, Curtis P; Garcia, Jose Fernando; Liu, George E

    2016-07-06

    We compared CNV region (CNVR) results derived from 1,682 Nellore cattle with equivalent results derived from our previous analysis of Bovine HapMap samples. By comparing CNV segment frequencies between different genders and groups, we identified 9 frequent, false positive CNVRs with a total length of 0.8 Mbp that were likely caused by assembly errors. Although there was a paucity of lineage specific events, we did find one 54 kb deletion on chr5 significantly enriched in Nellore cattle. A few highly frequent CNVRs present in both datasets were detected within genomic regions containing olfactory receptor, ATP-binding cassette, and major histocompatibility complex genes. We further evaluated their impacts on downstream bioinformatics and CNV association analyses. Our results revealed pitfalls caused by false positive and lineage-differential copy number variations and will increase the accuracy of future CNV studies in both taurine and indicine cattle.

  4. Genomic and evolutionary characteristics of cattle copy number variations

    USDA-ARS?s Scientific Manuscript database

    We performed a systematic analysis of cattle copy number variations (CNVs) using the Bovine HapMap SNP genotyping data, including 539 animals of 21 modern cattle breeds and 6 outgroups. After correcting genomic waves and considering the trio information, we identified 682 candidate CNV regions (CNVR...

  5. A multifaceted computational report on the variants effect on KIR2DL3 and IFNL3 candidate gene in HCV clearance.

    PubMed

    Singh, Pratichi; Dass, J Febin Prabhu

    2016-10-01

    HCV infection causes acute and chronic liver diseases including, cirrhosis and hepatocellular carcinoma. Following HCV infection, spontaneous clearance occurs in approximately 20 % of the population dependant upon HCV genotype. In this study, functional and non-functional variant analysis was executed for the classical and the latest HCV clearance candidate genes namely, KIR2DL3 and IFNL3. Initially, the functional effects of non-synonymous SNPs were assigned on exposing to homology based tools, SIFT, PolyPhen-2 and PROVEAN. Further, UTR and splice sites variants were scanned for the gene expression and regulation changes. Subsequently, the haplotype and CNV were also identified. The mutation H77Y of KIR2DL3 and R157Q, H156Y, S63L, R157W, F179V, H128R, T101M, R180C, and F176I of IFNL3 results in conservation, RMSD, total energy, stability, and secondary structures revealed a negative impact on the structural fitness. UTRscan and the splice site result indicate functional change, which may affect gene regulation and expression. The graphical display of selected population shows alleles like rs270779, rs2296370, rs10423751, rs12982559, rs9797797, and rs35987710 of KIR2DL3 and rs12972991, rs12980275, rs4803217, rs8109886, and rs8099917 of IFNL3 are in high LD with a measure of [Formula: see text] broadcasting its protective effect in HCV clearance. Similarly, CNV report suggests major DNA fragment loss that could have a profound impact on the gene expression affecting the overall phenotype. This roundup report specifies the effect of NK cell receptor, KIR2DL3 and IFNL3 variants that can have a better prospect in GWAS and immunogenetic studies leading to better understanding of HCV clearance and progression.

  6. Population-genetic properties of differentiated copy number variations in cattle

    USDA-ARS?s Scientific Manuscript database

    Copy number variations (CNVs) have been shown to be both common in mammals and important for understanding the relationship between genotype and phenotype. However, CNV differentiation, selection and its population genetic properties are not well understood across diverse populations. We performed a...

  7. Copy number variants calling for single cell sequencing data by multi-constrained optimization.

    PubMed

    Xu, Bo; Cai, Hongmin; Zhang, Changsheng; Yang, Xi; Han, Guoqiang

    2016-08-01

    Variations in DNA copy number carry important information on genome evolution and regulation of DNA replication in cancer cells. The rapid development of single-cell sequencing technology allows one to explore gene expression heterogeneity among single-cells, thus providing important cancer cell evolution information. Single-cell DNA/RNA sequencing data usually have low genome coverage, which requires an extra step of amplification to accumulate enough samples. However, such amplification will introduce large bias and makes bioinformatics analysis challenging. Accurately modeling the distribution of sequencing data and effectively suppressing the bias influence is the key to success variations analysis. Recent advances demonstrate the technical noises by amplification are more likely to follow negative binomial distribution, a special case of Poisson distribution. Thus, we tackle the problem CNV detection by formulating it into a quadratic optimization problem involving two constraints, in which the underling signals are corrupted by Poisson distributed noises. By imposing the constraints of sparsity and smoothness, the reconstructed read depth signals from single-cell sequencing data are anticipated to fit the CNVs patterns more accurately. An efficient numerical solution based on the classical alternating direction minimization method (ADMM) is tailored to solve the proposed model. We demonstrate the advantages of the proposed method using both synthetic and empirical single-cell sequencing data. Our experimental results demonstrate that the proposed method achieves excellent performance and high promise of success with single-cell sequencing data. Crown Copyright © 2016. Published by Elsevier Ltd. All rights reserved.

  8. The Growing Importance of CNVs: New Insights for Detection and Clinical Interpretation

    PubMed Central

    Valsesia, Armand; Macé, Aurélien; Jacquemont, Sébastien; Beckmann, Jacques S.; Kutalik, Zoltán

    2013-01-01

    Differences between genomes can be due to single nucleotide variants, translocations, inversions, and copy number variants (CNVs, gain or loss of DNA). The latter can range from sub-microscopic events to complete chromosomal aneuploidies. Small CNVs are often benign but those larger than 500 kb are strongly associated with morbid consequences such as developmental disorders and cancer. Detecting CNVs within and between populations is essential to better understand the plasticity of our genome and to elucidate its possible contribution to disease. Hence there is a need for better-tailored and more robust tools for the detection and genome-wide analyses of CNVs. While a link between a given CNV and a disease may have often been established, the relative CNV contribution to disease progression and impact on drug response is not necessarily understood. In this review we discuss the progress, challenges, and limitations that occur at different stages of CNV analysis from the detection (using DNA microarrays and next-generation sequencing) and identification of recurrent CNVs to the association with phenotypes. We emphasize the importance of germline CNVs and propose strategies to aid clinicians to better interpret structural variations and assess their clinical implications. PMID:23750167

  9. Pathogenic copy number variants in patients with congenital hypopituitarism associated with complex phenotypes.

    PubMed

    Correa, Fernanda A; Jorge, Alexander Al; Nakaguma, Marilena; Canton, Ana Pm; Costa, Silvia S; Funari, Mariana F; Lerario, Antonio M; Franca, Marcela M; Carvalho, Luciani R; Krepischi, Ana Cv; Arnhold, Ivo Jp; Rosenberg, Carla; Mendonca, Berenice B

    2018-03-01

    The aetiology of congenital hypopituitarism (CH) is unknown in most patients. Rare copy number variants (CNVs) have been implicated as the cause of genetic syndromes with previously unknown aetiology. Our aim was to study the presence of CNVs and their pathogenicity in patients with idiopathic CH associated with complex phenotypes. We selected 39 patients with syndromic CH for array-based comparative genomic hybridization (aCGH). Patients with pathogenic CNVs were also evaluated by whole exome sequencing. Twenty rare CNVs were detected in 19 patients. Among the identified rare CNVs, six were classified as benign, eleven as variants of uncertain clinical significance (VUS) and four as pathogenic. The three patients with pathogenic CNVs had combined pituitary hormone deficiencies, and the associated complex phenotypes were intellectual disabilities: trichorhinophalangeal type I syndrome (TRPS1) and developmental delay/intellectual disability with cardiac malformation, respectively. Patient one has a de novo 1.6-Mb deletion located at chromosome 3q13.31q13.32, which overlaps with the region of the 3q13.31 deletion syndrome. Patient two has a 10.5-Mb de novo deletion at 8q23.1q24.11, encompassing the TRPS1 gene; his phenotype is compatible with TRPS1. Patient three carries a chromosome translocation t(2p24.3;4q35.1) resulting in two terminal alterations: a 2p25.3p24.3 duplication of 14.7 Mb and a 4-Mb deletion at 4q35.1q35.2. Copy number variants explained the phenotype in 8% of patients with hypopituitarism and additional complex phenotypes. This suggests that chromosomal alterations are an important contributor to syndromic hypopituitarism. © 2017 John Wiley & Sons Ltd.

  10. Complement component 4 copy number variation and CYP21A2 genotype associations in patients with congenital adrenal hyperplasia due to 21-hydroxylase deficiency.

    PubMed

    Chen, Wuyan; Xu, Zhi; Nishitani, Miki; Van Ryzin, Carol; McDonnell, Nazli B; Merke, Deborah P

    2012-12-01

    Congenital adrenal hyperplasia (CAH) due to 21-hydroxylase deficiency (21-OHD) is an autosomal recessive disorder of cortisol biosynthesis caused by CYP21A2 mutations. An increase in gene copy number variation (CNV) exists at the CYP21A2 locus. CNV of C4, a neighboring gene that encodes complement component 4, is associated with autoimmune disease susceptibility. In this study, we performed comprehensive genetic analysis of the RP-C4-CYP21-TNX (RCCX) region in 127 unrelated 21-OHD patients (100 classic, 27 nonclassic). C4 copy number was determined by Southern blot. C4 CNV and serum C4 levels were evaluated in relation to CYP21A2 mutations and relevant phenotypes. We found that the most common CYP21A2 mutation associated with the nonclassic form of CAH, V281L, was associated with high C4 copy number (p = 7.13 × 10(-16)). Large CYP21A2 deletion, a common mutation associated with the classic form of CAH, was associated with low C4 copy number (p = 1.61 × 10(-14)). Monomodular RCCX with a short C4 gene, a risk factor for autoimmune disease, was significantly less frequent in CAH patients compared to population estimates (2.8 vs. 10.6 %; p = 1.08 × 10(-4)). In conclusion, CAH patients have increased C4 CNV, with mutation-specific associations that may be protective for autoimmune disease. The study of CYP21A2 in relation to neighboring genes provides insight into the genetics of CNV hotspots, an important determinant of human health.

  11. Identification of copy number variation-driven genes for liver cancer via bioinformatics analysis.

    PubMed

    Lu, Xiaojie; Ye, Kun; Zou, Kailin; Chen, Jinlian

    2014-11-01

    To screen out copy number variation (CNV)-driven differentially expressed genes (DEGs) in liver cancer and advance our understanding of the pathogenesis, an integrated analysis of liver cancer-related CNV data from The Cancer Genome Atlas (TCGA) and gene expression data from EBI Array Express database were performed. The DEGs were identified by package limma based on the cut-off of |log2 (fold-change)|>0.585 and adjusted p-value<0.05. Using hg19 annotation information provided by UCSC, liver cancer-related CNVs were then screened out. TF-target gene interactions were also predicted with information from UCSC using DAVID online tools. As a result, 25 CNV-driven genes were obtained, including tripartite motif containing 28 (TRIM28) and RanBP-type and C3HC4-type zinc finger containing 1 (RBCK1). In the transcriptional regulatory network, 8 known cancer-related transcription factors (TFs) interacted with 21 CNV-driven genes, suggesting that the other 8 TFs may be involved in liver cancer. These genes may be potential biomarkers for early detection and prevention of liver cancer. These findings may improve our knowledge of the pathogenesis of liver cancer. Nevertheless, further experiments are still needed to confirm our findings.

  12. Rethinking the starch digestion hypothesis for AMY1 copy number variation in humans.

    PubMed

    Fernández, Catalina I; Wiley, Andrea S

    2017-08-01

    Alpha-amylase exists across taxonomic kingdoms with a deep evolutionary history of gene duplications that resulted in several α-amylase paralogs. Copy number variation (CNV) in the salivary α-amylase gene (AMY1) exists in many taxa, but among primates, humans appear to have higher average AMY1 copies than nonhuman primates. Additionally, AMY1 CNV in humans has been associated with starch content of diets, and one known function of α-amylase is its involvement in starch digestion. Thus high AMY1 CNV is considered to result from selection favoring more efficient starch digestion in the Homo lineage. Here, we present several lines of evidence that challenge the hypothesis that increased AMY1 CNV is an adaptation to starch consumption. We observe that α- amylase plays a very limited role in starch digestion, with additional steps required for starch digestion and glucose metabolism. Specifically, we note that α-amylase hydrolysis only produces a minute amount of free glucose with further enzymatic digestion and glucose absorption being rate-limiting steps for glucose availability. Indeed α-amylase is nonessential for starch digestion since sucrase-isomaltase and maltase-glucoamylase can hydrolyze whole starch granules while releasing glucose. While higher AMY1 CN and CNV among human populations may result from natural selection, existing evidence does not support starch digestion as the major selective force. We report that in humans α-amylase is expressed in several other tissues where it may have potential roles of evolutionary significance. © 2017 Wiley Periodicals, Inc.

  13. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data.

    PubMed

    Sepúlveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain, Arnab; Clark, Taane G

    2013-02-26

    The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model. Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates. In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data.

  14. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data

    PubMed Central

    2013-01-01

    Background The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model. Results Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates. Conclusions In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data. PMID:23442253

  15. A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection.

    PubMed

    Wang, WeiBo; Sun, Wei; Wang, Wei; Szatkiewicz, Jin

    2018-03-01

    The application of high-throughput sequencing in a broad range of quantitative genomic assays (e.g., DNA-seq, ChIP-seq) has created a high demand for the analysis of large-scale read-count data. Typically, the genome is divided into tiling windows and windowed read-count data is generated for the entire genome from which genomic signals are detected (e.g. copy number changes in DNA-seq, enrichment peaks in ChIP-seq). For accurate analysis of read-count data, many state-of-the-art statistical methods use generalized linear models (GLM) coupled with the negative-binomial (NB) distribution by leveraging its ability for simultaneous bias correction and signal detection. However, although statistically powerful, the GLM+NB method has a quadratic computational complexity and therefore suffers from slow running time when applied to large-scale windowed read-count data. In this study, we aimed to speed up substantially the GLM+NB method by using a randomized algorithm and we demonstrate here the utility of our approach in the application of detecting copy number variants (CNVs) using a real example. We propose an efficient estimator, the randomized GLM+NB coefficients estimator (RGE), for speeding up the GLM+NB method. RGE samples the read-count data and solves the estimation problem on a smaller scale. We first theoretically validated the consistency and the variance properties of RGE. We then applied RGE to GENSENG, a GLM+NB based method for detecting CNVs. We named the resulting method as "R-GENSENG". Based on extensive evaluation using both simulated and empirical data, we concluded that R-GENSENG is ten times faster than the original GENSENG while maintaining GENSENG's accuracy in CNV detection. Our results suggest that RGE strategy developed here could be applied to other GLM+NB based read-count analyses, i.e. ChIP-seq data analysis, to substantially improve their computational efficiency while preserving the analytic power.

  16. Copy number variation and missense mutations of the agouti signaling protein (ASIP) gene in goat breeds with different coat colors.

    PubMed

    Fontanesi, L; Beretti, F; Riggio, V; Gómez González, E; Dall'Olio, S; Davoli, R; Russo, V; Portolano, B

    2009-01-01

    In goats, classical genetic studies reported a large number of alleles at the Agouti locus with effects on coat color and pattern distribution. From these early studies, the dominant A(Wt) (white/tan) allele was suggested to cause the white color of the Saanen breed. Here, we sequenced the coding region of the goat ASIP gene in 6 goat breeds (Girgentana, Maltese, Derivata di Siria, Murciano-Granadina, Camosciata delle Alpi, and Saanen), with different coat colors and patterns. Five single nucleotide polymorphisms (SNPs) were identified, 3 of which caused missense mutations in conserved positions of the cysteine-rich carboxy-terminal domain of the protein (p.Ala96Gly, p.Cys126Gly, and p.Val128Gly). Allele and genotype frequencies suggested that these mutations are not associated or not completely associated with coat color in the investigated goat breeds. Moreover, genotyping and sequencing results, deviation from Hardy-Weinberg equilibrium, as well as allele copy number evaluation from semiquantitative fluorescent multiplex PCR, indicated the presence of copy number variation (CNV) in all investigated breeds. To confirm the presence of CNV and evaluate its extension, we applied a bovine-goat cross-species array comparative genome hybridization (aCGH) experiment using a custom tiling array based on bovine chromosome 13. aCGH results obtained for 8 goat DNA samples confirmed the presence of CNV affecting a region of less that 100 kb including the ASIP and AHCY genes. In Girgentana and Saanen breeds, this CNV might cause the A(Wt) allele, as already suggested for a similar structural mutation in sheep affecting the ASIP and AHCY genes, providing evidence for a recurrent interspecies CNV. However, other mechanisms may also be involved in determining coat color in these 2 breeds. Copyright 2009 S. Karger AG, Basel.

  17. An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression.

    PubMed

    Wyszynski, Asaf; Hong, Chi-Chen; Lam, Kristin; Michailidou, Kyriaki; Lytle, Christian; Yao, Song; Zhang, Yali; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Hopper, John L; Southey, Melissa C; Schmidt, Marjanka K; Broeks, Annegien; Muir, Kenneth; Lophatananon, Artitaya; Fasching, Peter A; Beckmann, Matthias W; Peto, Julian; Dos-Santos-Silva, Isabel; Sawyer, Elinor J; Tomlinson, Ian; Burwinkel, Barbara; Marme, Frederik; Guénel, Pascal; Truong, Thérèse; Bojesen, Stig E; Nordestgaard, Børge G; González-Neira, Anna; Benitez, Javier; Neuhausen, Susan L; Brenner, Hermann; Dieffenbach, Aida Karina; Meindl, Alfons; Schmutzler, Rita K; Brauch, Hiltrud; Nevanlinna, Heli; Khan, Sofia; Matsuo, Keitaro; Ito, Hidemi; Dörk, Thilo; Bogdanova, Natalia V; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Wu, Anna H; Van Den Berg, David; Lambrechts, Diether; Wildiers, Hans; Chang-Claude, Jenny; Rudolph, Anja; Radice, Paolo; Peterlongo, Paolo; Couch, Fergus J; Olson, Janet E; Giles, Graham G; Milne, Roger L; Haiman, Christopher A; Henderson, Brian E; Dumont, Martine; Teo, Soo Hwang; Wong, Tien Y; Kristensen, Vessela; Zheng, Wei; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Andrulis, Irene L; Knight, Julia A; Devilee, Peter; Seynaeve, Caroline; García-Closas, Montserrat; Figueroa, Jonine; Klevebring, Daniel; Czene, Kamila; Hooning, Maartje J; van den Ouweland, Ans M W; Darabi, Hatef; Shu, Xiao-Ou; Gao, Yu-Tang; Cox, Angela; Blot, William; Signorello, Lisa B; Shah, Mitul; Kang, Daehee; Choi, Ji-Yeob; Hartman, Mikael; Miao, Hui; Hamann, Ute; Jakubowska, Anna; Lubinski, Jan; Sangrajrang, Suleeporn; McKay, James; Toland, Amanda E; Yannoukakos, Drakoulis; Shen, Chen-Yang; Wu, Pei-Ei; Swerdlow, Anthony; Orr, Nick; Simard, Jacques; Pharoah, Paul D P; Dunning, Alison M; Chenevix-Trench, Georgia; Hall, Per; Bandera, Elisa; Amos, Chris; Ambrosone, Christine; Easton, Douglas F; Cole, Michael D

    2016-09-01

    Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following oestrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR = 0.68 95%CI 0.55-0.83, P = 0.0002; replication OR = 0.77 95% CI 0.73-0.82, P = 2.1 × 10 -19 ) and identify 13 additional linked variants (r 2  >   0.8) in the 20Kb linkage block containing the enCNV (P = 3.2 × 10 -15 - 5.6 × 10 -17 ). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Rare De Novo Copy Number Variants in Patients with Congenital Pulmonary Atresia

    PubMed Central

    Xie, Li; Chen, Jin-Lan; Zhang, Wei-Zhi; Wang, Shou-Zheng; Zhao, Tian-Li; Huang, Can; Wang, Jian; Yang, Jin-Fu; Yang, Yi-Feng; Tan, Zhi-Ping

    2014-01-01

    Background Ongoing studies using genomic microarrays and next-generation sequencing have demonstrated that the genetic contributions to cardiovascular diseases have been significantly ignored in the past. The aim of this study was to identify rare copy number variants in individuals with congenital pulmonary atresia (PA). Methods and Results Based on the hypothesis that rare structural variants encompassing key genes play an important role in heart development in PA patients, we performed high-resolution genome-wide microarrays for copy number variations (CNVs) in 82 PA patient-parent trios and 189 controls with an Illumina SNP array platform. CNVs were identified in 17/82 patients (20.7%), and eight of these CNVs (9.8%) are considered potentially pathogenic. Five de novo CNVs occurred at two known congenital heart disease (CHD) loci (16p13.1 and 22q11.2). Two de novo CNVs that may affect folate and vitamin B12 metabolism were identified for the first time. A de novo 1-Mb deletion at 17p13.2 may represent a rare genomic disorder that involves mild intellectual disability and associated facial features. Conclusions Rare CNVs contribute to the pathogenesis of PA (9.8%), suggesting that the causes of PA are heterogeneous and pleiotropic. Together with previous data from animal models, our results might help identify a link between CHD and folate-mediated one-carbon metabolism (FOCM). With the accumulation of high-resolution SNP array data, these previously undescribed rare CNVs may help reveal critical gene(s) in CHD and may provide novel insights about CHD pathogenesis. PMID:24826987

  19. Rare de novo copy number variants in patients with congenital pulmonary atresia.

    PubMed

    Xie, Li; Chen, Jin-Lan; Zhang, Wei-Zhi; Wang, Shou-Zheng; Zhao, Tian-Li; Huang, Can; Wang, Jian; Yang, Jin-Fu; Yang, Yi-Feng; Tan, Zhi-Ping

    2014-01-01

    Ongoing studies using genomic microarrays and next-generation sequencing have demonstrated that the genetic contributions to cardiovascular diseases have been significantly ignored in the past. The aim of this study was to identify rare copy number variants in individuals with congenital pulmonary atresia (PA). Based on the hypothesis that rare structural variants encompassing key genes play an important role in heart development in PA patients, we performed high-resolution genome-wide microarrays for copy number variations (CNVs) in 82 PA patient-parent trios and 189 controls with an Illumina SNP array platform. CNVs were identified in 17/82 patients (20.7%), and eight of these CNVs (9.8%) are considered potentially pathogenic. Five de novo CNVs occurred at two known congenital heart disease (CHD) loci (16p13.1 and 22q11.2). Two de novo CNVs that may affect folate and vitamin B12 metabolism were identified for the first time. A de novo 1-Mb deletion at 17p13.2 may represent a rare genomic disorder that involves mild intellectual disability and associated facial features. Rare CNVs contribute to the pathogenesis of PA (9.8%), suggesting that the causes of PA are heterogeneous and pleiotropic. Together with previous data from animal models, our results might help identify a link between CHD and folate-mediated one-carbon metabolism (FOCM). With the accumulation of high-resolution SNP array data, these previously undescribed rare CNVs may help reveal critical gene(s) in CHD and may provide novel insights about CHD pathogenesis.

  20. Copy number variation detection in cattle reveals potential breed specific differences

    USDA-ARS?s Scientific Manuscript database

    Copy Number Variations (CNVs) are large, common deletions or duplications of genome sequence among individuals of a species that have been linked to diseases and phenotypic traits. For example, a CNV-generating, translocation mechanism encompassing the KIT gene is responsible for color sidedness in ...

  1. Homo sapiens exhibit a distinct pattern of CNV genes regulation: an important role of miRNAs and SNPs in expression plasticity.

    PubMed

    Dweep, Harsh; Kubikova, Nada; Gretz, Norbert; Voskarides, Konstantinos; Felekkis, Kyriacos

    2015-07-16

    Gene expression regulation is a complex and highly organized process involving a variety of genomic factors. It is widely accepted that differences in gene expression can contribute to the phenotypic variability between species, and that their interpretation can aid in the understanding of the physiologic variability. CNVs and miRNAs are two major players in the regulation of expression plasticity and may be responsible for the unique phenotypic characteristics observed in different lineages. We have previously demonstrated that a close interaction between these two genomic elements may have contributed to the regulation of gene expression during evolution. This work presents the molecular interactions between CNV and non CNV genes with miRNAs and other genomic elements in eight different species. A comprehensive analysis of these interactions indicates a unique nature of human CNV genes regulation as compared to other species. By using genes with short 3' UTR that abolish the "canonical" miRNA-dependent regulation, as a model, we demonstrate a distinct and tight regulation of human genes that might explain some of the unique features of human physiology. In addition, comparison of gene expression regulation between species indicated that there is a significant difference between humans and mice possibly questioning the effectiveness of the latest as experimental models of human diseases.

  2. A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens.

    PubMed

    Zhou, Wei; Liu, Ranran; Zhang, Jingjing; Zheng, Maiqing; Li, Peng; Chang, Guobin; Wen, Jie; Zhao, Guiping

    2014-10-01

    Copy number variation (CNV) has been recently examined in many species and is recognized as being a source of genetic variability, especially for disease-related phenotypes. In this study, the PennCNV software, a genome-wide CNV detection system based on the 60 K SNP BeadChip was used on a total sample size of 1,310 Beijing-You chickens (a Chinese local breed). After quality control, 137 high confidence CNVRs covering 27.31 Mb of the chicken genome and corresponding to 2.61 % of the whole chicken genome. Within these regions, 131 known genes or coding sequences were involved. Q-PCR was applied to verify some of the genes related to disease development. Results showed that copy number of genes such as, phosphatidylinositol-5-phosphate 4-kinase II alpha, PHD finger protein 14, RHACD8 (a CD8α- like messenger RNA), MHC B-G, zinc finger protein, sarcosine dehydrogenase and ficolin 2 varied between individual chickens, which also supports the reliability of chip-detection of the CNVs. As one source of genomic variation, CNVs may provide new insight into the relationship between the genome and phenotypic characteristics.

  3. Comparison of microfluidic digital PCR and conventional quantitative PCR for measuring copy number variation.

    PubMed

    Whale, Alexandra S; Huggett, Jim F; Cowen, Simon; Speirs, Valerie; Shaw, Jacqui; Ellison, Stephen; Foy, Carole A; Scott, Daniel J

    2012-06-01

    One of the benefits of Digital PCR (dPCR) is the potential for unparalleled precision enabling smaller fold change measurements. An example of an assessment that could benefit from such improved precision is the measurement of tumour-associated copy number variation (CNV) in the cell free DNA (cfDNA) fraction of patient blood plasma. To investigate the potential precision of dPCR and compare it with the established technique of quantitative PCR (qPCR), we used breast cancer cell lines to investigate HER2 gene amplification and modelled a range of different CNVs. We showed that, with equal experimental replication, dPCR could measure a smaller CNV than qPCR. As dPCR precision is directly dependent upon both the number of replicate measurements and the template concentration, we also developed a method to assist the design of dPCR experiments for measuring CNV. Using an existing model (based on Poisson and binomial distributions) to derive an expression for the variance inherent in dPCR, we produced a power calculation to define the experimental size required to reliably detect a given fold change at a given template concentration. This work will facilitate any future translation of dPCR to key diagnostic applications, such as cancer diagnostics and analysis of cfDNA.

  4. Predictive Genes in Adjacent Normal Tissue Are Preferentially Altered by sCNV during Tumorigenesis in Liver Cancer and May Rate Limiting

    PubMed Central

    Lamb, John R.; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Hao, Ke; Chudin, Eugene; Fraser, Hunter B.; Millstein, Joshua; Ferguson, Mark; Suver, Christine; Ivanovska, Irena; Scott, Martin; Philippar, Ulrike; Bansal, Dimple; Zhang, Zhan; Burchard, Julja; Smith, Ryan; Greenawalt, Danielle; Cleary, Michele; Derry, Jonathan; Loboda, Andrey; Watters, James; Poon, Ronnie T. P.; Fan, Sheung T.; Yeung, Chun; Lee, Nikki P. Y.; Guinney, Justin; Molony, Cliona; Emilsson, Valur; Buser-Doepner, Carolyn; Zhu, Jun; Friend, Stephen; Mao, Mao; Shaw, Peter M.; Dai, Hongyue; Luk, John M.; Schadt, Eric E.

    2011-01-01

    Background In hepatocellular carcinoma (HCC) genes predictive of survival have been found in both adjacent normal (AN) and tumor (TU) tissues. The relationships between these two sets of predictive genes and the general process of tumorigenesis and disease progression remains unclear. Methodology/Principal Findings Here we have investigated HCC tumorigenesis by comparing gene expression, DNA copy number variation and survival using ∼250 AN and TU samples representing, respectively, the pre-cancer state, and the result of tumorigenesis. Genes that participate in tumorigenesis were defined using a gene-gene correlation meta-analysis procedure that compared AN versus TU tissues. Genes predictive of survival in AN (AN-survival genes) were found to be enriched in the differential gene-gene correlation gene set indicating that they directly participate in the process of tumorigenesis. Additionally the AN-survival genes were mostly not predictive after tumorigenesis in TU tissue and this transition was associated with and could largely be explained by the effect of somatic DNA copy number variation (sCNV) in cis and in trans. The data was consistent with the variance of AN-survival genes being rate-limiting steps in tumorigenesis and this was confirmed using a treatment that promotes HCC tumorigenesis that selectively altered AN-survival genes and genes differentially correlated between AN and TU. Conclusions/Significance This suggests that the process of tumor evolution involves rate-limiting steps related to the background from which the tumor evolved where these were frequently predictive of clinical outcome. Additionally treatments that alter the likelihood of tumorigenesis occurring may act by altering AN-survival genes, suggesting that the process can be manipulated. Further sCNV explains a substantial fraction of tumor specific expression and may therefore be a causal driver of tumor evolution in HCC and perhaps many solid tumor types. PMID:21750698

  5. Association between copy number variation losses and alcohol dependence across African American and European American ethnic groups.

    PubMed

    Ulloa, Alvaro E; Chen, Jiayu; Vergara, Victor M; Calhoun, Vince; Liu, Jingyu

    2014-05-01

    Copy number variations (CNVs) are structural genetic mutations consisting of segmental gains or losses in DNA sequence. Although CNVs contribute substantially to genomic variation, few genetic and imaging studies report association of CNVs with alcohol dependence (AD). Our purpose is to find evidence of this association across ethnic populations and genders. This work is the first AD-CNV study across ethnic groups and the first to include the African American (AA) population. This study considers 2 CNV data sets, one for discovery (2,345 samples) and the other for validation (239 samples), both including subjects with AD and healthy controls of European and African ancestry. Our analysis assesses the association between AD and CNV losses across ethnic groups and gender by examining the effect of overall losses across the whole genome, collective losses within individual cytogenetic bands, and specific losses in CNV regions. Results from the discovery data set showed an association between CNV losses within 16q12.2 and AD diagnosis (p = 4.53 × 10(-3) ). An overlapping CNV region from the validation data set exhibited the same direction of effect with respect to AD (p = 0.051). This CNV region affects the genes CES1p1 and CES1, which are members of the carboxylesterase (CES) family. The enzyme encoded by CES1 is a major liver enzyme that typically catalyzes the decomposition of ester into alcohol and carboxylic acid and is involved in drug or xenobiotics, fatty acid, and cholesterol metabolisms. In addition, the most significantly associated CNV region was located at 9p21.2 (p = 1.9 × 10(-3) ) in our discovery data set. Although not observed in the validation data set, probably due to small sample size, this result might hold potential connection to AD given its connection with neuronal death. In contrast, we did not find any association between AD and the overall total losses or the collective losses within individual cytogenetic bands. Overall, our study provides

  6. Genomic Architecture of Aggression: Rare Copy Number Variants in Intermittent Explosive Disorder

    PubMed Central

    Vu, Tiffany H; Coccaro, Emil F; Eichler, Evan E; Girirajan, Santhosh

    2011-01-01

    Copy number variants (CNVs) are known to be associated with complex neuropsychiatric disorders (e.g., schizophrenia and autism) but have not been explored in the isolated features of aggressive behaviors such as intermittent explosive disorder (IED). IED is characterized by recurrent episodes of aggression in which individuals act impulsively and grossly out of proportion from the involved stressors. Previous studies have identified genetic variants in the serotonergic pathway that play a role in susceptibility to this behavior, but additional contributors have not been identified. Therefore, to further delineate possible genetic influences, we investigated CNVs in individuals diagnosed with IED and/or personality disorder (PD). We carried out array comparative genomic hybridization on 113 samples of individuals with isolated features of IED (n = 90) or PD (n = 23). We detected a recurrent 1.35-Mbp deletion on chromosome 1q21.1 in one IED subject and a novel ∼350-kbp deletion on chromosome 16q22.3q23.1 in another IED subject. While five recent reports have suggested the involvement of an ∼1.6-Mbp 15q13.3 deletion in individuals with behavioral problems, particularly aggression, we report an absence of such events in our study of individuals specifically selected for aggression. We did, however, detect a smaller ∼430-kbp 15q13.3 duplication containing CHRNA7 in one individual with PD. While these results suggest a possible role for rare CNVs in identifying genes underlying IED or PD, further studies on a large number of well-characterized individuals are necessary. © 2011 Wiley-Liss, Inc. PMID:21812102

  7. Copy number variation of human AMY1 is a minor contributor to variation in salivary amylase expression and activity.

    PubMed

    Carpenter, Danielle; Mitchell, Laura M; Armour, John A L

    2017-02-20

    Salivary amylase in humans is encoded by the copy variable gene AMY1 in the amylase gene cluster on chromosome 1. Although the role of salivary amylase is well established, the consequences of the copy number variation (CNV) at AMY1 on salivary amylase protein production are less well understood. The amylase gene cluster is highly structured with a fundamental difference between odd and even AMY1 copy number haplotypes. In this study, we aimed to explore, in samples from 119 unrelated individuals, not only the effects of AMY1 CNV on salivary amylase protein expression and amylase enzyme activity but also whether there is any evidence for underlying difference between the common haplotypes containing odd numbers of AMY1 and even copy number haplotypes. AMY1 copy number was significantly correlated with the variation observed in salivary amylase production (11.7% of variance, P < 0.0005) and enzyme activity (13.6% of variance, P < 0.0005) but did not explain the majority of observed variation between individuals. AMY1-odd and AMY1-even haplotypes showed a different relationship between copy number and expression levels, but the difference was not statistically significant (P = 0.052). Production of salivary amylase is correlated with AMY1 CNV, but the majority of interindividual variation comes from other sources. Long-range haplotype structure may affect expression, but this was not significant in our data.

  8. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    PubMed

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  9. Homo sapiens exhibit a distinct pattern of CNV genes regulation: an important role of miRNAs and SNPs in expression plasticity

    PubMed Central

    Dweep, Harsh; Kubikova, Nada; Gretz, Norbert; Voskarides, Konstantinos; Felekkis, Kyriacos

    2015-01-01

    Gene expression regulation is a complex and highly organized process involving a variety of genomic factors. It is widely accepted that differences in gene expression can contribute to the phenotypic variability between species, and that their interpretation can aid in the understanding of the physiologic variability. CNVs and miRNAs are two major players in the regulation of expression plasticity and may be responsible for the unique phenotypic characteristics observed in different lineages. We have previously demonstrated that a close interaction between these two genomic elements may have contributed to the regulation of gene expression during evolution. This work presents the molecular interactions between CNV and non CNV genes with miRNAs and other genomic elements in eight different species. A comprehensive analysis of these interactions indicates a unique nature of human CNV genes regulation as compared to other species. By using genes with short 3′ UTR that abolish the “canonical” miRNA-dependent regulation, as a model, we demonstrate a distinct and tight regulation of human genes that might explain some of the unique features of human physiology. In addition, comparison of gene expression regulation between species indicated that there is a significant difference between humans and mice possibly questioning the effectiveness of the latest as experimental models of human diseases. PMID:26178010

  10. Phenotypic Association Analyses With Copy Number Variation in Recurrent Depressive Disorder.

    PubMed

    Rucker, James J H; Tansey, Katherine E; Rivera, Margarita; Pinto, Dalila; Cohen-Woods, Sarah; Uher, Rudolf; Aitchison, Katherine J; Craddock, Nick; Owen, Michael J; Jones, Lisa; Jones, Ian; Korszun, Ania; Barnes, Michael R; Preisig, Martin; Mors, Ole; Maier, Wolfgang; Rice, John; Rietschel, Marcella; Holsboer, Florian; Farmer, Anne E; Craig, Ian W; Scherer, Stephen W; McGuffin, Peter; Breen, Gerome

    2016-02-15

    Defining the molecular genomic basis of the likelihood of developing depressive disorder is a considerable challenge. We previously associated rare, exonic deletion copy number variants (CNV) with recurrent depressive disorder (RDD). Sex chromosome abnormalities also have been observed to co-occur with RDD. In this reanalysis of our RDD dataset (N = 3106 cases; 459 screened control samples and 2699 population control samples), we further investigated the role of larger CNVs and chromosomal abnormalities in RDD and performed association analyses with clinical data derived from this dataset. We found an enrichment of Turner's syndrome among cases of depression compared with the frequency observed in a large population sample (N = 34,910) of live-born infants collected in Denmark (two-sided p = .023, odds ratio = 7.76 [95% confidence interval = 1.79-33.6]), a case of diploid/triploid mosaicism, and several cases of uniparental isodisomy. In contrast to our previous analysis, large deletion CNVs were no more frequent in cases than control samples, although deletion CNVs in cases contained more genes than control samples (two-sided p = .0002). After statistical correction for multiple comparisons, our data do not support a substantial role for CNVs in RDD, although (as has been observed in similar samples) occasional cases may harbor large variants with etiological significance. Genetic pleiotropy and sample heterogeneity suggest that very large sample sizes are required to study conclusively the role of genetic variation in mood disorders. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  11. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants

    PubMed Central

    Kwan, Elizabeth X.; Wang, Xiaobin S.; Amemiya, Haley M.; Brewer, Bonita J.; Raghuraman, M. K.

    2016-01-01

    The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. PMID:27449518

  12. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants.

    PubMed

    Kwan, Elizabeth X; Wang, Xiaobin S; Amemiya, Haley M; Brewer, Bonita J; Raghuraman, M K

    2016-09-08

    The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. Copyright © 2016 Kwan et al.

  13. Detection of pathogenic copy number variants in children with idiopathic intellectual disability using 500 K SNP array genomic hybridization

    PubMed Central

    2009-01-01

    Background Array genomic hybridization is being used clinically to detect pathogenic copy number variants in children with intellectual disability and other birth defects. However, there is no agreement regarding the kind of array, the distribution of probes across the genome, or the resolution that is most appropriate for clinical use. Results We performed 500 K Affymetrix GeneChip® array genomic hybridization in 100 idiopathic intellectual disability trios, each comprised of a child with intellectual disability of unknown cause and both unaffected parents. We found pathogenic genomic imbalance in 16 of these 100 individuals with idiopathic intellectual disability. In comparison, we had found pathogenic genomic imbalance in 11 of 100 children with idiopathic intellectual disability in a previous cohort who had been studied by 100 K GeneChip® array genomic hybridization. Among 54 intellectual disability trios selected from the previous cohort who were re-tested with 500 K GeneChip® array genomic hybridization, we identified all 10 previously-detected pathogenic genomic alterations and at least one additional pathogenic copy number variant that had not been detected with 100 K GeneChip® array genomic hybridization. Many benign copy number variants, including one that was de novo, were also detected with 500 K array genomic hybridization, but it was possible to distinguish the benign and pathogenic copy number variants with confidence in all but 3 (1.9%) of the 154 intellectual disability trios studied. Conclusion Affymetrix GeneChip® 500 K array genomic hybridization detected pathogenic genomic imbalance in 10 of 10 patients with idiopathic developmental disability in whom 100 K GeneChip® array genomic hybridization had found genomic imbalance, 1 of 44 patients in whom 100 K GeneChip® array genomic hybridization had found no abnormality, and 16 of 100 patients who had not previously been tested. Effective clinical interpretation of these studies requires

  14. SULT1A1 copy number variation: ethnic distribution analysis in an Indian population.

    PubMed

    Almal, Suhani; Padh, Harish

    2017-11-01

    Cytosolic sulfotransferases (SULTs) are phase II detoxification enzymes involved in metabolism of numerous xenobiotics, drugs and endogenous compounds. Interindividual variation in sulfonation capacity is important for determining an individual's response to xenobiotics. SNPs in SULTs, mainly SULT1A1 have been associated with cancer risk and also with response to therapeutic agents. Copy number variation (CNVs) in SULT1A1 is found to be correlated with altered enzyme activity. This short report primarily focuses on CNV in SULT1A1 and its distribution among different ethnic populations around the globe. Frequency distribution of SULT1A1 copy number (CN) in 157 healthy Indian individuals was assessed using florescent-based quantitative PCR assay. A range of 1 to >4 copies, with a frequency of SULT1A1 CN =2 (64.9%) the highest, was observed in our (Indian) population. Upon comparative analysis of frequency distribution of SULT1A1 CN among diverse population groups, a statistically significant difference was observed between Indians (our data) and African-American (AA) (p = 0.0001) and South African (Tswana) (p < 0.0001) populations. Distribution of CNV in the Indian population was found to be similar to that in European-derived populations of American and Japanese. CNV of SULT1A1 varies significantly among world populations and may be one of the determinants of health and diseases.

  15. A method for generating new datasets based on copy number for cancer analysis.

    PubMed

    Kim, Shinuk; Kon, Mark; Kang, Hyunsik

    2015-01-01

    New data sources for the analysis of cancer data are rapidly supplementing the large number of gene-expression markers used for current methods of analysis. Significant among these new sources are copy number variation (CNV) datasets, which typically enumerate several hundred thousand CNVs distributed throughout the genome. Several useful algorithms allow systems-level analyses of such datasets. However, these rich data sources have not yet been analyzed as deeply as gene-expression data. To address this issue, the extensive toolsets used for analyzing expression data in cancerous and noncancerous tissue (e.g., gene set enrichment analysis and phenotype prediction) could be redirected to extract a great deal of predictive information from CNV data, in particular those derived from cancers. Here we present a software package capable of preprocessing standard Agilent copy number datasets into a form to which essentially all expression analysis tools can be applied. We illustrate the use of this toolset in predicting the survival time of patients with ovarian cancer or glioblastoma multiforme and also provide an analysis of gene- and pathway-level deletions in these two types of cancer.

  16. Identification of new risk factors for rolandic epilepsy: CNV at Xp22.31 and alterations at cholinergic synapses.

    PubMed

    Addis, Laura; Sproviero, William; Thomas, Sanjeev V; Caraballo, Roberto H; Newhouse, Stephen J; Gomez, Kumudini; Hughes, Elaine; Kinali, Maria; McCormick, David; Hannan, Siobhan; Cossu, Silvia; Taylor, Jacqueline; Akman, Cigdem I; Wolf, Steven M; Mandelbaum, David E; Gupta, Rajesh; van der Spek, Rick A; Pruna, Dario; Pal, Deb K

    2018-05-22

    Rolandic epilepsy (RE) is the most common genetic childhood epilepsy, consisting of focal, nocturnal seizures and frequent neurodevelopmental impairments in speech, language, literacy and attention. A complex genetic aetiology is presumed in most, with monogenic mutations in GRIN2A accounting for >5% of cases. To identify rare, causal CNV in patients with RE. We used high-density SNP arrays to analyse the presence of rare CNVs in 186 patients with RE from the UK, the USA, Sardinia, Argentina and Kerala, India. We identified 84 patients with one or more rare CNVs, and, within this group, 14 (7.5%) with recurrent risk factor CNVs and 15 (8.0%) with likely pathogenic CNVs. Nine patients carried recurrent hotspot CNVs including at 16p13.11 and 1p36, with the most striking finding that four individuals (three from Sardinia) carried a duplication, and one a deletion, at Xp22.31. Five patients with RE carried a rare CNV that disrupted genes associated with other epilepsies ( KCTD7 , ARHGEF15 , CACNA2D1, GRIN2A and ARHGEF4 ), and 17 cases carried CNVs that disrupted genes associated with other neurological conditions or that are involved in neuronal signalling/development. Network analysis of disrupted genes with high brain expression identified significant enrichment in pathways of the cholinergic synapse, guanine-exchange factor activation and the mammalian target of rapamycin. Our results provide a CNV profile of an ethnically diverse cohort of patients with RE, uncovering new areas of research focus, and emphasise the importance of studying non-western European populations in oligogenic disorders to uncover a full picture of risk variation. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  17. Copy-Number Variation of the Glucose Transporter Gene SLC2A3 and Congenital Heart Defects in the 22q11.2 Deletion Syndrome.

    PubMed

    Mlynarski, Elisabeth E; Sheridan, Molly B; Xie, Michael; Guo, Tingwei; Racedo, Silvia E; McDonald-McGinn, Donna M; Gai, Xiaowu; Chow, Eva W C; Vorstman, Jacob; Swillen, Ann; Devriendt, Koen; Breckpot, Jeroen; Digilio, Maria Cristina; Marino, Bruno; Dallapiccola, Bruno; Philip, Nicole; Simon, Tony J; Roberts, Amy E; Piotrowicz, Małgorzata; Bearden, Carrie E; Eliez, Stephan; Gothelf, Doron; Coleman, Karlene; Kates, Wendy R; Devoto, Marcella; Zackai, Elaine; Heine-Suñer, Damian; Shaikh, Tamim H; Bassett, Anne S; Goldmuntz, Elizabeth; Morrow, Bernice E; Emanuel, Beverly S

    2015-05-07

    The 22q11.2 deletion syndrome (22q11DS; velocardiofacial/DiGeorge syndrome; VCFS/DGS) is the most common microdeletion syndrome and the phenotypic presentation is highly variable. Approximately 65% of individuals with 22q11DS have a congenital heart defect (CHD), mostly of the conotruncal type, and/or an aortic arch defect. The etiology of this phenotypic variability is not currently known. We hypothesized that copy-number variants (CNVs) outside the 22q11.2 deleted region might increase the risk of being born with a CHD in this sensitized population. Genotyping with Affymetrix SNP Array 6.0 was performed on two groups of subjects with 22q11DS separated by time of ascertainment and processing. CNV analysis was completed on a total of 949 subjects (cohort 1, n = 562; cohort 2, n = 387), 603 with CHDs (cohort 1, n = 363; cohort 2, n = 240) and 346 with normal cardiac anatomy (cohort 1, n = 199; cohort 2, n = 147). Our analysis revealed that a duplication of SLC2A3 was the most frequent CNV identified in the first cohort. It was present in 18 subjects with CHDs and 1 subject without (p = 3.12 × 10(-3), two-tailed Fisher's exact test). In the second cohort, the SLC2A3 duplication was also significantly enriched in subjects with CHDs (p = 3.30 × 10(-2), two-tailed Fisher's exact test). The SLC2A3 duplication was the most frequent CNV detected and the only significant finding in our combined analysis (p = 2.68 × 10(-4), two-tailed Fisher's exact test), indicating that the SLC2A3 duplication might serve as a genetic modifier of CHDs and/or aortic arch anomalies in individuals with 22q11DS. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  18. Type 2 diabetes mellitus disease risk genes identified by genome wide copy number variation scan in normal populations.

    PubMed

    Prabhanjan, Manasa; Suresh, Raviraj V; Murthy, Megha N; Ramachandra, Nallur B

    2016-03-01

    To identify the role of copy number variations (CNVs) on disease risk genes and its effect on disease phenotypes in type 2 diabetes mellitus (T2DM) in 12 random populations using high throughput arrays. CNV analysis was carried out on a total of 1715 individuals from 12 populations, from ArrayExpress Archive of the European Bioinformatics Institute along with our subjects using Affymetrix Genome Wide SNP 6.0 array. CNV effect on T2DM genes were analyzed using several bioinformatics tools and a molecular protein interaction network was constructed to identify the disease mechanism altered by the CNVs. Analysis showed 34.4% of the total population to be under CNV burden for T2DM, with 83 disease causal and associated genes being under CNV influence. Hotspots were identified on chromosomes 22, 12, 6, 19 and 11.Overlap studies with case cohorts revealed significant disease risk genes such as EGFR, E2F1, PPP1R3A, HLA and TSPAN8. CNVs play a significant role in predisposing T2DM in normal cohorts and contribute to the phenotypic effects. Thus, CNVs should be considered as one of the major contributors in predisposition of the disease. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  19. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    USDA-ARS?s Scientific Manuscript database

    The diversity and population-genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analyzed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, Romagnola), sequenced to 11-fold...

  20. A Common CYFIP1 Variant at the 15q11.2 Disease Locus Is Associated with Structural Variation at the Language-Related Left Supramarginal Gyrus

    PubMed Central

    Woo, Young Jae; Wang, Tao; Guadalupe, Tulio; Nebel, Rebecca A.; Vino, Arianna; Del Bene, Victor A.; Molholm, Sophie; Ross, Lars A.; Zwiers, Marcel P.; Fisher, Simon E.; Foxe, John J.; Abrahams, Brett S.

    2016-01-01

    Copy number variants (CNVs) at the Breakpoint 1 to Breakpoint 2 region at 15q11.2 (BP1-2) are associated with language-related difficulties and increased risk for developmental disorders in which language is compromised. Towards underlying mechanisms, we investigated relationships between single nucleotide polymorphisms (SNPs) across the region and quantitative measures of human brain structure obtained by magnetic resonance imaging of healthy subjects. We report an association between rs4778298, a common variant at CYFIP1, and inter-individual variation in surface area across the left supramarginal gyrus (lh.SMG), a cortical structure implicated in speech and language in independent discovery (n = 100) and validation cohorts (n = 2621). In silico analyses determined that this same variant, and others nearby, is also associated with differences in levels of CYFIP1 mRNA in human brain. One of these nearby polymorphisms is predicted to disrupt a consensus binding site for FOXP2, a transcription factor implicated in speech and language. Consistent with a model where FOXP2 regulates CYFIP1 levels and in turn influences lh.SMG surface area, analysis of publically available expression data identified a relationship between expression of FOXP2 and CYFIP1 mRNA in human brain. We propose that altered CYFIP1 dosage, through aberrant patterning of the lh.SMG, may contribute to language-related difficulties associated with BP1-2 CNVs. More generally, this approach may be useful in clarifying the contribution of individual genes at CNV risk loci. PMID:27351196

  1. Identification of copy number variants in whole-genome data using Reference Coverage Profiles

    PubMed Central

    Glusman, Gustavo; Severson, Alissa; Dhankani, Varsha; Robinson, Max; Farrah, Terry; Mauldin, Denise E.; Stittrich, Anna B.; Ament, Seth A.; Roach, Jared C.; Brunkow, Mary E.; Bodian, Dale L.; Vockley, Joseph G.; Shmulevich, Ilya; Niederhuber, John E.; Hood, Leroy

    2015-01-01

    The identification of DNA copy numbers from short-read sequencing data remains a challenge for both technical and algorithmic reasons. The raw data for these analyses are measured in tens to hundreds of gigabytes per genome; transmitting, storing, and analyzing such large files is cumbersome, particularly for methods that analyze several samples simultaneously. We developed a very efficient representation of depth of coverage (150–1000× compression) that enables such analyses. Current methods for analyzing variants in whole-genome sequencing (WGS) data frequently miss copy number variants (CNVs), particularly hemizygous deletions in the 1–100 kb range. To fill this gap, we developed a method to identify CNVs in individual genomes, based on comparison to joint profiles pre-computed from a large set of genomes. We analyzed depth of coverage in over 6000 high quality (>40×) genomes. The depth of coverage has strong sequence-specific fluctuations only partially explained by global parameters like %GC. To account for these fluctuations, we constructed multi-genome profiles representing the observed or inferred diploid depth of coverage at each position along the genome. These Reference Coverage Profiles (RCPs) take into account the diverse technologies and pipeline versions used. Normalization of the scaled coverage to the RCP followed by hidden Markov model (HMM) segmentation enables efficient detection of CNVs and large deletions in individual genomes. Use of pre-computed multi-genome coverage profiles improves our ability to analyze each individual genome. We make available RCPs and tools for performing these analyses on personal genomes. We expect the increased sensitivity and specificity for individual genome analysis to be critical for achieving clinical-grade genome interpretation. PMID:25741365

  2. [Advances in congenital vertebral malformation caused by genomic copy number variation].

    PubMed

    Liu, Zhenlei; Wu, Nan; Wu, Zhihong; Zuo, Yuzhi; Qiu, Guixing

    2016-04-01

    Congenital vertebral malformation (CVM) is a congenital vertebral structural deformity caused by abnormal somitogenesis during embryonic development, of which the reason lies in gene mutation or abnormal regulation of the genes that coordinate somitogenesis during embryonic period. ICVAS had proposed a new classification algorithm for CVM, which facilitated exploration for its genetic etiology. Genomic Copy Number Variation (CNV) is a kind of DNA mutation, which is important for human evolution, phenotype polymorphism and diseases. Series of advances have been made on genetic causes of CVM, especially on CVM caused by CNV. CNVs of chromosome 16p11.2, 10q24.31, 17p11.2, 20p11, 22q11.2 and a few other regions are associated with CVM, indicating that gene dosage may play important roles in the development of the spinal cord.

  3. MSMB gene variant alters the association between prostate cancer and number of sexual partners

    PubMed Central

    Stott-Miller, Marni; Wright, Jonathan L.; Stanford, Janet L.

    2014-01-01

    Background Recently, a genetic variant (rs10993994) in the MSMB gene associated with prostate cancer (PCa) risk was shown to correlate with reduced prostate secretory protein of 94 amino acids (PSP94) levels. Although the biological activity of PSP94 is unclear, one of its hypothesized functions is to protect prostatic cells from pathogens. Number of sexual partners and a history of sexually transmitted infections (STIs) have been positively associated with PCa risk, and these associations may be related to pathogen-induced chronic prostatic inflammation. Based on these observations, we investigated whether MSMB genotype modifies the PCa-sexual history association. Methods We estimated odds ratios (OR) and 95% confidence intervals (CI) for the association between number of sexual partners and PCa by fitting logistic regression models, stratified by MSMB genotype, and adjusted for age, family history of PCa, and PCa screening history among 1,239 incident cases and 1,232 controls. Results Compared with 1–4 female sexual partners, men with ≥15 such partners who carried the variant T allele of rs10993994 were at increased risk for PCa (OR=1.32; 95% CI, 1.03–1.71); no association was observed in men with the CC genotype (OR=1.03; 95% CI, 0.73–1.46; p=0.05 for interaction). Similar estimates were observed for total sexual partners (any T allele OR=1.37; 95% CI, 1.07–1.77; CC genotype OR=1.11; 95% CI, 0.79–1.55; p=0.06 for interaction). Conclusions The rs10993994 genotype in the MSMB gene modifies the association between number of sexual partners and PCa risk. These findings support a hypothesized biological mechanism whereby prostatic infection/inflammation may enhance risk of PCa. PMID:24037734

  4. Copy number variation of the APC gene is associated with regulation of bone mineral density☆

    PubMed Central

    Chew, Shelby; Dastani, Zari; Brown, Suzanne J.; Lewis, Joshua R.; Dudbridge, Frank; Soranzo, Nicole; Surdulescu, Gabriela L.; Richards, J. Brent; Spector, Tim D.; Wilson, Scott G.

    2012-01-01

    Introduction Genetic studies of osteoporosis have commonly examined SNPs in candidate genes or whole genome analyses, but insertions and deletions of DNA, collectively called copy number variations (CNVs), also comprise a large amount of the genetic variability between individuals. Previously, SNPs in the APC gene have been strongly associated with femoral neck and lumbar spine volumetric bone mineral density in older men. In addition, familial adenomatous polyposis patients carrying heterozygous mutations in the APC gene have been shown to have significantly higher mean bone mineral density than age- and sex-matched controls suggesting the importance of this gene in regulating bone mineral density. We examined CNV within the APC gene region to test for association with bone mineral density. Methods DNA was extracted from venous blood, genotyped using the Human Hap610 arrays and CNV determined from the fluorescence intensity data in 2070 Caucasian men and women aged 47.0 ± 13.0 (mean ± SD) years, to assess the effects of the CNV on bone mineral density at the forearm, spine and total hip sites. Results Data for covariate adjusted bone mineral density from subjects grouped by APC CNV genotype showed significant difference (P = 0.02–0.002). Subjects with a single copy loss of APC had a 7.95%, 13.10% and 13.36% increase in bone mineral density at the forearm, spine and total hip sites respectively, compared to subjects with two copies of the APC gene. Conclusions These data support previous findings of APC regulating bone mineral density and demonstrate that a novel CNV of the APC gene is significantly associated with bone mineral density in Caucasian men and women. PMID:22884971

  5. Associations of common copy number variants in glutathione S-transferase mu 1 and D-dopachrome tautomerase-like protein genes with risk of schizophrenia in a Japanese population.

    PubMed

    Nakamura, Toru; Ohnuma, Tohru; Hanzawa, Ryo; Takebayashi, Yuto; Takeda, Mayu; Nishimon, Shohei; Sannohe, Takahiro; Katsuta, Narimasa; Higashiyama, Ryoko; Shibata, Nobuto; Arai, Heii

    2015-10-01

    Oxidative-stress, genetic regions of interest (1p13 and 22q11), and common copy number variations (CNVs) may play roles in the pathophysiology of schizophrenia. In the present study, we confirmed associations between schizophrenia and the common CNVs in the glutathione (GSH)-related genes GSTT1, DDTL, and GSTM1 using quantitative real-time polymerase chain reaction analyses of 620 patients with schizophrenia and in 622 controls. No significant differences in GSTT1 copy number distributions were found between patient groups. However, frequencies of characterized CNVs and assumed gain alleles of DDTL and GSTM1 were significantly higher in patients with schizophrenia. In agreement with a previous report, the present data indicate that gains in the CNV alleles DDTL and GSTM1 are genetic risk factors in Japanese patients with schizophrenia, and suggest involvement of micro-inflammation and oxidative stress in the pathophysiology of schizophrenia. © 2015 Wiley Periodicals, Inc.

  6. Assessment of the role of copy-number variants in 150 patients with congenital heart defects.

    PubMed

    Derwińska, Katarzyna; Bartnik, Magdalena; Wiśniowiecka-Kowalnik, Barbara; Jagła, Mateusz; Rudziński, Andrzej; Pietrzyk, Jacek J; Kawalec, Wanda; Ziółkowska, Lidia; Kutkowska-Kaźmierczak, Anna; Gambin, Tomasz; Sykulski, Maciej; Shaw, Chad A; Gambin, Anna; Mazurczak, Tadeusz; Obersztyn, Ewa; Bocian, Ewa; Stankiewicz, Paweł

    2012-01-01

    Congenital heart defects are the most common group of major birth anomalies and one of the leading causes of infant deaths. Mendelian and chromosomal syndromes account for about 20% of congenital heart defects and in some cases are associated with other malformations, intellectual disability, and/or dysmorphic features. The remarkable conservation of genetic pathways regulating heart development in animals suggests that genetic factors can be responsible for a significantly higher percentage of cases. Assessment of the role of CNVs in the etiology of congenital heart defects using microarray studies. Genome-wide array comparative genomic hybridization, targeting genes known to play an important role in heart development or responsible for abnormal cardiac phenotype was used in the study on 150 patients. In addition, we have used multiplex ligation-dependent probe amplification specific for chromosome 22q11.2 region. We have identified 21 copy-number variants, including 13 known causative recurrent rearrangements (12 deletions 22q11.2 and one deletion 7q11.23), three potentially pathogenic duplications (5q14.2, 15q13.3, and 22q11.2), and five variants likely benign for cardiac anomalies. We suggest that abnormal copy-number of the ARRDC3 and KLF13 genes can be responsible for heart defects. Our study demonstrates that array comparative genomic hybridization enables detection of clinically significant chromosomal imbalances in patients with congenital heart defects.

  7. Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor

    USDA-ARS?s Scientific Manuscript database

    Different individuals of the same species are generally thought to have very similar genomes. However, there is growing evidence that structural variation in the form of copy number variation (CNV) and presence-absence variation (PAV) can lead to variation in the genome content of individuals withi...

  8. Genome-wide high-resolution screening in Dupuytren's disease reveals common regions of DNA copy number alterations.

    PubMed

    Shih, Barbara B; Tassabehji, May; Watson, James S; McGrouther, Angus D; Bayat, Ardeshir

    2010-07-01

    Dupuytren's disease (DD) is a familial disorder with a high genetic susceptibility in white people; however, its etiopathogenesis remains unknown. Previous comparative genomic hybridization studies using lower-resolution, 44-k oligonucleotide-based arrays revealed no copy number variation (CNV) changes in DD. In this study, we used a higher-resolution genome-wide screening (next-generation microarrays) comprising 963,331 human sequences (3 kb spacing between probes) for whole genome DNA variation analysis. The objective was to detect cryptic chromosomal imbalances in DD. Agilent SurePrint G3 microarrays, one million format (Agilent Technologies, Santa Clara, CA), were used to detect CNV regions (CNVRs) in DNA extracted from nodules of 4 white men with DD (age, 69 +/- 4 y). Reference samples were from the DNA of 10 men who served as control patients. Copy number variations that were common to greater than 3 assessed DD individuals (p < .05) were selected as candidate loci for DD etiology. In addition, quantitative polymerase chain reactions (qPCR) assays were designed for selected CNVRs on DNA from 13 DD patients and 11 control patients. Independent t-tests and Fisher's exact tests were carried out for statistical analysis. Three novel CNVs previously unreported in the phenotypically normal population were detected in 3 DD cases, located at 10q22, 16p12.1, and 17p12. Nine polymorphic CNVRs potentially associated with DD were determined using our strategic selection criteria, locating to chromosomes 1q31, 6p21, 7p14, 8p11, 12p13, 14q11, 17q21 and 20p13. More than 3 of the DD cases tested had a CNVR located to a small region on 6p21 and 4 CNVRs within 6p21-22 of the human leukocyte antigen (HLA) genes. Three novel copy number alterations were observed in 3 unrelated patients with sporadic (no known family history) DD. Nine polymorphic CNVRs were found to be common among the DD cases. These variants might contain genes involved in DD formation, indicating that

  9. Autistic-like behavioral phenotypes in a mouse model with copy number variation of the CAPS2/CADPS2 gene.

    PubMed

    Sadakata, Tetsushi; Shinoda, Yo; Oka, Megumi; Sekine, Yukiko; Furuichi, Teiichi

    2013-01-04

    Ca²⁺-dependent activator protein for secretion 2 (CAPS2 or CADPS2) facilitates secretion and trafficking of dense-core vesicles. Recent genome-wide association studies of autism have identified several microdeletions due to copy number variation (CNV) in one of the chromosome 7q31.32 alleles on which the locus for CAPS2 is located in autistic patients. To evaluate the biological significance of reducing CAPS2 copy number, we analyzed CAPS2 heterozygous mice. Our present findings suggest that adequate levels of CAPS2 protein are critical for normal brain development and behavior, and that allelic changes due to CNV may contribute to autistic symptoms in combination with deficits in other autism-associated genes. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  10. Copy Number Variation Affecting the Photoperiod-B1 and Vernalization-A1 Genes Is Associated with Altered Flowering Time in Wheat (Triticum aestivum)

    PubMed Central

    Isaac, Peter; Laurie, David A.

    2012-01-01

    The timing of flowering during the year is an important adaptive character affecting reproductive success in plants and is critical to crop yield. Flowering time has been extensively manipulated in crops such as wheat (Triticum aestivum L.) during domestication, and this enables them to grow productively in a wide range of environments. Several major genes controlling flowering time have been identified in wheat with mutant alleles having sequence changes such as insertions, deletions or point mutations. We investigated genetic variants in commercial varieties of wheat that regulate flowering by altering photoperiod response (Ppd-B1 alleles) or vernalization requirement (Vrn-A1 alleles) and for which no candidate mutation was found within the gene sequence. Genetic and genomic approaches showed that in both cases alleles conferring altered flowering time had an increased copy number of the gene and altered gene expression. Alleles with an increased copy number of Ppd-B1 confer an early flowering day neutral phenotype and have arisen independently at least twice. Plants with an increased copy number of Vrn-A1 have an increased requirement for vernalization so that longer periods of cold are required to potentiate flowering. The results suggest that copy number variation (CNV) plays a significant role in wheat adaptation. PMID:22457747

  11. Variants of cellobiohydrolases

    DOEpatents

    Bott, Richard R.; Foukaraki, Maria; Hommes, Ronaldus Wilhelmus; Kaper, Thijs; Kelemen, Bradley R.; Kralj, Slavko; Nikolaev, Igor; Sandgren, Mats; Van Lieshout, Johannes Franciscus Thomas; Van Stigt Thans, Sander

    2018-04-10

    Disclosed are a number of homologs and variants of Hypocrea jecorina Ce17A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.

  12. Redundancies in "H" Index Variants and the Proposal of the Number of Top-Cited Papers as an Attractive Indicator

    ERIC Educational Resources Information Center

    Bornmann, Lutz

    2012-01-01

    Ruscio, Seaman, D'Oriano, Stremlo, and Mahalchik (this issue) evaluate 22 bibliometric indicators, including conventional measures, like the number of publications, the "h" index, and many "h" index variants. To assess the quality of the indicators, their well-justified criteria encompass conceptual, empirical, and practical…

  13. Genome Reduction Uncovers a Large Dispensable Genome and Adaptive Role for Copy Number Variation in Asexually Propagated Solanum tuberosum[OPEN

    PubMed Central

    Hardigan, Michael A.; Crisovan, Emily; Hamilton, John P.; Laimbeer, Parker; Leisner, Courtney P.; Manrique-Carpintero, Norma C.; Newton, Linsey; Pham, Gina M.; Vaillancourt, Brieanne; Zeng, Zixian; Jiang, Jiming

    2016-01-01

    Clonally reproducing plants have the potential to bear a significantly greater mutational load than sexually reproducing species. To investigate this possibility, we examined the breadth of genome-wide structural variation in a panel of monoploid/doubled monoploid clones generated from native populations of diploid potato (Solanum tuberosum), a highly heterozygous asexually propagated plant. As rare instances of purely homozygous clones, they provided an ideal set for determining the degree of structural variation tolerated by this species and deriving its minimal gene complement. Extensive copy number variation (CNV) was uncovered, impacting 219.8 Mb (30.2%) of the potato genome with nearly 30% of genes subject to at least partial duplication or deletion, revealing the highly heterogeneous nature of the potato genome. Dispensable genes (>7000) were associated with limited transcription and/or a recent evolutionary history, with lower deletion frequency observed in genes conserved across angiosperms. Association of CNV with plant adaptation was highlighted by enrichment in gene clusters encoding functions for environmental stress response, with gene duplication playing a part in species-specific expansions of stress-related gene families. This study revealed unique impacts of CNV in a species with asexual reproductive habits and how CNV may drive adaption through evolution of key stress pathways. PMID:26772996

  14. [Clinical value of genome-wide high resolution chromosomal microarray analysis in etiological study of fetuses with congenital heart defects].

    PubMed

    Wu, Xiaoli; Fu, Fang; Li, Ru; Pan, Min; Han, Jin; Zhen, Li; Yang, Xin; Zhang, Yongling; Li, Fatao; Liao, Can

    2014-12-01

    To explore the clinical value of genome-wide high resolution chromosomal microarray analysis (CMA) in etiological study of fetuses with congenital heart disease (CHD) diagnosed by fetal echocardiography. A total of 176 fetuses diagnosed CHD by fetal echocardiography were analyzed, and invasive prenatal diagnosis was performed at Guangzhou Women and Children's Medical Center from January 2012 to January 2014. Among them, 158 fetuses were proved to have normal karyotype, and 88 fetuses (50.0%, 88/176) underwent CMA testing. The parental blood specimens were also collected for assisting the diagnosis of variants of uncertain clinical significance (VOUS). The 88 fetuses were divided into two groups: isolated CHD (n = 68) and CHD with extra-cardiac structural abnormalities (n = 20). The phenotypes of the two groups were subclassified. Copy number variations (CNV) were classified as benign CNV, pathogenic CNV (pCNV) or VOUS. (1) 58 fetuses (66%, 58/88) were with simple CHD and 30 fetuses were with complicated CHD (34%, 30/88). In the 45 fetuses with isolated and simple CHD, the pCNV detection rate was 11% (5/45). In the 23 fetuses with isolated and complicated CHD, the pCNV detection rate was 17% (4/23). In the 13 fetuses with simple CHD and extra-cardiac structural abnormalities, the pCNV detection rate was 5/13. In the 7 fetuses with complicated CHD and extra-cardiac structural abnormalities, the pCNV detection rate was 0. (2) The total detection rate for pCNV detection was 16% (14/88) in the 88 fetuses. The pCNV detection rates for isolated CHD and CHD with extra-cardiac structural abnormalities were 13% (9/68) and 25% (5/20), respectively (P > 0.05). The pCNV detection rates for simple and complicated CHD were 17% (10/58) and 13% (4/30), respectively (P > 0.05). (3) Eighteen fetuses (10.2%, 18/176) had abnormal karyotype results. (4) CMA test was performed in 88 fetuses. CNV detected in 8 fetuses were classified as VOUS initially. After parental microarray analysis

  15. β-Defensin genomic copy number does not influence the age of onset in Huntington's Disease.

    PubMed

    Vittori, Angelica; Orth, Michael; Roos, Raymund A C; Outeiro, Tiago F; Giorgini, Flaviano; Hollox, Edward J

    2013-01-01

    Huntington's disease (HD) is an autosomal dominant neurodegenerative disorder caused by the abnormal expansion of a CAG triplet repeat tract in the huntingtin gene. While the length of this CAG expansion is the major determinant of the age of onset (AO), other genetic factors have also been shown to play a modulatory role. Recent evidence suggests that neuroinflammations is a pivotal factor in the pathogenesis of HD, and that targeting this process may have important therapeutic ramifications. The human β-defensin 2 (hBD2)- encoded by DEFB4- is an antimicrobial peptide that exhibits inducible expression in astrocytes during inflammation and is an important regulator of innate and adaptive immune response. Therefore, DEFB4 may contribute to the neuroinflammatory processes observed in HD. In this study we tested the hypothesis that copy number variation (CNV) of the β-defensin region, including DEFB4, modifies the AO in HD. We genotyped β-defensin CNV in 490 HD individuals using the paralogue ratio test and found no association between β-defensin CNV and onset of HD. We conclude that it is unlikely that DEFB4 plays a role in HD pathogenesis.

  16. Genomic copy number variations in three Southeast Asian populations.

    PubMed

    Ku, Chee-Seng; Pawitan, Yudi; Sim, Xueling; Ong, Rick T H; Seielstad, Mark; Lee, Edmund J D; Teo, Yik-Ying; Chia, Kee-Seng; Salim, Agus

    2010-07-01

    Research on the role of copy number variations (CNVs) in the genetic risk of diseases in Asian populations has been hampered by a relative lack of reference CNV maps for Asian populations outside the East Asians. In this article, we report the population characteristics of CNVs in Chinese, Malay, and Asian Indian populations in Singapore. Using the Illumina Human 1M Beadchip array, we identify 1,174 CNV loci in these populations that corroborated with findings when the same samples were typed on the Affymetrix 6.0 platform. We identify 441 novel loci not previously reported in the Database of Genomic Variations (DGV). We observe a considerable number of loci that span all three populations and were previously unreported, as well as population-specific loci that are quite common in the respective populations. From this we observe the distribution of CNVs in the Asian Indian population to be considerably different from the Chinese and Malay populations. About half of the deletion loci and three-quarters of duplication loci overlap UCSC genes. Tens of loci show population differentiation and overlap with genes previously known to be associated with genetic risk of diseases. One of these loci is the CYP2A6 deletion, previously linked to reduced susceptibility to lung cancer. (c) 2010 Wiley-Liss, Inc.

  17. Complex Copy Number Variation of AMY1 does not Associate with Obesity in two East Asian Cohorts.

    PubMed

    Yong, Rita Y Y; Mustaffa, Su'Aidah B; Wasan, Pavandip S; Sheng, Liang; Marshall, Christian R; Scherer, Stephen W; Teo, Yik-Ying; Yap, Eric P H

    2016-07-01

    The human amylase gene locus at chromosome 1p21.1 is structurally complex. This region contains two pancreatic amylase genes, AMY2B, AMY2A, and a salivary gene AMY1. The AMY1 gene harbors extensive copy number variation (CNV), and recent studies have implicated this variation in adaptation to starch-rich diets and in association to obesity for European and Asian populations. In this study, we showed that by combining quantitative PCR and digital PCR, coupled with careful experimental design and calibration, we can improve the resolution of genotyping CNV with high copy numbers (CNs). In two East Asian populations of Chinese and Malay ethnicity studied, we observed a unique non-normal distribution of AMY1 diploid CN genotypes with even:odd CNs ratio of 4.5 (3.3-4.7), and an association between the common AMY2A CN = 2 genotype and odd CNs of AMY1, that could be explained by the underlying haplotypic structure. In two further case-control cohorts (n = 932 and 145, for Chinese and Malays, respectively), we did not observe the previously reported association between AMY1 and obesity or body mass index. Improved methods for accurately genotyping multiallelic CNV loci and understanding the haplotype complexity at the AMY1 locus are necessary for population genetics and association studies. © 2016 WILEY PERIODICALS, INC.

  18. Rare variants and cardiovascular disease.

    PubMed

    Wain, Louise V

    2014-09-01

    Cardiovascular disease (CVD) is a leading cause of mortality and morbidity in the Western world. Large genome-wide association studies (GWASs) of coronary artery disease, myocardial infarction, stroke and dilated cardiomyopathy have identified a number of common genetic variants with modest effects on disease risk. Similarly, studies of important modifiable risk factors of CVD have identified a large number of predominantly common variant associations, for example, with blood pressure and blood lipid levels. In each case, despite the often large numbers of loci identified, only a small proportion of the phenotypic variance is explained. It has been hypothesised that rare variants with large effects may account for some of the missing variance but large-scale studies of rare variation are in their infancy for cardiovascular traits and have yet to produce fruitful results. Studies of monogenic CVDs, inherited disorders believed to be entirely driven by individual rare mutations, have highlighted genes that play a key role in disease aetiology. In this review, we discuss how findings from studies of rare variants in monogenic disease and GWAS of predominantly common variants are converging to provide further insight into biological disease mechanisms. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  19. Whole Exome Sequencing of Pediatric Gastric Adenocarcinoma Reveals an Atypical Presentation of Li-Fraumeni Syndrome

    PubMed Central

    Chang, Vivian Y.; Federman, Noah; Martinez-Agosto, Julian; Tatishchev, Sergei F.; Nelson, Stanley F.

    2014-01-01

    Background Gastric adenocarcinoma is a rare diagnosis in childhood. A 14-year old male patient presented with metastatic gastric adenocarcinoma, and a strong family history of colon cancer. Clinical sequencing of CDH1 and APC were negative. Whole exome sequencing was therefore applied to capture the majority of protein-coding regions for the identification of single-nucleotide variants, small insertion/deletions, and copy number abnormalities in the patient’s germline as well as primary tumor. Materials and Methods DNA was extracted from the patient’s blood, primary tumor, and the unaffected mother’s blood. DNA libraries were constructed and sequenced on Illumina HiSeq2000. Data were post-processed using Picard and Samtools, then analyzed with the Genome Analysis Toolkit. Variants were annotated using an in-house Ensembl-based program. Copy number was assessed using ExomeCNV. Results Each sample was sequenced to a mean depth of coverage of greater than 120×. A rare non-synonymous coding SNV in TP53 was identified in the germline. There were 10 somatic cancer protein-damaging variants that were not observed in the unaffected mother genome. ExomeCNV comparing tumor to the patient’s germline, identified abnormal copy number, spanning 6,946 genes. Conclusion We present an unusual case of Li-Fraumeni detected by whole exome sequencing. There were also likely driver somatic mutations in the gastric adenocarcinoma. These results highlight the need for more thorough and broad scale germline and cancer analyses to accurately inform patients of inherited risk to cancer and to identify somatic mutations. PMID:23015295

  20. Characterization of the Copy Number and Variants of Deformed Wing Virus (DWV) in the Pairs of Honey Bee Pupa and Infesting Varroa destructor or Tropilaelaps mercedesae.

    PubMed

    Wu, Yunfei; Dong, Xiaofeng; Kadowaki, Tatsuhiko

    2017-01-01

    Recent honey bee colony losses, particularly during the winter, have been shown to be associated with the presence of both ectoparasitic mites and Deformed Wing Virus (DWV). Whilst the role of Varroa destructor mites as a viral vector is well established, the role of Tropilaelaps mercedesae mites in viral transmission has not been fully investigated. In this study, we tested the effects that V. destructor and T. mercedesae infestation have on fluctuation of the DWV copy number and alteration of the virus variants in honey bees by characterizing individual pupae and their infesting mites. We observed that both mite species were associated with increased viral copy number in honey bee pupae. We found a positive correlation between DWV copy number in pupae and copy number in infesting mites, and the same DWV type A variant was present in either low or high copy number in both honey bee pupae and infesting V. destructor . These data also suggest that variant diversity is similar between honey bee pupae and the mites that infest them. These results support a previously proposed hypothesis that DWV suppresses the honey bee immune system when virus copy number reaches a specific threshold, promoting greater replication.

  1. Identification of copy number variations and translocations in cancer cells from Hi-C data.

    PubMed

    Chakraborty, Abhijit; Ay, Ferhat

    2017-10-18

    Eukaryotic chromosomes adapt a complex and highly dynamic three-dimensional (3D) structure, which profoundly affects different cellular functions and outcomes including changes in epigenetic landscape and in gene expression. Making the scenario even more complex, cancer cells harbor chromosomal abnormalities (e.g., copy number variations (CNVs) and translocations) altering their genomes both at the sequence level and at the level of 3D organization. High-throughput chromosome conformation capture techniques (e.g., Hi-C), which are originally developed for decoding the 3D structure of the chromatin, provide a great opportunity to simultaneously identify the locations of genomic rearrangements and to investigate the 3D genome organization in cancer cells. Even though Hi-C data has been used for validating known rearrangements, computational methods that can distinguish rearrangement signals from the inherent biases of Hi-C data and from the actual 3D conformation of chromatin, and can precisely detect rearrangement locations de novo have been missing. In this work, we characterize how intra and inter-chromosomal Hi-C contacts are distributed for normal and rearranged chromosomes to devise a new set of algorithms (i) to identify genomic segments that correspond to CNV regions such as amplifications and deletions (HiCnv), (Nurtdinov et al.) to call inter-chromosomal translocations and their boundaries (HiCtrans) from Hi-C experiments, and (iii) to simulate Hi-C data from genomes with desired rearrangements and abnormalities (AveSim) in order to select optimal parameters for and to benchmark the accuracy of our methods. Our results on 10 different cancer cell lines with Hi-C data show that we identify a total number of 105 amplifications and 45 deletions together with 90 translocations, whereas we identify virtually no such events for two karyotypically normal cell lines. Our CNV predictions correlate very well with whole genome sequencing (WGS) data among chromosomes

  2. Copy Number Variation of TLR-7 Gene and its Association with the Development of Systemic Lupus Erythematosus in Female Patients from Yucatan Mexico

    PubMed Central

    Pacheco, Guillermo Valencia; Cruz, Darig Cámara; González Herrera, Lizbeth J; Pérez Mendoza, Gerardo J; Adrián Amaro, Guadalupe I; Nakazawa Ueji, Yumi E; Angulo Ramírez, Angélica V

    2014-01-01

    Systemic lupus erythematosus (SLE) is a systemic autoimmune disease characterized by the production of autoantibodies against self-antigens, which occurs most often in women between 15 and 40 years of age. The innate immunity is involved in the pathogenesis of SLE through TLR- 7. Genetic factors such as copy number variation (CNV) of target genes may contribute to disease development, but this possible risk has not yet been studied in SLE patients from Yucatan, Mexico. The CNV of TLR-7 gene was determined by quantitative polymerase chain reaction assay using TaqMan probes in 80 SLE women and 150 control subjects. The results showed that 10% of SLE patients exhibited more than two copies of TLR-7 gene, whereas no mRNA overexpression was detected. These data suggested that increased CNV of the TLR-7 gene in Yucatan SLE women can be a risk factor for this disease. PMID:25512712

  3. Modeling read counts for CNV detection in exome sequencing data.

    PubMed

    Love, Michael I; Myšičková, Alena; Sun, Ruping; Kalscheuer, Vera; Vingron, Martin; Haas, Stefan A

    2011-11-08

    Varying depth of high-throughput sequencing reads along a chromosome makes it possible to observe copy number variants (CNVs) in a sample relative to a reference. In exome and other targeted sequencing projects, technical factors increase variation in read depth while reducing the number of observed locations, adding difficulty to the problem of identifying CNVs. We present a hidden Markov model for detecting CNVs from raw read count data, using background read depth from a control set as well as other positional covariates such as GC-content. The model, exomeCopy, is applied to a large chromosome X exome sequencing project identifying a list of large unique CNVs. CNVs predicted by the model and experimentally validated are then recovered using a cross-platform control set from publicly available exome sequencing data. Simulations show high sensitivity for detecting heterozygous and homozygous CNVs, outperforming normalization and state-of-the-art segmentation methods.

  4. The joint effect of air pollution exposure and copy number variation on risk for autism.

    PubMed

    Kim, Dokyoon; Volk, Heather; Girirajan, Santhosh; Pendergrass, Sarah; Hall, Molly A; Verma, Shefali S; Schmidt, Rebecca J; Hansen, Robin L; Ghosh, Debashis; Ludena-Rodriguez, Yunin; Kim, Kyoungmi; Ritchie, Marylyn D; Hertz-Picciotto, Irva; Selleck, Scott B

    2017-09-01

    Autism spectrum disorder is a complex trait with a high degree of heritability as well as documented susceptibility from environmental factors. In this study the contributions of copy number variation, exposure to air pollutants, and the interaction between the two on autism risk, were evaluated in the population-based case-control Childhood Autism Risks from Genetics and Environment (CHARGE) Study. For the current investigation, we included only those CHARGE children (a) who met criteria for autism or typical development and (b) for whom our team had conducted both genetic evaluation of copy number burden and determination of environmental air pollution exposures based on mapping addresses from the pregnancy and early childhood. This sample consisted of 158 cases of children with autism and 147 controls with typical development. Multiple logistic regression models were fit with and without environmental variable-copy number burden interactions. We found no correlation between average air pollution exposure from conception to age 2 years and the child's CNV burden. We found a significant interaction in which a 1SD increase in duplication burden combined with a 1SD increase in ozone exposure was associated with an elevated autism risk (OR 3.4, P < 0.005) much greater than the increased risks associated with either genomic duplication (OR 1.85, 95% CI 1.25-2.73) or ozone (OR 1.20, 95% CI 0.93-1.54) alone. Similar results were obtained when CNV and ozone were dichotomized to compare those in the top quartile relative to those having a smaller CNV burden and lower exposure to ozone, and when exposures were assessed separately for pregnancy, the first year of life, and the second year of life. No interactions were observed for other air pollutants, even those that demonstrated main effects; ozone tends to be negatively correlated with the other pollutants examined. While earlier work has demonstrated interactions between the presence of a pathogenic CNV and an

  5. Identification of copy number variations associated with congenital heart disease by chromosomal microarray analysis and next-generation sequencing.

    PubMed

    Zhu, Xiangyu; Li, Jie; Ru, Tong; Wang, Yaping; Xu, Yan; Yang, Ying; Wu, Xing; Cram, David S; Hu, Yali

    2016-04-01

    To determine the type and frequency of pathogenic chromosomal abnormalities in fetuses diagnosed with congenital heart disease (CHD) using chromosomal microarray analysis (CMA) and validate next-generation sequencing as an alternative diagnostic method. Chromosomal aneuploidies and submicroscopic copy number variations (CNVs) were identified in amniocytes DNA samples from CHD fetuses using high-resolution CMA and copy number variation sequencing (CNV-Seq). Overall, 21 of 115 CHD fetuses (18.3%) referred for CMA had a pathogenic chromosomal anomaly. In six of 73 fetuses (8.2%) with an isolated CHD, CMA identified two cases of DiGeorge syndrome, and one case each of 1q21.1 microdeletion, 16p11.2 microdeletion and Angelman/Prader Willi syndromes, and 22q11.21 microduplication syndrome. In 12 of 42 fetuses (28.6%) with CHD and additional structural abnormalities, CMA identified eight whole or partial trisomies (19.0%), five CNVs (11.9%) associated with DiGeorge, Wolf-Hirschhorn, Miller-Dieker, Cri du Chat and Blepharophimosis, Ptosis, and Epicanthus Inversus syndromes and four other rare pathogenic CNVs (9.5%). Overall, there was a 100% diagnostic concordance between CMA and CNV-Seq for detecting all 21 pathogenic chromosomal abnormalities associated with CHD. CMA and CNV-Seq are reliable and accurate prenatal techniques for identifying pathogenic fetal chromosomal abnormalities associated with cardiac defects. © 2016 John Wiley & Sons, Ltd. © 2016 John Wiley & Sons, Ltd.

  6. Genome-wide copy number variant analysis for congenital ventricular septal defects in Chinese Han population.

    PubMed

    An, Yu; Duan, Wenyuan; Huang, Guoying; Chen, Xiaoli; Li, Li; Nie, Chenxia; Hou, Jia; Gui, Yonghao; Wu, Yiming; Zhang, Feng; Shen, Yiping; Wu, Bailin; Wang, Hongyan

    2016-01-08

    Ventricular septal defects (VSDs) constitute the most prevalent congenital heart disease (CHD), occurs either in isolation (isolated VSD) or in combination with other cardiac defects (complex VSD). Copy number variation (CNV) has been highlighted as a possible contributing factor to the etiology of many congenital diseases. However, little is known concerning the involvement of CNVs in either isolated or complex VSDs. We analyzed 154 unrelated Chinese individuals with VSD by chromosomal microarray analysis. The subjects were recruited from four hospitals across China. Each case underwent clinical assessment to define the type of VSD, either isolated or complex VSD. CNVs detected were categorized into syndrom related CNVs, recurrent CNVs and rare CNVs. Genes encompassed by the CNVs were analyzed using enrichment and pathway analysis. Among 154 probands, we identified 29 rare CNVs in 26 VSD patients (16.9 %, 26/154) and 8 syndrome-related CNVs in 8 VSD patients (5.2 %, 8/154). 12 of the detected 29 rare CNVs (41.3 %) were recurrently reported in DECIPHER or ISCA database as associated with either VSD or general heart disease. Fifteen genes (5 %, 15/285) within CNVs were associated with a broad spectrum of complicated CHD. Among these15 genes, 7 genes were in "abnormal interventricular septum morphology" derived from the MGI (mouse genome informatics) database, and nine genes were associated with cardiovascular system development (GO:0072538).We also found that these VSD-related candidate genes are enriched in chromatin binding and transcription regulation, which are the biological processes underlying heart development. Our study demonstrates the potential clinical diagnostic utility of genomic imbalance profiling in VSD patients. Additionally, gene enrichment and pathway analysis helped us to implicate VSD related candidate genes.

  7. DNA Methylation Patterns in Normal Tissue Correlate more Strongly with Breast Cancer Status than Copy-Number Variants.

    PubMed

    Gao, Yang; Widschwendter, Martin; Teschendorff, Andrew E

    2018-05-04

    Normal tissue at risk of neoplastic transformation is characterized by somatic mutations, copy-number variation and DNA methylation changes. It is unclear however, which type of alteration may be more informative of cancer risk. We analyzed genome-wide DNA methylation and copy-number calls from the same DNA assay in a cohort of healthy breast samples and age-matched normal samples collected adjacent to breast cancer. Using statistical methods to adjust for cell type heterogeneity, we show that DNA methylation changes can discriminate normal-adjacent from normal samples better than somatic copy-number variants. We validate this important finding in an independent dataset. These results suggest that DNA methylation alterations in the normal cell of origin may offer better cancer risk prediction and early detection markers than copy-number changes. Copyright © 2018. Published by Elsevier B.V.

  8. Global characterization of copy number variants in epilepsy patients from whole genome sequencing

    PubMed Central

    Meloche, Caroline; Andrade, Danielle M.; Lafreniere, Ron G.; Gravel, Micheline; Spiegelman, Dan; Dionne-Laporte, Alexandre; Boelman, Cyrus; Hamdan, Fadi F.; Michaud, Jacques L.; Rouleau, Guy; Minassian, Berge A.; Bourque, Guillaume; Cossette, Patrick

    2018-01-01

    Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases. PMID:29649218

  9. Characterization of the Copy Number and Variants of Deformed Wing Virus (DWV) in the Pairs of Honey Bee Pupa and Infesting Varroa destructor or Tropilaelaps mercedesae

    PubMed Central

    Wu, Yunfei; Dong, Xiaofeng; Kadowaki, Tatsuhiko

    2017-01-01

    Recent honey bee colony losses, particularly during the winter, have been shown to be associated with the presence of both ectoparasitic mites and Deformed Wing Virus (DWV). Whilst the role of Varroa destructor mites as a viral vector is well established, the role of Tropilaelaps mercedesae mites in viral transmission has not been fully investigated. In this study, we tested the effects that V. destructor and T. mercedesae infestation have on fluctuation of the DWV copy number and alteration of the virus variants in honey bees by characterizing individual pupae and their infesting mites. We observed that both mite species were associated with increased viral copy number in honey bee pupae. We found a positive correlation between DWV copy number in pupae and copy number in infesting mites, and the same DWV type A variant was present in either low or high copy number in both honey bee pupae and infesting V. destructor. These data also suggest that variant diversity is similar between honey bee pupae and the mites that infest them. These results support a previously proposed hypothesis that DWV suppresses the honey bee immune system when virus copy number reaches a specific threshold, promoting greater replication. PMID:28878743

  10. MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

    PubMed Central

    Schulz, Vincent; Chen, Min; Tuck, David

    2010-01-01

    Background Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. Methods We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. Conclusions We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. Availability The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM. PMID:20532221

  11. Concerted copy number variation balances ribosomal DNA dosage in human and mouse genomes

    PubMed Central

    Gibbons, John G.; Branco, Alan T.; Godinho, Susana A.; Yu, Shoukai; Lemos, Bernardo

    2015-01-01

    Tandemly repeated ribosomal DNA (rDNA) arrays are among the most evolutionary dynamic loci of eukaryotic genomes. The loci code for essential cellular components, yet exhibit extensive copy number (CN) variation within and between species. CN might be partly determined by the requirement of dosage balance between the 5S and 45S rDNA arrays. The arrays are nonhomologous, physically unlinked in mammals, and encode functionally interdependent RNA components of the ribosome. Here we show that the 5S and 45S rDNA arrays exhibit concerted CN variation (cCNV). Despite 5S and 45S rDNA elements residing on different chromosomes and lacking sequence similarity, cCNV between these loci is strong, evolutionarily conserved in humans and mice, and manifested across individual genotypes in natural populations and pedigrees. Finally, we observe that bisphenol A induces rapid and parallel modulation of 5S and 45S rDNA CN. Our observations reveal a novel mode of genome variation, indicate that natural selection contributed to the evolution and conservation of cCNV, and support the hypothesis that 5S CN is partly determined by the requirement of dosage balance with the 45S rDNA array. We suggest that human disease variation might be traced to disrupted rDNA dosage balance in the genome. PMID:25583482

  12. The contribution of de novo and rare inherited copy number changes to congenital heart disease in an unselected sample of children with conotruncal defects or hypoplastic left heart disease

    PubMed Central

    Ronemus, Michael; Kline, Jennie; Jobanputra, Vaidehi; Williams, Ismee; Anyane-Yeboa, Kwame; Chung, Wendy; Yu, Lan; Wong, Nancy; Awad, Danielle; Yu, Chih-yu; Leotta, Anthony; Kendall, Jude; Yamrom, Boris; Lee, Yoon-ha; Wigler, Michael; Levy, Dan

    2013-01-01

    Congenital heart disease (CHD) is the most common congenital malformation, with evidence of a strong genetic component. We analyzed data from 223 consecutively ascertained families, each consisting of at least one child affected by a conotruncal defect (CNT) or hypoplastic left heart disease (HLHS) and both parents. The NimbleGen HD2-2.1 comparative genomic hybridization platform was used to identify de novo and rare inherited copy number variants (CNVs). Excluding 10 cases with 22q11.2 DiGeorge deletions, we validated de novo CNVs in 8 % of 148 probands with CNTs, 12.7 % of 71 probands with HLHS and none in 4 probands with both. Only 2 % of control families showed a de novo CNV. We also identified a group of ultra-rare inherited CNVs that occurred de novo in our sample, contained a candidate gene for CHD, recurred in our sample or were present in an affected sibling. We confirmed the contribution to CHD of copy number changes in genes such as GATA4 and NODAL and identified several genes in novel recurrent CNVs that may point to novel CHD candidate loci. We also found CNVs previously associated with highly variable pheno-types and reduced penetrance, such as dup 1q21.1, dup 16p13.11, dup 15q11.2-13, dup 22q11.2, and del 2q23.1. We found that the presence of extra-cardiac anomalies was not related to the frequency of CNVs, and that there was no significant difference in CNV frequency or specificity between the probands with CNT and HLHS. In agreement with other series, we identified likely causal CNVs in 5.6 % of our total sample, half of which were de novo. PMID:23979609

  13. The contribution of de novo and rare inherited copy number changes to congenital heart disease in an unselected sample of children with conotruncal defects or hypoplastic left heart disease.

    PubMed

    Warburton, Dorothy; Ronemus, Michael; Kline, Jennie; Jobanputra, Vaidehi; Williams, Ismee; Anyane-Yeboa, Kwame; Chung, Wendy; Yu, Lan; Wong, Nancy; Awad, Danielle; Yu, Chih-Yu; Leotta, Anthony; Kendall, Jude; Yamrom, Boris; Lee, Yoon-Ha; Wigler, Michael; Levy, Dan

    2014-01-01

    Congenital heart disease (CHD) is the most common congenital malformation, with evidence of a strong genetic component. We analyzed data from 223 consecutively ascertained families, each consisting of at least one child affected by a conotruncal defect (CNT) or hypoplastic left heart disease (HLHS) and both parents. The NimbleGen HD2-2.1 comparative genomic hybridization platform was used to identify de novo and rare inherited copy number variants (CNVs). Excluding 10 cases with 22q11.2 DiGeorge deletions, we validated de novo CNVs in 8 % of 148 probands with CNTs, 12.7 % of 71 probands with HLHS and none in 4 probands with both. Only 2 % of control families showed a de novo CNV. We also identified a group of ultra-rare inherited CNVs that occurred de novo in our sample, contained a candidate gene for CHD, recurred in our sample or were present in an affected sibling. We confirmed the contribution to CHD of copy number changes in genes such as GATA4 and NODAL and identified several genes in novel recurrent CNVs that may point to novel CHD candidate loci. We also found CNVs previously associated with highly variable phenotypes and reduced penetrance, such as dup 1q21.1, dup 16p13.11, dup 15q11.2-13, dup 22q11.2, and del 2q23.1. We found that the presence of extra-cardiac anomalies was not related to the frequency of CNVs, and that there was no significant difference in CNV frequency or specificity between the probands with CNT and HLHS. In agreement with other series, we identified likely causal CNVs in 5.6 % of our total sample, half of which were de novo.

  14. Identification of both copy number variation-type and constant-type core elements in a large segmental duplication region of the mouse genome

    PubMed Central

    2013-01-01

    Background Copy number variation (CNV), an important source of diversity in genomic structure, is frequently found in clusters called CNV regions (CNVRs). CNVRs are strongly associated with segmental duplications (SDs), but the composition of these complex repetitive structures remains unclear. Results We conducted self-comparative-plot analysis of all mouse chromosomes using the high-speed and large-scale-homology search algorithm SHEAP. For eight chromosomes, we identified various types of large SD as tartan-checked patterns within the self-comparative plots. A complex arrangement of diagonal split lines in the self-comparative-plots indicated the presence of large homologous repetitive sequences. We focused on one SD on chromosome 13 (SD13M), and developed SHEPHERD, a stepwise ab initio method, to extract longer repetitive elements and to characterize repetitive structures in this region. Analysis using SHEPHERD showed the existence of 60 core elements, which were expected to be the basic units that form SDs within the repetitive structure of SD13M. The demonstration that sequences homologous to the core elements (>70% homology) covered approximately 90% of the SD13M region indicated that our method can characterize the repetitive structure of SD13M effectively. Core elements were composed largely of fragmented repeats of a previously identified type, such as long interspersed nuclear elements (LINEs), together with partial genic regions. Comparative genome hybridization array analysis showed that whereas 42 core elements were components of CNVR that varied among mouse strains, 8 did not vary among strains (constant type), and the status of the others could not be determined. The CNV-type core elements contained significantly larger proportions of long terminal repeat (LTR) types of retrotransposon than the constant-type core elements, which had no CNV. The higher divergence rates observed in the CNV-type core elements than in the constant type indicate that the

  15. A feasibility study of colorectal cancer diagnosis via circulating tumor DNA derived CNV detection.

    PubMed

    Molparia, Bhuvan; Oliveira, Glenn; Wagner, Jennifer L; Spencer, Emily G; Torkamani, Ali

    2018-01-01

    Circulating tumor DNA (ctDNA) has shown great promise as a biomarker for early detection of cancer. However, due to the low abundance of ctDNA, especially at early stages, it is hard to detect at high accuracies while keeping sequencing costs low. Here we present a pilot stage study to detect large scale somatic copy numbers variations (CNVs), which contribute more molecules to ctDNA signal compared to point mutations, via cell free DNA sequencing. We show that it is possible to detect somatic CNVs in early stage colorectal cancer (CRC) patients and subsequently discriminate them from normal patients. With 25 normal and 24 CRC samples, we achieve 100% specificity (lower bound confidence interval: 86%) and ~79% sensitivity (95% confidence interval: 63% - 95%,), though the performance should be considered with caution given the limited sample size. We report a lack of concordance between the CNVs detected via cfDNA sequencing and CNVs identified in parent tissue samples. However, recent findings suggest that a lack of concordance is expected for CNVs in CRC because of their sub-clonal nature. Finally, the CNVs we detect very likely contribute to cancer progression as they lie in functionally important regions, and have been shown to be associated with CRC specifically. This study paves the path for a larger scale exploration of the potential of CNV detection for both diagnoses and prognoses of cancer.

  16. A rare duplication on chromosome 16p11.2 is identified in patients with psychosis in Alzheimer's disease.

    PubMed

    Zheng, Xiaojing; Demirci, F Yesim; Barmada, M Michael; Richardson, Gale A; Lopez, Oscar L; Sweet, Robert A; Kamboh, M Ilyas; Feingold, Eleanor

    2014-01-01

    Epidemiological and genetic studies suggest that schizophrenia and autism may share genetic links. Besides common single nucleotide polymorphisms, recent data suggest that some rare copy number variants (CNVs) are risk factors for both disorders. Because we have previously found that schizophrenia and psychosis in Alzheimer's disease (AD+P) share some genetic risk, we investigated whether CNVs reported in schizophrenia and autism are also linked to AD+P. We searched for CNVs associated with AD+P in 7 recurrent CNV regions that have been previously identified across autism and schizophrenia, using the Illumina HumanOmni1-Quad BeadChip. A chromosome 16p11.2 duplication CNV (chr16: 29,554,843-30,105,652) was identified in 2 of 440 AD+P subjects, but not in 136 AD subjects without psychosis, or in 593 AD subjects with intermediate psychosis status, or in 855 non-AD individuals. The frequency of this duplication CNV in AD+P (0.46%) was similar to that reported previously in schizophrenia (0.46%). This duplication CNV was further validated using the NanoString nCounter CNV Custom CodeSets. The 16p11.2 duplication has been associated with developmental delay, intellectual disability, behavioral problems, autism, schizophrenia (SCZ), and bipolar disorder. These two AD+P patients had no personal of, nor any identified family history of, SCZ, bipolar disorder and autism. To the best of our knowledge, our case report is the first suggestion that 16p11.2 duplication is also linked to AD+P. Although rare, this CNV may have an important role in the development of psychosis.

  17. Molecular subtypes in stage II-III colon cancer defined by genomic instability: early recurrence-risk associated with a high copy-number variation and loss of RUNX3 and CDKN2A.

    PubMed

    Berg, Marianne; Nordgaard, Oddmund; Kørner, Hartwig; Oltedal, Satu; Smaaland, Rune; Søreide, Jon Arne; Søreide, Kjetil

    2015-01-01

    We sought to investigate various molecular subtypes defined by genomic instability that may be related to early death and recurrence in colon cancer. We sought to investigate various molecular subtypes defined by instability at microsatellites (MSI), changes in methylation patterns (CpG island methylator phenotype, CIMP) or copy number variation (CNV) in 8 genes. Stage II-III colon cancers (n = 64) were investigated by methylation-specific multiplex ligated probe amplification (MS-MLPA). Correlation of CNV, CIMP and MSI, with mutations in KRAS and BRAFV600E were assessed for overlap in molecular subtypes and early recurrence risk by uni- and multivariate regression. The CIMP phenotype occurred in 34% (22/64) and MSI in 27% (16/60) of the tumors, with noted CIMP/MSI overlap. Among the molecular subtypes, a high CNV phenotype had an associated odds ratio (OR) for recurrence of 3.2 (95% CI 1.1-9.3; P = 0.026). Losses of CACNA1G (OR of 2.9, 95% CI 1.4-6.0; P = 0.001), IGF2 (OR of 4.3, 95% CI 1.1-15.8; P = 0.007), CDKN2A (p16) (OR of 2.0, 95% CI 1.1-3.6; P = 0.024), and RUNX3 (OR of 3.4, 95% CI 1.3-8.7; P = 0.002) were associated with early recurrence, while MSI, CIMP, KRAS or BRAF V600E mutations were not. The CNV was significantly higher in deceased patients (CNV in 6 of 8) compared to survivors (CNV in 3 of 8). Only stage and loss of RUNX3 and CDKN2A were significant in the multivariable risk-model for early recurrence. A high copy number variation phenotype is a strong predictor of early recurrence and death, and may indicate a dose-dependent relationship between genetic instability and outcome. Loss of tumor suppressors RUNX3 and CDKN2A were related to recurrence-risk and warrants further investigation.

  18. Assessing the Role of Copy Number Variants in Prostate Cancer Risk and Progression Using a Novel Genome-Wide Screening Method

    DTIC Science & Technology

    2013-10-01

    role of copy number variants in prostate cancer risk and progression using a novel genome-wide screening method. 5a. CONTRACT NUMBER 5b. GRANT ...Prostate; Cancer; Risk; Deletion; Prognosismatter Published by Elsevier Inc. .urolonc.2013.06.004 d in part by DOD grant PC081025, by grant arly...Detection Research Network of the National CTRC at UTHSCSA grant P30CA054174. Data omics Core Shared Resource, which is supported CI P30CA054174 (CTRC of

  19. Impact of parental Bos taurus and Bos indicus origins on copy number variation in traditional Chinese cattle breeds

    USDA-ARS?s Scientific Manuscript database

    Copy number variation (CNV) is an important component of genomic structural variation and plays a role not only in evolutionary diversification but also domestication. Chinese cattle were derived from Bos taurus and Bos indicus, and several breeds presumably are of hybrid origin, but the evolution o...

  20. Variant pathogenicity evaluation in the community-driven Inherited Neuropathy Variant Browser.

    PubMed

    Saghira, Cima; Bis, Dana M; Stanek, David; Strickland, Alleene; Herrmann, David N; Reilly, Mary M; Scherer, Steven S; Shy, Michael E; Züchner, Stephan

    2018-05-01

    Charcot-Marie-Tooth disease (CMT) is an umbrella term for inherited neuropathies affecting an estimated one in 2,500 people. Over 120 CMT and related genes have been identified and clinical gene panels often contain more than 100 genes. Such a large genomic space will invariantly yield variants of uncertain clinical significance (VUS) in nearly any person tested. This rise in number of VUS creates major challenges for genetic counseling. Additionally, fewer individual variants in known genes are being published as the academic merit is decreasing, and most testing now happens in clinical laboratories, which typically do not correlate their variants with clinical phenotypes. For CMT, we aim to encourage and facilitate the global capture of variant data to gain a large collection of alleles in CMT genes, ideally in conjunction with phenotypic information. The Inherited Neuropathy Variant Browser provides user-friendly open access to currently reported variation in CMT genes. Geneticists, physicians, and genetic counselors can enter variants detected by clinical tests or in research studies in addition to genetic variation gathered from published literature, which are then submitted to ClinVar biannually. Active participation of the broader CMT community will provide an advance over existing resources for interpretation of CMT genetic variation. © 2018 Wiley Periodicals, Inc.

  1. β-Defensin Genomic Copy Number Does Not Influence the Age of Onset in Huntington’s Disease

    PubMed Central

    Vittori, Angelica; Orth, Michael; Roos, Raymund A. C.; Outeiro, Tiago F.; Giorgini, Flaviano; Hollox, Edward J.

    2014-01-01

    Background Huntington’s disease (HD) is an autosomal dominant neurodegenerative disorder caused by the abnormal expansion of a CAG triplet repeat tract in the huntingtin gene. While the length of this CAG expansion is the major determinant of the age of onset (AO), other genetic factors have also been shown to play a modulatory role. Recent evidence suggests that neuroinflammation is a pivotal factor in the pathogenesis of HD, and that targeting this process may have important therapeutic ramifications. The human β-defensin 2 (hBD2) – encoded by DEFB4 – is an antimicrobial peptide that exhibits inducible expression in astrocytes during inflammation and is an important regulator of innate and adaptive immune response. Therefore, DEFB4 may contribute to the neuroinflammatory processes observed in HD. Objective In this study we tested the hypothesis that copy number variation (CNV) of the β-defensin region, including DEFB4, modifies the AO in HD. Methods and results We genotyped β-defensin CNV in 490 HD individuals using the paralogue ratio test and found no association between β-defensin CNV and onset of HD. Conclusions We conclude that it is unlikely that DEFB4 plays a role in HD pathogenesis. PMID:24587836

  2. Genome-wide identification of copy number variations between two chicken lines that differ in genetic resistance to Marek’s disease

    USDA-ARS?s Scientific Manuscript database

    Background: Copy number variation (CNV) is a major source of genome polymorphism that directly contributes to phenotypic variation such as resistance to infectious diseases. Lines 63 and 72 are two highly inbred experimental chicken lines that differ greatly in susceptibility to Marek’s disease (MD)...

  3. Copy Number Variation across European Populations

    PubMed Central

    Chen, Wanting; Hayward, Caroline; Wright, Alan F.; Hicks, Andrew A.; Vitart, Veronique; Knott, Sara; Wild, Sarah H.; Pramstaller, Peter P.; Wilson, James F.; Rudan, Igor; Porteous, David J.

    2011-01-01

    Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations. PMID:21829696

  4. Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.

    PubMed

    Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H; Schraiber, Joshua G; Narvaiza, Iñigo; Sudmant, Peter H; Penn, Osnat; Chiatante, Giorgia; Malig, Maika; Huddleston, John; Benner, Chris; Camponeschi, Francesca; Ciofi-Baffoni, Simone; Stessman, Holly A F; Marchetto, Maria C N; Denman, Laura; Harshman, Lana; Baker, Carl; Raja, Archana; Penewit, Kelsi; Janke, Nicolette; Tang, W Joyce; Ventura, Mario; Banci, Lucia; Antonacci, Francesca; Akey, Joshua M; Amemiya, Chris T; Gage, Fred H; Reymond, Alexandre; Eichler, Evan E

    2016-08-11

    Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.

  5. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations.

    PubMed

    Renault, Victor; Tost, Jörg; Pichon, Fabien; Wang-Renault, Shu-Fang; Letouzé, Eric; Imbeaud, Sandrine; Zucman-Rossi, Jessica; Deleuze, Jean-François; How-Kit, Alexandre

    2017-01-01

    Copy number variations (CNV) include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH) events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH) and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information. To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer), a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs) Affymetrix SNP Array data (Fig 1A). Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test), validated by another cohort of HCCs (p-value of 5.6e-7) (Fig 2B). aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https://github.com/FJD-CEPH/aCNViewer and Docker https

  6. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations

    PubMed Central

    Wang-Renault, Shu-Fang; Letouzé, Eric; Imbeaud, Sandrine; Zucman-Rossi, Jessica; Deleuze, Jean-François; How-Kit, Alexandre

    2017-01-01

    Motivation Copy number variations (CNV) include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH) events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH) and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information. Results To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer), a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs) Affymetrix SNP Array data (Fig 1A). Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test), validated by another cohort of HCCs (p-value of 5.6e-7) (Fig 2B). Availability and implementation aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https

  7. Whole-genome sequencing analysis of phenotypic heterogeneity and anticipation in Li-Fraumeni cancer predisposition syndrome.

    PubMed

    Ariffin, Hany; Hainaut, Pierre; Puzio-Kuter, Anna; Choong, Soo Sin; Chan, Adelyne Sue Li; Tolkunov, Denis; Rajagopal, Gunaretnam; Kang, Wenfeng; Lim, Leon Li Wen; Krishnan, Shekhar; Chen, Kok-Siong; Achatz, Maria Isabel; Karsa, Mawar; Shamsani, Jannah; Levine, Arnold J; Chan, Chang S

    2014-10-28

    The Li-Fraumeni syndrome (LFS) and its variant form (LFL) is a familial predisposition to multiple forms of childhood, adolescent, and adult cancers associated with germ-line mutation in the TP53 tumor suppressor gene. Individual disparities in tumor patterns are compounded by acceleration of cancer onset with successive generations. It has been suggested that this apparent anticipation pattern may result from germ-line genomic instability in TP53 mutation carriers, causing increased DNA copy-number variations (CNVs) with successive generations. To address the genetic basis of phenotypic disparities of LFS/LFL, we performed whole-genome sequencing (WGS) of 13 subjects from two generations of an LFS kindred. Neither de novo CNV nor significant difference in total CNV was detected in relation with successive generations or with age at cancer onset. These observations were consistent with an experimental mouse model system showing that trp53 deficiency in the germ line of father or mother did not increase CNV occurrence in the offspring. On the other hand, individual records on 1,771 TP53 mutation carriers from 294 pedigrees were compiled to assess genetic anticipation patterns (International Agency for Research on Cancer TP53 database). No strictly defined anticipation pattern was observed. Rather, in multigeneration families, cancer onset was delayed in older compared with recent generations. These observations support an alternative model for apparent anticipation in which rare variants from noncarrier parents may attenuate constitutive resistance to tumorigenesis in the offspring of TP53 mutation carriers with late cancer onset.

  8. Copy-number variations are enriched for neurodevelopmental genes in children with developmental coordination disorder.

    PubMed

    Mosca, Stephen J; Langevin, Lisa Marie; Dewey, Deborah; Innes, A Micheil; Lionel, Anath C; Marshall, Christian C; Scherer, Stephen W; Parboosingh, Jillian S; Bernier, Francois P

    2016-12-01

    Developmental coordination disorder is a common neurodevelopment disorder that frequently co-occurs with other neurodevelopmental disorders including attention-deficit hyperactivity disorder (ADHD). Copy-number variations (CNVs) have been implicated in a number of neurodevelopmental and psychiatric disorders; however, the proportion of heritability in developmental coordination disorder (DCD) attributed to CNVs has not been explored. This study aims to investigate how CNVs may contribute to the genetic architecture of DCD. CNV analysis was performed on 82 extensively phenotyped Canadian children with DCD, with or without co-occurring ADHD and/or reading disorder, and 2988 healthy European controls using identical genome-wide SNP microarrays and CNV calling algorithms. An increased rate of large and rare genic CNVs (p=0.009) was detected, and there was an enrichment of duplications spanning brain-expressed genes (p=0.039) and genes previously implicated in other neurodevelopmental disorders (p=0.043). Genes and loci of particular interest in this group included: GAP43, RBFOX1, PTPRN2, SHANK3, 16p11.2 and distal 22q11.2. Although no recurrent CNVs were identified, 26% of DCD cases, where sample availability permitted segregation analysis, were found to have a de novo rare CNV. Of the inherited CNVs, 64% were from a parent who also had a neurodevelopmental disorder. These findings suggest that there may be shared susceptibility genes for DCD and other neurodevelopmental disorders and highlight the need for thorough phenotyping when investigating the genetics of neurodevelopmental disorders. Furthermore, these data provide compelling evidence supporting a genetic basis for DCD, and further implicate rare CNVs in the aetiology of neurodevelopmental disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  9. Clinical detection of deletion structural variants in whole-genome sequences

    PubMed Central

    Noll, Aaron C; Miller, Neil A; Smith, Laurie D; Yoo, Byunggil; Fiedler, Stephanie; Cooley, Linda D; Willig, Laurel K; Petrikin, Josh E; Cakici, Julie; Lesko, John; Newton, Angela; Detherage, Kali; Thiffault, Isabelle; Saunders, Carol J; Farrow, Emily G; Kingsmore, Stephen F

    2016-01-01

    Optimal management of acutely ill infants with monogenetic diseases requires rapid identification of causative haplotypes. Whole-genome sequencing (WGS) has been shown to identify pathogenic nucleotide variants in such infants. Deletion structural variants (DSVs, >50 nt) are implicated in many genetic diseases, and tools have been designed to identify DSVs using short-read WGS. Optimisation and integration of these tools into a WGS pipeline could improve diagnostic sensitivity and specificity of WGS. In addition, it may improve turnaround time when compared with current CNV assays, enhancing utility in acute settings. Here we describe DSV detection methods for use in WGS for rapid diagnosis in acutely ill infants: SKALD (Screening Konsensus and Annotation of Large Deletions) combines calls from two tools (Breakdancer and GenomeStrip) with calibrated filters and clinical interpretation rules. In four WGS runs, the average analytic precision (positive predictive value) of SKALD was 78%, and recall (sensitivity) was 27%, when compared with validated reference DSV calls. When retrospectively applied to a cohort of 36 families with acutely ill infants SKALD identified causative DSVs in two. The first was heterozygous deletion of exons 1–3 of MMP21 in trans with a heterozygous frame-shift deletion in two siblings with transposition of the great arteries and heterotaxy. In a newborn female with dysmorphic features, ventricular septal defect and persistent pulmonary hypertension, SKALD identified the breakpoints of a heterozygous, de novo 1p36.32p36.13 deletion. In summary, consensus DSV calling, implemented in an 8-h computational pipeline with parameterised filtering, has the potential to increase the diagnostic yield of WGS in acutely ill neonates and discover novel disease genes. PMID:29263817

  10. Structure and function of neonatal social communication in a genetic mouse model of autism.

    PubMed

    Takahashi, T; Okabe, S; Broin, P Ó; Nishi, A; Ye, K; Beckert, M V; Izumi, T; Machida, A; Kang, G; Abe, S; Pena, J L; Golden, A; Kikusui, T; Hiroi, N

    2016-09-01

    A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor.

  11. Structure and function of neonatal social communication in a genetic mouse model of autism

    PubMed Central

    Takahashi, Tomohisa; Okabe, Shota; Ó Broin, Pilib; Nishi, Akira; Ye, Kenny; Beckert, Michael V.; Izumi, Takeshi; Machida, Akihiro; Kang, Gina; Abe, Seiji; Pena, Jose L.; Golden, Aaron; Kikusui, Takefumi; Hiroi, Noboru

    2015-01-01

    A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically-triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor. PMID:26666205

  12. A Novel Method to Detect Early Colorectal Cancer Based on Chromosome Copy Number Variation in Plasma.

    PubMed

    Xu, Jun-Feng; Kang, Qian; Ma, Xing-Yong; Pan, Yuan-Ming; Yang, Lang; Jin, Peng; Wang, Xin; Li, Chen-Guang; Chen, Xiao-Chen; Wu, Chao; Jiao, Shao-Zhuo; Sheng, Jian-Qiu

    2018-01-01

    Colonoscopy screening has been accepted broadly to evaluate the risk and incidence of colorectal cancer (CRC) during health examination in outpatients. However, the intrusiveness, complexity and discomfort of colonoscopy may limit its application and the compliance of patients. Thus, more reliable and convenient diagnostic methods are necessary for CRC screening. Genome instability, especially copy-number variation (CNV), is a hallmark of cancer and has been proved to have potential in clinical application. We determined the diagnostic potential of chromosomal CNV at the arm level by whole-genome sequencing of CRC plasma samples (n = 32) and healthy controls (n = 38). Arm level CNV was determined and the consistence of arm-level CNV between plasma and tissue was further analyzed. Two methods including regular z score and trained Support Vector Machine (SVM) classifier were applied for detection of colorectal cancer. In plasma samples of CRC patients, the most frequent deletions were detected on chromosomes 6, 8p, 14q and 1p, and the most frequent amplifications occurred on chromosome 19, 5, 2, 9p and 20p. These arm-level alterations detected in plasma were also observed in tumor tissues. We showed that the specificity of regular z score analysis for the detection of colorectal cancer was 86.8% (33/38), whereas its sensitivity was only 56.3% (18/32). Applying a trained SVM classifier (n = 40 in trained group) as the standard to detect colorectal cancer relevance ratio in the test samples (n = 30), a sensitivity of 91.7% (11/12) and a specificity 88.9% (16/18) were finally reached. Furthermore, all five early CRC patients in stages I and II were successfully detected. Trained SVM classifier based on arm-level CNVs can be used as a promising method to screen early-stage CRC. © 2018 The Author(s). Published by S. Karger AG, Basel.

  13. CONAN: copy number variation analysis software for genome-wide association studies

    PubMed Central

    2010-01-01

    Background Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at. PMID:20546565

  14. Copy number variations and genetic admixtures in three Xinjiang ethnic minority groups

    PubMed Central

    Lou, Haiyi; Li, Shilin; Jin, Wenfei; Fu, Ruiqing; Lu, Dongsheng; Pan, Xinwei; Zhou, Huaigu; Ping, Yuan; Jin, Li; Xu, Shuhua

    2015-01-01

    Xinjiang is geographically located in central Asia, and it has played an important historical role in connecting eastern Eurasian (EEA) and western Eurasian (WEA) people. However, human population genomic studies in this region have been largely underrepresented, especially with respect to studies of copy number variations (CNVs). Here we constructed the first CNV map of the three major ethnic minority groups, the Uyghur, Kazakh and Kirgiz, using Affymetrix Genome-Wide Human SNP Array 6.0. We systematically compared the properties of CNVs we identified in the three groups with the data from representatives of EEA and WEA. The analyses indicated a typical genetic admixture pattern in all three groups with ancestries from both EEA and WEA. We also identified several CNV regions showing significant deviation of allele frequency from the expected genome-wide distribution, which might be associated with population-specific phenotypes. Our study provides the first genome-wide perspective on the CNVs of three major Xinjiang ethnic minority groups and has implications for both evolutionary and medical studies. PMID:25026903

  15. Copy number variations and genetic admixtures in three Xinjiang ethnic minority groups.

    PubMed

    Lou, Haiyi; Li, Shilin; Jin, Wenfei; Fu, Ruiqing; Lu, Dongsheng; Pan, Xinwei; Zhou, Huaigu; Ping, Yuan; Jin, Li; Xu, Shuhua

    2015-04-01

    Xinjiang is geographically located in central Asia, and it has played an important historical role in connecting eastern Eurasian (EEA) and western Eurasian (WEA) people. However, human population genomic studies in this region have been largely underrepresented, especially with respect to studies of copy number variations (CNVs). Here we constructed the first CNV map of the three major ethnic minority groups, the Uyghur, Kazakh and Kirgiz, using Affymetrix Genome-Wide Human SNP Array 6.0. We systematically compared the properties of CNVs we identified in the three groups with the data from representatives of EEA and WEA. The analyses indicated a typical genetic admixture pattern in all three groups with ancestries from both EEA and WEA. We also identified several CNV regions showing significant deviation of allele frequency from the expected genome-wide distribution, which might be associated with population-specific phenotypes. Our study provides the first genome-wide perspective on the CNVs of three major Xinjiang ethnic minority groups and has implications for both evolutionary and medical studies.

  16. panelcn.MOPS: Copy-number detection in targeted NGS panel data for clinical diagnostics.

    PubMed

    Povysil, Gundula; Tzika, Antigoni; Vogt, Julia; Haunschmid, Verena; Messiaen, Ludwine; Zschocke, Johannes; Klambauer, Günter; Hochreiter, Sepp; Wimmer, Katharina

    2017-07-01

    Targeted next-generation-sequencing (NGS) panels have largely replaced Sanger sequencing in clinical diagnostics. They allow for the detection of copy-number variations (CNVs) in addition to single-nucleotide variants and small insertions/deletions. However, existing computational CNV detection methods have shortcomings regarding accuracy, quality control (QC), incidental findings, and user-friendliness. We developed panelcn.MOPS, a novel pipeline for detecting CNVs in targeted NGS panel data. Using data from 180 samples, we compared panelcn.MOPS with five state-of-the-art methods. With panelcn.MOPS leading the field, most methods achieved comparably high accuracy. panelcn.MOPS reliably detected CNVs ranging in size from part of a region of interest (ROI), to whole genes, which may comprise all ROIs investigated in a given sample. The latter is enabled by analyzing reads from all ROIs of the panel, but presenting results exclusively for user-selected genes, thus avoiding incidental findings. Additionally, panelcn.MOPS offers QC criteria not only for samples, but also for individual ROIs within a sample, which increases the confidence in called CNVs. panelcn.MOPS is freely available both as R package and standalone software with graphical user interface that is easy to use for clinical geneticists without any programming experience. panelcn.MOPS combines high sensitivity and specificity with user-friendliness rendering it highly suitable for routine clinical diagnostics. © 2017 The Authors. Human Mutation published by Wiley Periodicals, Inc.

  17. panelcn.MOPS: Copy‐number detection in targeted NGS panel data for clinical diagnostics

    PubMed Central

    Povysil, Gundula; Tzika, Antigoni; Vogt, Julia; Haunschmid, Verena; Messiaen, Ludwine; Zschocke, Johannes; Klambauer, Günter; Wimmer, Katharina

    2017-01-01

    Abstract Targeted next‐generation‐sequencing (NGS) panels have largely replaced Sanger sequencing in clinical diagnostics. They allow for the detection of copy‐number variations (CNVs) in addition to single‐nucleotide variants and small insertions/deletions. However, existing computational CNV detection methods have shortcomings regarding accuracy, quality control (QC), incidental findings, and user‐friendliness. We developed panelcn.MOPS, a novel pipeline for detecting CNVs in targeted NGS panel data. Using data from 180 samples, we compared panelcn.MOPS with five state‐of‐the‐art methods. With panelcn.MOPS leading the field, most methods achieved comparably high accuracy. panelcn.MOPS reliably detected CNVs ranging in size from part of a region of interest (ROI), to whole genes, which may comprise all ROIs investigated in a given sample. The latter is enabled by analyzing reads from all ROIs of the panel, but presenting results exclusively for user‐selected genes, thus avoiding incidental findings. Additionally, panelcn.MOPS offers QC criteria not only for samples, but also for individual ROIs within a sample, which increases the confidence in called CNVs. panelcn.MOPS is freely available both as R package and standalone software with graphical user interface that is easy to use for clinical geneticists without any programming experience. panelcn.MOPS combines high sensitivity and specificity with user‐friendliness rendering it highly suitable for routine clinical diagnostics. PMID:28449315

  18. Copy number variation in CEP57L1 predisposes to congenital absence of bilateral ACL and PCL ligaments.

    PubMed

    Liu, Yichuan; Li, Yun; March, Michael E; Nguyen, Kenny; Kenny, Nguyen; Xu, Kexiang; Wang, Fengxiang; Guo, Yiran; Keating, Brendan; Glessner, Joseph; Li, Jiankang; Ganley, Theodore J; Zhang, Jianguo; Deardorff, Matthew A; Xu, Xun; Hakonarson, Hakon

    2015-11-11

    Absence of the anterior (ACL) or posterior cruciate ligament (PCL) are rare congenital malformations that result in knee joint instability, with a prevalence of 1.7 per 100,000 live births and can be associated with other lower-limb abnormalities such as ACL agnesia and absence of the menisci of the knee. While a few cases of absence of ACL/PCL are reported in the literature, a number of large familial case series of related conditions such as ACL agnesia suggest a potential underlying monogenic etiology. We performed whole exome sequencing of a family with two individuals affected by ACL/PCL. We identified copy number variation (CNV) deletion impacting the exon sequences of CEP57L1, present in the affected mother and her affected daughter based on the exome sequencing data. The deletion was validated using quantitative PCR (qPCR), and the gene was confirmed to be expressed in ACL ligament tissue. Interestingly, we detected reduced expression of CEP57L1 in Epstein-Barr virus (EBV) cells from the two patients in comparison with healthy controls. Evaluation of 3D protein structure showed that the helix-binding sites of the protein remain intact with the deletion, but other functional binding sites related to microtubule attachment are missing. The specificity of the CNV deletion was confirmed by showing that it was absent in ~700 exome sequencing samples as well as in the database of genomic variations (DGV), a database containing large numbers of annotated CNVs from previous scientific reports. We identified a novel CNV deletion that was inherited through an autosomal dominant transmission from an affected mother to her affected daughter, both of whom suffered from the absence of the anterior and posterior cruciate ligaments of the knees.

  19. Functional effects of CCL3L1 copy number

    PubMed Central

    Carpenter, Danielle; McIntosh, Richard S; Pleass, Richard J; Armour, John AL

    2012-01-01

    Copy number variation (CNV) is becoming increasingly important as a feature of human variation in disease susceptibility studies. However, the consequences of copy number variation are not so well understood. Here we present data exploring the functional consequences of copy number variation of CCL3L1 in 55 independent UK samples with no known clinical phenotypes. Copy number of CCL3L1 was determined by the paralogue ratio test (PRT), and expression levels of MIP-1α and mRNA from stimulated monocytes were measured and analysed. The data show no statistically significant association of MIP-1α protein levels with copy number. However, there was a significant correlation between copy number and CCL3L1:CCL3 mRNA ratio. The data also provide evidence that expression of CCL3 predominates in both protein and mRNA, and therefore the observed variation of CCL3 is potentially more important biologically than that of copy number variation of CCL3L1. PMID:22476153

  20. Association between genome-wide copy number variation and arsenic-induced skin lesions: a prospective study.

    PubMed

    Kibriya, Muhammad G; Jasmine, Farzana; Parvez, Faruque; Argos, Maria; Roy, Shantanu; Paul-Brutus, Rachelle; Islam, Tariqul; Ahmed, Alauddin; Rakibuz-Zaman, Muhammad; Shinkle, Justin; Slavkovich, Vesna; Graziano, Joseph H; Ahsan, Habibul

    2017-07-18

    Exposure to arsenic in drinking water is a global health problem and arsenic-induced skin lesions are hallmark of chronic arsenic toxicity. We and others have reported germline genetic variations as risk factors for such skin lesions. The role of copy number variation (CNV) in the germline DNA in this regard is unknown. From a large prospectively followed-up cohort, exposed to arsenic, we randomly selected 2171 subjects without arsenic-induced skin lesions at enrollment and genotyped their whole blood DNA samples on Illumina Cyto12v2.1 SNP chips to generate DNA copy number. Participants were followed up every 2 years for a total of 8 years, especially for the development of skin lesions. In Cox regression models, each CNV segment was used as a predictor, accounting for other potential covariates, for incidence of skin lesions. The presence of genomic deletion(s) in a number of genes (OR5J2, GOLGA6L7P, APBA2, GALNTL5, VN1R31P, PHKG1P2, SGCZ, ZNF658) and lincRNA genes (RP11-76I14.1, CTC-535 M15.2, RP11-73B2.2) were associated with higher risk [HR between 1.67 (CI 1.3-2.1) and 2.15 (CI 1.5-2.9) for different CNVs] for development of skin lesions independent of gender, age, and arsenic exposure. Some deletions had stronger effect in a specific gender (ZNF658 in males, SGCZ in females) and some had stronger effect in higher arsenic exposure (lincRNA CTD-3179P9.1) suggesting a possible gene-environment interaction. This first genome-wide CNV study in a prospectively followed-up large cohort, exposed to arsenic, suggests that DNA deletion in several genes and lincRNA genes may predispose an individual to a higher risk of development of arsenic-induced skin lesions.

  1. tRNAomics: tRNA gene copy number variation and codon use provide bioinformatic evidence of a new anticodon:codon wobble pair in a eukaryote

    PubMed Central

    Iben, James R.; Maraia, Richard J.

    2012-01-01

    tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155

  2. Copy Number Variants and Congenital Anomalies Surveillance: A Suggested Coding Strategy Using the Royal College of Paediatrics and Child Health Version of ICD-10.

    PubMed

    Bedard, Tanya; Lowry, R Brian; Sibbald, Barbara; Thomas, Mary Ann; Innes, A Micheil

    2016-01-01

    The use of array-based comparative genomic hybridization to assess DNA copy number is increasing in many jurisdictions. Such technology identifies more genetic causes of congenital anomalies; however, the clinical significance of some results may be challenging to interpret. A coding strategy to address cases with copy number variants has recently been implemented by the Alberta Congenital Anomalies Surveillance System and is described.

  3. Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder

    PubMed Central

    Elia, Josephine; Glessner, Joseph T; Wang, Kai; Takahashi, Nagahide; Shtir, Corina J; Hadley, Dexter; Sleiman, Patrick M A; Zhang, Haitao; Kim, Cecilia E; Robison, Reid; Lyon, Gholson J; Flory, James H; Bradfield, Jonathan P; Imielinski, Marcin; Hou, Cuiping; Frackelton, Edward C; Chiavacci, Rosetta M; Sakurai, Takeshi; Rabin, Cara; Middleton, Frank A; Thomas, Kelly A; Garris, Maria; Mentch, Frank; Freitag, Christine M; Steinhausen, Hans-Christoph; Todorov, Alexandre A; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Mick, Eric O; Roeyers, Herbert; Buitelaar, Jan; Lesch, Klaus-Peter; Banaschewski, Tobias; Ebstein, Richard P; Mulas, Fernando; Oades, Robert D; Sergeant, Joseph; Sonuga-Barke, Edmund; Renner, Tobias J; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Meyer, Jobst; Pálmason, Haukur; Seitz, Christiane; Loo, Sandra K; Smalley, Susan L; Biederman, Joseph; Kent, Lindsey; Asherson, Philip; Anney, Richard J L; Gaynor, J William; Shaw, Philip; Devoto, Marcella; White, Peter S; Grant, Struan F A; Buxbaum, Joseph D; Rapoport, Judith L; Williams, Nigel M; Nelson, Stanley F; Faraone, Stephen V; Hakonarson, Hakon

    2014-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts. PMID:22138692

  4. Chromosomal microarray analysis as the first-tier test for the identification of pathogenic copy number variants in chromosome 9 pericentric regions and its challenge.

    PubMed

    Wang, Jia-Chi; Boyar, Fatih Z

    2016-01-01

    Chromosomal microarray analysis (CMA) has been recommended and practiced routinely in the large reference laboratories of U.S.A. as the first-tier test for the postnatal evaluation of individuals with intellectual disability, autism spectrum disorders, and/or multiple congenital anomalies. Using CMA as a diagnostic tool and without a routine setting of fluorescence in situ hybridization with labeled bacterial artificial chromosome probes (BAC-FISH) in the large reference laboratories becomes a challenge in the characterization of chromosome 9 pericentric region. This region has a very complex genomic structure and contains a variety of heterochromatic and euchromatic polymorphic variants. These variants were usually studied by G-banding, C-banding and BAC-FISH analysis. Chromosomal microarray analysis (CMA) was not recommended since it may lead to false positive results. Here, we presented a cohort of four cases, in which high-resolution CMA was used as the first-tier test or simultaneously with G-banding analysis on the proband to identify pathogenic copy number variants (CNVs) in the whole genome. CMA revealed large pathogenic CNVs from chromosome 9 in 3 cases which also revealed different G-banding patterns between the two chromosome 9 homologues. Although we demonstrated that high-resolution CMA played an important role in the identification of pathogenic copy number variants in chromosome 9 pericentric regions, the lack of BAC-FISH analysis or other useful tools renders significant challenges in the characterization of chromosome 9 pericentric regions. None; it is not a clinical trial, and the cases were retrospectively collected and analyzed.

  5. cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.

    PubMed

    Clevert, Djork-Arné; Mitterecker, Andreas; Mayr, Andreas; Klambauer, Günter; Tuefferd, Marianne; De Bondt, An; Talloen, Willem; Göhlmann, Hinrich; Hochreiter, Sepp

    2011-07-01

    Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.

  6. Chromosomal microarray analysis of consecutive individuals with autism spectrum disorders or learning disability presenting for genetic services.

    PubMed

    Roberts, Jennifer L; Hovanes, Karine; Dasouki, Majed; Manzardo, Ann M; Butler, Merlin G

    2014-02-01

    Chromosomal microarray analysis is now commonly used in clinical practice to identify copy number variants (CNVs) in the human genome. We report our experience with the use of the 105 K and 180K oligonucleotide microarrays in 215 consecutive patients referred with either autism or autism spectrum disorders (ASD) or developmental delay/learning disability for genetic services at the University of Kansas Medical Center during the past 4 years (2009-2012). Of the 215 patients [140 males and 75 females (male/female ratio=1.87); 65 with ASD and 150 with learning disability], abnormal microarray results were seen in 45 individuals (21%) with a total of 49 CNVs. Of these findings, 32 represented a known diagnostic CNV contributing to the clinical presentation and 17 represented non-diagnostic CNVs (variants of unknown significance). Thirteen patients with ASD had a total of 14 CNVs, 6 CNVs recognized as diagnostic and 8 as non-diagnostic. The most common chromosome involved in the ASD group was chromosome 15. For those with a learning disability, 32 patients had a total of 35 CNVs. Twenty-six of the 35 CNVs were classified as a known diagnostic CNV, usually a deletion (n=20). Nine CNVs were classified as an unknown non-diagnostic CNV, usually a duplication (n=8). For the learning disability subgroup, chromosomes 2 and 22 were most involved. Thirteen out of 65 patients (20%) with ASD had a CNV compared with 32 out of 150 patients (21%) with a learning disability. The frequency of chromosomal microarray abnormalities compared by subject group or gender was not statistically different. A higher percentage of individuals with a learning disability had clinical findings of seizures, dysmorphic features and microcephaly, but not statistically significant. While both groups contained more males than females, a significantly higher percentage of males were present in the ASD group. © 2013 Elsevier B.V. All rights reserved.

  7. Rare Copy Number Deletions Predict Individual Variation in Intelligence

    PubMed Central

    Yeo, Ronald A.; Gangestad, Steven W.; Liu, Jingyu; Calhoun, Vince D.; Hutchison, Kent E.

    2011-01-01

    Phenotypic variation in human intellectual functioning shows substantial heritability, as demonstrated by a long history of behavior genetic studies. Many recent molecular genetic studies have attempted to uncover specific genetic variations responsible for this heritability, but identified effects capture little variance and have proven difficult to replicate. The present study, motivated an interest in “mutation load” emerging from evolutionary perspectives, examined the importance of the number of rare (or infrequent) copy number variations (CNVs), and the total number of base pairs included in such deletions, for psychometric intelligence. Genetic data was collected using the Illumina 1MDuoBeadChip Array from a sample of 202 adult individuals with alcohol dependence, and a subset of these (N = 77) had been administered the Wechsler Abbreviated Scale of Intelligence (WASI). After removing CNV outliers, the impact of rare genetic deletions on psychometric intelligence was investigated in 74 individuals. The total length of the rare deletions significantly and negatively predicted intelligence (r = −.30, p = .01). As prior studies have indicated greater heritability in individuals with relatively higher parental socioeconomic status (SES), we also examined the impact of ethnicity (Anglo/White vs. Other), as a proxy measure of SES; these groups did not differ on any genetic variable. This categorical variable significantly moderated the effect of length of deletions on intelligence, with larger effects being noted in the Anglo/White group. Overall, these results suggest that rare deletions (between 5% and 1% population frequency or less) adversely affect intellectual functioning, and that pleotropic effects might partly account for the association of intelligence with health and mental health status. Significant limitations of this research, including issues of generalizability and CNV measurement, are discussed. PMID:21298096

  8. Copy number variations of obesity relevant loci associated with body mass index in young Chinese.

    PubMed

    Sun, Chen; Cao, Min; Shi, Juan; Li, Lijuan; Miao, Lin; Hong, Jie; Cui, Bin; Ning, Guang

    2013-03-10

    Obesity is one of the most complex human diseases that are widely concerned and studied. More recently, copy number variations (CNVs) emerge as another important genetic marker to influence various human diseases. To elucidate the relationship between obesity and CNVs, this current study selected obesity-related candidate CNVs and analyzed their association with body mass index (BMI). Results showed that a CNV locus, 8q24.3, was significantly different (P=0.0070) in CNV frequency between the obese and healthy controls in a young eastern Chinese cohort, while no statistical significance was observed in other seven candidate loci including well reported 10q11.22 and 16p11.2 loci. The association of 8q24.3 CNVs with BMI of the subjects only showed marginal significance, while the copy number (CN) of 5p15.33 had a significant correlation with the BMI of the subject. These results suggested that 8q24.3 CN gains was associated with obesity, and 5p15.33 might also contribute to obesity pathogenesis, highlighting the importance of these CNVs for obesity risks, as well as providing new evidence for CNVs in the pathology of common diseases. Copyright © 2012 Elsevier B.V. All rights reserved.

  9. "Something Extra on Chromosome 5": Parents' Understanding of Positive Prenatal Chromosomal Microarray Analysis (CMA) Results.

    PubMed

    Walser, Sarah A; Werner-Lin, Allison; Russell, Amita; Wapner, Ronald J; Bernhardt, Barbara A

    2016-10-01

    This study aims to explore how couples' understanding of the nature and consequences of positive prenatal chromosomal microarray analysis (CMA) results impacts decision-making and concern about pregnancy. We interviewed 28 women and 12 male partners after receiving positive results and analyzed the transcripts to assess their understanding and level of concern about the expected clinical implications of results. Participant descriptions were compared to the original laboratory interpretation. When diagnosed prenatally, couples' understanding of the nature and consequences of copy number variants (CNVs) impacts decision-making and concern. Findings suggest women, but less so partners, generally understand the nature and clinical implications of prenatal CMA results. Couples feel reassured, perhaps sometimes falsely so, when a CNV is inherited from a "normal" parent and experience considerable uncertainty when a CNV is de novo, frequently precipitating a search for additional information and guidance. Five factors influenced participants' concern including: the pattern of inheritance, type of possible phenotypic involvement, perceived manageability of outcomes, availability and strength of evidence about outcomes associated with the CNV, and provider messages about continuing the pregnancy. A good understanding of results is vital as couples decide whether or not to continue with their pregnancy and seek additional information to assist in pregnancy decision-making.

  10. Identification of missing variants by combining multiple analytic pipelines.

    PubMed

    Ren, Yingxue; Reddy, Joseph S; Pottier, Cyril; Sarangi, Vivekananda; Tian, Shulan; Sinnwell, Jason P; McDonnell, Shannon K; Biernacka, Joanna M; Carrasquillo, Minerva M; Ross, Owen A; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hudson, Matthew; Mainzer, Liudmila Sergeevna; Asmann, Yan W

    2018-04-16

    After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. Identification of the complete variant set from sequencing data is the prerequisite of genetic

  11. iCopyDAV: Integrated platform for copy number variations—Detection, annotation and visualization

    PubMed Central

    Vogeti, Sriharsha

    2018-01-01

    Discovery of copy number variations (CNVs), a major category of structural variations, have dramatically changed our understanding of differences between individuals and provide an alternate paradigm for the genetic basis of human diseases. CNVs include both copy gain and copy loss events and their detection genome-wide is now possible using high-throughput, low-cost next generation sequencing (NGS) methods. However, accurate detection of CNVs from NGS data is not straightforward due to non-uniform coverage of reads resulting from various systemic biases. We have developed an integrated platform, iCopyDAV, to handle some of these issues in CNV detection in whole genome NGS data. It has a modular framework comprising five major modules: data pre-treatment, segmentation, variant calling, annotation and visualization. An important feature of iCopyDAV is the functional annotation module that enables the user to identify and prioritize CNVs encompassing various functional elements, genomic features and disease-associations. Parallelization of the segmentation algorithms makes the iCopyDAV platform even accessible on a desktop. Here we show the effect of sequencing coverage, read length, bin size, data pre-treatment and segmentation approaches on accurate detection of the complete spectrum of CNVs. Performance of iCopyDAV is evaluated on both simulated data and real data for different sequencing depths. It is an open-source integrated pipeline available at https://github.com/vogetihrsh/icopydav and as Docker’s image at http://bioinf.iiit.ac.in/icopydav/. PMID:29621297

  12. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data.

    PubMed

    Nishiyama, Takeshi; Takahashi, Kunihiko; Tango, Toshiro; Pinto, Dalila; Scherer, Stephen W; Takami, Satoshi; Kishino, Hirohisa

    2011-05-26

    Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  13. Glyoxalase 1 copy number variation in patients with well differentiated gastro-entero-pancreatic neuroendocrine tumours (GEP-NET)

    PubMed Central

    Xue, Mingzhan; Shafie, Alaa; Qaiser, Talha; Rajpoot, Nasir M.; Kaltsas, Gregory; James, Sean; Gopalakrishnan, Kishore; Fisk, Adrian; Dimitriadis, Georgios K.; Grammatopoulos, Dimitris K.; Rabbani, Naila; Thornalley, Paul J.; Weickert, Martin O.

    2017-01-01

    Background The glyoxalase-1 gene (GLO1) is a hotspot for copy-number variation (CNV) in human genomes. Increased GLO1 copy-number is associated with multidrug resistance in tumour chemotherapy, but prevalence of GLO1 CNV in gastro-entero-pancreatic neuroendocrine tumours (GEP-NET) is unknown. Methods GLO1 copy-number variation was measured in 39 patients with GEP-NET (midgut NET, n = 25; pancreatic NET, n = 14) after curative or debulking surgical treatment. Primary tumour tissue, surrounding healthy tissue and, where applicable, additional metastatic tumour tissue were analysed, using real time qPCR. Progression and survival following surgical treatment were monitored over 4.2 ± 0.5 years. Results In the pooled GEP-NET cohort, GLO1 copy-number in healthy tissue was 2.0 in all samples but significantly increased in primary tumour tissue in 43% of patients with pancreatic NET and in 72% of patients with midgut NET, mainly driven by significantly higher GLO1 copy-number in midgut NET. In tissue from additional metastases resection (18 midgut NET and one pancreatic NET), GLO1 copy number was also increased, compared with healthy tissue; but was not significantly different compared with primary tumour tissue. During mean 3 - 5 years follow-up, 8 patients died and 16 patients showed radiological progression. In midgut NET, a high GLO1 copy-number was associated with earlier progression. In NETs with increased GLO1 copy number, there was increased Glo1 protein expression compared to non-malignant tissue. Conclusions GLO1 copy-number was increased in a large percentage of patients with GEP-NET and correlated positively with increased Glo1 protein in tumour tissue. Analysis of GLO1 copy-number variation particularly in patients with midgut NET could be a novel prognostic marker for tumour progression. PMID:29100361

  14. Incidence of numerical variants and transitional lumbosacral vertebrae on whole-spine MRI.

    PubMed

    Tins, Bernhard J; Balain, Birender

    2016-04-01

    This study sets out to prospectively investigate the incidence of transitional vertebrae and numerical variants of the spine. Over a period of 28 months, MRIs of the whole spine were prospectively evaluated for the presence of transitional lumbosacral vertebrae and numerical variants of the spine. MRI of the whole spine was evaluated in 420 patients, comprising 211 female and 209 male subjects. Two patients had more complex anomalies. Lumbosacral transitional vertebrae were seen in 12 patients: eight sacralised L5 (3 male, 5 female) and four lumbarised S1 (3 male, 1 female). The incidence of transitional vertebrae was approximately 3.3. % (14/418). Thirty-two (7.7 %) of 418 patients had numerical variants of mobile vertebrae of the spine without transitional vertebrae. The number of mobile vertebrae was increased by one in 18 patients (12 male, 6 female), and the number was decreased by one in 14 patients (4 male, 10 female). Numerical variants of the spine are common, and were found to be almost 2.5 times as frequent as transitional lumbosacral vertebrae in the study population. Only whole-spine imaging can identify numerical variants and the anatomical nature of transitional vertebrae. The tendency is toward an increased number of mobile vertebrae in men and a decreased number in women. Main messages • Numerical variants of the spine are more common than transitional vertebrae. • Spinal numerical variants can be reliably identified only with whole-spine imaging. • Increased numbers of vertebrae are more common in men than women. • Transitional lumbosacral vertebrae occurred in about 3.3 % of the study population. • The incidence of numerical variants of the spine was about 7.7 %.

  15. 15q11.2 CNV affects cognitive, structural and functional correlates of dyslexia and dyscalculia.

    PubMed

    Ulfarsson, M O; Walters, G B; Gustafsson, O; Steinberg, S; Silva, A; Doyle, O M; Brammer, M; Gudbjartsson, D F; Arnarsdottir, S; Jonsdottir, G A; Gisladottir, R S; Bjornsdottir, G; Helgason, H; Ellingsen, L M; Halldorsson, J G; Saemundsen, E; Stefansdottir, B; Jonsson, L; Eiriksdottir, V K; Eiriksdottir, G R; Johannesdottir, G H; Unnsteinsdottir, U; Jonsdottir, B; Magnusdottir, B B; Sulem, P; Thorsteinsdottir, U; Sigurdsson, E; Brandeis, D; Meyer-Lindenberg, A; Stefansson, H; Stefansson, K

    2017-04-25

    Several copy number variants have been associated with neuropsychiatric disorders and these variants have been shown to also influence cognitive abilities in carriers unaffected by psychiatric disorders. Previously, we associated the 15q11.2(BP1-BP2) deletion with specific learning disabilities and a larger corpus callosum. Here we investigate, in a much larger sample, the effect of the 15q11.2(BP1-BP2) deletion on cognitive, structural and functional correlates of dyslexia and dyscalculia. We report that the deletion confers greatest risk of the combined phenotype of dyslexia and dyscalculia. We also show that the deletion associates with a smaller left fusiform gyrus. Moreover, tailored functional magnetic resonance imaging experiments using phonological lexical decision and multiplication verification tasks demonstrate altered activation in the left fusiform and the left angular gyri in carriers. Thus, by using convergent evidence from neuropsychological testing, and structural and functional neuroimaging, we show that the 15q11.2(BP1-BP2) deletion affects cognitive, structural and functional correlates of both dyslexia and dyscalculia.

  16. 15q11.2 CNV affects cognitive, structural and functional correlates of dyslexia and dyscalculia

    PubMed Central

    Ulfarsson, M O; Walters, G B; Gustafsson, O; Steinberg, S; Silva, A; Doyle, O M; Brammer, M; Gudbjartsson, D F; Arnarsdottir, S; Jonsdottir, G A; Gisladottir, R S; Bjornsdottir, G; Helgason, H; Ellingsen, L M; Halldorsson, J G; Saemundsen, E; Stefansdottir, B; Jonsson, L; Eiriksdottir, V K; Eiriksdottir, G R; Johannesdottir, G H; Unnsteinsdottir, U; Jonsdottir, B; Magnusdottir, B B; Sulem, P; Thorsteinsdottir, U; Sigurdsson, E; Brandeis, D; Meyer-Lindenberg, A; Stefansson, H; Stefansson, K

    2017-01-01

    Several copy number variants have been associated with neuropsychiatric disorders and these variants have been shown to also influence cognitive abilities in carriers unaffected by psychiatric disorders. Previously, we associated the 15q11.2(BP1–BP2) deletion with specific learning disabilities and a larger corpus callosum. Here we investigate, in a much larger sample, the effect of the 15q11.2(BP1–BP2) deletion on cognitive, structural and functional correlates of dyslexia and dyscalculia. We report that the deletion confers greatest risk of the combined phenotype of dyslexia and dyscalculia. We also show that the deletion associates with a smaller left fusiform gyrus. Moreover, tailored functional magnetic resonance imaging experiments using phonological lexical decision and multiplication verification tasks demonstrate altered activation in the left fusiform and the left angular gyri in carriers. Thus, by using convergent evidence from neuropsychological testing, and structural and functional neuroimaging, we show that the 15q11.2(BP1–BP2) deletion affects cognitive, structural and functional correlates of both dyslexia and dyscalculia. PMID:28440815

  17. Invited review DNA copy number changes as diagnostic tools for lung cancer.

    PubMed

    Bowcock, Anne M

    2014-05-01

    Lung cancer usually presents as advanced stage disease and there is a need for early diagnosis so that appropriate treatments can be provided prior to tumour progression. Copy number variation is frequently detected in tumours and can contribute to tumour progression. This is because regions harbouring DNA imbalance can contain genes encoding critical proteins whose altered dosage contributes to the neoplastic process. Three copy number variations (CNVs) from chromosomes 3p26-p11.1 (loss), 3q26.2-29 (gain) and 6q25.3-24.3 (loss) have previously been described in individuals presenting with endobronchial squamous metaplasia. These CNVs were predictors of cancer diagnosed within 44 months with 97% accuracy. An evaluation of this CNV-based classifier with an independent set of 12 samples (10 men and 2 women), each with a carcinoma in situ or invasive carcinoma at the same site at follow-up demonstrated 92% prediction accuracy. The negative predictive value of this classifier was 89%. The gain at 3q26.2-q29 contributed the most to the classification, being present in virtually all lesions. This region harbours the PIK3CA gene and evaluation of the number of copies of this gene gave very similar results to those from array comparative genomic hybridisation. This type of test can be performed on sputum or bronchial brushings. Larger cohorts now need to be examined to confirm this finding and to possibly refine the regions of CNV. This type of approach paves the way for future molecular analyses to assist in selecting subjects with endobronchial squamous metaplastic or dysplastic lesions who might benefit from more aggressive therapeutic intervention or surveillance.

  18. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is

  19. Robust detection of EGFR copy number changes and EGFR variant III: technical aspects and relevance for glioma diagnostics.

    PubMed

    Jeuken, Judith; Sijben, Angelique; Alenda, Cristina; Rijntjes, Jos; Dekkers, Marieke; Boots-Sprenger, Sandra; McLendon, Roger; Wesseling, Pieter

    2009-10-01

    Epidermal growth factor receptor (EGFR) is commonly affected in cancer, generally in the form of an increase in DNA copy number and/or as mutation variants [e.g., EGFR variant III (EGFRvIII), an in-frame deletion of exons 2-7]. While detection of EGFR aberrations can be expected to be relevant for glioma patients, such analysis has not yet been implemented in a routine setting, also because feasible and robust assays were lacking. We evaluated multiplex ligation-dependent probe amplification (MLPA) for detection of EGFR amplification and EGFRvIII in DNA of a spectrum of 216 diffuse gliomas. EGFRvIII detection was verified at the protein level by immunohistochemistry and at the RNA level using the conventionally used endpoint RT-PCR as well as a newly developed quantitative RT-PCR. Compared to these techniques, the DNA-based MLPA assay for EGFR/EGFRvIII analysis tested showed 100% sensitivity and specificity. We conclude that MLPA is a robust assay for detection of EGFR/EGFRvIII aberrations. While the exact diagnostic, prognostic and predictive value of such EGFR testing remains to be seen, MLPA has great potential as it can reliably and relatively easily be performed on routinely processed (formalin-fixed, paraffin-embedded) tumor tissue in combination with testing for other relevant glioma markers.

  20. OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization.

    PubMed

    Antanaviciute, Agne; Watson, Christopher M; Harrison, Sally M; Lascelles, Carolina; Crinnion, Laura; Markham, Alexander F; Bonthron, David T; Carr, Ian M

    2015-12-01

    Exome sequencing has become a de facto standard method for Mendelian disease gene discovery in recent years, yet identifying disease-causing mutations among thousands of candidate variants remains a non-trivial task. Here we describe a new variant prioritization tool, OVA (ontology variant analysis), in which user-provided phenotypic information is exploited to infer deeper biological context. OVA combines a knowledge-based approach with a variant-filtering framework. It reduces the number of candidate variants by considering genotype and predicted effect on protein sequence, and scores the remainder on biological relevance to the query phenotype.We take advantage of several ontologies in order to bridge knowledge across multiple biomedical domains and facilitate computational analysis of annotations pertaining to genes, diseases, phenotypes, tissues and pathways. In this way, OVA combines information regarding molecular and physical phenotypes and integrates both human and model organism data to effectively prioritize variants. By assessing performance on both known and novel disease mutations, we show that OVA performs biologically meaningful candidate variant prioritization and can be more accurate than another recently published candidate variant prioritization tool. OVA is freely accessible at http://dna2.leeds.ac.uk:8080/OVA/index.jsp. Supplementary data are available at Bioinformatics online. umaan@leeds.ac.uk. © The Author 2015. Published by Oxford University Press.

  1. Copy number variation profile in the placental and parental genomes of recurrent pregnancy loss families

    PubMed Central

    Kasak, Laura; Rull, Kristiina; Sõber, Siim; Laan, Maris

    2017-01-01

    We have previously shown an extensive load of somatic copy number variations (CNVs) in the human placental genome with the highest fraction detected in normal term pregnancies. Hereby, we hypothesized that insufficient promotion of CNVs may impair placental development and lead to recurrent pregnancy loss (RPL). RPL affects ~3% of couples aiming at childbirth and idiopathic RPL represents ~50% of cases. We analysed placental and parental CNV profiles of idiopathic RPL trios (mother-father-placenta) and duos (mother-placenta). Consistent with the hypothesis, the placental genomes of RPL cases exhibited 2-fold less CNVs compared to uncomplicated 1st trimester pregnancies (P = 0.02). This difference mainly arose from lower number of duplications. Overall, 1st trimester control placentas shared only 5.3% of identified CNV regions with RPL cases, whereas the respective fraction with term placentas was 35.1% (P = 1.1 × 10−9). Disruption of the genes NUP98 (embryonic stem cell development) and MTRR (folate metabolism) was detected exclusively in RPL placentas, potentially indicative to novel loci implicated in RPL. Interestingly, genes with higher overall expression were prone to deletions (>3-fold higher median expression compared to genes unaffected by CNVs, P = 6.69 × 10−20). Additionally, large pericentromeric and subtelomeric CNVs in parental genomes emerged as a risk factor for RPL. PMID:28345611

  2. Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.

    PubMed

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Alejandro Q; Musolf, Anthony; Matise, Tara C; Finch, Stephen J; Gordon, Derek

    2012-01-01

    As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have

  3. Single variant and multi-variant trend tests for genetic association with next generation sequencing that are robust to sequencing error

    PubMed Central

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Andrew; Musolf, Anthony; Matise, Tara C.; Finch, Stephen J.; Gordon, Derek

    2013-01-01

    As with any new technology, next generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model, based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to that data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have

  4. Copy number variation in 19 Italian multiplex families with autism spectrum disorder: Importance of synaptic and neurite elongation genes.

    PubMed

    Lintas, Carla; Picinelli, Chiara; Piras, Ignazio Stefano; Sacco, Roberto; Brogna, Claudia; Persico, Antonio M

    2017-03-17

    Autism Spectrum Disorder (ASD) is endowed with impressive heritability estimates and high recurrence rates. Its genetic underpinnings are nonetheless very heterogeneous, with common, and rare contributing variants located in hundreds of different loci, each characterized by variable levels of penetrance. Multiplex families from single ethnic groups represent a useful means to reduce heterogeneity and enhance genetic load. We screened 19 Italian ASD multiplex families (3 triplets and 16 duplets, total N = 41 ASD subjects), using array-CGH (Agilent 180 K). Causal or ASD-relevant CNVs were detected in 36.6% (15/41) of ASD probands, corresponding to 36.8% (7/19) multiplex families with at least one affected sibling genetically positive. However, only in less than half (3/7) of positive families, affected siblings share the same causal or ASD-relevant CNV. Even in these three families, additional potentially relevant CNVs not shared by affected sib pairs were also detected. These results provide further evidence of genetic heterogeneity in ASD even within multiplex families belonging to a single ethnic group. Differences in CNV burden may likely contribute to the substantial clinical heterogeneity observed between affected siblings. In addition, Gene Ontology enrichment analysis indicates that most potentially causal or relevant ASD genes detected in our cohort belong to nervous system-specific categories, especially involved in neurite elongation and synaptic structure/function. These findings point toward the existence of genomic instability in these families, whose underlying genetic and epigenetic mechanisms deserve further scrutiny. © 2017 Wiley Periodicals, Inc.

  5. Mobile Laser Indirect Ophthalmoscope: For the Induction of Choroidal Neovascularization in a Mouse Model.

    PubMed

    Weinberger, Dov; Bor-Shavit, Elite; Barliya, Tilda; Dahbash, Mor; Kinrot, Opher; Gaton, Dan D; Nisgav, Yael; Livnat, Tami

    2017-11-01

    This study aims to evaluate and standardize the reliability of a mobile laser indirect ophthalmoscope in the induction of choroidal neovascularization (CNV) in a mouse model. A diode laser indirect ophthalmoscope was used to induce CNV in pigmented male C57BL/6J mice. Standardization of spot size and laser intensity was determined using different aspheric lenses with increasing laser intensities applied around the optic disc. Development of CNV was evaluated 1, 5, and 14 days post laser application using fluorescein angiography (FA), histology, and choroidal flat mounts stained for the endothelial marker CD31 and FITC-dextran. Correlation between the number of laser hits to the number and size of developed CNV lesions was determined using flat mount choroid staining. The ability of intravitreally injected anti-human and anti-mouse VEGF antibodies to inhibit CNV induced by the mobile laser was evaluated. Laser parameters were standardized on 350 mW for 100 msec, using the 90 diopter lens to accomplish the highest incidence of Bruch's membrane rupture. CNV lesions' formation was validated on days 5 and 14 post laser injury, though FA showed leakage on as early as day 1. The number of laser hits was significantly correlated with the CNV area. CNV growth was successfully inhibited by both anti-human and mouse VEGF antibodies. The mobile laser indirect ophthalmoscope can serve as a feasible and a reliable alternative method for the CNV induction in a mouse model.

  6. Leapfrog variants of iterative methods for linear algebra equations

    NASA Technical Reports Server (NTRS)

    Saylor, Paul E.

    1988-01-01

    Two iterative methods are considered, Richardson's method and a general second order method. For both methods, a variant of the method is derived for which only even numbered iterates are computed. The variant is called a leapfrog method. Comparisons between the conventional form of the methods and the leapfrog form are made under the assumption that the number of unknowns is large. In the case of Richardson's method, it is possible to express the final iterate in terms of only the initial approximation, a variant of the iteration called the grand-leap method. In the case of the grand-leap variant, a set of parameters is required. An algorithm is presented to compute these parameters that is related to algorithms to compute the weights and abscissas for Gaussian quadrature. General algorithms to implement the leapfrog and grand-leap methods are presented. Algorithms for the important special case of the Chebyshev method are also given.

  7. Fcγ receptor IIIa single-nucleotide polymorphisms and haplotypes affect human IgG binding and are associated with lupus nephritis in African Americans.

    PubMed

    Dong, Chaoling; Ptacek, Travis S; Redden, David T; Zhang, Kui; Brown, Elizabeth E; Edberg, Jeffrey C; McGwin, Gerald; Alarcón, Graciela S; Ramsey-Goldman, Rosalind; Reveille, John D; Vilá, Luis M; Petri, Michelle; Qin, Aijian; Wu, Jianming; Kimberly, Robert P

    2014-05-01

    To investigate whether the Fcγ receptor IIIa-66L/R/H (FcγRIIIa-66L/R/H) polymorphism influences net effective receptor function and to assess if the FCGR3A combined genotypes formed by FcγRIIIa-66L/R/H and FcγRIIIa-176F/V, as well as copy number variation (CNV), confer risk of developing systemic lupus erythematosus (SLE) and lupus nephritis. FcγRIIIa variants, expressed on A20 IIA1.6 cells, were used in flow cytometry-based human IgG-binding assays. Using Pyrosequencing methodology, FCGR3A single-nucleotide polymorphism and CNV genotypes were determined in a cohort of 1,728 SLE patients and 2,404 healthy controls. The FcγRIIIa-66L/R/H (rs10127939) polymorphism influenced ligand binding capacity in the presence of the FcγRIIIa-176V (rs396991) allele. There was a trend toward an association of the low-binding FcγRIIIa-176F allele with lupus nephritis among African Americans (P = 0.0609) but not among European Americans (P > 0.10). Nephritis among African American patients with SLE was associated with FcγRIIIa low-binding haplotypes containing the 66L/R/H and 176F variants (P = 0.03) and with low-binding genotype combinations (P = 0.002). No association was observed among European American patients with SLE. The distribution of FCGR3A CNV was not significantly different among controls and SLE patients with or without nephritis. FcγRIIIa-66L/R/H influences ligand binding. The low-binding haplotypes formed by 66L/R/H and 176F confer enhanced risk of lupus nephritis in African Americans. FCGR3A CNVs are not associated with SLE or lupus nephritis in either African Americans or European Americans. Copyright © 2014 by the American College of Rheumatology.

  8. FcγRIIIa SNPs and haplotypes affect human IgG binding and association with lupus nephritis in African Americans

    PubMed Central

    Dong, Chaoling; Ptacek, Travis S; Redden, David T; Zhang, Kui; Brown, Elizabeth E.; Edberg, Jeffrey C.; McGwin, Gerald; Alarcón, Graciela S.; Ramsey-Goldman, Rosalind; Reveille, John D.; Vilá, Luis M.; Petri, Michelle; Qin, Aijian; Wu, Jianming; Kimberly, Robert P.

    2014-01-01

    Objective To investigate whether the FcγRIIIa-66R/H/L polymorphism influences net effective receptor function and to assess if the FCGR3A combined genotypes formed by FcγRIIIa-66R/H/L and FcγRIIIa-176F/V as well as copy number variation (CNV) confer risk for development of SLE and lupus nephritis. Methods FcγRIIIa variants, expressed on A20 IIA1.6 cells, were used in flow cytometry-based human IgG binding assays. FCGR3A SNP and CNV genotypes were determined by Pyrosequencing methodology in a cohort of 1728 SLE patients and 2404 healthy controls. Results The FcγRIIIa-66L/H/R (rs10127939) polymorphism influences ligand binding capacity in the context of the FcγRIIIa-176V (rs396991) allele. The low binding FcγRIIIa-176F allele was associated with SLE nephritis (p = 0.0609) in African Americans but not in European Americans (p > 0.10). Nephritis among African American SLE subjects was associated with FcγRIIIa low binding haplotypes containing the 66R/H/L and 176F variants (p = 0.03) and with low binding genotype combinations (p = 0.002). No association was observed in European American SLE patients. The distribution of FCGR3A CNV was not significantly different between controls and SLE patients with or without nephritis. Conclusion FcγRIIIa-66R/H/L influences ligand binding. The low binding haplotypes formed by 66R/H/L and 176F confer enhanced risk for lupus nephritis in African Americans. FCGR3A CNVs are not associated with SLE or SLE nephritis in either African Americans or European Americans. PMID:24782186

  9. The distribution and impact of common copy-number variation in the genome of the domesticated apple, Malus x domestica Borkh.

    PubMed

    Boocock, James; Chagné, David; Merriman, Tony R; Black, Michael A

    2015-10-23

    Copy number variation (CNV) is a common feature of eukaryotic genomes, and a growing body of evidence suggests that genes affected by CNV are enriched in processes that are associated with environmental responses. Here we use next generation sequence (NGS) data to detect copy-number variable regions (CNVRs) within the Malus x domestica genome, as well as to examine their distribution and impact. CNVRs were detected using NGS data derived from 30 accessions of M. x domestica analyzed using the read-depth method, as implemented in the CNVrd2 software. To improve the reliability of our results, we developed a quality control and analysis procedure that involved checking for organelle DNA, not repeat masking, and the determination of CNVR identity using a permutation testing procedure. Overall, we identified 876 CNVRs, which spanned 3.5 % of the apple genome. To verify that detected CNVRs were not artifacts, we analyzed the B- allele-frequencies (BAF) within a single nucleotide polymorphism (SNP) array dataset derived from a screening of 185 individual apple accessions and found the CNVRs were enriched for SNPs having aberrant BAFs (P < 1e-13, Fisher's Exact test). Putative CNVRs overlapped 845 gene models and were enriched for resistance (R) gene models (P < 1e-22, Fisher's exact test). Of note was a cluster of resistance gene models on chromosome 2 near a region containing multiple major gene loci conferring resistance to apple scab. We present the first analysis and catalogue of CNVRs in the M. x domestica genome. The enrichment of the CNVRs with R gene models and their overlap with gene loci of agricultural significance draw attention to a form of unexplored genetic variation in apple. This research will underpin further investigation of the role that CNV plays within the apple genome.

  10. Chromosomal microarray in clinical diagnosis: a study of 337 patients with congenital anomalies and developmental delays or intellectual disability.

    PubMed

    Sansović, Ivona; Ivankov, Ana-Maria; Bobinec, Adriana; Kero, Mijana; Barišić, Ingeborg

    2017-06-14

    To determine the diagnostic yield and criteria that could help to classify and interpret the copy number variations (CNVs) detected by chromosomal microarray (CMA) technique in patients with congenital and developmental abnormalities including dysmorphia, developmental delay (DD) or intellectual disability (ID), autism spectrum disorders (ASD) and congenital anomalies (CA). CMA analysis was performed in 337 patients with DD/ID with or without dysmorphism, ASD, and/or CA. In 30 of 337 patients, chromosomal imbalances had previously been detected by classical cytogenetic and molecular cytogenetic methods. In 73 of 337 patients, clinically relevant variants were detected and better characterized. Most of them were >1 Mb. Variants of unknown clinical significance (VOUS) were discovered in 35 patients. The most common VOUS size category was <300 kb (40.5%). Deletions and de novo imbalances were more frequent in pathogenic CNV than in VOUS category. CMA had a high diagnostic yield of 43/307, excluding patients previously detected by other methods. CMA was valuable in establishing the diagnosis in a high proportion of patients. Criteria for classification and interpretation of CNVs include CNV size and type, mode of inheritance, and genotype-phenotype correlation. Agilent ISCA v2 Human Genome 8x60 K oligonucleotide microarray format proved to be reasonable resolution for clinical use, particularly in the regions that are recommended by the International Standard Cytogenomic Array (ISCA) Consortium and associated with well-established syndromes.

  11. CBH1 homologs and variant CBH1 cellulases

    DOEpatents

    Goedegebuur, Frits [Rozenlaan, NL; Gualfetti, Peter [San Francisco, CA; Mitchinson, Colin [Half Moon Bay, CA; Neefe, Paulien [Zoetermeer, NL

    2011-05-31

    Disclosed are a number of homologs and variants of Hypocrea jecorina Cel7A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.

  12. Opposing brain differences in 16p11.2 deletion and duplication carriers.

    PubMed

    Qureshi, Abid Y; Mueller, Sophia; Snyder, Abraham Z; Mukherjee, Pratik; Berman, Jeffrey I; Roberts, Timothy P L; Nagarajan, Srikantan S; Spiro, John E; Chung, Wendy K; Sherr, Elliott H; Buckner, Randy L

    2014-08-20

    Deletions and duplications of the recurrent ~600 kb chromosomal BP4-BP5 region of 16p11.2 are associated with a broad variety of neurodevelopmental outcomes including autism spectrum disorder. A clue to the pathogenesis of the copy number variant (CNV)'s effect on the brain is that the deletion is associated with a head size increase, whereas the duplication is associated with a decrease. Here we analyzed brain structure in a clinically ascertained group of human deletion (N = 25) and duplication (N = 17) carriers from the Simons Variation in Individuals Project compared with age-matched controls (N = 29 and 33, respectively). Multiple brain measures showed increased size in deletion carriers and reduced size in duplication carriers. The effects spanned global measures of intracranial volume, brain size, compartmental measures of gray matter and white matter, subcortical structures, and the cerebellum. Quantitatively, the largest effect was on the thalamus, but the collective results suggest a pervasive rather than a selective effect on the brain. Detailed analysis of cortical gray matter revealed that cortical surface area displays a strong dose-dependent effect of CNV (deletion > control > duplication), whereas average cortical thickness is less affected. These results suggest that the CNV may exert its opposing influences through mechanisms that influence early stages of embryonic brain development. Copyright © 2014 the authors 0270-6474/14/3411199-13$15.00/0.

  13. Opposing Brain Differences in 16p11.2 Deletion and Duplication Carriers

    PubMed Central

    Qureshi, Abid Y.; Mueller, Sophia; Snyder, Abraham Z.; Mukherjee, Pratik; Berman, Jeffrey I.; Roberts, Timothy P.L.; Nagarajan, Srikantan S.; Spiro, John E.; Chung, Wendy K.; Sherr, Elliott H.

    2014-01-01

    Deletions and duplications of the recurrent ∼600 kb chromosomal BP4–BP5 region of 16p11.2 are associated with a broad variety of neurodevelopmental outcomes including autism spectrum disorder. A clue to the pathogenesis of the copy number variant (CNV)'s effect on the brain is that the deletion is associated with a head size increase, whereas the duplication is associated with a decrease. Here we analyzed brain structure in a clinically ascertained group of human deletion (N = 25) and duplication (N = 17) carriers from the Simons Variation in Individuals Project compared with age-matched controls (N = 29 and 33, respectively). Multiple brain measures showed increased size in deletion carriers and reduced size in duplication carriers. The effects spanned global measures of intracranial volume, brain size, compartmental measures of gray matter and white matter, subcortical structures, and the cerebellum. Quantitatively, the largest effect was on the thalamus, but the collective results suggest a pervasive rather than a selective effect on the brain. Detailed analysis of cortical gray matter revealed that cortical surface area displays a strong dose-dependent effect of CNV (deletion > control > duplication), whereas average cortical thickness is less affected. These results suggest that the CNV may exert its opposing influences through mechanisms that influence early stages of embryonic brain development. PMID:25143601

  14. CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations

    PubMed Central

    Wang, Xihong; Zheng, Zhuqing; Cai, Yudong; Chen, Ting; Li, Chao; Fu, Weiwei

    2017-01-01

    Abstract Background The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. Results Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. Conclusions The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species. PMID:29220491

  15. Using whole-exome sequencing to identify variants inherited from mosaic parents

    PubMed Central

    Rios, Jonathan J; Delgado, Mauricio R

    2015-01-01

    Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828

  16. affy2sv: an R package to pre-process Affymetrix CytoScan HD and 750K arrays for SNP, CNV, inversion and mosaicism calling.

    PubMed

    Hernandez-Ferrer, Carles; Quintela Garcia, Ines; Danielski, Katharina; Carracedo, Ángel; Pérez-Jurado, Luis A; González, Juan R

    2015-05-20

    The well-known Genome-Wide Association Studies (GWAS) had led to many scientific discoveries using SNP data. Even so, they were not able to explain the full heritability of complex diseases. Now, other structural variants like copy number variants or DNA inversions, either germ-line or in mosaicism events, are being studies. We present the R package affy2sv to pre-process Affymetrix CytoScan HD/750k array (also for Genome-Wide SNP 5.0/6.0 and Axiom) in structural variant studies. We illustrate the capabilities of affy2sv using two different complete pipelines on real data. The first one performing a GWAS and a mosaic alterations detection study, and the other detecting CNVs and performing an inversion calling. Both examples presented in the article show up how affy2sv can be used as part of more complex pipelines aimed to analyze Affymetrix SNP arrays data in genetic association studies, where different types of structural variants are considered.

  17. A common copy number variation polymorphism in the CNTNAP2 gene: sexual dimorphism in association with healthy aging and disease.

    PubMed

    Iakoubov, Leonid; Mossakowska, Malgorzata; Szwed, Malgorzata; Puzianowska-Kuznicka, Monika

    2015-01-01

    New therapeutic targets are needed to fight aging-related diseases and increase life span. A new female-specific association with diseases and limited survival past 80 years was recently reported for a copy number variation (CNV) in the CNTNAP4 gene from the neurexin superfamily. We asked whether there are CNVs that are associated with aging phenotypes within other genes from the neurexin superfamily and whether this association is sex specific. Select CNV polymorphisms were genotyped with proprietary TaqMan qPCR assays. A case/control study, in which a group of 81- to 90-year-old community-dwelling Caucasians with no chronic diseases (case) was compared to a similar control group of 65- to 75-year-olds, revealed a negative association with healthy aging for the ins allele of common esv11910 CNV in the CNTNAP2 gene (n = 388; OR = 0.29, 95% CI: 0.14-0.59, p = 0.0004 for males, and OR = 0.82, 95% CI: 0.42-1.57, p = 0.625 for females). This male-specific association was validated in a study of an independent group of 76- to 80-year-olds. To look for a corresponding positive association of the allele with aging-related diseases, two case subgroups of 81- to 90-year-olds, one composed of individuals with cognitive impairment and the other with various diseases not directly related to the nervous system, such as cardiovascular diseases, etc., were compared to a healthy control subgroup of the same age. A positive male-specific association was found for both cases (OR = 2.75, p = 0.008 for association with cognitive impairment, and OR = 3.18, p = 0.002 for other diseases combined). A new male-specific association with aging is reported for a CNV in the CNTNAP2 gene. The polymorphism might be useful for diagnosing individual genetic predispositions to healthy aging versus aging complicated by chronic diseases. © 2014 S. Karger AG, Basel.

  18. Analysis of the Saccharomyces cerevisiae pan-genome reveals a pool of copy number variants distributed in diverse yeast strains from differing industrial environments.

    PubMed

    Dunn, Barbara; Richter, Chandra; Kvitek, Daniel J; Pugh, Tom; Sherlock, Gavin

    2012-05-01

    Although the budding yeast Saccharomyces cerevisiae is arguably one of the most well-studied organisms on earth, the genome-wide variation within this species--i.e., its "pan-genome"--has been less explored. We created a multispecies microarray platform containing probes covering the genomes of several Saccharomyces species: S. cerevisiae, including regions not found in the standard laboratory S288c strain, as well as the mitochondrial and 2-μm circle genomes-plus S. paradoxus, S. mikatae, S. kudriavzevii, S. uvarum, S. kluyveri, and S. castellii. We performed array-Comparative Genomic Hybridization (aCGH) on 83 different S. cerevisiae strains collected across a wide range of habitats; of these, 69 were commercial wine strains, while the remaining 14 were from a diverse set of other industrial and natural environments. We observed interspecific hybridization events, introgression events, and pervasive copy number variation (CNV) in all but a few of the strains. These CNVs were distributed throughout the strains such that they did not produce any clear phylogeny, suggesting extensive mating in both industrial and wild strains. To validate our results and to determine whether apparently similar introgressions and CNVs were identical by descent or recurrent, we also performed whole-genome sequencing on nine of these strains. These data may help pinpoint genomic regions involved in adaptation to different industrial milieus, as well as shed light on the course of domestication of S. cerevisiae.

  19. Analysis of the Saccharomyces cerevisiae pan-genome reveals a pool of copy number variants distributed in diverse yeast strains from differing industrial environments

    PubMed Central

    Dunn, Barbara; Richter, Chandra; Kvitek, Daniel J.; Pugh, Tom; Sherlock, Gavin

    2012-01-01

    Although the budding yeast Saccharomyces cerevisiae is arguably one of the most well-studied organisms on earth, the genome-wide variation within this species—i.e., its “pan-genome”—has been less explored. We created a multispecies microarray platform containing probes covering the genomes of several Saccharomyces species: S. cerevisiae, including regions not found in the standard laboratory S288c strain, as well as the mitochondrial and 2-μm circle genomes–plus S. paradoxus, S. mikatae, S. kudriavzevii, S. uvarum, S. kluyveri, and S. castellii. We performed array-Comparative Genomic Hybridization (aCGH) on 83 different S. cerevisiae strains collected across a wide range of habitats; of these, 69 were commercial wine strains, while the remaining 14 were from a diverse set of other industrial and natural environments. We observed interspecific hybridization events, introgression events, and pervasive copy number variation (CNV) in all but a few of the strains. These CNVs were distributed throughout the strains such that they did not produce any clear phylogeny, suggesting extensive mating in both industrial and wild strains. To validate our results and to determine whether apparently similar introgressions and CNVs were identical by descent or recurrent, we also performed whole-genome sequencing on nine of these strains. These data may help pinpoint genomic regions involved in adaptation to different industrial milieus, as well as shed light on the course of domestication of S. cerevisiae. PMID:22369888

  20. Method of generating ploynucleotides encoding enhanced folding variants

    DOEpatents

    Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.

    2017-05-02

    The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.

  1. Rare-Variant Association Analysis: Study Designs and Statistical Tests

    PubMed Central

    Lee, Seunggeung; Abecasis, Gonçalo R.; Boehnke, Michael; Lin, Xihong

    2014-01-01

    Despite the extensive discovery of trait- and disease-associated common variants, much of the genetic contribution to complex traits remains unexplained. Rare variants can explain additional disease risk or trait variability. An increasing number of studies are underway to identify trait- and disease-associated rare variants. In this review, we provide an overview of statistical issues in rare-variant association studies with a focus on study designs and statistical tests. We present the design and analysis pipeline of rare-variant studies and review cost-effective sequencing designs and genotyping platforms. We compare various gene- or region-based association tests, including burden tests, variance-component tests, and combined omnibus tests, in terms of their assumptions and performance. Also discussed are the related topics of meta-analysis, population-stratification adjustment, genotype imputation, follow-up studies, and heritability due to rare variants. We provide guidelines for analysis and discuss some of the challenges inherent in these studies and future research directions. PMID:24995866

  2. DNA methylation and copy number variation analyses of human embryonic stem cell-derived neuroprogenitors after low-dose decabromodiphenyl ether and/or bisphenol A exposure.

    PubMed

    Du, L; Sun, W; Li, X M; Li, X Y; Liu, W; Chen, D

    2018-05-01

    The polybrominated diphenyl ether flame retardants decabromodiphenyl ether (BDE-209) and bisphenol A (BPA) are environmental contaminants that can cross the placenta and exert toxicity in the developing fetal nervous system. Copy number variants (CNVs) play a role in a number of genetic disorders and may be implicated in BDE-209/BPA teratogenicity. In this study, we found that BDE-209 and/or BPA exposure decreased neural differentiation efficiency of human embryonic stem cells (hESCs), although there was a >90% induction of neuronal progenitor cells (NPCs) from exposed hESCs. However, the mean of CNV numbers in the NPCs with BDE-209 + BPA treatment was significantly higher compared to the other groups, whereas DNA methylation was lower and DNA methyltransferase(DNMT1 and DNMT3A) expression were significantly decreased in all of the BDE-209 and/or BPA treatment groups compared with the control groups. The number of CNVs in chromosomes 3, 4, 11, 22, and X in NPCs with BDE-209 and/or BPA exposure was higher compared to the control group. In addition, CNVs in chromosomes 7, 8, 14, and 16 were stable in hESCs and hESCs-derived NPCs irrespective of BDE-209/BPA exposure, and CNVs in chromosomes 20 q11.21 and 16 p13.11 might be induced by neural differentiation. Thus, BDE-209/BPA exposure emerges as a potential source of CNVs distinct from neural differentiation by itself. BDE-209 and/or BPA exposure may cause genomic instability in cultured stem cells via reduced activity of DNA methyltransferase, suggesting a new mechanism of human embryonic neurodevelopmental toxicity caused by this class of environmental toxins.

  3. Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.

    PubMed

    Lee, Sangmoon; Seo, Jihae; Park, Jinman; Nam, Jae-Yong; Choi, Ahyoung; Ignatius, Jason S; Bjornson, Robert D; Chae, Jong-Hee; Jang, In-Jin; Lee, Sanghyuk; Park, Woong-Yang; Baek, Daehyun; Choi, Murim

    2017-06-27

    Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean population is a distinct ethnic group comparable to other discrete ethnic groups in Africa and Europe, providing a rationale for such independent genomic datasets. Indeed, KOVA conferred 22.8% increased variant filtering power in addition to Exome Aggregation Consortium (ExAC) when used on Korean exomes. Functional assessment of nonsynonymous variant supported the presence of purifying selection in Koreans. Analysis of copy number variants detected 5.2 deletions and 10.3 amplifications per individual with an increased fraction of novel variants among smaller and rarer copy number variable segments. We also report a list of germline variants that are associated with increased tumor susceptibility. This catalog can function as a critical addition to the pre-existing variant databases in pursuing genetic studies of Korean individuals.

  4. The clinical significance of small copy number variants in neurodevelopmental disorders.

    PubMed

    Asadollahi, Reza; Oneda, Beatrice; Joset, Pascal; Azzarello-Burri, Silvia; Bartholdi, Deborah; Steindl, Katharina; Vincent, Marie; Cobilanschi, Joana; Sticht, Heinrich; Baldinger, Rosa; Reissmann, Regina; Sudholt, Irene; Thiel, Christian T; Ekici, Arif B; Reis, André; Bijlsma, Emilia K; Andrieux, Joris; Dieux, Anne; FitzPatrick, David; Ritter, Susanne; Baumer, Alessandra; Latal, Beatrice; Plecko, Barbara; Jenni, Oskar G; Rauch, Anita

    2014-10-01

    Despite abundant evidence for pathogenicity of large copy number variants (CNVs) in neurodevelopmental disorders (NDDs), the individual significance of genome-wide rare CNVs <500 kb has not been well elucidated in a clinical context. By high-resolution chromosomal microarray analysis, we investigated the clinical significance of all rare non-polymorphic exonic CNVs sizing 1-500 kb in a cohort of 714 patients with undiagnosed NDDs. We detected 96 rare CNVs <500 kb affecting coding regions, of which 58 (60.4%) were confirmed. 6 of 14 confirmed de novo, one of two homozygous and four heterozygous inherited CNVs affected the known microdeletion regions 17q21.31, 16p11.2 and 2p21 or OMIM morbid genes (CASK, CREBBP, PAFAH1B1, SATB2; AUTS2, NRXN3, GRM8). Two further de novo CNVs affecting single genes (MED13L, CTNND2) were instrumental in delineating novel recurrent conditions. For the first time, we here report exonic deletions of CTNND2 causing low normal IQ with learning difficulties with or without autism spectrum disorder. Additionally, we discovered a homozygous out-of-frame deletion of ACOT7 associated with features comparable to the published mouse model. In total, 24.1% of the confirmed small CNVs were categorised as pathogenic or likely pathogenic (median size 130 kb), 17.2% as likely benign, 3.4% represented incidental findings and 55.2% remained unclear. These results verify the diagnostic relevance of genome-wide rare CNVs <500 kb, which were found pathogenic in ∼2% (14/714) of cases (1.1% de novo, 0.3% homozygous, 0.6% inherited) and highlight their inherent potential for discovery of new conditions. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  5. Chromosome copy number variation in telomerized human bone marrow stromal cells; insights for monitoring safe ex-vivo expansion of adult stem cells.

    PubMed

    Burns, Jorge S; Harkness, Linda; Aldahmash, Abdullah; Gautier, Laurent; Kassem, Moustapha

    2017-12-01

    Adult human bone marrow stromal cells (hBMSC) cultured for cell therapy require evaluation of potency and stability for safe use. Chromosomal aberrations upsetting genomic integrity in such cells have been contrastingly described as "Limited" or "Significant". Previously reported stepwise acquisition of a spontaneous neoplastic phenotype during three-year continuous culture of telomerized cells (hBMSC-TERT20) didn't alter a diploid karyotype measured by spectral karyotype analysis (SKY). Such screening may not adequately monitor abnormal and potentially tumorigenic hBMSC in clinical scenarios. We here used array comparative genomic hybridization (aCGH) to more stringently compare non-tumorigenic parental hBMSC-TERT strains with their tumorigenic subcloned populations. Confirmation of a known chromosome 9p21 microdeletion at locus CDKN2A/B, showed it also impinged upon the adjacent MTAP gene. Compared to reference diploid human fibroblast genomic DNA, the non-tumorigenic hBMSC-TERT4 cells had a copy number variation (CNV) in at least 14 independent loci. The pre-tumorigenic hBMSC-TERT20 cell strain had further CNV including 1q44 gain enhancing SMYD3 expression and 11q13.1 loss downregulating MUS81 expression. Bioinformatic analysis of gene products reflecting 11p15.5 CNV gain in tumorigenic hBMSC-TERT20 cells highlighted networks implicated in tumorigenic progression involving cell cycle control and mis-match repair. We provide novel biomarkers for prospective risk assessment of expanded stem cell cultures. Copyright © 2017. Published by Elsevier B.V.

  6. Variability extraction and modeling for product variants.

    PubMed

    Linsbauer, Lukas; Lopez-Herrejon, Roberto Erick; Egyed, Alexander

    2017-01-01

    Fast-changing hardware and software technologies in addition to larger and more specialized customer bases demand software tailored to meet very diverse requirements. Software development approaches that aim at capturing this diversity on a single consolidated platform often require large upfront investments, e.g., time or budget. Alternatively, companies resort to developing one variant of a software product at a time by reusing as much as possible from already-existing product variants. However, identifying and extracting the parts to reuse is an error-prone and inefficient task compounded by the typically large number of product variants. Hence, more disciplined and systematic approaches are needed to cope with the complexity of developing and maintaining sets of product variants. Such approaches require detailed information about the product variants, the features they provide and their relations. In this paper, we present an approach to extract such variability information from product variants. It identifies traces from features and feature interactions to their implementation artifacts, and computes their dependencies. This work can be useful in many scenarios ranging from ad hoc development approaches such as clone-and-own to systematic reuse approaches such as software product lines. We applied our variability extraction approach to six case studies and provide a detailed evaluation. The results show that the extracted variability information is consistent with the variability in our six case study systems given by their variability models and available product variants.

  7. Increased de novo copy number variants in the offspring of older males

    PubMed Central

    Flatscher-Bader, T; Foldi, C J; Chong, S; Whitelaw, E; Moser, R J; Burne, T H J; Eyles, D W; McGrath, J J

    2011-01-01

    The offspring of older fathers have an increased risk of neurodevelopmental disorders, such as schizophrenia and autism. In light of the evidence implicating copy number variants (CNVs) with schizophrenia and autism, we used a mouse model to explore the hypothesis that the offspring of older males have an increased risk of de novo CNVs. C57BL/6J sires that were 3- and 12–16-months old were mated with 3-month-old dams to create control offspring and offspring of old sires, respectively. Applying genome-wide microarray screening technology, 7 distinct CNVs were identified in a set of 12 offspring and their parents. Competitive quantitative PCR confirmed these CNVs in the original set and also established their frequency in an independent set of 77 offspring and their parents. On the basis of the combined samples, six de novo CNVs were detected in the offspring of older sires, whereas none were detected in the control group. Two of the CNVs were associated with behavioral and/or neuroanatomical phenotypic features. One of the de novo CNVs involved Auts2 (autism susceptibility candidate 2), and other CNVs included genes linked to schizophrenia, autism and brain development. This is the first experimental demonstration that the offspring of older males have an increased risk of de novo CNVs. Our results support the hypothesis that the offspring of older fathers have an increased risk of neurodevelopmental disorders such as schizophrenia and autism by generation of de novo CNVs in the male germline. PMID:22832608

  8. BlackOPs: increasing confidence in variant detection through mappability filtering.

    PubMed

    Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil

    2013-10-01

    Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.

  9. Histone H3 Variants in Trichomonas vaginalis

    PubMed Central

    Zubáčová, Zuzana; Hostomská, Jitka

    2012-01-01

    The parabasalid protist Trichomonas vaginalis is a widespread parasite that affects humans, frequently causing vaginitis in infected women. Trichomonad mitosis is marked by the persistence of the nuclear membrane and the presence of an asymmetric extranuclear spindle with no obvious direct connection to the chromosomes. No centromeric markers have been described in T. vaginalis, which has prevented a detailed analysis of mitotic events in this organism. In other eukaryotes, nucleosomes of centromeric chromatin contain the histone H3 variant CenH3. The principal aim of this work was to identify a CenH3 homolog in T. vaginalis. We performed a screen of the T. vaginalis genome to retrieve sequences of canonical and variant H3 histones. Three variant histone H3 proteins were identified, and the subcellular localization of their epitope-tagged variants was determined. The localization of the variant TVAG_185390 could not be distinguished from that of the canonical H3 histone. The sequence of the variant TVAG_087830 closely resembled that of histone H3. The tagged protein colocalized with sites of active transcription, indicating that the variant TVAG_087830 represented H3.3 in T. vaginalis. The third H3 variant (TVAG_224460) was localized to 6 or 12 distinct spots at the periphery of the nucleus, corresponding to the number of chromosomes in G1 phase and G2 phase, respectively. We propose that this variant represents the centromeric marker CenH3 and thus can be employed as a tool to study mitosis in T. vaginalis. Furthermore, we suggest that the peripheral distribution of CenH3 within the nucleus results from the association of centromeres with the nuclear envelope throughout the cell cycle. PMID:22408228

  10. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    PubMed

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.

  11. Copy number variation analysis implicates the cell polarity gene glypican 5 as a human spina bifida candidate gene

    PubMed Central

    Bassuk, Alexander G.; Muthuswamy, Lakshmi B.; Boland, Riley; Smith, Tiffany L.; Hulstrand, Alissa M.; Northrup, Hope; Hakeman, Matthew; Dierdorff, Jason M.; Yung, Christina K.; Long, Abby; Brouillette, Rachel B.; Au, Kit Sing; Gurnett, Christina; Houston, Douglas W.; Cornell, Robert A.; Manak, J. Robert

    2013-01-01

    Neural tube defects (NTDs) are common birth defects of complex etiology. Family and population-based studies have confirmed a genetic component to NTDs. However, despite more than three decades of research, the genes involved in human NTDs remain largely unknown. We tested the hypothesis that rare copy number variants (CNVs), especially de novo germline CNVs, are a significant risk factor for NTDs. We used array-based comparative genomic hybridization (aCGH) to identify rare CNVs in 128 Caucasian and 61 Hispanic patients with non-syndromic lumbar-sacral myelomeningocele. We also performed aCGH analysis on the parents of affected individuals with rare CNVs where parental DNA was available (42 sets). Among the eight de novo CNVs that we identified, three generated copy number changes of entire genes. One large heterozygous deletion removed 27 genes, including PAX3, a known spina bifida-associated gene. A second CNV altered genes (PGPD8, ZC3H6) for which little is known regarding function or expression. A third heterozygous deletion removed GPC5 and part of GPC6, genes encoding glypicans. Glypicans are proteoglycans that modulate the activity of morphogens such as Sonic Hedgehog (SHH) and bone morphogenetic proteins (BMPs), both of which have been implicated in NTDs. Additionally, glypicans function in the planar cell polarity (PCP) pathway, and several PCP genes have been associated with NTDs. Here, we show that GPC5 orthologs are expressed in the neural tube, and that inhibiting their expression in frog and fish embryos results in NTDs. These results implicate GPC5 as a gene required for normal neural tube development. PMID:23223018

  12. Inferring causal genomic alterations in breast cancer using gene expression data

    PubMed Central

    2011-01-01

    Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811

  13. SCN5A (NaV1.5) Variant Functional Perturbation and Clinical Presentation: Variants of a Certain Significance.

    PubMed

    Kroncke, Brett M; Glazer, Andrew M; Smith, Derek K; Blume, Jeffrey D; Roden, Dan M

    2018-05-01

    Accurately predicting the impact of rare nonsynonymous variants on disease risk is an important goal in precision medicine. Variants in the cardiac sodium channel SCN5A (protein Na V 1.5; voltage-dependent cardiac Na+ channel) are associated with multiple arrhythmia disorders, including Brugada syndrome and long QT syndrome. Rare SCN5A variants also occur in ≈1% of unaffected individuals. We hypothesized that in vitro electrophysiological functional parameters explain a statistically significant portion of the variability in disease penetrance. From a comprehensive literature review, we quantified the number of carriers presenting with and without disease for 1712 reported SCN5A variants. For 356 variants, data were also available for 5 Na V 1.5 electrophysiological parameters: peak current, late/persistent current, steady-state V1/2 of activation and inactivation, and recovery from inactivation. We found that peak and late current significantly associate with Brugada syndrome ( P <0.001; ρ=-0.44; Spearman rank test) and long QT syndrome disease penetrance ( P <0.001; ρ=0.37). Steady-state V1/2 activation and recovery from inactivation associate significantly with Brugada syndrome and long QT syndrome penetrance, respectively. Continuous estimates of disease penetrance align with the current American College of Medical Genetics classification paradigm. Na V 1.5 in vitro electrophysiological parameters are correlated with Brugada syndrome and long QT syndrome disease risk. Our data emphasize the value of in vitro electrophysiological characterization and incorporating counts of affected and unaffected carriers to aid variant classification. This quantitative analysis of the electrophysiological literature should aid the interpretation of Na V 1.5 variant electrophysiological abnormalities and help improve Na V 1.5 variant classification. © 2018 American Heart Association, Inc.

  14. Identifying Causal Variants at Loci with Multiple Signals of Association

    PubMed Central

    Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar

    2014-01-01

    Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20–50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. PMID:25104515

  15. Burden of Common Complex Disease Variants in the Exomes of Two Healthy Centenarian Brothers.

    PubMed

    Tindale, Lauren C; Zeng, Andy; Bretherick, Karla L; Leach, Stephen; Thiessen, Nina; Brooks-Wilson, Angela R

    2015-01-01

    It is not understood whether long-term good health is promoted by the absence of disease risk variants, the presence of protective variants, or both. We characterized the exomes of two exceptionally healthy centenarian brothers aged 106 and 109 years who had never been diagnosed with cancer, cardiovascular disease, diabetes, Alzheimer's disease, or major pulmonary disease. The aim of this study was to gain insight into whether exceptional health and longevity are a result of carrying fewer disease-associated variants than typical individuals. We compared the number of disease-associated alleles, and the proportion of alleles predicted to be functionally damaging, between the centenarian brothers and published population data. Mitochondrial sequence reads were extracted from the exome data in order to analyze mitochondrial variants. The brothers carry a similar number of common disease-associated variants and predicted damaging variants compared to reference groups. They did not carry any high-penetrance clinically actionable variants. They carry mitochondrial haplogroup T, and one brother has a single heteroplasmic variant. Although our small sample size does not allow for definitive conclusions, a healthy aging and longevity phenotype is not necessarily due to a decreased burden of common disease-associated variants. Instead, it may be rare 'positive' variants that play a role in this desirable phenotype. © 2015 S. Karger AG, Basel.

  16. Cellulase variants

    DOEpatents

    Blazej, Robert; Toriello, Nicholas; Emrich, Charles; Cohen, Richard N.; Koppel, Nitzan

    2015-07-14

    This invention provides novel variant cellulolytic enzymes having improved activity and/or stability. In certain embodiments the variant cellulotyic enzymes comprise a glycoside hydrolase with or comprising a substitution at one or more positions corresponding to one or more of residues F64, A226, and/or E246 in Thermobifida fusca Cel9A enzyme. In certain embodiments the glycoside hydrolase is a variant of a family 9 glycoside hydrolase. In certain embodiments the glycoside hydrolase is a variant of a theme B family 9 glycoside hydrolase.

  17. Identifying genetic variants that affect viability in large cohorts

    PubMed Central

    Berisa, Tomaz; Day, Felix R.; Perry, John R. B.

    2017-01-01

    A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P~6.2 × 10−6 for fathers and P~2.0 × 10−3 for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10−3). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans. PMID:28873088

  18. Using high-resolution variant frequencies to empower clinical genome interpretation.

    PubMed

    Whiffin, Nicola; Minikel, Eric; Walsh, Roddy; O'Donnell-Luria, Anne H; Karczewski, Konrad; Ing, Alexander Y; Barton, Paul J R; Funke, Birgit; Cook, Stuart A; MacArthur, Daniel; Ware, James S

    2017-10-01

    PurposeWhole-exome and whole-genome sequencing have transformed the discovery of genetic variants that cause human Mendelian disease, but discriminating pathogenic from benign variants remains a daunting challenge. Rarity is recognized as a necessary, although not sufficient, criterion for pathogenicity, but frequency cutoffs used in Mendelian analysis are often arbitrary and overly lenient. Recent very large reference datasets, such as the Exome Aggregation Consortium (ExAC), provide an unprecedented opportunity to obtain robust frequency estimates even for very rare variants.MethodsWe present a statistical framework for the frequency-based filtering of candidate disease-causing variants, accounting for disease prevalence, genetic and allelic heterogeneity, inheritance mode, penetrance, and sampling variance in reference datasets.ResultsUsing the example of cardiomyopathy, we show that our approach reduces by two-thirds the number of candidate variants under consideration in the average exome, without removing true pathogenic variants (false-positive rate<0.001).ConclusionWe outline a statistically robust framework for assessing whether a variant is "too common" to be causative for a Mendelian disorder of interest. We present precomputed allele frequency cutoffs for all variants in the ExAC dataset.

  19. Analysis of copy number variation in Alzheimer's disease in a cohort of clinically characterized and neuropathologically verified individuals.

    PubMed

    Swaminathan, Shanker; Huentelman, Matthew J; Corneveaux, Jason J; Myers, Amanda J; Faber, Kelley M; Foroud, Tatiana; Mayeux, Richard; Shen, Li; Kim, Sungeun; Turk, Mari; Hardy, John; Reiman, Eric M; Saykin, Andrew J

    2012-01-01

    Copy number variations (CNVs) are genomic regions that have added (duplications) or deleted (deletions) genetic material. They may overlap genes affecting their function and have been shown to be associated with disease. We previously investigated the role of CNVs in late-onset Alzheimer's disease (AD) and mild cognitive impairment using Alzheimer's Disease Neuroimaging Initiative (ADNI) and National Institute of Aging-Late Onset AD/National Cell Repository for AD (NIA-LOAD/NCRAD) Family Study participants, and identified a number of genes overlapped by CNV calls. To confirm the findings and identify other potential candidate regions, we analyzed array data from a unique cohort of 1617 Caucasian participants (1022 AD cases and 595 controls) who were clinically characterized and whose diagnosis was neuropathologically verified. All DNA samples were extracted from brain tissue. CNV calls were generated and subjected to quality control (QC). 728 cases and 438 controls who passed all QC measures were included in case/control association analyses including candidate gene and genome-wide approaches. Rates of deletions and duplications did not significantly differ between cases and controls. Case-control association identified a number of previously reported regions (CHRFAM7A, RELN and DOPEY2) as well as a new gene (HLA-DRA). Meta-analysis of CHRFAM7A indicated a significant association of the gene with AD and/or MCI risk (P = 0.006, odds ratio = 3.986 (95% confidence interval 1.490-10.667)). A novel APP gene duplication was observed in one case sample. Further investigation of the identified genes in independent and larger samples is warranted.

  20. Linkage disequilibrium among commonly genotyped SNP and variants detected from bull sequence

    USDA-ARS?s Scientific Manuscript database

    Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNP genotyped by commercial assays. A number of variants detected from sequencing influential sires are likely to be causal, but noticable improvements in prediction accuracy using imputed sequen...

  1. CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations.

    PubMed

    Wang, Xihong; Zheng, Zhuqing; Cai, Yudong; Chen, Ting; Li, Chao; Fu, Weiwei; Jiang, Yu

    2017-12-01

    The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1-2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species. © The Authors 2017. Published by Oxford University Press.

  2. Copy number variants are frequent in genetic generalized epilepsy with intellectual disability

    PubMed Central

    Mullen, Saul A.; Carvill, Gemma L.; Bellows, Susannah; Bayly, Marta A.; Berkovic, Samuel F.; Dibbens, Leanne M.

    2013-01-01

    Objective: We examined whether copy number variants (CNVs) were more common in those with a combination of intellectual disability (ID) and genetic generalized epilepsy (GGE) than in those with either phenotype alone via a case-control study. Methods: CNVs contribute to the genetics of multiple neurodevelopmental disorders with complex inheritance, including GGE and ID. Three hundred fifty-nine probands with GGE and 60 probands with ID-GGE were screened for GGE-associated recurrent microdeletions at 15q13.3, 15q11.2, and 16p13.11 via quantitative PCR or loss of heterozygosity. Deletions were confirmed by comparative genomic hybridization (CGH). ID-GGE probands also had genome-wide CGH. Results: ID-GGE probands showed a significantly higher rate of CNVs compared with probands with GGE alone, with 17 of 60 (28%) ID-GGE probands having one or more potentially causative CNVs. The patients with ID-GGE had a 3-fold-higher rate of the 3 GGE-associated recurrent microdeletions than probands with GGE alone (10% vs 3%, p = 0.02). They also showed a high rate (13/60, 22%) of rare CNVs identified using genome-wide CGH. Conclusions: This study shows that CNVs are common in those with ID-GGE with recurrent deletions at 15q13.3, 15q11.2, and 16p13.11, particularly enriched compared with individuals with GGE or ID alone. Recurrent CNVs are likely to act as risk factors for multiple phenotypes not just at the population level, but also in any given individual. Testing for CNVs in ID-GGE will have a high diagnostic yield in a clinical setting and will inform genetic counseling. PMID:24068782

  3. CHEK2*1100DELC Variant and Breast Cancer Risk

    DTIC Science & Technology

    2006-10-01

    AD_________________ Award Number: DAMD17-03-1-0774 TITLE: CHEK2 *1100DELC Variant and Breast...01-10-2006 2. REPORT TYPE Final 3. DATES COVERED (From - To) 15 Sep 03 – 14 Sep 06 4. TITLE AND SUBTITLE CHEK2 *1100DELC...SUPPLEMENTARY NOTES 14. ABSTRACT: We propose to examine the association between the CHEK2 *1100delC gene variant and breast cancer among BRCA1/2

  4. Identifying causal variants at loci with multiple signals of association.

    PubMed

    Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar

    2014-10-01

    Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20-50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. Copyright © 2014 by the Genetics Society of America.

  5. Differences in antimicrobial susceptibility of pigmented and unpigmented colonial variants of Mycobacterium avium.

    PubMed Central

    Stormer, R S; Falkinham, J O

    1989-01-01

    Unpigmented colonial variants were isolated from pigmented Mycobacterium avium isolates recovered from patients with acquired immunodeficiency syndrome and the environment. The variants were interconvertible: the rate of transition from unpigmented to pigmented type was 4.0 x 10(-5) variants per cell per generation. The unpigmented variants were more tolerant to antibiotics, especially beta-lactams, and Cd2+ and Cu2+ salts than were their pigmented parents. Both pigmented and unpigmented variants of the strains produced beta-lactamase, although beta-lactamase did not appear to be a determinant of beta-lactam susceptibility. Pigmented variants grew more rapidly in a number of commonly used mycobacterial media, were more hydrophobic, and had higher carotenoid contents than their unpigmented segregants. PMID:2808669

  6. How important are rare variants in common disease?

    PubMed

    Saint Pierre, Aude; Génin, Emmanuelle

    2014-09-01

    Genome-wide association studies have uncovered hundreds of common genetic variants involved in complex diseases. However, for most complex diseases, these common genetic variants only marginally contribute to disease susceptibility. It is now argued that rare variants located in different genes could in fact play a more important role in disease susceptibility than common variants. These rare genetic variants were not captured by genome-wide association studies using single nucleotide polymorphism-chips but with the advent of next-generation sequencing technologies, they have become detectable. It is now possible to study their contribution to common disease by resequencing samples of cases and controls or by using new genotyping exome arrays that cover rare alleles. In this review, we address the question of the contribution of rare variants in common disease by taking the examples of different diseases for which some resequencing studies have already been performed, and by summarizing the results of simulation studies conducted so far to investigate the genetic architecture of complex traits in human. So far, empirical data have not allowed the exclusion of many models except the most extreme ones involving only a small number of rare variants with large effects contributing to complex disease. To unravel the genetic architecture of complex disease, case-control data will not be sufficient, and alternative study designs need to be proposed together with methodological developments. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  7. Genotype and phenotype spectrum of NRAS germline variants.

    PubMed

    Altmüller, Franziska; Lissewski, Christina; Bertola, Debora; Flex, Elisabetta; Stark, Zornitza; Spranger, Stephanie; Baynam, Gareth; Buscarilli, Michelle; Dyack, Sarah; Gillis, Jane; Yntema, Helger G; Pantaleoni, Francesca; van Loon, Rosa LE; MacKay, Sara; Mina, Kym; Schanze, Ina; Tan, Tiong Yang; Walsh, Maie; White, Susan M; Niewisch, Marena R; García-Miñaúr, Sixto; Plaza, Diego; Ahmadian, Mohammad Reza; Cavé, Hélène; Tartaglia, Marco; Zenker, Martin

    2017-06-01

    RASopathies comprise a group of disorders clinically characterized by short stature, heart defects, facial dysmorphism, and varying degrees of intellectual disability and cancer predisposition. They are caused by germline variants in genes encoding key components or modulators of the highly conserved RAS-MAPK signalling pathway that lead to dysregulation of cell signal transmission. Germline changes in the genes encoding members of the RAS subfamily of GTPases are rare and associated with variable phenotypes of the RASopathy spectrum, ranging from Costello syndrome (HRAS variants) to Noonan and Cardiofaciocutaneous syndromes (KRAS variants). A small number of RASopathy cases with disease-causing germline NRAS alterations have been reported. Affected individuals exhibited features fitting Noonan syndrome, and the observed germline variants differed from the typical oncogenic NRAS changes occurring as somatic events in tumours. Here we describe 19 new cases with RASopathy due to disease-causing variants in NRAS. Importantly, four of them harbored missense changes affecting Gly12, which was previously described to occur exclusively in cancer. The phenotype in our cohort was variable but well within the RASopathy spectrum. Further, one of the patients (c.35G>A; p.(Gly12Asp)) had a myeloproliferative disorder, and one subject (c.34G>C; p.(Gly12Arg)) exhibited an uncharacterized brain tumour. With this report, we expand the genotype and phenotype spectrum of RASopathy-associated germline NRAS variants and provide evidence that NRAS variants do not spare the cancer-associated mutation hotspots.

  8. Systematic comparison of variant calling pipelines using gold standard personal exome variants

    PubMed Central

    Hwang, Sohyun; Kim, Eiru; Lee, Insuk; Marcotte, Edward M.

    2015-01-01

    The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes. PMID:26639839

  9. Comparison of Constitutional and Replication Stress-Induced Genome Structural Variation by SNP Array and Mate-Pair Sequencing

    PubMed Central

    Arlt, Martin F.; Ozdemir, Alev Cagla; Birkeland, Shanda R.; Lyons, Robert H.; Glover, Thomas W.; Wilson, Thomas E.

    2011-01-01

    Copy-number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have implicated replication stress as a causative factor in CNV formation. However, existing data are technically limited in the quality of comparisons that can be made between human CNVs and experimentally induced variants. Here, we used two high-resolution strategies—single nucleotide polymorphism (SNP) arrays and mate-pair sequencing—to compare CNVs that occur constitutionally to those that arise following aphidicolin-induced DNA replication stress in the same human cells. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs. PMID:21212237

  10. CDKL5 variants

    PubMed Central

    Kalscheuer, Vera M.; Hennig, Friederike; Leonard, Helen; Downs, Jenny; Clarke, Angus; Benke, Tim A.; Armstrong, Judith; Pineda, Mercedes; Bailey, Mark E.S.; Cobb, Stuart R.

    2017-01-01

    Objective: To provide new insights into the interpretation of genetic variants in a rare neurologic disorder, CDKL5 deficiency, in the contexts of population sequencing data and an updated characterization of the CDKL5 gene. Methods: We analyzed all known potentially pathogenic CDKL5 variants by combining data from large-scale population sequencing studies with CDKL5 variants from new and all available clinical cohorts and combined this with computational methods to predict pathogenicity. Results: The study has identified several variants that can be reclassified as benign or likely benign. With the addition of novel CDKL5 variants, we confirm that pathogenic missense variants cluster in the catalytic domain of CDKL5 and reclassify a purported missense variant as having a splicing consequence. We provide further evidence that missense variants in the final 3 exons are likely to be benign and not important to disease pathology. We also describe benign splicing and nonsense variants within these exons, suggesting that isoform hCDKL5_5 is likely to have little or no neurologic significance. We also use the available data to make a preliminary estimate of minimum incidence of CDKL5 deficiency. Conclusions: These findings have implications for genetic diagnosis, providing evidence for the reclassification of specific variants previously thought to result in CDKL5 deficiency. Together, these analyses support the view that the predominant brain isoform in humans (hCDKL5_1) is crucial for normal neurodevelopment and that the catalytic domain is the primary functional domain. PMID:29264392

  11. Protein variants in Hiroshima and Nagasaki: tales of two cities.

    PubMed Central

    Neel, J V; Satoh, C; Smouse, P; Asakawa, J; Takahashi, N; Goriki, K; Fujita, M; Kageoka, T; Hazama, R

    1988-01-01

    The results of 1,465,423 allele product determinations based on blood samples from Hiroshima and Nagasaki, involving 30 different proteins representing 32 different gene products, are analyzed in a variety of ways, with the following conclusions: (1) Sibships and their parents are included in the sample. Our analysis reveals that statistical procedures designed to reduce the sample to equivalent independent genomes do not in population comparisons compensate for the familial cluster effect of rare variants. Accordingly, the data set was reduced to one representative of each sibship (937,427 allele products). (2) Both chi 2-type contrasts and a genetic distance measure (delta) reveal that rare variants (P less than .01) are collectively as effective as polymorphisms in establishing genetic differences between the two cities. (3) We suggest that rare variants that individually exhibit significant intercity differences are probably the legacy of tribal private polymorphisms that occurred during prehistoric times. (4) Despite the great differences in the known histories of the two cities, both the overall frequency of rare variants and the number of different rare variants are essentially identical in the two cities. (5) The well-known differences in locus variability are confirmed, now after adjustment for sample size differences for the various locus products; in this large series we failed to detect variants at only three of 29 loci for which sample size exceeded 23,000. (6) The number of alleles identified per locus correlates positively with subunit molecular weight. (7) Loci supporting genetic polymorphisms are characterized by more rare variants than are loci at which polymorphisms were not encountered. (8) Loci whose products do not appear to be essential for health support more variants than do loci the absence of whose product is detrimental to health. (9) There is a striking excess of rare variants over the expectation under the neutral mutation

  12. Protein variants in Hiroshima and Nagasaki: tales of two cities.

    PubMed

    Neel, J V; Satoh, C; Smouse, P; Asakawa, J; Takahashi, N; Goriki, K; Fujita, M; Kageoka, T; Hazama, R

    1988-12-01

    The results of 1,465,423 allele product determinations based on blood samples from Hiroshima and Nagasaki, involving 30 different proteins representing 32 different gene products, are analyzed in a variety of ways, with the following conclusions: (1) Sibships and their parents are included in the sample. Our analysis reveals that statistical procedures designed to reduce the sample to equivalent independent genomes do not in population comparisons compensate for the familial cluster effect of rare variants. Accordingly, the data set was reduced to one representative of each sibship (937,427 allele products). (2) Both chi 2-type contrasts and a genetic distance measure (delta) reveal that rare variants (P less than .01) are collectively as effective as polymorphisms in establishing genetic differences between the two cities. (3) We suggest that rare variants that individually exhibit significant intercity differences are probably the legacy of tribal private polymorphisms that occurred during prehistoric times. (4) Despite the great differences in the known histories of the two cities, both the overall frequency of rare variants and the number of different rare variants are essentially identical in the two cities. (5) The well-known differences in locus variability are confirmed, now after adjustment for sample size differences for the various locus products; in this large series we failed to detect variants at only three of 29 loci for which sample size exceeded 23,000. (6) The number of alleles identified per locus correlates positively with subunit molecular weight. (7) Loci supporting genetic polymorphisms are characterized by more rare variants than are loci at which polymorphisms were not encountered. (8) Loci whose products do not appear to be essential for health support more variants than do loci the absence of whose product is detrimental to health. (9) There is a striking excess of rare variants over the expectation under the neutral mutation

  13. Probable Chemical Hypoxia Effects on Progress of CNV Through Induction of Promoter CpG Demethylation and Overexpression of IL17RC in Human RPE Cells.

    PubMed

    Alivand, Mohammad Reza; Sabouni, Farzaneh; Soheili, Zahra-Soheila

    2016-09-01

    To survey the changes of promoter CpG methylation status and mRNA expression of IL17RC (interleukin 17 receptor C) gene in retinal pigment epithelium (RPE) cells under chemical hypoxia condition for choroidal neovascularization (CNV) modeling in vitro. RPE cells were cultured in both untreated as a control group and treated by cobalt chloride media as a hypoxia group for various concentrations (100-150μM) and times (24-36 hrs.) To confirm chemical hypoxia condition, mRNA expression of HIF (Hypoxia Inducible Factor) -1α, -2α, and Vascular Endothelial Growth Factor (VEGF) was compared between two groups by Real-time PCR. Also, in normoxia and hypoxia conditions, IL17RC expression changes and promoter CpG methylation status were evaluated by Real-time PCR and methylation-specific PCR (MSP) techniques, respectively. Overexpression of HIF-1α, HIF-2α, and VEGF was significant in hypoxia versus normoxia conditions. Our data showed overexpression of IL17RC (2.1- to 6.3-fold) and decreasing of its promoter methylation in comparison with hypoxia and normoxia conditions. It was found that there are significant association between promoter methylation status and expression of IL17RC in chemical hypoxia condition. Therefore, methylation of IL17RC could play as a marker in CNV and degeneration of RPE cells in vitro. Additionally, HIF-α and methylation phenomena may be considered as critical targets for blocking in angiogenesis of age-related degeneration in future studies.

  14. The UCL low-density lipoprotein receptor gene variant database: pathogenicity update

    PubMed Central

    Futema, Marta; Whittall, Ros; Taylor-Beadling, Alison; Williams, Maggie; den Dunnen, Johan T; Humphries, Steve E

    2017-01-01

    Background Familial hypercholesterolaemia (OMIM 143890) is most frequently caused by variations in the low-density lipoprotein receptor (LDLR) gene. Predicting whether novel variants are pathogenic may not be straightforward, especially for missense and synonymous variants. In 2013, the Association of Clinical Genetic Scientists published guidelines for the classification of variants, with categories 1 and 2 representing clearly not or unlikely pathogenic, respectively, 3 representing variants of unknown significance (VUS), and 4 and 5 representing likely to be or clearly pathogenic, respectively. Here, we update the University College London (UCL) LDLR variant database according to these guidelines. Methods PubMed searches and alerts were used to identify novel LDLR variants for inclusion in the database. Standard in silico tools were used to predict potential pathogenicity. Variants were designated as class 4/5 only when the predictions from the different programs were concordant and as class 3 when predictions were discordant. Results The updated database (http://www.lovd.nl/LDLR) now includes 2925 curated variants, representing 1707 independent events. All 129 nonsense variants, 337 small frame-shifting and 117/118 large rearrangements were classified as 4 or 5. Of the 795 missense variants, 115 were in classes 1 and 2, 605 in class 4 and 75 in class 3. 111/181 intronic variants, 4/34 synonymous variants and 14/37 promoter variants were assigned to classes 4 or 5. Overall, 112 (7%) of reported variants were class 3. Conclusions This study updates the LDLR variant database and identifies a number of reported VUS where additional family and in vitro studies will be required to confirm or refute their pathogenicity. PMID:27821657

  15. Quadruplex MAPH: improvement of throughput in high-resolution copy number screening.

    PubMed

    Tyson, Jess; Majerus, Tamsin Mo; Walker, Susan; Armour, John Al

    2009-09-28

    Copy number variation (CNV) in the human genome is recognised as a widespread and important source of human genetic variation. Now the challenge is to screen for these CNVs at high resolution in a reliable, accurate and cost-effective way. Multiplex Amplifiable Probe Hybridisation (MAPH) is a sensitive, high-resolution technology appropriate for screening for CNVs in a defined region, for a targeted population. We have developed MAPH to a highly multiplexed format ("QuadMAPH") that allows the user a four-fold increase in the number of loci tested simultaneously. We have used this method to analyse a genomic region of 210 kb, including the MSH2 gene and 120 kb of flanking DNA. We show that the QuadMAPH probes report copy number with equivalent accuracy to simplex MAPH, reliably demonstrating diploid copy number in control samples and accurately detecting deletions in Hereditary Non-Polyposis Colorectal Cancer (HNPCC) samples. QuadMAPH is an accurate, high-resolution method that allows targeted screening of large numbers of subjects without the expense of genome-wide approaches. Whilst we have applied this technique to a region of the human genome, it is equally applicable to the genomes of other organisms.

  16. Analysis of X chromosome genomic DNA sequence copy number variation associated with premature ovarian failure (POF)

    PubMed Central

    Quilter, C.R.; Karcanias, A.C.; Bagga, M.R.; Duncan, S.; Murray, A.; Conway, G.S.; Sargent, C.A.; Affara, N.A.

    2013-01-01

    BACKGROUND Premature ovarian failure (POF) is a heterogeneous disease defined as amenorrhoea for >6 months before age 40, with an FSH serum level >40 mIU/ml (menopausal levels). While there is a strong genetic association with POF, familial studies have also indicated that idiopathic POF may also be genetically linked. Conventional cytogenetic analyses have identified regions of the X chromosome that are strongly associated with ovarian function, as well as several POF candidate genes. Cryptic chromosome abnormalities that have been missed might be detected by array comparative genomic hybridization. METHODS In this study, samples from 42 idiopathic POF patients were subjected to a complete end-to-end X/Y chromosome tiling path array to achieve a detailed copy number variation (CNV) analysis of X chromosome involvement in POF. The arrays also contained a 1 Mb autosomal tiling path as a reference control. Quantitative PCR for selected genes contained within the CNVs was used to confirm the majority of the changes detected. The expression pattern of some of these genes in human tissue RNA was examined by reverse transcription (RT)–PCR. RESULTS A number of CNVs were identified on both Xp and Xq, with several being shared among the POF cases. Some CNVs fall within known polymorphic CNV regions, and others span previously identified POF candidate regions and genes. CONCLUSIONS The new data reported in this study reveal further discrete X chromosome intervals not previously associated with the disease and therefore implicate new clusters of candidate genes. Further studies will be required to elucidate their involvement in POF. PMID:20570974

  17. A de novo whole gene deletion of XIAP detected by exome sequencing analysis in very early onset inflammatory bowel disease: a case report.

    PubMed

    Kelsen, Judith R; Dawany, Noor; Martinez, Alejandro; Martinez, Alejuandro; Grochowski, Christopher M; Maurer, Kelly; Rappaport, Eric; Piccoli, David A; Baldassano, Robert N; Mamula, Petar; Sullivan, Kathleen E; Devoto, Marcella

    2015-11-18

    Children with very early-onset inflammatory bowel disease (VEO-IBD), those diagnosed at less than 5 years of age, are a unique population. A subset of these patients present with a distinct phenotype and more severe disease than older children and adults. Host genetics is thought to play a more prominent role in this young population, and monogenic defects in genes related to primary immunodeficiencies are responsible for the disease in a small subset of patients with VEO-IBD. We report a child who presented at 3 weeks of life with very early-onset inflammatory bowel disease (VEO-IBD). He had a complicated disease course and remained unresponsive to medical and surgical therapy. The refractory nature of his disease, together with his young age of presentation, prompted utilization of whole exome sequencing (WES) to detect an underlying monogenic primary immunodeficiency and potentially target therapy to the identified defect. Copy number variation analysis (CNV) was performed using the eXome-Hidden Markov Model. Whole exome sequencing revealed 1,380 nonsense and missense variants in the patient. Plausible candidate variants were not detected following analysis of filtered variants, therefore, we performed CNV analysis of the WES data, which led us to identify a de novo whole gene deletion in XIAP. This is the first reported whole gene deletion in XIAP, the causal gene responsible for XLP2 (X-linked lymphoproliferative Disease 2). XLP2 is a syndrome resulting in VEO-IBD and can increase susceptibility to hemophagocytic lymphohistocytosis (HLH). This identification allowed the patient to be referred for bone marrow transplantation, potentially curative for his disease and critical to prevent the catastrophic sequela of HLH. This illustrates the unique etiology of VEO-IBD, and the subsequent effects on therapeutic options. This cohort requires careful and thorough evaluation for monogenic defects and primary immunodeficiencies.

  18. Identification of pathogen genomic variants through an integrated pipeline

    PubMed Central

    2014-01-01

    Background Whole-genome sequencing represents a powerful experimental tool for pathogen research. We present methods for the analysis of small eukaryotic genomes, including a streamlined system (called Platypus) for finding single nucleotide and copy number variants as well as recombination events. Results We have validated our pipeline using four sets of Plasmodium falciparum drug resistant data containing 26 clones from 3D7 and Dd2 background strains, identifying an average of 11 single nucleotide variants per clone. We also identify 8 copy number variants with contributions to resistance, and report for the first time that all analyzed amplification events are in tandem. Conclusions The Platypus pipeline provides malaria researchers with a powerful tool to analyze short read sequencing data. It provides an accurate way to detect SNVs using known software packages, and a novel methodology for detection of CNVs, though it does not currently support detection of small indels. We have validated that the pipeline detects known SNVs in a variety of samples while filtering out spurious data. We bundle the methods into a freely available package. PMID:24589256

  19. HFE gene variants affect iron in the brain.

    PubMed

    Nandar, Wint; Connor, James R

    2011-04-01

    Iron accumulation in the brain and increased oxidative stress are consistent observations in many neurodegenerative diseases. Thus, we have begun examination into gene mutations or allelic variants that could be associated with loss of iron homeostasis. One of the mechanisms leading to iron overload is a mutation in the HFE gene, which is involved in iron metabolism. The 2 most common HFE gene variants are C282Y (1.9%) and H63D (8.9%). The C282Y HFE variant is more commonly associated with hereditary hemochromatosis, which is an autosomal recessive disorder, characterized by iron overload in a number of systemic organs. The H63D HFE variant appears less frequently associated with hemochromatosis, but its role in the neurodegenerative diseases has received more attention. At the cellular level, the HFE mutant protein resulting from the H63D HFE gene variant is associated with iron dyshomeostasis, increased oxidative stress, glutamate release, tau phosphorylation, and alteration in inflammatory response, each of which is under investigation as a contributing factor to neurodegenerative diseases. Therefore, the HFE gene variants are proposed to be genetic modifiers or a risk factor for neurodegenerative diseases by establishing an enabling milieu for pathogenic agents. This review will discuss the current knowledge of the association of the HFE gene variants with neurodegenerative diseases: amyotrophic lateral sclerosis, Alzheimer's disease, Parkinson's disease, and ischemic stroke. Importantly, the data herein also begin to dispel the long-held view that the brain is protected from iron accumulation associated with the HFE mutations.

  20. Identification of De Novo and Rare Inherited Copy Number Variants in Children with Syndromic Congenital Heart Defects.

    PubMed

    Hussein, Ibtessam R; Bader, Rima S; Chaudhary, Adeel G; Bassiouni, Randa; Alquaiti, Maha; Ashgan, Fai; Schulten, Hans-Juergen; Al Qahtani, Mohammad H

    2018-06-01

    Congenital heart defects (CHDs) are the most common birth defects in neonatal life. CHDs could be presented as isolated defects or associated with developmental delay (DD) and/or other congenital malformations. A small proportion of cardiac defects are caused by chromosomal abnormalities or single gene defects; however, in a large proportion of cases no genetic diagnosis could be achieved by clinical examination and conventional genetic analysis. The development of genome wide array-Comparative Genomic Hybridization technique (array-CGH) allowed for the detection of cryptic chromosomal imbalances and pathogenic copy number variants (CNVs) not detected by conventional techniques. We investigated 94 patients having CHDs associated with other malformations and/or DD. Clinical examination and Echocardiography was done to all patients to evaluate the type of CHD. To investigate for genome defects we applied high-density array-CGH 2 × 400K (41 patients) and CGH/SNP microarray 2 × 400K (Agilent) for 53 patients. Confirmation of results was done using Fluorescent in situ hybridization (FISH) or qPCR techniques in certain cases. Chromosomal abnormalities such as trisomy 18, 13, 21, microdeletions: del22q11.2, del7q11.23, del18 (p11.32; p11.21), tetrasomy 18p, trisomy 9p, del11q24-q25, add 15p, add(18)(q21.3), and der 9, 15 (q34.2; q11.2) were detected in 21/94 patients (22%) using both conventional cytogenetics methods and array-CGH technique. Cryptic chromosomal anomalies and pathogenic variants were detected in 15/73 (20.5%) cases. CNVs were observed in a large proportion of the studied samples (27/56) (48%). Clustering of variants was observed in chromosome 1p36, 1p21.1, 2q37, 3q29, 5p15, 7p22.3, 8p23, 11p15.5, 14q11.2, 15q11.2, 16p13.3, 16p11.2, 18p11, 21q22, and 22q11.2. CGH/SNP array could detect loss of heterozygosity (LOH) in different chromosomal loci in 10/25 patients. Array-CGH technique allowed for detection of cryptic chromosomal imbalances that

  1. Human papillomavirus variants among Inuit women in northern Quebec, Canada.

    PubMed

    Gauthier, Barbara; Coutlée, Francois; Franco, Eduardo L; Brassard, Paul

    2015-01-01

    Inuit communities in northern Quebec have high rates of human papillomavirus (HPV) infection, cervical cancer and cervical cancer-related mortality as compared to the Canadian population. HPV types can be further classified as intratypic variants based on the extent of homology in their nucleotide sequences. There is limited information on the distribution of intratypic variants in circumpolar areas. Our goal was to describe the HPV intratypic variants and associated baseline characteristics. We collected cervical cell samples in 2002-2006 from 676 Inuit women between the ages of 15 and 69 years in Nunavik. DNA isolates from high-risk HPVs were sequenced to determine the intratypic variant. There were 149 women that were positive for HPVs 16, 18, 31, 33, 35, 45, 52, 56 or 58 during follow-up. There were 5 different HPV16 variants, all of European lineage, among the 57 women positive for this type. There were 8 different variants of HPV18 present and all were of European lineage (n=21). The majority of samples of HPV31 (n=52) were of lineage B. The number of isolates and diversity of the other HPV types was low. Age was the only covariate associated with HPV16 variant category. These frequencies are similar to what was seen in another circumpolar region of Canada, although there appears to be less diversity as only European variants were detected. This study shows that most variants were clustered in one lineage for each HPV type.

  2. Hundreds of variants clustered in genomic loci and biological pathways affect human height

    PubMed Central

    Lango Allen, Hana; Estrada, Karol; Lettre, Guillaume; Berndt, Sonja I.; Weedon, Michael N.; Rivadeneira, Fernando; Willer, Cristen J.; Jackson, Anne U.; Vedantam, Sailaja; Raychaudhuri, Soumya; Ferreira, Teresa; Wood, Andrew R.; Weyant, Robert J.; Segrè, Ayellet V.; Speliotes, Elizabeth K.; Wheeler, Eleanor; Soranzo, Nicole; Park, Ju-Hyun; Yang, Jian; Gudbjartsson, Daniel; Heard-Costa, Nancy L.; Randall, Joshua C.; Qi, Lu; Smith, Albert Vernon; Mägi, Reedik; Pastinen, Tomi; Liang, Liming; Heid, Iris M.; Luan, Jian'an; Thorleifsson, Gudmar; Winkler, Thomas W.; Goddard, Michael E.; Lo, Ken Sin; Palmer, Cameron; Workalemahu, Tsegaselassie; Aulchenko, Yurii S.; Johansson, Åsa; Zillikens, M.Carola; Feitosa, Mary F.; Esko, Tõnu; Johnson, Toby; Ketkar, Shamika; Kraft, Peter; Mangino, Massimo; Prokopenko, Inga; Absher, Devin; Albrecht, Eva; Ernst, Florian; Glazer, Nicole L.; Hayward, Caroline; Hottenga, Jouke-Jan; Jacobs, Kevin B.; Knowles, Joshua W.; Kutalik, Zoltán; Monda, Keri L.; Polasek, Ozren; Preuss, Michael; Rayner, Nigel W.; Robertson, Neil R.; Steinthorsdottir, Valgerdur; Tyrer, Jonathan P.; Voight, Benjamin F.; Wiklund, Fredrik; Xu, Jianfeng; Zhao, Jing Hua; Nyholt, Dale R.; Pellikka, Niina; Perola, Markus; Perry, John R.B.; Surakka, Ida; Tammesoo, Mari-Liis; Altmaier, Elizabeth L.; Amin, Najaf; Aspelund, Thor; Bhangale, Tushar; Boucher, Gabrielle; Chasman, Daniel I.; Chen, Constance; Coin, Lachlan; Cooper, Matthew N.; Dixon, Anna L.; Gibson, Quince; Grundberg, Elin; Hao, Ke; Junttila, M. Juhani; Kaplan, Lee M.; Kettunen, Johannes; König, Inke R.; Kwan, Tony; Lawrence, Robert W.; Levinson, Douglas F.; Lorentzon, Mattias; McKnight, Barbara; Morris, Andrew P.; Müller, Martina; Ngwa, Julius Suh; Purcell, Shaun; Rafelt, Suzanne; Salem, Rany M.; Salvi, Erika; Sanna, Serena; Shi, Jianxin; Sovio, Ulla; Thompson, John R.; Turchin, Michael C.; Vandenput, Liesbeth; Verlaan, Dominique J.; Vitart, Veronique; White, Charles C.; Ziegler, Andreas; Almgren, Peter; Balmforth, Anthony J.; Campbell, Harry; Citterio, Lorena; De Grandi, Alessandro; Dominiczak, Anna; Duan, Jubao; Elliott, Paul; Elosua, Roberto; Eriksson, Johan G.; Freimer, Nelson B.; Geus, Eco J.C.; Glorioso, Nicola; Haiqing, Shen; Hartikainen, Anna-Liisa; Havulinna, Aki S.; Hicks, Andrew A.; Hui, Jennie; Igl, Wilmar; Illig, Thomas; Jula, Antti; Kajantie, Eero; Kilpeläinen, Tuomas O.; Koiranen, Markku; Kolcic, Ivana; Koskinen, Seppo; Kovacs, Peter; Laitinen, Jaana; Liu, Jianjun; Lokki, Marja-Liisa; Marusic, Ana; Maschio, Andrea; Meitinger, Thomas; Mulas, Antonella; Paré, Guillaume; Parker, Alex N.; Peden, John F.; Petersmann, Astrid; Pichler, Irene; Pietiläinen, Kirsi H.; Pouta, Anneli; Ridderstråle, Martin; Rotter, Jerome I.; Sambrook, Jennifer G.; Sanders, Alan R.; Schmidt, Carsten Oliver; Sinisalo, Juha; Smit, Jan H.; Stringham, Heather M.; Walters, G.Bragi; Widen, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Zagato, Laura; Zgaga, Lina; Zitting, Paavo; Alavere, Helene; Farrall, Martin; McArdle, Wendy L.; Nelis, Mari; Peters, Marjolein J.; Ripatti, Samuli; van Meurs, Joyce B.J.; Aben, Katja K.; Ardlie, Kristin G; Beckmann, Jacques S.; Beilby, John P.; Bergman, Richard N.; Bergmann, Sven; Collins, Francis S.; Cusi, Daniele; den Heijer, Martin; Eiriksdottir, Gudny; Gejman, Pablo V.; Hall, Alistair S.; Hamsten, Anders; Huikuri, Heikki V.; Iribarren, Carlos; Kähönen, Mika; Kaprio, Jaakko; Kathiresan, Sekar; Kiemeney, Lambertus; Kocher, Thomas; Launer, Lenore J.; Lehtimäki, Terho; Melander, Olle; Mosley, Tom H.; Musk, Arthur W.; Nieminen, Markku S.; O'Donnell, Christopher J.; Ohlsson, Claes; Oostra, Ben; Palmer, Lyle J.; Raitakari, Olli; Ridker, Paul M.; Rioux, John D.; Rissanen, Aila; Rivolta, Carlo; Schunkert, Heribert; Shuldiner, Alan R.; Siscovick, David S.; Stumvoll, Michael; Tönjes, Anke; Tuomilehto, Jaakko; van Ommen, Gert-Jan; Viikari, Jorma; Heath, Andrew C.; Martin, Nicholas G.; Montgomery, Grant W.; Province, Michael A.; Kayser, Manfred; Arnold, Alice M.; Atwood, Larry D.; Boerwinkle, Eric; Chanock, Stephen J.; Deloukas, Panos; Gieger, Christian; Grönberg, Henrik; Hall, Per; Hattersley, Andrew T.; Hengstenberg, Christian; Hoffman, Wolfgang; Lathrop, G.Mark; Salomaa, Veikko; Schreiber, Stefan; Uda, Manuela; Waterworth, Dawn; Wright, Alan F.; Assimes, Themistocles L.; Barroso, Inês; Hofman, Albert; Mohlke, Karen L.; Boomsma, Dorret I.; Caulfield, Mark J.; Cupples, L.Adrienne; Erdmann, Jeanette; Fox, Caroline S.; Gudnason, Vilmundur; Gyllensten, Ulf; Harris, Tamara B.; Hayes, Richard B.; Jarvelin, Marjo-Riitta; Mooser, Vincent; Munroe, Patricia B.; Ouwehand, Willem H.; Penninx, Brenda W.; Pramstaller, Peter P.; Quertermous, Thomas; Rudan, Igor; Samani, Nilesh J.; Spector, Timothy D.; Völzke, Henry; Watkins, Hugh; Wilson, James F.; Groop, Leif C.; Haritunians, Talin; Hu, Frank B.; Kaplan, Robert C.; Metspalu, Andres; North, Kari E.; Schlessinger, David; Wareham, Nicholas J.; Hunter, David J.; O'Connell, Jeffrey R.; Strachan, David P.; Wichmann, H.-Erich; Borecki, Ingrid B.; van Duijn, Cornelia M.; Schadt, Eric E.; Thorsteinsdottir, Unnur; Peltonen, Leena; Uitterlinden, André; Visscher, Peter M.; Chatterjee, Nilanjan; Loos, Ruth J.F.; Boehnke, Michael; McCarthy, Mark I.; Ingelsson, Erik; Lindgren, Cecilia M.; Abecasis, Gonçalo R.; Stefansson, Kari; Frayling, Timothy M.; Hirschhorn, Joel N

    2010-01-01

    Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence phenotype. Genome-wide association (GWA) studies have identified >600 variants associated with human traits1, but these typically explain small fractions of phenotypic variation, raising questions about the utility of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait2,3. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P=0.016), and that underlie skeletal growth defects (P<0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants, and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented amongst variants that alter amino acid structure of proteins and expression levels of nearby genes. Our data explain ∼10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to ∼16% of phenotypic variation (∼20% of heritable variation). Although additional approaches are needed to fully dissect the genetic architecture of polygenic human traits, our findings indicate that GWA studies can identify large numbers of loci that

  3. CHEK2*1100delC Variant and BRCA1/2-Negative Familial Breast Cancer - A Family-Based Genetic Association Study

    DTIC Science & Technology

    2007-10-01

    AD_________________ Award Number: DAMD17-03-1-0774 TITLE: CHEK2 *1100delC Variant and BRCA1/2...NUMBER CHEK2 *1100delC Variant and BRCA1/2-Negative Familial Breast Cancer - A Family- Based Genetic Association Study 5b. GRANT NUMBER DAMD17...association between the CHEK2 *1100delC gene variant and breast cancer among BRCA1/2-negative families. Vital to DNA replication and normal growth of breast

  4. Influence of renal artery variants, number, location, and degree of renal artery stenoses on the atherosclerotic burden of the aorta.

    PubMed

    Petersen, Johannes; Plaikner, Michaela; Nasseri, Parinaz; Rehder, Peter; Koppelstätter, Christian; Pauli, Guido F; Glodny, Bernhard

    2012-10-01

    To determine the assumed influence of the number of renal arteries, the distribution and extent of renal artery stenosis (RAS), and the kidney length on calcified aortic atherosclerotic plaque burden. The computed tomographic angiographies of 1381 patients were analyzed retrospectively using a volumetric aortic calcium scoring method. The Spearman method was used to calculate the correlation between kidney length, number and diameter of renal arteries, as well as number, degree, and location of RASs on main or additional renal arteries with the extent of aortic atherosclerosis. Logistic regression analyses were conducted with the target variable "calcification present or absent." Patients with multiple renal arteries (38.3%) had lower plaque volumes than patients without such variants (0.55 ± 0.97 vs 0.64 ± 1.06 mL; P < 0.05). Renal artery stenoses affected all renal vessels with equal frequency. The aortic calcium score correlated with the number of RASs (P < 0.0001) and the maximum degree of RAS up to a threshold of 60%. Location of an RAS in the various renal arteries was irrelevant. In regression analyses, the presence of RAS (Wald = 5.523), the degree of RAS (Wald = 6.251), and age (Wald = 223.1) were positive predictors of the aortic calcium score, whereas kidney length (Wald = 9.564) proved to be a negative predictor. The aortic calcium score correlates with both the number of RASs and the maximum degree of RAS up to a threshold of 60% but correlates inversely with the number of renal arteries. Renal artery stenosis affects all renal vessels with equal frequency, and this finding should be considered in screening procedures.

  5. Toll-Like Receptor-3 and Geographic Atrophy in Age-Related Macular Degeneration

    PubMed Central

    Yang, Zhenglin; Stratton, Charity; Francis, Peter J.; Kleinman, Mark E.; Tan, Perciliz L.; Gibbs, Daniel; Tong, Zongzhong; Chen, Haoyu; Constantine, Ryan; Yang, Xian; Chen, Yuhong; Zeng, Jiexi; Davey, Lisa; Ma, Xiang; Hau, Vincent S.; Wang, Chi; Harmon, Jennifer; Buehler, Jeanette; Pearson, Erik; Patel, Shrena; Kaminoh, Yuuki; Watkins, Scott; Luo, Ling; Zabriskie, Norman A.; Bernstein, Paul S.; Cho, Wongil; Schwager, Andrea; Hinton, David R; Klein, Michael L; Hamon, Sara C.; Simmons, Emily; Yu, Beifeng; Campochiaro, Betsy; Sunness, Janet S.; Campochiaro, Peter; Jorde, Lynn; Parmigiani, Giovanni; Zack, Donald J.; Katsanis, Nicholas; Ambati, Jayakrishna; Zhang, Kang

    2008-01-01

    BACKGROUND Age-related macular degeneration (AMD) is the most common cause of irreversible visual impairment in the developed world. Advanced AMD is comprised of geographic atrophy (GA) and choroidal neovascularization (CNV). Specific genetic variants that predispose for GA are largely unknown. METHODS We tested (i) for association between the functional toll-like receptor-3 (TLR3) variant rs3775291 (L412F) and AMD in European Americans and (ii) the effect of TLR3 L and F variants on the viability of human retinal pigment epithelium (RPE) cells in vitro and on RPE cell apoptosis in wildtype and Tlr3−/− mice. RESULTS The F variant (or T allele at single nucleotide polymorphism at rs3775291) was associated with protection against GA (P=0.005); this association was replicated in two independent GA case-control series (P=5.43×10−4 and P=0.002, respectively. We observed no association between TLR3 variants and CNV. The rs377291 variant is probably critical to the function of TLR3, because a prototypic TLR3 ligand induced cell death and apoptosis in human RPE cells with the LL genotype to a greater extent than it did RPE cells with the LF genotype. Moreover, the ligand induced more RPE cell death and apoptosis in wild-type than in Tlr3−/− mice. CONCLUSIONS The TLR3 412F variant confers protection against GA, probably by suppressing RPE cell death. Given that double stranded RNA can activate TLR3-mediated apoptosis, our results suggest a possible role for viral dsRNA transcripts in the development of GA and raise awareness of potential toxicity induced by short interfering RNA (siRNA) therapeutics in the eye. PMID:18753640

  6. Impact of Parental Bos taurus and Bos indicus Origins on Copy Number Variation in Traditional Chinese Cattle Breeds

    PubMed Central

    Zhang, Liangzhi; Jia, Shangang; Plath, Martin; Huang, Yongzhen; Li, Congjun; Lei, Chuzhao; Zhao, Xin; Chen, Hong

    2015-01-01

    Copy number variation (CNV) is an important component of genomic structural variation and plays a role not only in evolutionary diversification but also in domestication. Chinese cattle were derived from Bos taurus and Bos indicus, and several breeds presumably are of hybrid origin, but the evolution of CNV regions (CNVRs) has not yet been examined in this context. Here, we of CNVRs, mtDNA D-loop sequence variation, and Y-chromosomal single nucleotide polymorphisms to assess the impact of maternal and paternal B. taurus and B. indicus origins on the distribution of CNVRs in 24 Chinese domesticated bulls. We discovered 470 genome-wide CNVRs, only 72 of which were shared by all three Y-lineages (B. taurus: Y1, Y2; B. indicus: Y3), whereas 265 were shared by inferred taurine or indicine paternal lineages, and 228 when considering their maternal taurine or indicine origins. Phylogenetic analysis uncovered eight taurine/indicine hybrids, and principal component analysis on CNVs corroborated genomic exchange during hybridization. The distribution patterns of CNVRs tended to be lineage-specific, and correlation analysis revealed significant positive or negative co-occurrences of CNVRs across lineages. Our study suggests that CNVs in Chinese cattle partly result from selective breeding during domestication, but also from hybridization and introgression. PMID:26260653

  7. Are there meaningful differences between major depressive disorder, dysthymic disorder, and their subthreshold variants?

    PubMed

    Moore, Michael T; Brown, Timothy A

    2012-09-01

    A number of researchers have proposed adding an increasing number of subthreshold variants of major depressive disorder (MDD) as new mood disorder. However, this research has suffered from a number of theoretical and methodological flaws that the current investigation has attempted to address. Individuals with MDD (n = 470) were compared with individuals with subthreshold MDD (n = 57). Individuals with MDD reported consistently more severe symptoms, albeit of small magnitude, as well as differences in comorbidity with only two disorders. Results also indicated that diagnosis did not significantly predict rate of symptom change when MDD was compared with its subthreshold variant. Taken together, the aforementioned evidence suggests that small differences exist between MDD and its subthreshold variant. In addition, the extent to which the latter serves as useful analogs for the former may depend upon the variables under study.

  8. FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets

    PubMed Central

    2013-01-01

    Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software

  9. Randomized trial of the ForeseeHome monitoring device for early detection of neovascular age-related macular degeneration. The HOme Monitoring of the Eye (HOME) study design - HOME Study report number 1.

    PubMed

    Chew, Emily Y; Clemons, Traci E; Bressler, Susan B; Elman, Michael J; Danis, Ronald P; Domalpally, Amitha; Heier, Jeffrey S; Kim, Judy E; Garfinkel, Richard A

    2014-03-01

    To evaluate the effects of a home-monitoring device with tele-monitoring compared with standard care in detection of progression to choroidal neovascularization (CNV) associated with age-related macular degeneration (AMD), the leading cause of blindness in the US. Participants, aged 55 to 90 years, at high risk of developing CNV associated with AMD were recruited to the HOme Monitoring of Eye (HOME) Study, an unmasked, multi-center, randomized trial of the ForeseeHome (FH) device plus standard care vs. standard care alone. The FH device utilizes preferential hyperacuity perimetry and tele-monitoring to detect changes in vision function associated with development of CNV, potentially prior to symptom and visual acuity loss. After establishing baseline measurements, subsequent changes on follow-up are detected by the device, causing the monitoring center to alert the clinical center to recall participants for an exam. Standard care consists of instructions for self-monitoring visual changes with subsequent self-report to the clinical center. The primary objective of this study is to determine whether home monitoring plus standard care in comparison with standard care alone, results in earlier detection of incident CNV with better present visual acuity. The primary outcome is the decline in visual acuity at CNV diagnosis from baseline. Detection of CNV prior to substantial vision loss is critical as vision outcome following anti-angiogenic therapy is dependent on the visual acuity at initiation of treatment. HOME Study is the first large scale study to test the use of home tele-monitoring system in the management of AMD patients. Published by Elsevier Inc.

  10. Genetic compendium of 1511 human brains available through the UK Medical Research Council Brain Banks Network Resource.

    PubMed

    Keogh, Michael J; Wei, Wei; Wilson, Ian; Coxhead, Jon; Ryan, Sarah; Rollinson, Sara; Griffin, Helen; Kurzawa-Akanbi, Marzena; Santibanez-Koref, Mauro; Talbot, Kevin; Turner, Martin R; McKenzie, Chris-Anne; Troakes, Claire; Attems, Johannes; Smith, Colin; Al Sarraj, Safa; Morris, Chris M; Ansorge, Olaf; Pickering-Brown, Stuart; Ironside, James W; Chinnery, Patrick F

    2017-01-01

    Given the central role of genetic factors in the pathogenesis of common neurodegenerative disorders, it is critical that mechanistic studies in human tissue are interpreted in a genetically enlightened context. To address this, we performed exome sequencing and copy number variant analysis on 1511 frozen human brains with a diagnosis of Alzheimer's disease (AD, n = 289), frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS, n = 252), Creutzfeldt-Jakob disease (CJD, n = 239), Parkinson's disease (PD, n = 39), dementia with Lewy bodies (DLB, n = 58), other neurodegenerative, vascular, or neurogenetic disorders (n = 266), and controls with no significant neuropathology (n = 368). Genomic DNA was extracted from brain tissue in all cases before exome sequencing (Illumina Nextera 62 Mb capture) with variants called by FreeBayes; copy number variant (CNV) analysis (Illumina HumanOmniExpress-12 BeadChip); C9orf72 repeat expansion detection; and APOE genotyping. Established or likely pathogenic heterozygous, compound heterozygous, or homozygous variants, together with the C9orf72 hexanucleotide repeat expansions and a copy number gain of APP, were found in 61 brains. In addition to known risk alleles in 349 brains (23.9% of 1461 undergoing exome sequencing), we saw an association between rare variants in GRN and DLB. Rare CNVs were found in <1.5% of brains, including copy number gains of PRPH that were overrepresented in AD. Clinical, pathological, and genetic data are available, enabling the retrieval of specific frozen brains through the UK Medical Research Council Brain Banks Network. This allows direct access to pathological and control human brain tissue based on an individual's genetic architecture, thus enabling the functional validation of known genetic risk factors and potentially pathogenic alleles identified in future studies. © 2017 Keogh et al.; Published by Cold Spring Harbor Laboratory Press.

  11. Histone variant innovation in a rapidly evolving chordate lineage.

    PubMed

    Moosmann, Alexandra; Campsteijn, Coen; Jansen, Pascal Wtc; Nasrallah, Carole; Raasholm, Martina; Stunnenberg, Henk G; Thompson, Eric M

    2011-07-15

    Histone variants alter the composition of nucleosomes and play crucial roles in transcription, chromosome segregation, DNA repair, and sperm compaction. Modification of metazoan histone variant lineages occurs on a background of genome architecture that shows global similarities from sponges to vertebrates, but the urochordate, Oikopleura dioica, a member of the sister group to vertebrates, exhibits profound modification of this ancestral architecture. We show that a histone complement of 47 gene loci encodes 31 histone variants, grouped in distinct sets of developmental expression profiles throughout the life cycle. A particularly diverse array of 15 male-specific histone variants was uncovered, including a testes-specific H4t, the first metazoan H4 sequence variant reported. Universal histone variants H3.3, CenH3, and H2A.Z are present but O. dioica lacks homologs of macroH2A and H2AX. The genome encodes many H2A and H2B variants and the repertoire of H2A.Z isoforms is expanded through alternative splicing, incrementally regulating the number of acetylatable lysine residues in the functionally important N-terminal "charge patch". Mass spectrometry identified 40 acetylation, methylation and ubiquitylation posttranslational modifications (PTMs) and showed that hallmark PTMs of "active" and "repressive" chromatin were present in O. dioica. No obvious reduction in silent heterochromatic marks was observed despite high gene density in this extraordinarily compacted chordate genome. These results show that histone gene complements and their organization differ considerably even over modest phylogenetic distances. Substantial innovation among all core and linker histone variants has evolved in concert with adaptation of specific life history traits in this rapidly evolving chordate lineage.

  12. VIPER: a web application for rapid expert review of variant calls.

    PubMed

    Wöste, Marius; Dugas, Martin

    2018-06-01

    With the rapid development in next-generation sequencing, cost and time requirements for genomic sequencing are decreasing, enabling applications in many areas such as cancer research. Many tools have been developed to analyze genomic variation ranging from single nucleotide variants to whole chromosomal aberrations. As sequencing throughput increases, the number of variants called by such tools also grows. Often employed manual inspection of such calls is thus becoming a time-consuming procedure. We developed the Variant InsPector and Expert Rating tool (VIPER) to speed up this process by integrating the Integrative Genomics Viewer into a web application. Analysts can then quickly iterate through variants, apply filters and make decisions based on the generated images and variant metadata. VIPER was successfully employed in analyses with manual inspection of more than 10 000 calls. VIPER is implemented in Java and Javascript and is freely available at https://github.com/MarWoes/viper. marius.woeste@uni-muenster.de. Supplementary data are available at Bioinformatics online.

  13. Two-Year Outcomes of a Treat-and-Extend Regimen Using Intravitreal Aflibercept Injections for Typical Age-Related Macular Degeneration.

    PubMed

    Ito, Arisa; Matsumoto, Hidetaka; Morimoto, Masahiro; Mimura, Kensuke; Akiyama, Hideo

    2017-01-01

    The aim of this study was to evaluate the efficacy of a treat-and-extend (TAE) regimen using intravitreal injection of aflibercept (IVA) for typical age-related macular degeneration (tAMD). We retrospectively studied 61 treatment-naïve eyes with tAMD. Best-corrected visual acuity (BCVA), central macular thickness (CMT), central choroidal thickness (CCT), number of injections, and complications during 2 years were evaluated. BCVA significantly improved by on average 0.13 logMAR units, and CMT and CCT significantly decreased after 2 years. The number of injections was on average 13.6. In the second year, eyes with classic choroidal neovascularization (CNV) needed significantly fewer treatments than eyes with occult CNV. Fourteen eyes, which developed subfoveal fibrosis, showed significantly poorer BCVA after 2 years. Subfoveal fibrosis was significantly common in classic CNV. A TAE regimen using IVA for tAMD might be effective for improving BCVA and exudative changes. The exudation may be suppressed with fewer treatments in classic CNV compared to occult CNV. © 2017 S. Karger AG, Basel.

  14. Planning of visually guided reach-to-grasp movements: inference from reaction time and contingent negative variation (CNV).

    PubMed

    Zaepffel, Manuel; Brochier, Thomas

    2012-01-01

    We performed electroencephalogram (EEG) recording in a precuing task to investigate the planning processes of reach-to-grasp movements in human. In this reaction time (RT) task, subjects had to reach, grasp, and pull an object as fast as possible after a visual GO signal. We manipulated two parameters: the hand shape for grasping (precision grip or side grip) and the force required to pull the object (high or low). Three seconds before the GO onset, a cue provided advance information about force, grip, both parameters, or no information at all. EEG data show that reach-to-grasp movements generate differences in the topographic distribution of the late Contingent Negative Variation (ICNV) amplitude between the 4 precuing conditions. Along with RT data, it confirms that two distinct functional networks are involved with different time courses in the planning of grip and force. Finally, we outline the composite nature of the lCNV that might reflect both high- and low-level planning processes. Copyright © 2011 Society for Psychophysiological Research.

  15. Influence of Hydrogen and Number of Particle Variants on Ordinary and Two-Way Shape Memory Effects in Ti-Ni Single Crystals

    NASA Astrophysics Data System (ADS)

    Kireeva, I. V.; Platonova, Yu. N.; Chumlyakov, Yu. I.

    2017-02-01

    The ordinary and two-way shape memory effects (SMEs) are investigated for [ overline{1} 12] single crystals of Ti-51.3Ni (at.%) alloy aged at 823 K for 1.5 h in free state and under tensile stress of 150 MPa without hydrogen and after saturation by hydrogen. It is established that without hydrogen in [ overline{1} 12] single crystals with one and four variants of Ti3Ni4 particles the maximum magnitude of the ordinary SME is 1.9-2.6% under the external stress σext = 250 MPa. Under σext > 250 MPa, crystals are destroyed. The magnitude of the two-way SME caused by the B2- R- B19' MT equal to 1.1% at σext = 0 is observed in [ overline{1} 12] single crystals with one variant of Ti3Ni4 particles. The physical reason for the observed two-way SME is the internal compressive stresses oriented along the [ overline{1} 12] directions arising from one variant of Ti3Ni4 particles as a result of aging under tensile stress of 150 MPa. It is established that hydrogen does not influence the TR temperature, reduces the plasticity, and suppresses the two-way SME. The suppression of two-way SME in the [ overline{1} 12] single crystals of the Ti-51.3Ni (at.%) alloy with one variant of Ti3Ni4 particles is caused by shielding of stress fields from one variant of Ti3Ni4 particles and multiple nucleation of R- and B19' martensite variants under loading with saturation by hydrogen.

  16. Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort

    PubMed Central

    Gambin, Tomasz; Akdemir, Zeynep C.; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M.B.; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M.; Eldomery, Mohammad K.; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W.; Boerwinkle, Eric; Beaudet, Arthur L.; Gibbs, Richard A.

    2017-01-01

    Abstract We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor–Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17–50% of pathogenic CNVs in different disease cohorts where 7.1–11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. PMID:27980096

  17. The Association between Pediatric NAFLD and Common Genetic Variants

    PubMed Central

    Umano, Giuseppina Rosaria; Martino, Mariangela; Santoro, Nicola

    2017-01-01

    Non-alcoholic fatty liver disease (NAFLD) is one of the most common complications of obesity. Several studies have shown that genetic predisposition probably plays an important role in its pathogenesis. In fact, in the last few years a large number of genetic studies have provided compelling evidence that some gene variants, especially those in genes encoding proteins regulating lipid metabolism, are associated with intra-hepatic fat accumulation. Here we provide a comprehensive review of the gene variants that have affected the natural history of the disease. PMID:28629152

  18. VCFR: A package to manipulate and visualize variant call format data in R

    USDA-ARS?s Scientific Manuscript database

    Software to call single nucleotide polymorphisms or related genetic variants has converged on the variant call format (vcf) as their output format of choice. This has created a need for tools to work with vcf files. While an increasing number of software exists to read vcf data, many of them only ex...

  19. Copy number variants analysis in a cohort of isolated and syndromic developmental delay/intellectual disability reveals novel genomic disorders, position effects and candidate disease genes.

    PubMed

    Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B

    2017-10-01

    Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  20. A systematic approach to assessing the clinical significance of genetic variants.

    PubMed

    Duzkale, H; Shen, J; McLaughlin, H; Alfares, A; Kelly, M A; Pugh, T J; Funke, B H; Rehm, H L; Lebo, M S

    2013-11-01

    Molecular genetic testing informs diagnosis, prognosis, and risk assessment for patients and their family members. Recent advances in low-cost, high-throughput DNA sequencing and computing technologies have enabled the rapid expansion of genetic test content, resulting in dramatically increased numbers of DNA variants identified per test. To address this challenge, our laboratory has developed a systematic approach to thorough and efficient assessments of variants for pathogenicity determination. We first search for existing data in publications and databases including internal, collaborative and public resources. We then perform full evidence-based assessments through statistical analyses of observations in the general population and disease cohorts, evaluation of experimental data from in vivo or in vitro studies, and computational predictions of potential impacts of each variant. Finally, we weigh all evidence to reach an overall conclusion on the potential for each variant to be disease causing. In this report, we highlight the principles of variant assessment, address the caveats and pitfalls, and provide examples to illustrate the process. By sharing our experience and providing a framework for variant assessment, including access to a freely available customizable tool, we hope to help move towards standardized and consistent approaches to variant assessment. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  1. Quadruplex MAPH: improvement of throughput in high-resolution copy number screening

    PubMed Central

    Tyson, Jess; Majerus, Tamsin MO; Walker, Susan; Armour, John AL

    2009-01-01

    Background Copy number variation (CNV) in the human genome is recognised as a widespread and important source of human genetic variation. Now the challenge is to screen for these CNVs at high resolution in a reliable, accurate and cost-effective way. Results Multiplex Amplifiable Probe Hybridisation (MAPH) is a sensitive, high-resolution technology appropriate for screening for CNVs in a defined region, for a targeted population. We have developed MAPH to a highly multiplexed format ("QuadMAPH") that allows the user a four-fold increase in the number of loci tested simultaneously. We have used this method to analyse a genomic region of 210 kb, including the MSH2 gene and 120 kb of flanking DNA. We show that the QuadMAPH probes report copy number with equivalent accuracy to simplex MAPH, reliably demonstrating diploid copy number in control samples and accurately detecting deletions in Hereditary Non-Polyposis Colorectal Cancer (HNPCC) samples. Conclusion QuadMAPH is an accurate, high-resolution method that allows targeted screening of large numbers of subjects without the expense of genome-wide approaches. Whilst we have applied this technique to a region of the human genome, it is equally applicable to the genomes of other organisms. PMID:19785739

  2. Extensive Copy-Number Variation of Young Genes across Stickleback Populations

    PubMed Central

    Eizaguirre, Christophe; Samonte, Irene E.; Kalbe, Martin; Lenz, Tobias L.; Stoll, Monika; Bornberg-Bauer, Erich; Milinski, Manfred; Reusch, Thorsten B. H.

    2014-01-01

    Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their potential adaptive benefits, we combine comparative genomics with population genomics analyses to evaluate the distribution and impact of CNVs across natural populations of an eco-genomic model, the three-spined stickleback. With whole genome sequences of 66 individuals from populations inhabiting three distinct habitats, we find that CNVs generally occur at low frequencies and are often only found in one of the 11 populations surveyed. A subset of CNVs, however, displays copy-number differentiation between populations, showing elevated within-population frequencies consistent with local adaptation. By comparing teleost genomes to identify lineage-specific genes and duplications in sticklebacks, we highlight rampant gene content differences among individuals in which over 30% of young duplicate genes are CNVs. These CNV genes are evolving rapidly at the molecular level and are enriched with functional categories associated with environmental interactions, depicting the dynamic early copy-number polymorphic stage of genes during population differentiation. PMID:25474574

  3. Functional Assessment of Genetic Variants with Outcomes Adapted to Clinical Decision-Making

    PubMed Central

    Thouvenot, Pierre; Ben Yamin, Barbara; Fourrière, Lou; Lescure, Aurianne; Boudier, Thomas; Del Nery, Elaine; Chauchereau, Anne; Goldgar, David E.; Stoppa-Lyonnet, Dominique; Nicolas, Alain; Millot, Gaël A.

    2016-01-01

    Understanding the medical effect of an ever-growing number of human variants detected is a long term challenge in genetic counseling. Functional assays, based on in vitro or in vivo evaluations of the variant effects, provide essential information, but they require robust statistical validation, as well as adapted outputs, to be implemented in the clinical decision-making process. Here, we assessed 25 pathogenic and 15 neutral missense variants of the BRCA1 breast/ovarian cancer susceptibility gene in four BRCA1 functional assays. Next, we developed a novel approach that refines the variant ranking in these functional assays. Lastly, we developed a computational system that provides a probabilistic classification of variants, adapted to clinical interpretation. Using this system, the best functional assay exhibits a variant classification accuracy estimated at 93%. Additional theoretical simulations highlight the benefit of this ready-to-use system in the classification of variants after functional assessment, which should facilitate the consideration of functional evidences in the decision-making process after genetic testing. Finally, we demonstrate the versatility of the system with the classification of siRNAs tested for human cell growth inhibition in high throughput screening. PMID:27272900

  4. Novel CCR3 Antagonists Are Effective Mono- and Combination Inhibitors of Choroidal Neovascular Growth and Vascular Permeability.

    PubMed

    Nagai, Nori; Ju, Meihua; Izumi-Nagai, Kanako; Robbie, Scott J; Bainbridge, James W; Gale, David C; Pierre, Esaie; Krauss, Achim H P; Adamson, Peter; Shima, David T; Ng, Yin-Shan

    2015-09-01

    Choroidal neovascularization (CNV) is a defining feature of wet age-related macular degeneration. We examined the functional role of CCR3 in the development of CNV in mice and primates. CCR3 was associated with spontaneous CNV lesions in the newly described JR5558 mice, whereas CCR3 ligands localized to CNV-associated macrophages and the retinal pigment epithelium/choroid complex. Intravitreal injection of neutralizing antibodies against vascular endothelial growth factor receptor 2, CCR3, CC chemokine ligand 11/eotaxin-1, and CC chemokine ligand 24/eotaxin-2 all reduced CNV area and lesion number in these mice. Systemic administration of the CCR3 antagonists GW766994X and GW782415X reduced spontaneous CNV in JR5558 mice and laser-induced CNV in mouse and primate models in a dose-dependent fashion. Combination treatment with antivascular endothelial growth factor receptor 2 antibody and GW766994X yielded additive reductions in CNV area and hyperpermeability in mice. Interestingly, topical GW766994X and intravitreal anti-CCR3 antibody yielded strong systemic effects, reducing CNV in the untreated, contralateral eye. Contrarily, ocular administration of GW782415X in primates failed to substantially elevate plasma drug levels or to reduce the development of grade IV CNV lesions. These findings suggest that CCR3 signaling may be an attractive therapeutic target for CNV, utilizing a pathway that is at least partly distinct from that of vascular endothelial growth factor receptor. The findings also demonstrate that systemic exposure to CCR3 antagonists may be crucial for CNV-targeted activity. Copyright © 2015 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.

  5. Reducing Communication in Algebraic Multigrid Using Additive Variants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vassilevski, Panayot S.; Yang, Ulrike Meier

    Algebraic multigrid (AMG) has proven to be an effective scalable solver on many high performance computers. However, its increasing communication complexity on coarser levels has shown to seriously impact its performance on computers with high communication cost. Moreover, additive AMG variants provide not only increased parallelism as well as decreased numbers of messages per cycle but also generally exhibit slower convergence. Here we present various new additive variants with convergence rates that are significantly improved compared to the classical additive algebraic multigrid method and investigate their potential for decreased communication, and improved communication-computation overlap, features that are essential for goodmore » performance on future exascale architectures.« less

  6. Reducing Communication in Algebraic Multigrid Using Additive Variants

    DOE PAGES

    Vassilevski, Panayot S.; Yang, Ulrike Meier

    2014-02-12

    Algebraic multigrid (AMG) has proven to be an effective scalable solver on many high performance computers. However, its increasing communication complexity on coarser levels has shown to seriously impact its performance on computers with high communication cost. Moreover, additive AMG variants provide not only increased parallelism as well as decreased numbers of messages per cycle but also generally exhibit slower convergence. Here we present various new additive variants with convergence rates that are significantly improved compared to the classical additive algebraic multigrid method and investigate their potential for decreased communication, and improved communication-computation overlap, features that are essential for goodmore » performance on future exascale architectures.« less

  7. Waardenburg syndrome: Novel mutations in a large Brazilian sample.

    PubMed

    Bocángel, Magnolia Astrid Pretell; Melo, Uirá Souto; Alves, Leandro Ucela; Pardono, Eliete; Lourenço, Naila Cristina Vilaça; Marcolino, Humberto Vicente Cezar; Otto, Paulo Alberto; Mingroni-Netto, Regina Célia

    2018-06-01

    This paper deals with the molecular investigation of Waardenburg syndrome (WS) in a sample of 49 clinically diagnosed probands (most from southeastern Brazil), 24 of them having the type 1 (WS1) variant (10 familial and 14 isolated cases) and 25 being affected by the type 2 (WS2) variant (five familial and 20 isolated cases). Sequential Sanger sequencing of all coding exons of PAX3, MITF, EDN3, EDNRB, SOX10 and SNAI2 genes, followed by CNV detection by MLPA of PAX3, MITF and SOX10 genes in selected cases revealed many novel pathogenic variants. Molecular screening, performed in all patients, revealed 19 causative variants (19/49 = 38.8%), six of them being large whole-exon deletions detected by MLPA, seven (four missense and three nonsense substitutions) resulting from single nucleotide substitutions (SNV), and six representing small indels. A pair of dizygotic affected female twins presented the c.430delC variant in SOX10, but the mutation, imputed to gonadal mosaicism, was not found in their unaffected parents. At least 10 novel causative mutations, described in this paper, were found in this Brazilian sample. Copy-number-variation detected by MLPA identified the causative mutation in 12.2% of our cases, corresponding to 31.6% of all causative mutations. In the majority of cases, the deletions were sporadic, since they were not present in the parents of isolated cases. Our results, as a whole, reinforce the fact that the screening of copy-number-variants by MLPA is a powerful tool to identify the molecular cause in WS patients. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  8. Prevalence of Pathogenic Copy Number Variation in Adults With Pediatric-Onset Epilepsy and Intellectual Disability.

    PubMed

    Borlot, Felippe; Regan, Brigid M; Bassett, Anne S; Stavropoulos, D James; Andrade, Danielle M

    2017-11-01

    Copy number variation (CNV) is an important cause of neuropsychiatric disorders. Little is known about the role of CNV in adults with epilepsy and intellectual disability. To evaluate the prevalence of pathogenic CNVs and identify possible candidate CNVs and genes in patients with epilepsy and intellectual disability. In this cross-sectional study, genome-wide microarray was used to evaluate a cohort of 143 adults with unexplained childhood-onset epilepsy and intellectual disability who were recruited from the Toronto Western Hospital epilepsy outpatient clinic from January 1, 2012, through December 31, 2014. The inclusion criteria were (1) pediatric seizure onset with ongoing seizure activity in adulthood, (2) intellectual disability of any degree, and (3) no structural brain abnormalities or metabolic conditions that could explain the seizures. DNA screening was performed using genome-wide microarray platforms. Pathogenicity of CNVs was assessed based on the American College of Medical Genetics guidelines. The Residual Variation Intolerance Score was used to evaluate genes within the identified CNVs that could play a role in each patient's phenotype. Of the 2335 patients, 143 probands were investigated (mean [SD] age, 24.6 [10.8] years; 69 male and 74 female). Twenty-three probands (16.1%) and 4 affected relatives (2.8%) (mean [SD] age, 24.1 [6.1] years; 11 male and 16 female) presented with pathogenic or likely pathogenic CNVs (0.08-18.9 Mb). Five of the 23 probands with positive results (21.7%) had more than 1 CNV reported. Parental testing revealed de novo CNVs in 11 (47.8%), with CNVs inherited from a parent in 4 probands (17.4%). Sixteen of 23 probands (69.6%) presented with previously cataloged human genetic disorders and/or defined CNV hot spots in epilepsy. Eight nonrecurrent rare CNVs that overlapped 1 or more genes associated with intellectual disability, autism, and/or epilepsy were identified: 2p16.1-p15 duplication, 6p25.3-p25.1 duplication, 8p23.3p

  9. The Clinical Next-Generation Sequencing Database: A Tool for the Unified Management of Clinical Information and Genetic Variants to Accelerate Variant Pathogenicity Classification.

    PubMed

    Nishio, Shin-Ya; Usami, Shin-Ichi

    2017-03-01

    Recent advances in next-generation sequencing (NGS) have given rise to new challenges due to the difficulties in variant pathogenicity interpretation and large dataset management, including many kinds of public population databases as well as public or commercial disease-specific databases. Here, we report a new database development tool, named the "Clinical NGS Database," for improving clinical NGS workflow through the unified management of variant information and clinical information. This database software offers a two-feature approach to variant pathogenicity classification. The first of these approaches is a phenotype similarity-based approach. This database allows the easy comparison of the detailed phenotype of each patient with the average phenotype of the same gene mutation at the variant or gene level. It is also possible to browse patients with the same gene mutation quickly. The other approach is a statistical approach to variant pathogenicity classification based on the use of the odds ratio for comparisons between the case and the control for each inheritance mode (families with apparently autosomal dominant inheritance vs. control, and families with apparently autosomal recessive inheritance vs. control). A number of case studies are also presented to illustrate the utility of this database. © 2016 The Authors. **Human Mutation published by Wiley Periodicals, Inc.

  10. HGVS Recommendations for the Description of Sequence Variants: 2016 Update.

    PubMed

    den Dunnen, Johan T; Dalgleish, Raymond; Maglott, Donna R; Hart, Reece K; Greenblatt, Marc S; McGowan-Jordan, Jean; Roux, Anne-Francoise; Smith, Timothy; Antonarakis, Stylianos E; Taschner, Peter E M

    2016-06-01

    The consistent and unambiguous description of sequence variants is essential to report and exchange information on the analysis of a genome. In particular, DNA diagnostics critically depends on accurate and standardized description and sharing of the variants detected. The sequence variant nomenclature system proposed in 2000 by the Human Genome Variation Society has been widely adopted and has developed into an internationally accepted standard. The recommendations are currently commissioned through a Sequence Variant Description Working Group (SVD-WG) operating under the auspices of three international organizations: the Human Genome Variation Society (HGVS), the Human Variome Project (HVP), and the Human Genome Organization (HUGO). Requests for modifications and extensions go through the SVD-WG following a standard procedure including a community consultation step. Version numbers are assigned to the nomenclature system to allow users to specify the version used in their variant descriptions. Here, we present the current recommendations, HGVS version 15.11, and briefly summarize the changes that were made since the 2000 publication. Most focus has been on removing inconsistencies and tightening definitions allowing automatic data processing. An extensive version of the recommendations is available online, at http://www.HGVS.org/varnomen. © 2016 WILEY PERIODICALS, INC.

  11. VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.

    PubMed

    Wala, Jeremiah; Zhang, Cheng-Zhong; Meyerson, Matthew; Beroukhim, Rameen

    2016-07-01

    We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. VariantBam and full documentation are available at github.com/jwalabroad/VariantBam rameen@broadinstitute.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  12. Amyotrophic lateral sclerosis onset is influenced by the burden of rare variants in known amyotrophic lateral sclerosis genes.

    PubMed

    Cady, Janet; Allred, Peggy; Bali, Taha; Pestronk, Alan; Goate, Alison; Miller, Timothy M; Mitra, Robi D; Ravits, John; Harms, Matthew B; Baloh, Robert H

    2015-01-01

    To define the genetic landscape of amyotrophic lateral sclerosis (ALS) and assess the contribution of possible oligogenic inheritance, we aimed to comprehensively sequence 17 known ALS genes in 391 ALS patients from the United States. Targeted pooled-sample sequencing was used to identify variants in 17 ALS genes. Fragment size analysis was used to define ATXN2 and C9ORF72 expansion sizes. Genotype-phenotype correlations were made with individual variants and total burden of variants. Rare variant associations for risk of ALS were investigated at both the single variant and gene level. A total of 64.3% of familial and 27.8% of sporadic subjects carried potentially pathogenic novel or rare coding variants identified by sequencing or an expanded repeat in C9ORF72 or ATXN2; 3.8% of subjects had variants in >1 ALS gene, and these individuals had disease onset 10 years earlier (p = 0.0046) than subjects with variants in a single gene. The number of potentially pathogenic coding variants did not influence disease duration or site of onset. Rare and potentially pathogenic variants in known ALS genes are present in >25% of apparently sporadic and 64% of familial patients, significantly higher than previous reports using less comprehensive sequencing approaches. A significant number of subjects carried variants in >1 gene, which influenced the age of symptom onset and supports oligogenic inheritance as relevant to disease pathogenesis. © 2014 American Neurological Association.

  13. Positional bias in variant calls against draft reference assemblies.

    PubMed

    Briskine, Roman V; Shimizu, Kentaro K

    2017-03-28

    Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to

  14. Public variant databases: liability?

    PubMed

    Thorogood, Adrian; Cook-Deegan, Robert; Knoppers, Bartha Maria

    2017-07-01

    Public variant databases support the curation, clinical interpretation, and sharing of genomic data, thus reducing harmful errors or delays in diagnosis. As variant databases are increasingly relied on in the clinical context, there is concern that negligent variant interpretation will harm patients and attract liability. This article explores the evolving legal duties of laboratories, public variant databases, and physicians in clinical genomics and recommends a governance framework for databases to promote responsible data sharing.Genet Med advance online publication 15 December 2016.

  15. Decision Variants for the Automatic Determination of Optimal Feature Subset in RF-RFE.

    PubMed

    Chen, Qi; Meng, Zhaopeng; Liu, Xinyi; Jin, Qianguo; Su, Ran

    2018-06-15

    Feature selection, which identifies a set of most informative features from the original feature space, has been widely used to simplify the predictor. Recursive feature elimination (RFE), as one of the most popular feature selection approaches, is effective in data dimension reduction and efficiency increase. A ranking of features, as well as candidate subsets with the corresponding accuracy, is produced through RFE. The subset with highest accuracy (HA) or a preset number of features (PreNum) are often used as the final subset. However, this may lead to a large number of features being selected, or if there is no prior knowledge about this preset number, it is often ambiguous and subjective regarding final subset selection. A proper decision variant is in high demand to automatically determine the optimal subset. In this study, we conduct pioneering work to explore the decision variant after obtaining a list of candidate subsets from RFE. We provide a detailed analysis and comparison of several decision variants to automatically select the optimal feature subset. Random forest (RF)-recursive feature elimination (RF-RFE) algorithm and a voting strategy are introduced. We validated the variants on two totally different molecular biology datasets, one for a toxicogenomic study and the other one for protein sequence analysis. The study provides an automated way to determine the optimal feature subset when using RF-RFE.

  16. Antigen Loss Variants: Catching Hold of Escaping Foes.

    PubMed

    Vyas, Maulik; Müller, Rolf; Pogge von Strandmann, Elke

    2017-01-01

    Since mid-1990s, the field of cancer immunotherapy has seen steady growth and selected immunotherapies are now a routine and preferred therapeutic option of certain malignancies. Both active and passive cancer immunotherapies exploit the fact that tumor cells express specific antigens on the cell surface, thereby mounting an immune response specifically against malignant cells. It is well established that cancer cells typically lose surface antigens following natural or therapy-induced selective pressure and these antigen-loss variants are often the population that causes therapy-resistant relapse. CD19 and CD20 antigen loss in acute lymphocytic leukemia and chronic lymphocytic leukemia, respectively, and lineage switching in leukemia associated with mixed lineage leukemia (MLL) gene rearrangements are well-documented evidences in this regard. Although increasing number of novel immunotherapies are being developed, majority of these do not address the control of antigen loss variants. Here, we review the occurrence of antigen loss variants in leukemia and discuss the therapeutic strategies to tackle the same. We also present an approach of dual-targeting immunoligand effectively retargeting NK cells against antigen loss variants in MLL-associated leukemia. Novel immunotherapies simultaneously targeting more than one tumor antigen certainly hold promise to completely eradicate tumor and prevent therapy-resistant relapses.

  17. Public variant databases: liability?

    PubMed Central

    Thorogood, Adrian; Cook-Deegan, Robert; Knoppers, Bartha Maria

    2017-01-01

    Public variant databases support the curation, clinical interpretation, and sharing of genomic data, thus reducing harmful errors or delays in diagnosis. As variant databases are increasingly relied on in the clinical context, there is concern that negligent variant interpretation will harm patients and attract liability. This article explores the evolving legal duties of laboratories, public variant databases, and physicians in clinical genomics and recommends a governance framework for databases to promote responsible data sharing. Genet Med advance online publication 15 December 2016 PMID:27977006

  18. Impact of Parental Bos taurus and Bos indicus Origins on Copy Number Variation in Traditional Chinese Cattle Breeds.

    PubMed

    Zhang, Liangzhi; Jia, Shangang; Plath, Martin; Huang, Yongzhen; Li, Congjun; Lei, Chuzhao; Zhao, Xin; Chen, Hong

    2015-08-10

    Copy number variation (CNV) is an important component of genomic structural variation and plays a role not only in evolutionary diversification but also in domestication. Chinese cattle were derived from Bos taurus and Bos indicus, and several breeds presumably are of hybrid origin, but the evolution of CNV regions (CNVRs) has not yet been examined in this context. Here, we of CNVRs, mtDNA D-loop sequence variation, and Y-chromosomal single nucleotide polymorphisms to assess the impact of maternal and paternal B. taurus and B. indicus origins on the distribution of CNVRs in 24 Chinese domesticated bulls. We discovered 470 genome-wide CNVRs, only 72 of which were shared by all three Y-lineages (B. taurus: Y1, Y2; B. indicus: Y3), whereas 265 were shared by inferred taurine or indicine paternal lineages, and 228 when considering their maternal taurine or indicine origins. Phylogenetic analysis uncovered eight taurine/indicine hybrids, and principal component analysis on CNVs corroborated genomic exchange during hybridization. The distribution patterns of CNVRs tended to be lineage-specific, and correlation analysis revealed significant positive or negative co-occurrences of CNVRs across lineages. Our study suggests that CNVs in Chinese cattle partly result from selective breeding during domestication, but also from hybridization and introgression. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort.

    PubMed

    Gambin, Tomasz; Akdemir, Zeynep C; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M B; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M; Eldomery, Mohammad K; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W; Boerwinkle, Eric; Beaudet, Arthur L; Gibbs, Richard A; Lupski, James R

    2017-02-28

    We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor-Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17-50% of pathogenic CNVs in different disease cohorts where 7.1-11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Postnatally-transmitted HIV-1 Envelope variants have similar neutralization-sensitivity and function to that of nontransmitted breast milk variants

    PubMed Central

    2013-01-01

    Background Breastfeeding is a leading cause of infant HIV-1 infection in the developing world, yet only a minority of infants exposed to HIV-1 via breastfeeding become infected. As a genetic bottleneck severely restricts the number of postnatally-transmitted variants, genetic or phenotypic properties of the virus Envelope (Env) could be important for the establishment of infant infection. We examined the efficiency of virologic functions required for initiation of infection in the gastrointestinal tract and the neutralization sensitivity of HIV-1 Env variants isolated from milk of three postnatally-transmitting mothers (n=13 viruses), five clinically-matched nontransmitting mothers (n=16 viruses), and seven postnatally-infected infants (n = 7 postnatally-transmitted/founder (T/F) viruses). Results There was no difference in the efficiency of epithelial cell interactions between Env virus variants from the breast milk of transmitting and nontransmitting mothers. Moreover, there was similar efficiency of DC-mediated trans-infection, CCR5-usage, target cell fusion, and infectivity between HIV-1 Env-pseudoviruses from nontransmitting mothers and postnatal T/F viruses. Milk Env-pseudoviruses were generally sensitive to neutralization by autologous maternal plasma and resistant to breast milk neutralization. Infant T/F Env-pseudoviruses were equally sensitive to neutralization by broadly-neutralizing monoclonal and polyclonal antibodies as compared to nontransmitted breast milk Env variants. Conclusion Postnatally-T/F Env variants do not appear to possess a superior ability to interact with and cross a mucosal barrier or an exceptional resistance to neutralization that define their capability to initiate infection across the infant gastrointestinal tract in the setting of preexisting maternal antibodies. PMID:23305422